Velocity field measurements in oblique static divergent vocal fold models
NASA Astrophysics Data System (ADS)
Erath, Byron
2005-11-01
During normal phonation, the vocal fold cycle is characterized by the glottal opening transitioning from a convergent to a divergent passage and then closing before the cycle is repeated. Under ordinary phonatory conditions, both vocal folds, which form the glottal passage, move in phase with each other, creating a time-varying symmetric opening. However, abnormal pathological conditions, such as unilateral paralysis, and polyps, can result in geometrical asymmetries between the vocal folds throughout the phonatory cycle. This study investigates pulsatile flow fields through 7.5 times life-size vocal fold models with included divergence angles of 5 to 30 degrees, and obliquities between the vocal folds of up to 15 degrees. Flow conditions were scaled to match physiological parameters. Data were taken at the anterior posterior mid-plane using phase-averaged Particle Image Velocimetry (PIV). Viscous flow phenomena including the Coanda effect, flow separation points, and jet "flapping" were investigated. The results are compared to previously reported work of flow through symmetric divergent vocal fold models.
Vocal fold contact patterns based on normal modes of vibration.
Smith, Simeon L; Titze, Ingo R
2018-05-17
The fluid-structure interaction and energy transfer from respiratory airflow to self-sustained vocal fold oscillation continues to be a topic of interest in vocal fold research. Vocal fold vibration is driven by pressures on the vocal fold surface, which are determined by the shape of the glottis and the contact between vocal folds. Characterization of three-dimensional glottal shapes and contact patterns can lead to increased understanding of normal and abnormal physiology of the voice, as well as to development of improved vocal fold models, but a large inventory of shapes has not been directly studied previously. This study aimed to take an initial step toward characterizing vocal fold contact patterns systematically. Vocal fold motion and contact was modeled based on normal mode vibration, as it has been shown that vocal fold vibration can be almost entirely described by only the few lowest order vibrational modes. Symmetric and asymmetric combinations of the four lowest normal modes of vibration were superimposed on left and right vocal fold medial surfaces, for each of three prephonatory glottal configurations, according to a surface wave approach. Contact patterns were generated from the interaction of modal shapes at 16 normalized phases during the vibratory cycle. Eight major contact patterns were identified and characterized by the shape of the flow channel, with the following descriptors assigned: convergent, divergent, convergent-divergent, uniform, split, merged, island, and multichannel. Each of the contact patterns and its variation are described, and future work and applications are discussed. Copyright © 2018 Elsevier Ltd. All rights reserved.
Integrating perspectives on vocal performance and consistency
Sakata, Jon T.; Vehrencamp, Sandra L.
2012-01-01
SUMMARY Recent experiments in divergent fields of birdsong have revealed that vocal performance is important for reproductive success and under active control by distinct neural circuits. Vocal consistency, the degree to which the spectral properties (e.g. dominant or fundamental frequency) of song elements are produced consistently from rendition to rendition, has been highlighted as a biologically important aspect of vocal performance. Here, we synthesize functional, developmental and mechanistic (neurophysiological) perspectives to generate an integrated understanding of this facet of vocal performance. Behavioral studies in the field and laboratory have found that vocal consistency is affected by social context, season and development, and, moreover, positively correlated with reproductive success. Mechanistic investigations have revealed a contribution of forebrain and basal ganglia circuits and sex steroid hormones to the control of vocal consistency. Across behavioral, developmental and mechanistic studies, a convergent theme regarding the importance of vocal practice in juvenile and adult songbirds emerges, providing a basis for linking these levels of analysis. By understanding vocal consistency at these levels, we gain an appreciation for the various dimensions of song control and plasticity and argue that genes regulating the function of basal ganglia circuits and sex steroid hormones could be sculpted by sexual selection. PMID:22189763
Central pattern generators for social vocalization: Androgen-dependent neurophysiological mechanisms
Bass, Andrew H.; Remage-Healey, Luke
2008-01-01
Historically, most studies of vertebrate central pattern generators (CPGs) have focused on mechanisms for locomotion and respiration. Here, we highlight new results for ectothermic vertebrates, namely teleost fish and amphibians, showing how androgenic steroids can influence the temporal patterning of CPGs for social vocalization. Investigations of vocalizing teleosts show how androgens can rapidly (within minutes) modulate the neurophysiological output of the vocal CPG (fictive vocalizations that mimic the temporal properties of natural vocalizations) inclusive of their divergent actions between species, as well as intraspecific differences between male reproductive morphs. Studies of anuran amphibians (frogs) demonstrate that long-term steroid treatments (wks) can masculinize the fictive vocalizations of females, inclusive of its sensitivity to rapid modulation by serotonin. Given the conserved organization of vocal control systems across vertebrate groups, the vocal CPGs of fish and amphibians provide tractable models for identifying androgen-dependent events that are fundamental to the mechanisms of vocal motor patterning. These basic mechanisms can also inform our understanding of the more complex CPGs for vocalization, and social behaviors in general, that have evolved among birds and mammals. PMID:18262186
Divergent morphological and acoustic traits in sympatric communities of Asian barbets
Tamma, Krishnapriya
2016-01-01
The opposing effects of environmental filtering and competitive interactions may influence community assembly and coexistence of related species. Competition, both in the domain of ecological resources, and in the sensory domain (for example, acoustic interference) may also result in sympatric species evolving divergent traits and niches. Delineating these scenarios within communities requires understanding trait distributions and phylogenetic structure within the community, as well as patterns of trait evolution. We report that sympatric assemblages of Asian barbets (frugivorous canopy birds) consist of a random phylogenetic sample of species, but are divergent in both morphological and acoustic traits. Additionally, we find that morphology is more divergent than expected under Brownian evolution, whereas vocal frequency evolution is close to the pattern expected under Brownian motion (i.e. a random walk). Together, these patterns are consistent with a role for competition or competitive exclusion in driving community assembly. Phylogenetic patterns of morphological divergence between related species suggest that these traits are key in species coexistence. Because vocal frequency and size are correlated in barbets, we therefore hypothesize that frequency differences between sympatric barbets are a by-product of their divergent morphologies. PMID:27853589
Tchernichovski, Ofer; Marcus, Gary
2014-01-01
Studies of vocal learning in songbirds typically focus on the acquisition of sensory templates for song imitation and on the consequent process of matching song production to templates. However, functional vocal development also requires the capacity to adaptively diverge from sensory templates, and to flexibly assemble vocal units. Examples of adaptive divergence include the corrective imitation of abnormal songs, and the decreased tendency to copy overabundant syllables. Such frequency-dependent effects might mirror tradeoffs between the assimilation of group identity (culture) while establishing individual and flexibly expressive songs. Intriguingly, although the requirements for vocal plasticity vary across songbirds, and more so between birdsong and language, the capacity to flexibly assemble vocal sounds develops in a similar, stepwise manner across species. Therefore, universal features of vocal learning go well beyond the capacity to imitate. PMID:25005823
Flow-structure interaction simulation of voice production in a canine larynx
NASA Astrophysics Data System (ADS)
Jiang, Weili; Zheng, Xudong; Xue, Qian; Oren, Liran; Khosla, Sid
2017-11-01
Experimental measurements conducted on a hemi-larynx canine vocal fold showed that negative pressures formed in the glottis near the superior surface of the vocal fold in the closing phase even without a supra-glottal vocal tract. It was hypothesized that such negative pressures were due to intraglottal vortices caused by flow separation in a divergent vocal tract during vocal fold closing phase. This work aims to test this hypothesis from the numerical aspect. Flow-structure interaction simulations are performed in realistic canine laryngeal shapes. In the simulations, a sharp interface immersed boundary method based incompressible flow solver is utilized to model the air flow; a finite element based solid mechanics solver is utilized to model the vocal fold vibration. The geometric structure of the vocal fold and vocal tract are based on MRI scans of a mongrel canine. The vocal fold tissue is modeled as transversely isotropic nonlinear materials with a vertical stiffness gradient. Numerical indentation is first performed and compared with the experiment data to obtain the material properties. Simulation setup about the inlet and outlet pressure follows the setup in the experiment. Simulation results including the fundamental frequency, air flow rate, the divergent angle will be compared with the experimental data, providing the validation of the simulation approach. The relationship between flow separation, intra-glottal vortices, divergent angle and flow rate will be comprehensively analyzed.
NASA Astrophysics Data System (ADS)
Erath, Byron D.; Plesniak, Michael W.
2005-09-01
In speech, sound production arises from fluid-structure interactions within the larynx as well as viscous flow phenomena that is most likely to occur during the divergent orientation of the vocal folds. Of particular interest are the flow mechanisms that influence the location of flow separation points on the vocal folds walls. Physiologically scaled pulsatile flow fields in 7.5 times real size static divergent glottal models were investigated. Three divergence angles were investigated using phase-averaged particle image velocimetry (PIV). The pulsatile glottal jet exhibited a bi-modal stability toward both glottal walls, although there was a significant amount of variance in the angle the jet deflected from the midline. The attachment of the Coanda effect to the glottal model walls occurred when the pulsatile velocity was a maximum, and the acceleration of the waveform was zero. The location of the separation and reattachment points of the flow from the glottal models was a function of the velocity waveform and divergence angle. Acoustic analogies show that a dipole sound source contribution arising from the fluid interaction (Coanda jet) with the vocal fold walls is expected. [Work funded by NIH Grant RO1 DC03577.
Webster, Paula J.; Skipper-Kallal, Laura M.; Frum, Chris A.; Still, Hayley N.; Ward, B. Douglas; Lewis, James W.
2017-01-01
A major gap in our understanding of natural sound processing is knowledge of where or how in a cortical hierarchy differential processing leads to categorical perception at a semantic level. Here, using functional magnetic resonance imaging (fMRI) we sought to determine if and where cortical pathways in humans might diverge for processing action sounds vs. vocalizations as distinct acoustic-semantic categories of real-world sound when matched for duration and intensity. This was tested by using relatively less semantically complex natural sounds produced by non-conspecific animals rather than humans. Our results revealed a striking double-dissociation of activated networks bilaterally. This included a previously well described pathway preferential for processing vocalization signals directed laterally from functionally defined primary auditory cortices to the anterior superior temporal gyri, and a less well-described pathway preferential for processing animal action sounds directed medially to the posterior insulae. We additionally found that some of these regions and associated cortical networks showed parametric sensitivity to high-order quantifiable acoustic signal attributes and/or to perceptual features of the natural stimuli, such as the degree of perceived recognition or intentional understanding. Overall, these results supported a neurobiological theoretical framework for how the mammalian brain may be fundamentally organized to process acoustically and acoustic-semantically distinct categories of ethologically valid, real-world sounds. PMID:28111538
Titze, Ingo R
2014-04-01
The origin of vocal registers has generally been attributed to differential activation of cricothyroid and thyroarytenoid muscles in the larynx. Register shifts, however, have also been shown to be affected by glottal pressures exerted on vocal fold surfaces, which can change with loudness, pitch, and vowel. Here it is shown computationally and with empirical data that intraglottal pressures can change abruptly when glottal adductory geometry is changed relatively smoothly from convergent to divergent. An intermediate shape between large convergence and large divergence, namely, a nearly rectangular glottal shape with almost parallel vocal fold surfaces, is associated with mixed registration. It can be less stable than either of the highly angular shapes unless transglottal pressure is reduced and upper stiffness of vocal fold tissues is balanced with lower stiffness. This intermediate state of adduction is desirable because it leads to a low phonation threshold pressure with moderate vocal fold collision. Achieving mixed registration consistently across wide ranges of F0, lung pressure, and vocal tract shapes appears to be a balancing act of coordinating laryngeal muscle activation with vocal tract pressures. Surprisingly, a large transglottal pressure is not facilitative in this process, exacerbating the bi-stable condition and the associated register contrast.
Bi-stable vocal fold adduction: A mechanism of modal-falsetto register shifts and mixed registration
Titze, Ingo R.
2014-01-01
The origin of vocal registers has generally been attributed to differential activation of cricothyroid and thyroarytenoid muscles in the larynx. Register shifts, however, have also been shown to be affected by glottal pressures exerted on vocal fold surfaces, which can change with loudness, pitch, and vowel. Here it is shown computationally and with empirical data that intraglottal pressures can change abruptly when glottal adductory geometry is changed relatively smoothly from convergent to divergent. An intermediate shape between large convergence and large divergence, namely, a nearly rectangular glottal shape with almost parallel vocal fold surfaces, is associated with mixed registration. It can be less stable than either of the highly angular shapes unless transglottal pressure is reduced and upper stiffness of vocal fold tissues is balanced with lower stiffness. This intermediate state of adduction is desirable because it leads to a low phonation threshold pressure with moderate vocal fold collision. Achieving mixed registration consistently across wide ranges of F0, lung pressure, and vocal tract shapes appears to be a balancing act of coordinating laryngeal muscle activation with vocal tract pressures. Surprisingly, a large transglottal pressure is not facilitative in this process, exacerbating the bi-stable condition and the associated register contrast. PMID:25235006
Song evolution, speciation, and vocal learning in passerine birds.
Mason, Nicholas A; Burns, Kevin J; Tobias, Joseph A; Claramunt, Santiago; Seddon, Nathalie; Derryberry, Elizabeth P
2017-03-01
Phenotypic divergence can promote reproductive isolation and speciation, suggesting a possible link between rates of phenotypic evolution and the tempo of speciation at multiple evolutionary scales. To date, most macroevolutionary studies of diversification have focused on morphological traits, whereas behavioral traits─including vocal signals─are rarely considered. Thus, although behavioral traits often mediate mate choice and gene flow, we have a limited understanding of how behavioral evolution contributes to diversification. Furthermore, the developmental mode by which behavioral traits are acquired may affect rates of behavioral evolution, although this hypothesis is seldom tested in a phylogenetic framework. Here, we examine evidence for rate shifts in vocal evolution and speciation across two major radiations of codistributed passerines: one oscine clade with learned songs (Thraupidae) and one suboscine clade with innate songs (Furnariidae). We find that evolutionary bursts in rates of speciation and song evolution are coincident in both thraupids and furnariids. Further, overall rates of vocal evolution are higher among taxa with learned rather than innate songs. Taken together, these findings suggest an association between macroevolutionary bursts in speciation and vocal evolution, and that the tempo of behavioral evolution can be influenced by variation in developmental modes among lineages. © 2016 The Author(s). Evolution © 2016 The Society for the Study of Evolution.
Patterns and causes of geographic variation in bat echolocation pulses.
Jiang, Tinglei; Wu, Hui; Feng, Jiang
2015-05-01
Evolutionary biologists have a long-standing interest in how acoustic signals in animals vary geographically, because divergent ecology and sensory perception play an important role in speciation. Geographic comparisons are valuable in determining the factors that influence divergence of acoustic signals. Bats are social mammals and they depend mainly on echolocation pulses to locate prey, to navigate and to communicate. Mounting evidence shows that geographic variation of bat echolocation pulses is common, with a mean 5-10 kHz differences in peak frequency, and a high level of individual variation may be nested in this geographical variation. However, understanding the geographic variation of echolocation pulses in bats is very difficult, because of differences in sample and statistical analysis techniques as well as the variety of factors shaping the vocal geographic evolution. Geographic differences in echolocation pulses of bats generally lack latitudinal, longitudinal and elevational patterns, and little is known about vocal dialects. Evidence is accumulating to support the fact that geographic variation in echolocation pulses of bats may be caused by genetic drift, cultural drift, ecological selection, sexual selection and social selection. Future studies could relate geographic differences in echolocation pulses to social adaptation, vocal learning strategies and patterns of dispersal. In addition, new statistical techniques and acoustic playback experiments may help to illustrate the causes and consequences of the geographic evolution of echolocation pulse in bats. © 2015 International Society of Zoological Sciences, Institute of Zoology/Chinese Academy of Sciences and Wiley Publishing Asia Pty Ltd.
Arterbery, Adam S; Deitcher, David L; Bass, Andrew H
2010-05-15
The vocalizing midshipman fish, Porichthys notatus, has two male morphs that exhibit alternative mating tactics. Only territorial males acoustically court females with long duration (minutes to >1h) calls, whereas sneaker males attempt to steal fertilizations. During the breeding season, morph-specific tactics are paralleled by a divergence in relative testis and vocal muscle size, plasma levels of the androgen 11-ketotestosterone (11KT) and the glucocorticoid cortisol, and mRNA expression levels in the central nervous system (CNS) of the steroid-synthesizing enzyme aromatase (estrogen synthase). Here, we tested the hypothesis that the midshipman's two male morphs would further differ in the CNS, as well as in the testis and vocal muscle, in mRNA abundance for the enzymes 11beta-hydroxylase (11betaH) and 11beta-hydroxysteroid dehydrogenase (11betaHSD) that directly regulate both 11KT and cortisol synthesis. Quantitative real-time PCR demonstrated male morph-specific profiles for both enzymes. Territorial males had higher 11betaH and 11betaHSD mRNA levels in testis and vocal muscle. By contrast, sneaker males had the higher CNS expression, especially for 11betaHSD, in the region containing an expansive vocal pacemaker circuit that directly determines the temporal attributes of natural calls. We propose for territorial males that higher enzyme expression in testis underlies its greater plasma 11KT levels, which in vocal muscle provides both gluconeogenic and androgenic support for its long duration calling. We further propose for sneaker males that higher enzyme expression in the vocal CNS contributes to known cortisol-specific effects on its vocal physiology. Copyright 2010 Elsevier Inc. All rights reserved.
Chen, Shiang-Fan; Jones, Gareth; Rossiter, Stephen J.
2009-01-01
The origin and maintenance of intraspecific variation in vocal signals is important for population divergence and speciation. Where vocalizations are transmitted by vertical cultural inheritance, similarity will reflect co-ancestry, and thus vocal divergence should reflect genetic structure. Horseshoe bats are characterized by echolocation calls dominated by a constant frequency component that is partly determined by maternal imprinting. Although previous studies showed that constant frequency calls are also influenced by some non-genetic factors, it is not known how frequency relates to genetic structure. To test this, we related constant frequency variation to genetic and non-genetic variables in the Formosan lesser horseshoe bat (Rhinolophus monoceros). Recordings of bats from across Taiwan revealed that females called at higher frequencies than males; however, we found no effect of environmental or morphological factors on call frequency. By comparison, variation showed clear population structure, with frequencies lower in the centre and east, and higher in the north and south. Within these regions, frequency divergence was directional and correlated with geographical distance, suggesting that call frequencies are subject to cultural drift. However, microsatellite clustering analysis showed that broad differences in constant frequency among populations corresponded to discontinuities in allele frequencies resulting from vicariant events. Our results provide evidence that the processes shaping genetic subdivision have concomitant consequences for divergence in echolocation call frequency. PMID:19692399
Fergus, Daniel J.; Bass, Andrew H.
2013-01-01
Estrogens play a salient role in the development and maintenance of both male and female nervous systems and behaviors. The plainfin midshipman (Porichthys notatus), a teleost fish, has two male reproductive morphs that follow alternative mating tactics and diverge in multiple somatic, hormonal and neural traits, including the central control of morph-specific vocal behaviors. After we identified duplicate estrogen receptors (ERβ1 and ERβ2) in midshipman, we developed antibodies to localize protein expression in the central vocal-acoustic networks and saccule, the auditory division of the inner ear. As in other teleost species, ERβ1 and ERβ2 were robustly expressed in the telencephalon and hypothalamus in vocal-acoustic and other brain regions shown previously to exhibit strong expression of ERα and aromatase (estrogen synthetase, CYP19) in midshipman. Like aromatase, ERβ1 label co-localized with glial fibrillary acidic protein (GFAP) in telencephalic radial glial cells. Quantitative PCR revealed similar patterns of transcript abundance across reproductive morphs for ERβ1, ERβ2, ERα and aromatase in the forebrain and saccule. In contrast, transcript abundance for ERs and aromatase varied significantly between morphs in and around the sexually polymorphic vocal motor nucleus (VMN). Together, the results suggest that VMN is the major estrogen target within the estrogen-sensitive hindbrain vocal network that directly determines the duration, frequency and amplitude of morph-specific vocalizations. Comparable regional differences in steroid receptor abundances likely regulate morph-specific behaviors in males and females of other species exhibiting alternative reproductive tactics. PMID:23460422
Paradigms and progress in vocal fold restoration.
Ford, Charles N
2008-09-01
Science advances occur through orderly steps, puzzle-solving leaps, or divergences from the accepted disciplinary matrix that occasionally result in a revolutionary paradigm shift. Key advances must overcome bias, criticism, and rejection. Examples in biological science include use of embryonic stem cells, recognition of Helicobacter pylori in the etiology of ulcer disease, and the evolution of species. Our work in vocal fold restoration reflects these patterns. We progressed through phases of tissue replacement with fillers and biological implants, to current efforts at vocal fold regeneration through tissue engineering, and face challenges of a new "systems biology" paradigm embracing genomics and proteomics.
An agent-based model of dialect evolution in killer whales.
Filatova, Olga A; Miller, Patrick J O
2015-05-21
The killer whale is one of the few animal species with vocal dialects that arise from socially learned group-specific call repertoires. We describe a new agent-based model of killer whale populations and test a set of vocal-learning rules to assess which mechanisms may lead to the formation of dialect groupings observed in the wild. We tested a null model with genetic transmission and no learning, and ten models with learning rules that differ by template source (mother or matriline), variation type (random errors or innovations) and type of call change (no divergence from kin vs. divergence from kin). The null model without vocal learning did not produce the pattern of group-specific call repertoires we observe in nature. Learning from either mother alone or the entire matriline with calls changing by random errors produced a graded distribution of the call phenotype, without the discrete call types observed in nature. Introducing occasional innovation or random error proportional to matriline variance yielded more or less discrete and stable call types. A tendency to diverge from the calls of related matrilines provided fast divergence of loose call clusters. A pattern resembling the dialect diversity observed in the wild arose only when rules were applied in combinations and similar outputs could arise from different learning rules and their combinations. Our results emphasize the lack of information on quantitative features of wild killer whale dialects and reveal a set of testable questions that can draw insights into the cultural evolution of killer whale dialects. Copyright © 2015 Elsevier Ltd. All rights reserved.
Erath, Byron D.; Peterson, Sean D.; Zañartu, Matías; Wodicka, George R.; Stewart, Kelley C.; Plesniak, Michael W.
2013-01-01
Hirschberg [J. Acoust. Soc. Am. 134, 9-12 (2013)] presents a commentary and criticisms of the viscous flow model presented by Erath et al. [J. Acoust. Soc. Am. 130, 389–403 (2011)] that solves for the asymmetric pressure loading on the vocal fold walls. This pressure loading arises from asymmetric flow attachment to one vocal fold wall when the glottal channel forms a divergent configuration. Hirschberg proposes an alternative model for the asymmetric loading based upon inviscid flow curvature at the glottal inlet. In this manuscript further evidence is provided in support of the model of Erath et al. and the underlying assumptions, and demonstrates that the primary criticisms presented by Hirschberg are unwarranted. The model presented by Hirschberg is compared with the model from the original paper by Erath et al., and it is shown that each model describes different and complementary aspects of divergent glottal flows. PMID:23927090
NASA Astrophysics Data System (ADS)
Soderstrom, Ken; Alalawi, Ali
KLFromRecordingDays allows measurement of Kullback-Leibler (KL) distances between 2D probability distributions of vocal acoustic features. Greater KL distance measures reflect increased phonological divergence across the vocalizations compared. The software has been used to compare *.wav file recordings made by Sound Analysis Recorder 2011 of songbird vocalizations pre- and post-drug and surgical manipulations. Recordings from individual animals in *.wav format are first organized into subdirectories by recording day and then segmented into individual syllables uttered and acoustic features of these syllables using Sound Analysis Pro 2011 (SAP). KLFromRecordingDays uses syllable acoustic feature data output by SAP to a MySQL table to generate and compare "template" (typically pre-treatment) and "target" (typically post-treatment) probability distributions. These distributions are a series of virtual 2D plots of the duration of each syllable (as x-axis) to each of 13 other acoustic features measured by SAP for that syllable (as y-axes). Differences between "template" and "target" probability distributions for each acoustic feature are determined by calculating KL distance, a measure of divergence of the target 2D distribution pattern from that of the template. KL distances and the mean KL distance across all acoustic features are calculated for each recording day and output to an Excel spreadsheet. Resulting data for individual subjects may then be pooled across treatment groups and graphically summarized and used for statistical comparisons. Because SAP-generated MySQL files are accessed directly, data limits associated with spreadsheet output are avoided, and the totality of vocal output over weeks may be objectively analyzed all at once. The software has been useful for measuring drug effects on songbird vocalizations and assessing recovery from damage to regions of vocal motor cortex. It may be useful in studies employing other species, and as part of speech therapies tracking progress in producing distinct speech sounds in isolation.
Computational Modeling of Fluid–Structure–Acoustics Interaction during Voice Production
Jiang, Weili; Zheng, Xudong; Xue, Qian
2017-01-01
The paper presented a three-dimensional, first-principle based fluid–structure–acoustics interaction computer model of voice production, which employed a more realistic human laryngeal and vocal tract geometries. Self-sustained vibrations, important convergent–divergent vibration pattern of the vocal folds, and entrainment of the two dominant vibratory modes were captured. Voice quality-associated parameters including the frequency, open quotient, skewness quotient, and flow rate of the glottal flow waveform were found to be well within the normal physiological ranges. The analogy between the vocal tract and a quarter-wave resonator was demonstrated. The acoustic perturbed flux and pressure inside the glottis were found to be at the same order with their incompressible counterparts, suggesting strong source–filter interactions during voice production. Such high fidelity computational model will be useful for investigating a variety of pathological conditions that involve complex vibrations, such as vocal fold paralysis, vocal nodules, and vocal polyps. The model is also an important step toward a patient-specific surgical planning tool that can serve as a no-risk trial and error platform for different procedures, such as injection of biomaterials and thyroplastic medialization. PMID:28243588
Ortiz-Ramírez, Marco F; Andersen, Michael J; Zaldívar-Riverón, Alejandro; Ornelas, Juan Francisco; Navarro-Sigüenza, Adolfo G
2016-01-01
Montane barriers influence the evolutionary history of lineages by promoting isolation of populations. The effects of these historical processes are evident in patterns of differentiation among extant populations, which are often expressed as genetic and behavioral variation between populations. We investigated the effects of geographic barriers on the evolutionary history of a Mesoamerican bird by studying patterns of genetic and vocal variation in the Ruddy-capped Nightingale-Thrush (Turdidae: Catharus frantzii), a non-migratory oscine bird that inhabits montane forests from central Mexico to Panama. We reconstructed the phylogeographic history and estimated divergence times between populations using Bayesian and maximum likelihood methods. We found strong support for the existence of four mitochondrial lineages of C. frantzii corresponding to isolated mountain ranges: Sierra Madre Oriental; Sierra Madre del Sur; the highlands of Chiapas, Guatemala, and El Salvador; and the Talamanca Cordillera. Vocal features in C. frantzii were highly variable among the four observed clades, but vocal variation and genetic variation were uncorrelated. Song variation in C. frantzii suggests that sexual selection and cultural drift could be important factors driving song differentiation in C. frantzii. Copyright © 2015 Elsevier Inc. All rights reserved.
Inhibitory and modulatory inputs to the vocal central pattern generator of a teleost fish
Rosner, Elisabeth; Rohmann, Kevin N.; Bass, Andrew H.
2018-01-01
Abstract Vocalization is a behavioral feature that is shared among multiple vertebrate lineages, including fish. The temporal patterning of vocal communication signals is set, in part, by central pattern generators (CPGs). Toadfishes are well‐established models for CPG coding of vocalization at the hindbrain level. The vocal CPG comprises three topographically separate nuclei: pre‐pacemaker, pacemaker, motor. While the connectivity between these nuclei is well understood, their neurochemical profile remains largely unexplored. The highly vocal Gulf toadfish, Opsanus beta, has been the subject of previous behavioral, neuroanatomical and neurophysiological studies. Combining transneuronal neurobiotin‐labeling with immunohistochemistry, we map the distribution of inhibitory neurotransmitters and neuromodulators along with gap junctions in the vocal CPG of this species. Dense GABAergic and glycinergic label is found throughout the CPG, with labeled somata immediately adjacent to or within CPG nuclei, including a distinct subset of pacemaker neurons co‐labeled with neurobiotin and glycine. Neurobiotin‐labeled motor and pacemaker neurons are densely co‐labeled with the gap junction protein connexin 35/36, supporting the hypothesis that transneuronal neurobiotin‐labeling occurs, at least in part, via gap junction coupling. Serotonergic and catecholaminergic label is also robust within the entire vocal CPG, with additional cholinergic label in pacemaker and prepacemaker nuclei. Likely sources of these putative modulatory inputs are neurons within or immediately adjacent to vocal CPG neurons. Together with prior neurophysiological investigations, the results reveal potential mechanisms for generating multiple classes of social context‐dependent vocalizations with widely divergent temporal and spectral properties. PMID:29424431
Development of echolocation and communication vocalizations in the big brown bat, Eptesicus fuscus.
Monroy, Jenna A; Carter, Matthew E; Miller, Kimberly E; Covey, Ellen
2011-05-01
Big brown bats form large maternity colonies of up to 200 mothers and their pups. If pups are separated from their mothers, they can locate each other using vocalizations. The goal of this study was to systematically characterize the development of echolocation and communication calls from birth through adulthood to determine whether they develop from a common precursor at the same or different rates, or whether both types are present initially. Three females and their six pups were isolated from our captive breeding colony. We recorded vocal activity from postnatal day 1 to 35, both when the pups were isolated and when they were reunited with their mothers. At birth, pups exclusively emitted isolation calls, with a fundamental frequency range <20 kHz, and duration >30 ms. By the middle of week 1, different types of vocalizations began to emerge. Starting in week 2, pups in the presence of their mothers emitted sounds that resembled adult communication vocalizations, with a lower frequency range and longer durations than isolation calls or echolocation signals. During weeks 2 and 3, these vocalizations were extremely heterogeneous, suggesting that the pups went through a babbling stage before establishing a repertoire of stereotyped adult vocalizations around week 4. By week 4, vocalizations emitted when pups were alone were identical to adult echolocation signals. Echolocation and communication signals both appear to develop from the isolation call, diverging during week 2 and continuing to develop at different rates for several weeks until the adult vocal repertoire is established.
Higher songs of city birds may not be an individual response to noise.
Zollinger, Sue Anne; Slater, Peter J B; Nemeth, Erwin; Brumm, Henrik
2017-08-16
It has been observed in many songbird species that populations in noisy urban areas sing with a higher minimum frequency than do matched populations in quieter, less developed areas. However, why and how this divergence occurs is not yet understood. We experimentally tested whether chronic noise exposure during vocal learning results in songs with higher minimum frequencies in great tits ( Parus major ), the first species for which a correlation between anthropogenic noise and song frequency was observed. We also tested vocal plasticity of adult great tits in response to changing background noise levels by measuring song frequency and amplitude as we changed noise conditions. We show that noise exposure during ontogeny did not result in songs with higher minimum frequencies. In addition, we found that adult birds did not make any frequency or song usage adjustments when their background noise conditions were changed after song crystallization. These results challenge the common view of vocal adjustments by city birds, as they suggest that either noise itself is not the causal force driving the divergence of song frequency between urban and forest populations, or that noise induces population-wide changes over a time scale of several generations rather than causing changes in individual behaviour. © 2017 The Author(s).
Two organizing principles of vocal production: Implications for nonhuman and human primates.
Owren, Michael J; Amoss, R Toby; Rendall, Drew
2011-06-01
Vocal communication in nonhuman primates receives considerable research attention, with many investigators arguing for similarities between this calling and speech in humans. Data from development and neural organization show a central role of affect in monkey and ape sounds, however, suggesting that their calls are homologous to spontaneous human emotional vocalizations while having little relation to spoken language. Based on this evidence, we propose two principles that can be useful in evaluating the many and disparate empirical findings that bear on the nature of vocal production in nonhuman and human primates. One principle distinguishes production-first from reception-first vocal development, referring to the markedly different role of auditory-motor experience in each case. The second highlights a phenomenon dubbed dual neural pathways, specifically that when a species with an existing vocal system evolves a new functionally distinct vocalization capability, it occurs through emergence of a second parallel neural pathway rather than through expansion of the extant circuitry. With these principles as a backdrop, we review evidence of acoustic modification of calling associated with background noise, conditioning effects, audience composition, and vocal convergence and divergence in nonhuman primates. Although each kind of evidence has been interpreted to show flexible cognitively mediated control over vocal production, we suggest that most are more consistent with affectively grounded mechanisms. The lone exception is production of simple, novel sounds in great apes, which is argued to reveal at least some degree of volitional vocal control. If also present in early hominins, the cortically based circuitry surmised to be associated with these rudimentary capabilities likely also provided the substrate for later emergence of the neural pathway allowing volitional production in modern humans. © 2010 Wiley-Liss, Inc.
Cocroft, R.B.; McDiarmid, R.W.; Jaslow, A.P.; Ruiz-Carranza, P.M.
1990-01-01
Vocalizations of frogs of the genus Atelopus include three discrete types of signals: pulsed calls, pure tone calls, and short calls. Repertoire composition is conservative across species. Repertoires of most species whose calls have been recorded contain two or three of these identifiable call types. Within a call type, details of call structure are very similar across species. This apparent lack of divergence in calls may be related to the rarity of sympatry among species of Atelopus and to the relative importance of visual communication in their social interactions.
Current Understanding and Future Directions for Vocal Fold Mechanobiology
Li, Nicole Y.K.; Heris, Hossein K.; Mongeau, Luc
2013-01-01
The vocal folds, which are located in the larynx, are the main organ of voice production for human communication. The vocal folds are under continuous biomechanical stress similar to other mechanically active organs, such as the heart, lungs, tendons and muscles. During speech and singing, the vocal folds oscillate at frequencies ranging from 20 Hz to 3 kHz with amplitudes of a few millimeters. The biomechanical stress associated with accumulated phonation is believed to alter vocal fold cell activity and tissue structure in many ways. Excessive phonatory stress can damage tissue structure and induce a cell-mediated inflammatory response, resulting in a pathological vocal fold lesion. On the other hand, phonatory stress is one major factor in the maturation of the vocal folds into a specialized tri-layer structure. One specific form of vocal fold oscillation, which involves low impact and large amplitude excursion, is prescribed therapeutically for patients with mild vocal fold injuries. Although biomechanical forces affect vocal fold physiology and pathology, there is little understanding of how mechanical forces regulate these processes at the cellular and molecular level. Research into vocal fold mechanobiology has burgeoned over the past several years. Vocal fold bioreactors are being developed in several laboratories to provide a biomimic environment that allows the systematic manipulation of physical and biological factors on the cells of interest in vitro. Computer models have been used to simulate the integrated response of cells and proteins as a function of phonation stress. The purpose of this paper is to review current research on the mechanobiology of the vocal folds as it relates to growth, pathogenesis and treatment as well as to propose specific research directions that will advance our understanding of this subject. PMID:24812638
Flow separation in a computational oscillating vocal fold model
NASA Astrophysics Data System (ADS)
Alipour, Fariborz; Scherer, Ronald C.
2004-09-01
A finite-volume computational model that solves the time-dependent glottal airflow within a forced-oscillation model of the glottis was employed to study glottal flow separation. Tracheal input velocity was independently controlled with a sinusoidally varying parabolic velocity profile. Control parameters included flow rate (Reynolds number), oscillation frequency and amplitude of the vocal folds, and the phase difference between the superior and inferior glottal margins. Results for static divergent glottal shapes suggest that velocity increase caused glottal separation to move downstream, but reduction in velocity increase and velocity decrease moved the separation upstream. At the fixed frequency, an increase of amplitude of the glottal walls moved the separation further downstream during glottal closing. Increase of Reynolds number caused the flow separation to move upstream in the glottis. The flow separation cross-sectional ratio ranged from approximately 1.1 to 1.9 (average of 1.47) for the divergent shapes. Results suggest that there may be a strong interaction of rate of change of airflow, inertia, and wall movement. Flow separation appeared to be ``delayed'' during the vibratory cycle, leading to movement of the separation point upstream of the glottal end only after a significant divergent angle was reached, and to persist upstream into the convergent phase of the cycle.
Core and Shell Song Systems Unique to the Parrot Brain
Chakraborty, Mukta; Walløe, Solveig; Nedergaard, Signe; Fridel, Emma E.; Dabelsteen, Torben; Pakkenberg, Bente; Bertelsen, Mads F.; Dorrestein, Gerry M.; Brauth, Steven E.; Durand, Sarah E.; Jarvis, Erich D.
2015-01-01
The ability to imitate complex sounds is rare, and among birds has been found only in parrots, songbirds, and hummingbirds. Parrots exhibit the most advanced vocal mimicry among non-human animals. A few studies have noted differences in connectivity, brain position and shape in the vocal learning systems of parrots relative to songbirds and hummingbirds. However, only one parrot species, the budgerigar, has been examined and no differences in the presence of song system structures were found with other avian vocal learners. Motivated by questions of whether there are important differences in the vocal systems of parrots relative to other vocal learners, we used specialized constitutive gene expression, singing-driven gene expression, and neural connectivity tracing experiments to further characterize the song system of budgerigars and/or other parrots. We found that the parrot brain uniquely contains a song system within a song system. The parrot “core” song system is similar to the song systems of songbirds and hummingbirds, whereas the “shell” song system is unique to parrots. The core with only rudimentary shell regions were found in the New Zealand kea, representing one of the only living species at a basal divergence with all other parrots, implying that parrots evolved vocal learning systems at least 29 million years ago. Relative size differences in the core and shell regions occur among species, which we suggest could be related to species differences in vocal and cognitive abilities. PMID:26107173
Individual killer whale vocal variation during intra-group behavioral dynamics
NASA Astrophysics Data System (ADS)
Grebner, Dawn M.
The scientific goal of this dissertation was to carefully study the signal structure of killer whale communications and vocal complexity and link them to behavioral circumstances. The overall objective of this research sought to provide insight into killer whale call content and usage which may be conveying information to conspecifics in order to maintain group cohesion. Data were collected in the summers of 2006 and 2007 in Johnstone Strait, British Columbia. For both individuals and small groups, vocalizations were isolated using a triangular hydrophone array and the behavioral movement patterns were captured by a theodolite and video camera positioned on a cliff overlooking the hyrophone locations. This dissertation is divided into four analysis chapters. In Chapter 3, discriminant analysis was used to validate the four N04 call subtypes which were originally parsed due to variations in slope segments. The first two functions of the discriminant analysis explained 97% of the variability. Most of the variability for the N04 call was found in the front convex and the terminal portions of the call, while very little variability was found in the center region of the call. This research revealed that individual killer whales produced multiple subtypes of the N04 call. No correlations of behaviors to acoustic parameters obtained were found. The aim of the Chapter 4 was to determine if killer whale calling behavior varied prior to and after the animals had joined. Pulsed call rates were found to be greater pre- compared to post-joining events. Two-way vocal exchanges were more common occurring 74% of the time during pre-joining events. In Chapter 5, initiated and first response to calls varied between age/sex class groups when mothers were separated from an offspring. Solo mothers and calves initiated pulsed calls more often than they responded. Most of the no vocal responses were due to mothers who were foraging. Finally, observations of the frequency split in N04 calls discussed in Chapter 6 showed that the higher frequency component (HFC) was always associated with sideband 7 (SB7) of the lower frequency component (LFC). Insight into Northern Resident killer whale intra-group vocal dynamics would aid our understanding of vocal behaviors of many other marine mammal species that rely on vocal exchanges for prey capture, group movement or survival. This is the first study to focus on killer whale vocal content and usage as it pertains to intra-group dynamics for (1) mother and offspring separations and (2) for all individuals prior to joining events, as well as (3) individual usage in a diverging pulsed call. It is also the first time the N04 call has been parsed into subtypes.
Brain evolution by brain pathway duplication
Chakraborty, Mukta; Jarvis, Erich D.
2015-01-01
Understanding the mechanisms of evolution of brain pathways for complex behaviours is still in its infancy. Making further advances requires a deeper understanding of brain homologies, novelties and analogies. It also requires an understanding of how adaptive genetic modifications lead to restructuring of the brain. Recent advances in genomic and molecular biology techniques applied to brain research have provided exciting insights into how complex behaviours are shaped by selection of novel brain pathways and functions of the nervous system. Here, we review and further develop some insights to a new hypothesis on one mechanism that may contribute to nervous system evolution, in particular by brain pathway duplication. Like gene duplication, we propose that whole brain pathways can duplicate and the duplicated pathway diverge to take on new functions. We suggest that one mechanism of brain pathway duplication could be through gene duplication, although other mechanisms are possible. We focus on brain pathways for vocal learning and spoken language in song-learning birds and humans as example systems. This view presents a new framework for future research in our understanding of brain evolution and novel behavioural traits. PMID:26554045
Leininger, Elizabeth C.; Kelley, Darcy B.
2013-01-01
Independent or convergent evolution can underlie phenotypic similarity of derived behavioural characters. Determining the underlying neural and neuromuscular mechanisms sheds light on how these characters arose. One example of evolutionarily derived characters is a temporally simple advertisement call of male African clawed frogs (Xenopus) that arose at least twice independently from a more complex ancestral pattern. How did simplification occur in the vocal circuit? To distinguish shared from divergent mechanisms, we examined activity from the calling brain and vocal organ (larynx) in two species that independently evolved simplified calls. We find that each species uses distinct neural and neuromuscular strategies to produce the simplified calls. Isolated Xenopus borealis brains produce fictive vocal patterns that match temporal patterns of actual male calls; the larynx converts nerve activity faithfully into muscle contractions and single clicks. In contrast, fictive patterns from isolated Xenopus boumbaensis brains are short bursts of nerve activity; the isolated larynx requires stimulus bursts to produce a single click of sound. Thus, unlike X. borealis, the output of the X. boumbaensis hindbrain vocal pattern generator is an ancestral burst-type pattern, transformed by the larynx into single clicks. Temporally simple advertisement calls in genetically distant species of Xenopus have thus arisen independently via reconfigurations of central and peripheral vocal neuroeffectors. PMID:23407829
Leininger, Elizabeth C; Kelley, Darcy B
2013-04-07
Independent or convergent evolution can underlie phenotypic similarity of derived behavioural characters. Determining the underlying neural and neuromuscular mechanisms sheds light on how these characters arose. One example of evolutionarily derived characters is a temporally simple advertisement call of male African clawed frogs (Xenopus) that arose at least twice independently from a more complex ancestral pattern. How did simplification occur in the vocal circuit? To distinguish shared from divergent mechanisms, we examined activity from the calling brain and vocal organ (larynx) in two species that independently evolved simplified calls. We find that each species uses distinct neural and neuromuscular strategies to produce the simplified calls. Isolated Xenopus borealis brains produce fictive vocal patterns that match temporal patterns of actual male calls; the larynx converts nerve activity faithfully into muscle contractions and single clicks. In contrast, fictive patterns from isolated Xenopus boumbaensis brains are short bursts of nerve activity; the isolated larynx requires stimulus bursts to produce a single click of sound. Thus, unlike X. borealis, the output of the X. boumbaensis hindbrain vocal pattern generator is an ancestral burst-type pattern, transformed by the larynx into single clicks. Temporally simple advertisement calls in genetically distant species of Xenopus have thus arisen independently via reconfigurations of central and peripheral vocal neuroeffectors.
Lachlan, Robert F; van Heijningen, Caroline A A; Ter Haar, Sita M; Ten Cate, Carel
2016-01-01
Learned bird songs are often characterized by a high degree of variation between individuals and sometimes between populations, while at the same time maintaining species specificity. The evolution of such songs depends on the balance between plasticity and constraints. Captive populations provide an opportunity to examine signal variation and differentiation in detail, so we analyzed adult male zebra finch (Taeniopygia guttata) songs recorded from 13 populations across the world, including one sample of songs from wild-caught males in their native Australia. Cluster analysis suggested some, albeit limited, evidence that zebra finch song units belonged to universal, species-wide categories, linked to restrictions in vocal production and non-song parts of the vocal repertoire. Across populations, songs also showed some syntactical structure, although any song unit could be placed anywhere within the song. On the other hand, there was a statistically significant differentiation between populations, but the effect size was very small, and its communicative significance dubious. Our results suggest that variation in zebra finch songs within a population is largely determined by species-wide constraints rather than population-specific features. Although captive zebra finch populations have been sufficiently isolated to allow them to genetically diverge, there does not appear to have been any divergence in the genetically determined constraints that underlie song learning. Perhaps more surprising is the lack of locally diverged cultural traditions. Zebra finches serve as an example of a system where frequent learning errors may rapidly create within-population diversity, within broad phonological and syntactical constraints, and prevent the formation of long-term cultural traditions that allow populations to diverge.
González, Clementina; Ornelas, Juan Francisco; Gutiérrez-Rodríguez, Carla
2011-02-08
Mesoamerica is one of the most threatened biodiversity hotspots in the world, yet we are far from understanding the geologic history and the processes driving population divergence and speciation for most endemic taxa. In species with highly differentiated populations selective and/or neutral factors can induce rapid changes to traits involved in mate choice, promoting reproductive isolation between allopatric populations that can eventually lead to speciation. We present the results of genetic differentiation, and explore drift and selection effects in promoting acoustic and morphological divergence among populations of Campylopterus curvipennis, a lekking hummingbird with an extraordinary vocal variability across Mesoamerica. Analyses of two mitochondrial genes and ten microsatellite loci genotyped for 160 individuals revealed the presence of three lineages with no contemporary gene flow: C. c. curvipennis, C. c. excellens, and C. c. pampa disjunctly distributed in the Sierra Madre Oriental, the Tuxtlas region and the Yucatan Peninsula, respectively. Sequence mtDNA and microsatellite data were congruent with two diversification events: an old vicariance event at the Isthmus of Tehuantepec (c. 1.4 Ma), and a more recent Pleistocene split, isolating populations in the Tuxtlas region. Hummingbirds of the excellens group were larger, and those of the pampa group had shorter bills, and lineages that have been isolated the longest shared fewer syllables and differed in spectral and temporal traits of a shared syllable. Coalescent simulations showed that fixation of song types has occurred faster than expected under neutrality but the null hypothesis that morphological divergence resulted from drift was not rejected. Our phylogeographic analyses uncovered the presence of three Mesoamerican wedge-tailed sabrewing lineages, which diverged at different time scales. These results highlight the importance of the Isthmus of Tehuantepec and more recent Pleistocene climatic events in driving isolation and population divergence. Coalescent analyses of the evolution of phenotypic traits suggest that selection is driving song evolution in wedge-tailed sabrewings but drift could not be rejected as a possibility for morphological divergence.
Effects of Social Games on Infant Vocalizations
ERIC Educational Resources Information Center
Hsu, Hui-Chin; Iyer, Suneeti Nathani; Fogel, Alan
2014-01-01
The aim of the present study was to examine the contextual effects of social games on prelinguistic vocalizations. The two main goals were to (1) investigate the functions of vocalizations as symptoms of affective arousal and symbols of social understanding, and (2) explore form-function (de)coupling relations between vocalization types and game…
Recurrence plot analysis of nonstationary data: the understanding of curved patterns.
Facchini, A; Kantz, H; Tiezzi, E
2005-08-01
Recurrence plots of the calls of the Nomascus concolor (Western black crested gibbon) and Hylobates lar (White-handed gibbon) show characteristic circular, curved, and hyperbolic patterns superimposed to the main temporal scale of the signal. It is shown that these patterns are related to particular nonstationarities in the signal. Some of them can be reproduced by artificial signals like frequency modulated sinusoids and sinusoids with time divergent frequency. These modulations are too faint to be resolved by conventional time-frequency analysis with similar precision. Therefore, recurrence plots act as a magnifying glass for the detection of multiple temporal scales in slightly modulated signals. The detected phenomena in these acoustic signals can be explained in the biomechanical context by taking in account the role of the muscles controlling the vocal folds.
NASA Astrophysics Data System (ADS)
Westervelt, Andrea; Erath, Byron
2013-11-01
Voiced speech is produced by fluid-structure interactions that drive vocal fold motion. Viscous flow features influence the pressure in the gap between the vocal folds (i.e. glottis), thereby altering vocal fold dynamics and the sound that is produced. During the closing phases of the phonatory cycle, vortices form as a result of flow separation as air passes through the divergent glottis. It is hypothesized that the reduced pressure within a vortex core will alter the pressure distribution along the vocal fold surface, thereby aiding in vocal fold closure. The objective of this study is to determine the impact of intraglottal vortices on the fluid-structure interactions of voiced speech by investigating how the dynamics of a flexible plate are influenced by a vortex ring passing tangentially over it. A flexible plate, which models the medial vocal fold surface, is placed in a water-filled tank and positioned parallel to the exit of a vortex generator. The physical parameters of plate stiffness and vortex circulation are scaled with physiological values. As vortices propagate over the plate, particle image velocimetry measurements are captured to analyze the energy exchange between the fluid and flexible plate. The investigations are performed over a range of vortex formation numbers, and lateral displacements of the plate from the centerline of the vortex trajectory. Observations show plate oscillations with displacements directly correlated with the vortex core location.
Elie, Julie E.; Theunissen, Frédéric E.
2018-01-01
Although a universal code for the acoustic features of animal vocal communication calls may not exist, the thorough analysis of the distinctive acoustical features of vocalization categories is important not only to decipher the acoustical code for a specific species but also to understand the evolution of communication signals and the mechanisms used to produce and understand them. Here, we recorded more than 8,000 examples of almost all the vocalizations of the domesticated zebra finch, Taeniopygia guttata: vocalizations produced to establish contact, to form and maintain pair bonds, to sound an alarm, to communicate distress or to advertise hunger or aggressive intents. We characterized each vocalization type using complete representations that avoided any a priori assumptions on the acoustic code, as well as classical bioacoustics measures that could provide more intuitive interpretations. We then used these acoustical features to rigorously determine the potential information-bearing acoustical features for each vocalization type using both a novel regularized classifier and an unsupervised clustering algorithm. Vocalization categories are discriminated by the shape of their frequency spectrum and by their pitch saliency (noisy to tonal vocalizations) but not particularly by their fundamental frequency. Notably, the spectral shape of zebra finch vocalizations contains peaks or formants that vary systematically across categories and that would be generated by active control of both the vocal organ (source) and the upper vocal tract (filter). PMID:26581377
Primate vocal communication: a useful tool for understanding human speech and language evolution?
Fedurek, Pawel; Slocombe, Katie E
2011-04-01
Language is a uniquely human trait, and questions of how and why it evolved have been intriguing scientists for years. Nonhuman primates (primates) are our closest living relatives, and their behavior can be used to estimate the capacities of our extinct ancestors. As humans and many primate species rely on vocalizations as their primary mode of communication, the vocal behavior of primates has been an obvious target for studies investigating the evolutionary roots of human speech and language. By studying the similarities and differences between human and primate vocalizations, comparative research has the potential to clarify the evolutionary processes that shaped human speech and language. This review examines some of the seminal and recent studies that contribute to our knowledge regarding the link between primate calls and human language and speech. We focus on three main aspects of primate vocal behavior: functional reference, call combinations, and vocal learning. Studies in these areas indicate that despite important differences, primate vocal communication exhibits some key features characterizing human language. They also indicate, however, that some critical aspects of speech, such as vocal plasticity, are not shared with our primate cousins. We conclude that comparative research on primate vocal behavior is a very promising tool for deepening our understanding of the evolution of human speech and language, but much is still to be done as many aspects of monkey and ape vocalizations remain largely unexplored.
The effect of vocal fold vertical stiffness gradient on sound production
NASA Astrophysics Data System (ADS)
Geng, Biao; Xue, Qian; Zheng, Xudong
2015-11-01
It is observed in some experimental studies on canine vocal folds (VFs) that the inferior aspect of the vocal fold (VF) is much stiffer than the superior aspect under relatively large strain. Such vertical difference is supposed to promote the convergent-divergent shape during VF vibration and consequently facilitate the production of sound. In this study, we investigate the effect of vertical variation of VF stiffness on sound production using a numerical model. The vertical variation of stiffness is produced by linearly increasing the Young's modulus and shear modulus from the superior to inferior aspects in the cover layer, and its effect on phonation is examined in terms of aerodynamic and acoustic quantities such as flow rate, open quotient, skewness of flow wave form, sound intensity and vocal efficiency. The flow-induced vibration of the VF is solved with a finite element solver coupled with 1D Bernoulli equation, which is further coupled with a digital waveguide model. This study is designed to find out whether it's beneficial to artificially induce the vertical stiffness gradient by certain implanting material in VF restoring surgery, and if it is beneficial, what gradient is the most favorable.
Vocal fold ion transport and mucin expression following acrolein exposure
Levendoski, Elizabeth Erickson; Sivasankar, M. Preeti
2014-01-01
The vocal fold epithelium is exposed to inhaled particulates including pollutants during breathing in everyday environments. Yet, our understanding of the effects of pollutants on vocal fold epithelial function is extremely limited. The objective of this study was to investigate the effect of the pollutant acrolein on two vocal fold epithelial mechanisms: ion transport and mucin synthesis. These mechanisms were chosen as each plays a critical role in vocal defense and in maintaining surface hydration which is necessary for optimal voice production. Healthy, native porcine vocal folds (N=85) were excised and exposed to an acrolein or sham challenge. A 60 minute acrolein, but not sham challenge significantly reduced ion transport and inhibited cyclic adenosine monophosphate-dependent increases in ion transport. Decreases in ion transport were associated with reduced sodium absorption. Within the same timeline, no significant acrolein-induced changes in mucin gene or protein expression were observed. These results improve our understanding of the effects of acrolein on key vocal fold epithelial functions and inform the development of future investigations that seek to elucidate the impact of a wide range of pollutant exposures on vocal fold health. PMID:24648011
Vocal learning in elephants: neural bases and adaptive context
Stoeger, Angela S; Manger, Paul
2014-01-01
In the last decade clear evidence has accumulated that elephants are capable of vocal production learning. Examples of vocal imitation are documented in African (Loxodonta africana) and Asian (Elephas maximus) elephants, but little is known about the function of vocal learning within the natural communication systems of either species. We are also just starting to identify the neural basis of elephant vocalizations. The African elephant diencephalon and brainstem possess specializations related to aspects of neural information processing in the motor system (affecting the timing and learning of trunk movements) and the auditory and vocalization system. Comparative interdisciplinary (from behavioral to neuroanatomical) studies are strongly warranted to increase our understanding of both vocal learning and vocal behavior in elephants. PMID:25062469
The Infant Monitor of Vocal Production: Simple Beginnings
ERIC Educational Resources Information Center
Moore, Robyn Cantle
2014-01-01
The Infant Monitor of vocal Production (IMP) was conceived as an educational strategy to help parents understand the nature and pace of their baby's vocal development following neonatal diagnosis and amplification for hearing loss. The potential for other clinical applications emerged with use. The instrument presents as a series of…
Cortical representations of communication sounds.
Heiser, Marc A; Cheung, Steven W
2008-10-01
This review summarizes recent research into cortical processing of vocalizations in animals and humans. There has been a resurgent interest in this topic accompanied by an increased number of studies using animal models with complex vocalizations and new methods in human brain imaging. Recent results from such studies are discussed. Experiments have begun to reveal the bilateral cortical fields involved in communication sound processing and the transformations of neural representations that occur among those fields. Advances have also been made in understanding the neuronal basis of interaction between developmental exposures and behavioral experiences with vocalization perception. Exposure to sounds during the developmental period produces large effects on brain responses, as do a variety of specific trained tasks in adults. Studies have also uncovered a neural link between the motor production of vocalizations and the representation of vocalizations in cortex. Parallel experiments in humans and animals are answering important questions about vocalization processing in the central nervous system. This dual approach promises to reveal microscopic, mesoscopic, and macroscopic principles of large-scale dynamic interactions between brain regions that underlie the complex phenomenon of vocalization perception. Such advances will yield a greater understanding of the causes, consequences, and treatment of disorders related to speech processing.
Nonlinear dynamic mechanism of vocal tremor from voice analysis and model simulations
NASA Astrophysics Data System (ADS)
Zhang, Yu; Jiang, Jack J.
2008-09-01
Nonlinear dynamic analysis and model simulations are used to study the nonlinear dynamic characteristics of vocal folds with vocal tremor, which can typically be characterized by low-frequency modulation and aperiodicity. Tremor voices from patients with disorders such as paresis, Parkinson's disease, hyperfunction, and adductor spasmodic dysphonia show low-dimensional characteristics, differing from random noise. Correlation dimension analysis statistically distinguishes tremor voices from normal voices. Furthermore, a nonlinear tremor model is proposed to study the vibrations of the vocal folds with vocal tremor. Fractal dimensions and positive Lyapunov exponents demonstrate the evidence of chaos in the tremor model, where amplitude and frequency play important roles in governing vocal fold dynamics. Nonlinear dynamic voice analysis and vocal fold modeling may provide a useful set of tools for understanding the dynamic mechanism of vocal tremor in patients with laryngeal diseases.
Prat, Yosef; Taub, Mor; Pratt, Ester; Yovel, Yossi
2017-10-03
Animal acoustic communication research depends on our ability to record the vocal behaviour of different species. Only rarely do we have the opportunity to continuously follow the vocal behaviour of a group of individuals of the same species for a long period of time. Here, we provide a database of Egyptian fruit bat vocalizations, which were continuously recorded in the lab in several groups simultaneously for more than a year. The dataset includes almost 300,000 files, a few seconds each, containing social vocalizations and representing the complete vocal repertoire used by the bats in the experiment period. Around 90,000 files are annotated with details about the individuals involved in the vocal interactions, their behaviours and the context. Moreover, the data include the complete vocal ontogeny of pups, from birth to adulthood, in different conditions (e.g., isolated or in a group). We hope that this comprehensive database will stimulate studies that will enhance our understanding of bat, and mammal, social vocal communication.
Optimal glottal configuration for ease of phonation.
Lucero, J C
1998-06-01
Recent experimental studies have shown the existence of optimal values of the glottal width and convergence angle, at which the phonation threshold pressure is minimum. These results indicate the existence of an optimal glottal configuration for ease of phonation, not predicted by the previous theory. In this paper, the origin of the optimal configuration is investigated using a low dimensional mathematical model of the vocal fold. Two phenomena of glottal aerodynamics are examined: pressure losses due to air viscosity, and air flow separation from a divergent glottis. The optimal glottal configuration seems to be a consequence of the combined effect of both factors. The results agree with the experimental data, showing that the phonation threshold pressure is minimum when the vocal folds are slightly separated in a near rectangular glottis.
Hasiniaina, Alida F; Scheumann, Marina; Rina Evasoa, Mamy; Braud, Diane; Rasoloharijaona, Solofonirina; Randrianambinina, Blanchard; Zimmermann, Elke
2018-05-02
The critically endangered Claire's mouse lemur, only found in the evergreen rain forest of the National Park Lokobe (LNP) and a few lowland evergreen rain forest fragments of northern Madagascar, was described recently. The present study provides the first quantified information on vocal acoustics of calls, sound associated behavioral context, acoustic niche, and vocal activity of this species. We recorded vocal and social behavior of six male-female and six male-male dyads in a standardized social-encounter paradigm in June and July 2016 at the LNP, Nosy Bé island. Over six successive nights per dyad, we audio recorded and observed behaviors for 3 hr at the beginning of the activity period. Based on the visual inspection of spectrograms and standardized multiparametric sound analysis, we identified seven different call types. Call types can be discriminated based on a combination of harmonicity, fundamental frequency variation, call duration, and degree of tonality. Acoustic features of tonal call types showed that for communication, mouse lemurs use the cryptic, high frequency/ultrasonic frequency niche. Two call types, the Tsak and the Grunt call, were emitted most frequently. Significant differences in vocal activity of the Tsak call were found between male-female and male-male dyads, linked primarily to agonistic conflicts. Dominant mouse lemurs vocalized more than subdominant ones, suggesting that signaling may present an honest indicator of fitness. A comparison of our findings of the Claire's mouse lemur with published findings of five bioacoustically studied mouse lemur species points to the notion that a complex interplay between ecology, predation pressure, and phylogenetic relatedness may shape the evolution of acoustic divergence between species in this smallest-bodied primate radiation. Thus, comparative bioacoustic studies, using standardized procedures, are promising to unravel the role of vocalization for primate species diversity and evolution and for identifying candidates for vocalization-based non-invasive monitoring for conservation purposes. © 2018 Wiley Periodicals, Inc.
Therapeutic potential of gel-based injectables for vocal fold regeneration
Bartlett, Rebecca S.; Thibeault, Susan L.; Prestwich, Glenn D.
2012-01-01
Vocal folds are anatomically and biomechanically unique, thus complicating the design and implementation of tissue engineering strategies for repair and regeneration. Integration of an enhanced understanding of tissue biomechanics, wound healing dynamics and innovative gel-based therapeutics has generated enthusiasm for the notion that an efficacious treatment for vocal fold scarring could be clinically attainable within several years. Fibroblast phenotype and gene expression are mediated by the three-dimensional mechanical and chemical microenvironment at an injury site. Thus, therapeutic approaches need to coordinate spatial and temporal aspects of the wound healing response in an injured vocal tissue to achieve an optimal clinical outcome. Successful gel-based injectables for vocal fold scarring will require a keen understanding of how the native inflammatory response sets into motion the later extracellular matrix remodeling, which in turn will determine the ultimate biomechanical properties of the tissue. We present an overview of the challenges associated with this translation as well as the proposed gel-based injectable solutions. PMID:22456756
Improvement of Vocal Pathologies Diagnosis Using High-Speed Videolaryngoscopy
Tsuji, Domingos Hiroshi; Hachiya, Adriana; Dajer, Maria Eugenia; Ishikawa, Camila Cristina; Takahashi, Marystella Tomoe; Montagnoli, Arlindo Neto
2014-01-01
Introduction The study of the dynamic properties of vocal fold vibration is important for understanding the vocal production mechanism and the impact of organic and functional changes. The advent of high-speed videolaryngoscopy (HSV) has provided the possibility of seeing the real cycle of vocal fold vibration in detail through high sampling rate of successive frames and adequate spatial resolution. Objective To describe the technique, advantages, and limitations of using HSV and digital videokymography in the diagnosis of vocal pathologies. Methods We used HSV and digital videokymography to evaluate one normophonic individual and four patients with vocal fold pathologies (nodules, unilateral paralysis of the left vocal fold, intracordal cyst, and adductor spasmodic dysphonia). The vocal fold vibration parameters (glottic closure, vibrational symmetry, periodicity, mucosal wave, amplitude, and glottal cycle phases) were assessed. Results Differences in the vocal vibration parameters were observed and correlated with the pathophysiology. Conclusion HSV is the latest diagnostic tool in visual examination of vocal behavior and has considerable potential to refine our knowledge regarding the vocal fold vibration and voice production, as well as regarding the impact of pathologic conditions have on the mechanism of phonation. PMID:25992109
Effects of long-acting beta adrenergic agonists on vocal fold ion transport.
Sivasankar, Mahalakshmi; Blazer-Yost, Bonnie
2009-03-01
Inhaled medications prescribed for the hypersensitive airway typically combine corticosteroids and long-acting beta2 adrenergic agonists (LABAs). The phonatory side effects of these combination treatments are widely recognized. However, there is limited understanding of the physiological changes induced by these medications that underlie the phonatory side effects. The objective of this study was to investigate the distinct effects of corticosteroids and LABAs on vocal fold mucosal physiology. Understanding the physiological changes to the vocal folds after corticosteroid and LABA treatments is necessary to prevent the prevalent vocal decrement associated with these medications. Experimental in vitro design with treatment and control groups. Native porcine vocal fold mucosae (N = 38) were exposed to corticosteroid or LABA treatments. Ion transport was measured continuously at baseline and after treatment. To quantify the nature of ion transport, vocal folds were also treated with chloride and sodium channel inhibitors. Corticosteroid treatment did not alter ion transport. Conversely, exposure to LABAs significantly increased ion transport. This increase in ion transport was transient, observed immediately after treatment in all tissue and associated with increased chloride secretion. The distinct effects of corticosteroids and LABAs on vocal fold physiology have not been examined to date. This study demonstrates that short-term treatment with LABAs, but not corticosteroids, significantly increases ion transport. These findings suggest that one underlying physiological mechanism for phonatory changes associated with inhaled treatments may be related to acute alterations in vocal fold ion transport and surface hydration.
Meaning in the avian auditory cortex: Neural representation of communication calls
Elie, Julie E; Theunissen, Frédéric E
2014-01-01
Understanding how the brain extracts the behavioral meaning carried by specific vocalization types that can be emitted by various vocalizers and in different conditions is a central question in auditory research. This semantic categorization is a fundamental process required for acoustic communication and presupposes discriminative and invariance properties of the auditory system for conspecific vocalizations. Songbirds have been used extensively to study vocal learning, but the communicative function of all their vocalizations and their neural representation has yet to be examined. In our research, we first generated a library containing almost the entire zebra finch vocal repertoire and organized communication calls along 9 different categories based on their behavioral meaning. We then investigated the neural representations of these semantic categories in the primary and secondary auditory areas of 6 anesthetized zebra finches. To analyze how single units encode these call categories, we described neural responses in terms of their discrimination, selectivity and invariance properties. Quantitative measures for these neural properties were obtained using an optimal decoder based both on spike counts and spike patterns. Information theoretic metrics show that almost half of the single units encode semantic information. Neurons achieve higher discrimination of these semantic categories by being more selective and more invariant. These results demonstrate that computations necessary for semantic categorization of meaningful vocalizations are already present in the auditory cortex and emphasize the value of a neuro-ethological approach to understand vocal communication. PMID:25728175
Morphological properties of collagen fibers in porcine lamina propria
Johanes, Iecun; Mihelc, Elaine; Sivasankar, Mahalakshmi; Ivanisevic, Albena
2009-01-01
Objectives Collagen influences the biomechanical properties of vocal folds. Altered collagen morphology has been implicated in dysphonia associated with aging and scarring. Documenting the morphological properties of native collagen in healthy vocal folds is essential to understand the structural and functional alterations to collagen with aging and disease. Our primary objective was to quantify the morphological properties of collagen in the vocal fold lamina propria. Our secondary exploratory objective was to investigate the effects of pepsin exposure on the morphological properties of collagen in the lamina propria. Design Experimental, in vitro study with porcine model. Methods Lamina propria was dissected from 26 vocal folds and imaged with Atomic Force Microscopy (AFM). Morphological data on d-periodicity, diameter, and roughness of collagen fibers were obtained. To investigate the effects of pepsin exposure on collagen morphology, vocal fold surface was exposed to pepsin or sham challenge prior to lamina propria dissection and AFM imaging. Results The d-periodicity, diameter, and roughness values for native vocal fold collagen are consistent with literature reports for collagen fibers in other body tissue. Pepsin exposure on vocal fold surface did not appear to change the morphological properties of collagen fibers in the lamina propria. Conclusions Quantitative data on collagen morphology were obtained at nanoscale resolution. Documenting collagen morphology in healthy vocal folds is critical for understanding the physiological changes to collagen with aging and scarring, and for designing biomaterials that match the native topography of lamina propria. PMID:20171830
Williams, Jenevora; Welch, Graham; Howard, David M
2005-01-01
Currently, there is no existing published empirical longitudinal data on the singing behaviours and development of choristers who perform in UK cathedrals and major chapels. Longitudinal group data is needed to provide a baseline against which individual chorister development can be mapped. The choristers perform to a professional standard on a daily basis, usually with linked rehearsals, whilst also following a full school curriculum. The impact of this intensive schedule in relation to current vocal behaviour, health and future development requires investigation. Furthermore, it is also necessary to understand the relationship between the requirements of chorister singing behaviour and adolescent voice change. The paper will report the initial findings of a new longitudinal chorister study, based in one of London's cathedrals. Singing and vocal behaviours are being profiled on a six-monthly basis using data from a specially designed acoustic and behavioural instrument. The information obtained will enable us to understand better the effects of such training and performance on underlying vocal behaviour and vocal health. The findings will also have implications for singing teachers and choral directors in relation to particular methods of vocal education and rehearsal.
Chan, Roger W; Siegmund, Thomas; Zhang, Kai
2009-12-01
Accurate characterization of biomechanical characteristics of the vocal fold is critical for understanding the regulation of vocal fundamental frequency (F(0)), which depends on the active control of the intrinsic laryngeal muscles as well as the passive biomechanical response of the vocal fold lamina propria. Specifically, the tissue stress-strain response and viscoelastic properties under cyclic tensile deformation are relevant, when the vocal folds are subjected to length and tension changes due to posturing. This paper describes a constitutive modeling approach quantifying the relationship between vocal fold stress and strain (or stretch), and establishes predictions of F(0) with the string model of phonation based on the constitutive parameters. Results indicated that transient and time-dependent changes in F(0), including global declinations in declarative sentences, as well as local F(0) overshoots and undershoots, can be partially attributed to the time-dependent viscoplastic response of the vocal fold cover.
Animal models of speech and vocal communication deficits associated with psychiatric disorders
Konopka, Genevieve; Roberts, Todd F.
2015-01-01
Disruptions in speech, language and vocal communication are hallmarks of several neuropsychiatric disorders, most notably autism spectrum disorders. Historically, the use of animal models to dissect molecular pathways and connect them to behavioral endophenotypes in cognitive disorders has proven to be an effective approach for developing and testing disease-relevant therapeutics. The unique aspects of human language when compared to vocal behaviors in other animals make such an approach potentially more challenging. However, the study of vocal learning in species with analogous brain circuits to humans may provide entry points for understanding this human-specific phenotype and diseases. Here, we review animal models of vocal learning and vocal communication, and specifically link phenotypes of psychiatric disorders to relevant model systems. Evolutionary constraints in the organization of neural circuits and synaptic plasticity result in similarities in the brain mechanisms for vocal learning and vocal communication. Comparative approaches and careful consideration of the behavioral limitations among different animal models can provide critical avenues for dissecting the molecular pathways underlying cognitive disorders that disrupt speech, language and vocal communication. PMID:26232298
Computation of the three-dimensional medial surface dynamics of the vocal folds.
Döllinger, Michael; Berry, David A
2006-01-01
To increase our understanding of pathological and healthy voice production, quantitative measurement of the medial surface dynamics of the vocal folds is significant, albeit rarely performed because of the inaccessibility of the vocal folds. Using an excised hemilarynx methodology, a new calibration technique, herein referred to as the linear approximate (LA) method, was introduced to compute the three-dimensional coordinates of fleshpoints along the entire medial surface of the vocal fold. The results were compared with results from the direct linear transform. An associated error estimation was presented, demonstrating the improved accuracy of the new method. A test on real data was reported including computation of quantitative measurements of vocal fold dynamics.
Vast assembly of vocal marine mammals from diverse species on fish spawning ground.
Wang, Delin; Garcia, Heriberto; Huang, Wei; Tran, Duong D; Jain, Ankita D; Yi, Dong Hoon; Gong, Zheng; Jech, J Michael; Godø, Olav Rune; Makris, Nicholas C; Ratilal, Purnima
2016-03-17
Observing marine mammal (MM) populations continuously in time and space over the immense ocean areas they inhabit is challenging but essential for gathering an unambiguous record of their distribution, as well as understanding their behaviour and interaction with prey species. Here we use passive ocean acoustic waveguide remote sensing (POAWRS) in an important North Atlantic feeding ground to instantaneously detect, localize and classify MM vocalizations from diverse species over an approximately 100,000 km(2) region. More than eight species of vocal MMs are found to spatially converge on fish spawning areas containing massive densely populated herring shoals at night-time and diffuse herring distributions during daytime. We find the vocal MMs divide the enormous fish prey field into species-specific foraging areas with varying degrees of spatial overlap, maintained for at least two weeks of the herring spawning period. The recorded vocalization rates are diel (24 h)-dependent for all MM species, with some significantly more vocal at night and others more vocal during the day. The four key baleen whale species of the region: fin, humpback, blue and minke have vocalization rate trends that are highly correlated to trends in fish shoaling density and to each other over the diel cycle. These results reveal the temporospatial dynamics of combined multi-species MM foraging activities in the vicinity of an extensive fish prey field that forms a massive ecological hotspot, and would be unattainable with conventional methodologies. Understanding MM behaviour and distributions is essential for management of marine ecosystems and for accessing anthropogenic impacts on these protected marine species.
Tardif, Carole; Lainé, France; Rodriguez, Mélissa; Gepner, Bruno
2007-09-01
This study examined the effects of slowing down presentation of facial expressions and their corresponding vocal sounds on facial expression recognition and facial and/or vocal imitation in children with autism. Twelve autistic children and twenty-four normal control children were presented with emotional and non-emotional facial expressions on CD-Rom, under audio or silent conditions, and under dynamic visual conditions (slowly, very slowly, at normal speed) plus a static control. Overall, children with autism showed lower performance in expression recognition and more induced facial-vocal imitation than controls. In the autistic group, facial expression recognition and induced facial-vocal imitation were significantly enhanced in slow conditions. Findings may give new perspectives for understanding and intervention for verbal and emotional perceptive and communicative impairments in autistic populations.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bouchard, Kristofer E.; Conant, David F.; Anumanchipalli, Gopala K.
A complete neurobiological understanding of speech motor control requires determination of the relationship between simultaneously recorded neural activity and the kinematics of the lips, jaw, tongue, and larynx. Many speech articulators are internal to the vocal tract, and therefore simultaneously tracking the kinematics of all articulators is nontrivial-especially in the context of human electrophysiology recordings. Here, we describe a noninvasive, multi-modal imaging system to monitor vocal tract kinematics, demonstrate this system in six speakers during production of nine American English vowels, and provide new analysis of such data. Classification and regression analysis revealed considerable variability in the articulator-to-acoustic relationship acrossmore » speakers. Non-negative matrix factorization extracted basis sets capturing vocal tract shapes allowing for higher vowel classification accuracy than traditional methods. Statistical speech synthesis generated speech from vocal tract measurements, and we demonstrate perceptual identification. We demonstrate the capacity to predict lip kinematics from ventral sensorimotor cortical activity. These results demonstrate a multi-modal system to non-invasively monitor articulator kinematics during speech production, describe novel analytic methods for relating kinematic data to speech acoustics, and provide the first decoding of speech kinematics from electrocorticography. These advances will be critical for understanding the cortical basis of speech production and the creation of vocal prosthetics.« less
Anumanchipalli, Gopala K.; Dichter, Benjamin; Chaisanguanthum, Kris S.; Johnson, Keith; Chang, Edward F.
2016-01-01
A complete neurobiological understanding of speech motor control requires determination of the relationship between simultaneously recorded neural activity and the kinematics of the lips, jaw, tongue, and larynx. Many speech articulators are internal to the vocal tract, and therefore simultaneously tracking the kinematics of all articulators is nontrivial—especially in the context of human electrophysiology recordings. Here, we describe a noninvasive, multi-modal imaging system to monitor vocal tract kinematics, demonstrate this system in six speakers during production of nine American English vowels, and provide new analysis of such data. Classification and regression analysis revealed considerable variability in the articulator-to-acoustic relationship across speakers. Non-negative matrix factorization extracted basis sets capturing vocal tract shapes allowing for higher vowel classification accuracy than traditional methods. Statistical speech synthesis generated speech from vocal tract measurements, and we demonstrate perceptual identification. We demonstrate the capacity to predict lip kinematics from ventral sensorimotor cortical activity. These results demonstrate a multi-modal system to non-invasively monitor articulator kinematics during speech production, describe novel analytic methods for relating kinematic data to speech acoustics, and provide the first decoding of speech kinematics from electrocorticography. These advances will be critical for understanding the cortical basis of speech production and the creation of vocal prosthetics. PMID:27019106
Bouchard, Kristofer E.; Conant, David F.; Anumanchipalli, Gopala K.; ...
2016-03-28
A complete neurobiological understanding of speech motor control requires determination of the relationship between simultaneously recorded neural activity and the kinematics of the lips, jaw, tongue, and larynx. Many speech articulators are internal to the vocal tract, and therefore simultaneously tracking the kinematics of all articulators is nontrivial-especially in the context of human electrophysiology recordings. Here, we describe a noninvasive, multi-modal imaging system to monitor vocal tract kinematics, demonstrate this system in six speakers during production of nine American English vowels, and provide new analysis of such data. Classification and regression analysis revealed considerable variability in the articulator-to-acoustic relationship acrossmore » speakers. Non-negative matrix factorization extracted basis sets capturing vocal tract shapes allowing for higher vowel classification accuracy than traditional methods. Statistical speech synthesis generated speech from vocal tract measurements, and we demonstrate perceptual identification. We demonstrate the capacity to predict lip kinematics from ventral sensorimotor cortical activity. These results demonstrate a multi-modal system to non-invasively monitor articulator kinematics during speech production, describe novel analytic methods for relating kinematic data to speech acoustics, and provide the first decoding of speech kinematics from electrocorticography. These advances will be critical for understanding the cortical basis of speech production and the creation of vocal prosthetics.« less
Vocalizations associated with anxiety and fear in the common marmoset (Callithrix jacchus).
Kato, Yoko; Gokan, Hayato; Oh-Nishi, Arata; Suhara, Tetsuya; Watanabe, Shigeru; Minamimoto, Takafumi
2014-12-15
Vocalizations of common marmoset (Callithrix jacchus) were examined under experimental situations related to fear or anxiety. When marmosets were isolated in an unfamiliar environment, they frequently vocalized "tsik-egg" calls, which were the combination calls of 'tsik' followed by several 'egg'. Tsik-egg calls were also observed after treatment with the anxiogenic drug FG-7142 (20mg/kg, sc). In contrast, when marmosets were exposed to predatory stimuli as fear-evoking situations, they frequently vocalized tsik solo calls as well as tsik-egg calls. These results suggest that marmosets dissociate the vocalization of tsik-egg and tsik calls under conditions related to fear/anxiety; tsik-egg solo vocalizations were emitted under anxiety-related conditions (e.g., isolation and anxiogenic drug treatment), whereas a mixed vocalization of tsik-egg and tsik was emitted when confronted with fear-provoking stimuli (i.e., threatening predatory stimuli). Tsik-egg call with/without tsik can be used as a specific vocal index of fear/anxiety in marmosets, which allows us to understand the neural mechanism of negative emotions in primate. Copyright © 2014 Elsevier B.V. All rights reserved.
Vocal contagion of emotions in non-human animals
2018-01-01
Communicating emotions to conspecifics (emotion expression) allows the regulation of social interactions (e.g. approach and avoidance). Moreover, when emotions are transmitted from one individual to the next, leading to state matching (emotional contagion), information transfer and coordination between group members are facilitated. Despite the high potential for vocalizations to influence the affective state of surrounding individuals, vocal contagion of emotions has been largely unexplored in non-human animals. In this paper, I review the evidence for discrimination of vocal expression of emotions, which is a necessary step for emotional contagion to occur. I then describe possible proximate mechanisms underlying vocal contagion of emotions, propose criteria to assess this phenomenon and review the existing evidence. The literature so far shows that non-human animals are able to discriminate and be affected by conspecific and also potentially heterospecific (e.g. human) vocal expression of emotions. Since humans heavily rely on vocalizations to communicate (speech), I suggest that studying vocal contagion of emotions in non-human animals can lead to a better understanding of the evolution of emotional contagion and empathy. PMID:29491174
Effects of social games on infant vocalizations*.
Hsu, Hui-Chin; Iyer, Suneeti Nathani; Fogel, Alan
2014-01-01
The aim of the present study was to examine the contextual effects of social games on prelinguistic vocalizations. The two main goals were to (1) investigate the functions of vocalizations as symptoms of affective arousal and symbols of social understanding, and (2) explore form-function (de)coupling relations between vocalization types and game contexts. Seventy-one six-month-olds and sixty-four twelve-month-olds played with their mothers in normal and perturbed tickle and peek-a-boo games. The effects of infant age, game, game climax, and game perturbation on the frequency and types of infant vocalizations were examined. Results showed twelve-month-olds vocalized more mature canonical syllables during peek-a-boo and more primitive quasi-resonant nuclei during tickle than six-month-olds. Six- and twelve-month-olds increased their vocalizations from the set-up to climax during peek-a-boo, but they did not show such an increase during tickle. Findings support the symptom function of prelinguistic vocalizations reflecting affective arousal and the prevalence of form-function decoupling during the first year of life.
Vocal Features of Song and Speech: Insights from Schoenberg's Pierrot Lunaire.
Merrill, Julia; Larrouy-Maestri, Pauline
2017-01-01
Similarities and differences between speech and song are often examined. However, the perceptual definition of these two types of vocalization is challenging. Indeed, the prototypical characteristics of speech or song support top-down processes, which influence listeners' perception of acoustic information. In order to examine vocal features associated with speaking and singing, we propose an innovative approach designed to facilitate bottom-up mechanisms in perceiving vocalizations by using material situated between speech and song: Speechsong. 25 participants were asked to evaluate 20 performances of a speechsong composition by Arnold Schoenberg, "Pierrot lunaire" op. 21 from 1912, evaluating 20 features of vocal-articulatory expression. Raters provided reliable judgments concerning the vocal features used by the performers and did not show strong appeal or specific expectations in reference to Schoenberg's piece. By examining the relationship between the vocal features and the impression of song or speech, the results confirm the importance of pitch (height, contour, range), but also point to the relevance of register, timbre, tension and faucal distance. Besides highlighting vocal features associated with speech and song, this study supports the relevance of the present approach of focusing on a theoretical middle category in order to better understand vocal expression in song and speech.
Vocal Fold Surface Hydration: A review
Leydon, Ciara; Sivasankar, Mahalakshmi; Falciglia, Danielle Lodewyck; Atkins, Christopher; Fisher, Kimberly V.
2009-01-01
Vocal fold surface liquid homeostasis contributes to optimal vocal physiology. In this paper we review emerging evidence that vocal fold surface liquid is maintained in part by salt and water fluxes across the epithelium. Based on recent immunolocalization and electrophysiological findings, we describe a transcellular pathway as one mechanism for regulating superficial vocal fold hydration. We propose that the pathway includes the sodium-potassium pump, sodium-potassium-chloride cotransporter, epithelial sodium channels, cystic fibrosis transmembrane regulator chloride channels, and aquaporin water channels. By integrating knowledge of the regulating mechanisms underlying ion and fluid transport with observations from hydration challenges and treatments using in vitro and in vivo studies, we provide a theoretical basis for understanding how environmental and behavioral challenges and clinical interventions may modify vocal fold surface liquid composition. We present converging evidence that clinical protocols directed at facilitating vocal fold epithelial ion and fluid transport may benefit healthy speakers, those with voice disorders, and those at risk for voice disorders. PMID:19111440
The Effect of Classroom Capacity on Vocal Fatigue as Quantified by the Vocal Fatigue Index.
Banks, Russell E; Bottalico, Pasquale; Hunter, Eric J
2017-01-01
Previous research has concluded that teachers are at a higher-than-normal risk for voice issues that can cause occupational limitations. While some risk factors have been identified, there are still many unknowns. A survey was distributed electronically with 506 female teacher respondents. The survey included questions to quantify three aspects of vocal fatigue as captured by the Vocal Fatigue Index (VFI): (1) general tiredness of voice (performance), (2) physical discomfort associated with voicing (pain), and (3) improvement of symptoms with rest (recovery). The effect of classroom capacity on US teachers' self-reported experience of vocal fatigue was analyzed. The results indicated that a classroom's capacity significantly affected teachers' reported amounts of vocal fatigue, while a teacher's age also appeared to significantly affect the reported amount of vocal fatigue. A quadratic rather than linear effect was seen, with the largest age effect occurring at around 40-45 years in all three factors of the VFI. Further factors which may affect vocal fatigue must be explored in future research. By understanding what increases the risk for vocal fatigue, educators and school administrators can take precautions to mitigate the occupational risk of short- and long-term vocal health issues in school teachers. © 2017 S. Karger AG, Basel.
The biology and evolution of music: a comparative perspective.
Fitch, W Tecumseh
2006-05-01
Studies of the biology of music (as of language) are highly interdisciplinary and demand the integration of diverse strands of evidence. In this paper, I present a comparative perspective on the biology and evolution of music, stressing the value of comparisons both with human language, and with those animal communication systems traditionally termed "song". A comparison of the "design features" of music with those of language reveals substantial overlap, along with some important differences. Most of these differences appear to stem from semantic, rather than structural, factors, suggesting a shared formal core of music and language. I next review various animal communication systems that appear related to human music, either by analogy (bird and whale "song") or potential homology (great ape bimanual drumming). A crucial comparative distinction is between learned, complex signals (like language, music and birdsong) and unlearned signals (like laughter, ape calls, or bird calls). While human vocalizations clearly build upon an acoustic and emotional foundation shared with other primates and mammals, vocal learning has evolved independently in our species since our divergence with chimpanzees. The convergent evolution of vocal learning in other species offers a powerful window into psychological and neural constraints influencing the evolution of complex signaling systems (including both song and speech), while ape drumming presents a fascinating potential homology with human instrumental music. I next discuss the archeological data relevant to music evolution, concluding on the basis of prehistoric bone flutes that instrumental music is at least 40,000 years old, and perhaps much older. I end with a brief review of adaptive functions proposed for music, concluding that no one selective force (e.g., sexual selection) is adequate to explaining all aspects of human music. I suggest that questions about the past function of music are unlikely to be answered definitively and are thus a poor choice as a research focus for biomusicology. In contrast, a comparative approach to music promises rich dividends for our future understanding of the biology and evolution of music.
Adapted to Roar: Functional Morphology of Tiger and Lion Vocal Folds
Klemuk, Sarah A.; Riede, Tobias; Walsh, Edward J.; Titze, Ingo R.
2011-01-01
Vocal production requires active control of the respiratory system, larynx and vocal tract. Vocal sounds in mammals are produced by flow-induced vocal fold oscillation, which requires vocal fold tissue that can sustain the mechanical stress during phonation. Our understanding of the relationship between morphology and vocal function of vocal folds is very limited. Here we tested the hypothesis that vocal fold morphology and viscoelastic properties allow a prediction of fundamental frequency range of sounds that can be produced, and minimal lung pressure necessary to initiate phonation. We tested the hypothesis in lions and tigers who are well-known for producing low frequency and very loud roaring sounds that expose vocal folds to large stresses. In histological sections, we found that the Panthera vocal fold lamina propria consists of a lateral region with adipocytes embedded in a network of collagen and elastin fibers and hyaluronan. There is also a medial region that contains only fibrous proteins and hyaluronan but no fat cells. Young's moduli range between 10 and 2000 kPa for strains up to 60%. Shear moduli ranged between 0.1 and 2 kPa and differed between layers. Biomechanical and morphological data were used to make predictions of fundamental frequency and subglottal pressure ranges. Such predictions agreed well with measurements from natural phonation and phonation of excised larynges, respectively. We assume that fat shapes Panthera vocal folds into an advantageous geometry for phonation and it protects vocal folds. Its primary function is probably not to increase vocal fold mass as suggested previously. The large square-shaped Panthera vocal fold eases phonation onset and thereby extends the dynamic range of the voice. PMID:22073246
Insights Into the Role of Collagen in Vocal Fold Health and Disease.
Tang, Sharon S; Mohad, Vidisha; Gowda, Madhu; Thibeault, Susan L
2017-09-01
As one of the key fibrous proteins in the extracellular matrix, collagen plays a significant role in the structural and biomechanical characteristics of the vocal fold. Anchored fibrils of collagen create secure structural regions within the vocal folds and are strong enough to sustain vibratory impact and stretch during phonation. This contributes tensile strength, density, and organization to the vocal folds and influences health and pathogenesis. This review offers a comprehensive summary for a current understanding of collagen within normal vocal fold tissues throughout the life span as well as vocal pathology and wound repair. Further, collagen's molecular structure and biosynthesis are discussed. Finally, collagen alterations in tissue injury and repair and the incorporation of collagen-based biomaterials as a method of treating voice disorders are reviewed. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Histological changes in vocal fold growth and aging.
Kuhn, Maggie A
2014-12-01
Sophisticated descriptions of the highly specialized vocal fold microarchitecture have been available for over three decades, but how this anatomy evolves with growth and aging remains an area of active investigation and, at times, a source of controversy. As our aging population expands and interest in pediatric voice disorders blossoms, it is timely to consider our contemporary understanding of evolving vocal fold histology and its implications for voice production. Novel applications of existing and emerging biotechnology, development of animal models and skillful use of human specimens have afforded greater insights into the histologic vocal fold changes seen throughout the lifespan in health and disease. Burgeoning knowledge has laid the foundation for more comprehensive models of vocal fold histology and has led to the development of innovative therapies for challenging voice disorders.
Allen, Jacqui
2010-12-01
The prolonged debilitation, loss of income, and decrement in quality of life caused by vocal fold scar is exacerbated by our inability to successfully treat this difficult problem. As technology focuses on developing innovative treatments, we need to fully appreciate and understand the mechanisms giving rise to glottal scar, on both a macroscopic and microscopic level. This review examines recent literature pertaining to the gross and molecular mechanisms which give rise to vocal fold scar. Mechanisms of vocal fold scar production have been examined in both macroscopic and microscopic detail. Trauma and injury involving any aspect of the lamina propria, particularly the deeper layers, may result in epithelial tethering and scar formation. At the molecular level, early inflammatory cytokines activate and recruit fibroblasts which then drive the fibrotic cascade. Transforming growth factor-β enhances fibrosis and is balanced by tissue matrix metalloproteinases and hepatocyte growth factor activity. Molecular signaling offers novel opportunities to intervene in scar formation. New work investigating the cause of vocal fold scar identifies complex molecular processes leading to fibrosis in the lamina propria. Improved mechanistic understanding offers insight into prevention strategies and possible targets for antifibrotic therapies that may help prevent or treat this debilitating condition.
Tissue Engineering-based Therapeutic Strategies for Vocal Fold Repair and Regeneration
Li, Linqing; Stiadle, Jeanna M.; Lau, Hang K.; Zerdoum, Aidan B.; Jia, Xinqiao; L.Thibeault, Susan; Kiick, Kristi L.
2016-01-01
Vocal folds are soft laryngeal connective tissues with distinct layered structures and complex multicomponent matrix compositions that endow phonatory and respiratory functions. This delicate tissue is easily damaged by various environmental factors and pathological conditions, altering vocal biomechanics and causing debilitating vocal disorders that detrimentally affect the daily lives of suffering individuals. Modern techniques and advanced knowledge of regenerative medicine have led to a deeper understanding of the microstructure, microphysiology, and micropathophysiology of vocal fold tissues. State-of-the-art materials ranging from extracecullar-matrix (ECM)-derived biomaterials to synthetic polymer scaffolds have been proposed for the prevention and treatment of voice disorders including vocal fold scarring and fibrosis. This review intends to provide a thorough overview of current achievements in the field of vocal fold tissue engineering, including the fabrication of injectable biomaterials to mimic in vitro cell microenvironments, novel designs of bioreactors that capture in vivo tissue biomechanics, and establishment of various animal models to characterize the in vivo biocompatibility of these materials. The combination of polymeric scaffolds, cell transplantation, biomechanical stimulation, and delivery of antifibrotic growth factors will lead to successful restoration of functional vocal folds and improved vocal recovery in animal models, facilitating the application of these materials and related methodologies in clinical practice. PMID:27619243
Vocal Features of Song and Speech: Insights from Schoenberg's Pierrot Lunaire
Merrill, Julia; Larrouy-Maestri, Pauline
2017-01-01
Similarities and differences between speech and song are often examined. However, the perceptual definition of these two types of vocalization is challenging. Indeed, the prototypical characteristics of speech or song support top-down processes, which influence listeners' perception of acoustic information. In order to examine vocal features associated with speaking and singing, we propose an innovative approach designed to facilitate bottom-up mechanisms in perceiving vocalizations by using material situated between speech and song: Speechsong. 25 participants were asked to evaluate 20 performances of a speechsong composition by Arnold Schoenberg, “Pierrot lunaire” op. 21 from 1912, evaluating 20 features of vocal-articulatory expression. Raters provided reliable judgments concerning the vocal features used by the performers and did not show strong appeal or specific expectations in reference to Schoenberg's piece. By examining the relationship between the vocal features and the impression of song or speech, the results confirm the importance of pitch (height, contour, range), but also point to the relevance of register, timbre, tension and faucal distance. Besides highlighting vocal features associated with speech and song, this study supports the relevance of the present approach of focusing on a theoretical middle category in order to better understand vocal expression in song and speech. PMID:28744233
Social behaviors and acoustic vocalizations in different strains of mice.
Faure, Alexis; Pittaras, Elsa; Nosjean, Anne; Chabout, Jonathan; Cressant, Arnaud; Granon, Sylvie
2017-03-01
Proposing a framework for the study of core functions is valuable for understanding how they are altered in multiple mental disorders involving prefrontal dysfunction, for understanding genetic influences and for testing therapeutic compounds. Social and communication disabilities are reported in several major psychiatric disorders, and social communication disorders also can occur independently. Being able to study social communication involving interactions and associated acoustic vocalizations in animal models is thus important. All rodents display extensive social behaviors, including interactions and acoustic vocalizations. It is therefore important to pinpoint potential genetic-related strain differences -and similarities- in social behavior and vocalization. One approach is to compare different mouse strains, and this may be useful in choosing which strains may be best suitable in modeling psychiatric disorders where social and communication deficits are core symptoms. We compared social behavior and ultrasonic acoustic vocalization profiles in males of four mouse strains (129S2/Sv, C57BL/6J, DBA/2, and CD-1) using a social interaction task that we previously showed to rely on prefrontal network activity. Our social interaction task promotes a high level of ultrasonic vocalization with both social and acoustic parameters, and further allows other measures of social behaviors. The duration of social contact, dominance and aggressiveness varied with the mouse strains. Only C57BL/6J mice showed no attacks, with social contact being highly affiliative, whereas others strains emitted aggressive attacks. C57BL/6J mice also exhibited a significantly higher rate of ultrasonic vocalizations (USV), especially during social interaction. Copyright © 2016 Elsevier B.V. All rights reserved.
Zhang, Zhaoyan
2016-01-01
The goal of this study is to better understand the cause-effect relation between vocal fold physiology and the resulting vibration pattern and voice acoustics. Using a three-dimensional continuum model of phonation, the effects of changes in vocal fold stiffness, medial surface thickness in the vertical direction, resting glottal opening, and subglottal pressure on vocal fold vibration and different acoustic measures are investigated. The results show that the medial surface thickness has dominant effects on the vertical phase difference between the upper and lower margins of the medial surface, closed quotient, H1-H2, and higher-order harmonics excitation. The main effects of vocal fold approximation or decreasing resting glottal opening are to lower the phonation threshold pressure, reduce noise production, and increase the fundamental frequency. Increasing subglottal pressure is primarily responsible for vocal intensity increase but also leads to significant increase in noise production and an increased fundamental frequency. Increasing AP stiffness significantly increases the fundamental frequency and slightly reduces noise production. The interaction among vocal fold thickness, stiffness, approximation, and subglottal pressure in the control of F0, vocal intensity, and voice quality is discussed. PMID:27106298
Döllinger, M; Rosanowski, F; Eysholdt, U; Lohscheller, J
2008-12-01
The understanding of normal and pathological vocal fold dynamics is the basis for a pathophysiological motivated voice therapy. Crucial vocal fold dynamics concerning voice production occur at the medial part of the vocal fold which is seen as the most critical region of mucosal wave propagation. Due to the limited size of the larynx the possibilities of laryngeal imaging by endoscopic techniques are limited. This work describes an experimental set-up that enables quantification of the entire medial and superior vocal fold surface using excised human and in vivo canine larynges. The data obtained enable analysis of vocal fold deflections, velocities, and mucosal wave propagation. The reciprocal dependencies can be examined and different areas of vocal fold dynamics located. The vertical components obscured in clinical endoscopy can be visualized. This is not negligible. In particular it is shown that the vertical deflection, which cannot be observed by clinical examination, plays an important part in the dynamics and therefore cannot be omitted for therapeutic procedures. The theoretically assumed entrainment and influence of the two main vibration modes enabling normal phonation is confirmed.
Schmidt, Marc F.; McLean, Judith; Goller, Franz
2011-01-01
The production of vocalizations is intimately linked to the respiratory system. Despite our understanding of neural circuits that generate normal respiratory patterns, very little is understood regarding how these ponto-medullary circuits become engaged during vocal production. Songbirds offer a potentially powerful model system for addressing this relationship. Songs dramatically alter the respiratory pattern in ways that are often highly predictable and songbirds have a specialized telencephalic vocal motor circuit that provides massive innervation to a brainstem respiratory network that shares many similarities with its mammalian counterpart. In this review, we highlight interactions between the song motor circuit and the respiratory system, describing how both systems likely interact to produce the complex respiratory patterns that are observed during vocalization. We also discuss how the respiratory system, through its bilateral bottom-up projections to thalamus, might play a key role in sending precisely timed signals that synchronize premotor activity in both hemispheres. PMID:21984733
Başkent, Deniz; Fuller, Christina D; Galvin, John J; Schepel, Like; Gaudrain, Etienne; Free, Rolien H
2018-05-01
In adult normal-hearing musicians, perception of music, vocal emotion, and speech in noise has been previously shown to be better than non-musicians, sometimes even with spectro-temporally degraded stimuli. In this study, melodic contour identification, vocal emotion identification, and speech understanding in noise were measured in young adolescent normal-hearing musicians and non-musicians listening to unprocessed or degraded signals. Different from adults, there was no musician effect for vocal emotion identification or speech in noise. Melodic contour identification with degraded signals was significantly better in musicians, suggesting potential benefits from music training for young cochlear-implant users, who experience similar spectro-temporal signal degradations.
Factors Associated with Problematic Vocalizations in Nursing Home Residents with Dementia
ERIC Educational Resources Information Center
Beck, Cornelia; Richards, Kathy; Lambert, Corinne; Doan, Rebecca; Landes, Reid D.; Whall, Ann; Algase, Donna; Kolanowski, Ann; Feldman, Zachary
2011-01-01
Purpose of the Study: Problematic vocalizations (PVs) are the most frequent and persistent disruptive behaviors exhibited by nursing home residents with dementia. Understanding factors associated with these behaviors are important to prevent or reduce them. We used the Need-Driven Dementia-Compromised Behavior model to identify the characteristics…
Vocal Changes Following Thyroid Surgery: Prospective Study of Objective and Subjective Parameters.
Delgado-Vargas, Beatriz; Lloris Romero-Salazar, Azucena; Cobeta, Ignacio
2017-10-19
Vocal changes are frequent following a surgical procedure to the thyroid gland. Even though they are a recognized morbidity, their bases are yet to be defined as well as their effect on vocal parameters. This study investigates the objective and subjective changes that occur after the surgery. This study is a prospective analysis of consecutive cases. This study was conducted in a single-center tertiary care facility. Patients programmed for any thyroid procedure in Hospital Universitario Ramón y Cajal were enrolled consecutively to perform the vocal analysis before and after the surgery from April 2014 to April 2016. Patients were divided according to the vocal fold motility, and their vocal and aerodynamic parameters were obtained by means of electroglottography and phonatory aerodynamic system. Patients filled in the 10-item Voice Handicap Index (VHI-10) questionnaire. Statistical analysis was performed comparing vocal and aerodynamic parameters and quality of life before and after the surgery. 218 patients met inclusion criteria and completed the protocol. A total of 86.6% of the sample showed no vocal motility impairment, whereas the rest of the patients showed a paresis or a paralysis. Maximum phonatory time and VHI-10 questionnaire showed a statistically significant difference (P < 0.05) between groups. No differences were assessed regarding other vocal parameters. Efforts are still needed to understand the groundings and magnitude of the vocal changes after a thyroid surgery. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Donahue, Erin N; Leborgne, Wendy D; Brehm, Susan Baker; Weinrich, Barbara D
2014-05-01
Collegiate-level musical theater performance students are a specialized group of vocal performers, who rely on frequent and optimal voice use for their academic advancement and ultimate livelihood. The purpose of this study was to gather information to develop a greater understanding of vocal health and practice patterns of incoming collegiate-level musical theater performers. Data obtained from questionnaires completed by freshman musical theater majors were retrospectively analyzed to gather information about baseline vocal habits of the participants. Results of a questionnaire were obtained from incoming freshman musical theater students at the Cincinnati Conservatory of Music over a period of 10 years (2002-2011). One hundred eighty-eight participants (female = 90) (male = 98) with an average age of 18.28 years (standard deviation = 0.726) were included. Results specifying participants' self-reported vocal training and practice habits, vocal health and hygiene practices, and current vocal symptoms or contributing factors to potential voice problems are provided. Data obtained from the participants revealed that the potential for vocal problems exists in this group of performers, as over half of the subjects reported at least one current negative vocal symptom. The findings from this study provide information that may be useful for individuals who are involved in the training of vocal performers. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Carey, Daniel; McGettigan, Carolyn
2017-04-01
The human vocal system is highly plastic, allowing for the flexible expression of language, mood and intentions. However, this plasticity is not stable throughout the life span, and it is well documented that adult learners encounter greater difficulty than children in acquiring the sounds of foreign languages. Researchers have used magnetic resonance imaging (MRI) to interrogate the neural substrates of vocal imitation and learning, and the correlates of individual differences in phonetic "talent". In parallel, a growing body of work using MR technology to directly image the vocal tract in real time during speech has offered primarily descriptive accounts of phonetic variation within and across languages. In this paper, we review the contribution of neural MRI to our understanding of vocal learning, and give an overview of vocal tract imaging and its potential to inform the field. We propose methods by which our understanding of speech production and learning could be advanced through the combined measurement of articulation and brain activity using MRI - specifically, we describe a novel paradigm, developed in our laboratory, that uses both MRI techniques to for the first time map directly between neural, articulatory and acoustic data in the investigation of vocalisation. This non-invasive, multimodal imaging method could be used to track central and peripheral correlates of spoken language learning, and speech recovery in clinical settings, as well as provide insights into potential sites for targeted neural interventions. Copyright © 2016 Elsevier Ltd. All rights reserved.
Soma, Masayo; Mori, Chihiro
2015-01-01
Music and dance are two remarkable human characteristics that are closely related. Communication through integrated vocal and motional signals is also common in the courtship displays of birds. The contribution of songbird studies to our understanding of vocal learning has already shed some light on the cognitive underpinnings of musical ability. Moreover, recent pioneering research has begun to show how animals can synchronize their behaviors with external stimuli, like metronome beats. However, few studies have applied such perspectives to unraveling how animals can integrate multimodal communicative signals that have natural functions. Additionally, studies have rarely asked how well these behaviors are learned. With this in mind, here we cast a spotlight on an unusual animal behavior: non-vocal sound production associated with singing in the Java sparrow (Lonchura oryzivora), a songbird. We show that male Java sparrows coordinate their bill-click sounds with the syntax of their song-note sequences, similar to percussionists. Analysis showed that they produced clicks frequently toward the beginning of songs and before/after specific song notes. We also show that bill-clicking patterns are similar between social fathers and their sons, suggesting that these behaviors might be learned from models or linked to learning-based vocalizations. Individuals untutored by conspecifics also exhibited stereotypical bill-clicking patterns in relation to song-note sequence, indicating that while the production of bill clicking itself is intrinsic, its syncopation appears to develop with songs. This paints an intriguing picture in which non-vocal sounds are integrated with vocal courtship signals in a songbird, a model that we expect will contribute to the further understanding of multimodal communication. PMID:25992841
Vocal Imitations of Non-Vocal Sounds
Houix, Olivier; Voisin, Frédéric; Misdariis, Nicolas; Susini, Patrick
2016-01-01
Imitative behaviors are widespread in humans, in particular whenever two persons communicate and interact. Several tokens of spoken languages (onomatopoeias, ideophones, and phonesthemes) also display different degrees of iconicity between the sound of a word and what it refers to. Thus, it probably comes at no surprise that human speakers use a lot of imitative vocalizations and gestures when they communicate about sounds, as sounds are notably difficult to describe. What is more surprising is that vocal imitations of non-vocal everyday sounds (e.g. the sound of a car passing by) are in practice very effective: listeners identify sounds better with vocal imitations than with verbal descriptions, despite the fact that vocal imitations are inaccurate reproductions of a sound created by a particular mechanical system (e.g. a car driving by) through a different system (the voice apparatus). The present study investigated the semantic representations evoked by vocal imitations of sounds by experimentally quantifying how well listeners could match sounds to category labels. The experiment used three different types of sounds: recordings of easily identifiable sounds (sounds of human actions and manufactured products), human vocal imitations, and computational “auditory sketches” (created by algorithmic computations). The results show that performance with the best vocal imitations was similar to the best auditory sketches for most categories of sounds, and even to the referent sounds themselves in some cases. More detailed analyses showed that the acoustic distance between a vocal imitation and a referent sound is not sufficient to account for such performance. Analyses suggested that instead of trying to reproduce the referent sound as accurately as vocally possible, vocal imitations focus on a few important features, which depend on each particular sound category. These results offer perspectives for understanding how human listeners store and access long-term sound representations, and sets the stage for the development of human-computer interfaces based on vocalizations. PMID:27992480
Nonlinear acoustics in the pant-hoot vocalization of common chimpanzees (Pan troglodytes)
NASA Astrophysics Data System (ADS)
Riede, Tobias; Arcadi, Adam Clark; Owren, Michael J.
2003-04-01
Pant-hoots produced by chimpanzees are multi-call vocalizations. While predominantly harmonically structured, pant-hoots can exhibit acoustic complexity that has recently been found to result from inherent nonlinearity in the vocal-fold dynamics. This complexity reflects abrupt shifts between qualitatively distinct vibration patterns (known as modes), which include but are not limited to simple, synchronous movements by the two vocal folds. Studies with humans in particular have shown that as the amplitude and vibration rate increase, vocal-fold action becomes increasingly susceptible to higher-order synchronizations, desynchronized movements, and irregular behavior. We examined the occurrence of these sorts of nonlinear phenomena in pant-hoots, contrasting quieter and lower-pitched introduction components with loud and high-pitched climax calls in the same sounds. Spectrographic evidence revealed four classic kinds of nonlinear phenomena, including discrete frequency jumps, subharmonics, biphonation, and deterministic chaos. While these events were virtually never found in the introduction, they occurred in more than half of the climax calls. Biphonation was by far the most common. Individual callers varied in the degree to which their climax calls exhibited nonlinear phenomena, but we are consistent in showing more biphonation than any of the other forms. These outcomes demonstrate that understanding these calls requisitely requires an understanding of such events.
The Neural Basis of Vocal Pitch Imitation in Humans.
Belyk, Michel; Pfordresher, Peter Q; Liotti, Mario; Brown, Steven
2016-04-01
Vocal imitation is a phenotype that is unique to humans among all primate species, and so an understanding of its neural basis is critical in explaining the emergence of both speech and song in human evolution. Two principal neural models of vocal imitation have emerged from a consideration of nonhuman animals. One hypothesis suggests that putative mirror neurons in the inferior frontal gyrus pars opercularis of Broca's area may be important for imitation. An alternative hypothesis derived from the study of songbirds suggests that the corticostriate motor pathway performs sensorimotor processes that are specific to vocal imitation. Using fMRI with a sparse event-related sampling design, we investigated the neural basis of vocal imitation in humans by comparing imitative vocal production of pitch sequences with both nonimitative vocal production and pitch discrimination. The strongest difference between these tasks was found in the putamen bilaterally, providing a striking parallel to the role of the analogous region in songbirds. Other areas preferentially activated during imitation included the orofacial motor cortex, Rolandic operculum, and SMA, which together outline the corticostriate motor loop. No differences were seen in the inferior frontal gyrus. The corticostriate system thus appears to be the central pathway for vocal imitation in humans, as predicted from an analogy with songbirds.
High-precision spatial localization of mouse vocalizations during social interaction.
Heckman, Jesse J; Proville, Rémi; Heckman, Gert J; Azarfar, Alireza; Celikel, Tansu; Englitz, Bernhard
2017-06-07
Mice display a wide repertoire of vocalizations that varies with age, sex, and context. Especially during courtship, mice emit ultrasonic vocalizations (USVs) of high complexity, whose detailed structure is poorly understood. As animals of both sexes vocalize, the study of social vocalizations requires attributing single USVs to individuals. The state-of-the-art in sound localization for USVs allows spatial localization at centimeter resolution, however, animals interact at closer ranges, involving tactile, snout-snout exploration. Hence, improved algorithms are required to reliably assign USVs. We develop multiple solutions to USV localization, and derive an analytical solution for arbitrary vertical microphone positions. The algorithms are compared on wideband acoustic noise and single mouse vocalizations, and applied to social interactions with optically tracked mouse positions. A novel, (frequency) envelope weighted generalised cross-correlation outperforms classical cross-correlation techniques. It achieves a median error of ~1.4 mm for noise and ~4-8.5 mm for vocalizations. Using this algorithms in combination with a level criterion, we can improve the assignment for interacting mice. We report significant differences in mean USV properties between CBA mice of different sexes during social interaction. Hence, the improved USV attribution to individuals lays the basis for a deeper understanding of social vocalizations, in particular sequences of USVs.
A budget of energy transfer in a sustained vocal folds vibration in glottis
NASA Astrophysics Data System (ADS)
Zhang, Lucy; Yang, Jubiao; Krane, Michael
2016-11-01
A set of force and energy balance equations using the control volume approach is derived based on the first principles of physics for a sustained vocal folds vibration in glottis. The control volume analysis is done for compressible airflow in a moving and deforming control volume in the vicinity of the vocal folds. The interaction between laryngeal airflow and vocal folds are successfully simulated using the modified Immersed Finite Element Method (mIFEM), a fully coupled approach to simulate fluid-structure interactions. Detailed mathematical terms are separated out for deeper physical understanding and utilization of mechanical energy is quantified with the derived equation. The results show that majority of energy input is consumed for driving laryngeal airflow, while a smaller portion is for compensating viscous losses in and sustaining the vibration of the vocal folds. We acknowledge the funding support of NIH 2R01DC005642-10A1.
Cohen, Alex S; Dinzeo, Thomas J; Donovan, Neila J; Brown, Caitlin E; Morrison, Sean C
2015-03-30
Vocal expression reflects an integral component of communication that varies considerably within individuals across contexts and is disrupted in a range of neurological and psychiatric disorders. There is reason to suspect that variability in vocal expression reflects, in part, the availability of "on-line" resources (e.g., working memory, attention). Thus, understanding vocal expression is a potentially important biometric index of information processing, not only across but within individuals over time. A first step in this line of research involves establishing a link between vocal expression and information processing systems in healthy adults. The present study employed a dual attention experimental task where participants provided natural speech while simultaneously engaged in a baseline, medium or high nonverbal processing-load task. Objective, automated, and computerized analysis was employed to measure vocal expression in 226 adults. Increased processing load resulted in longer pauses, fewer utterances, greater silence overall and less variability in frequency and intensity levels. These results provide compelling evidence of a link between information processing resources and vocal expression, and provide important information for the development of an automated, inexpensive and uninvasive biometric measure of information processing. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Vocal fold tissue failure: preliminary data and constitutive modeling.
Chan, Roger W; Siegmund, Thomas
2004-08-01
In human voice production (phonation), linear small-amplitude vocal fold oscillation occurs only under restricted conditions. Physiologically, phonation more often involves large-amplitude oscillation associated with tissue stresses and strains beyond their linear viscoelastic limits, particularly in the lamina propria extracellular matrix (ECM). This study reports some preliminary measurements of tissue deformation and failure response of the vocal fold ECM under large-strain shear The primary goal was to formulate and test a novel constitutive model for vocal fold tissue failure, based on a standard-linear cohesive-zone (SL-CZ) approach. Tissue specimens of the sheep vocal fold mucosa were subjected to torsional deformation in vitro, at constant strain rates corresponding to twist rates of 0.01, 0.1, and 1.0 rad/s. The vocal fold ECM demonstrated nonlinear stress-strain and rate-dependent failure response with a failure strain as low as 0.40 rad. A finite-element implementation of the SL-CZ model was capable of capturing the rate dependence in these preliminary data, demonstrating the model's potential for describing tissue failure. Further studies with additional tissue specimens and model improvements are needed to better understand vocal fold tissue failure.
Warren, Megan R; Sangiamo, Daniel T; Neunuebel, Joshua P
2018-03-01
An integral component in the assessment of vocal behavior in groups of freely interacting animals is the ability to determine which animal is producing each vocal signal. This process is facilitated by using microphone arrays with multiple channels. Here, we made important refinements to a state-of-the-art microphone array based system used to localize vocal signals produced by freely interacting laboratory mice. Key changes to the system included increasing the number of microphones as well as refining the methodology for localizing and assigning vocal signals to individual mice. We systematically demonstrate that the improvements in the methodology for localizing mouse vocal signals led to an increase in the number of signals detected as well as the number of signals accurately assigned to an animal. These changes facilitated the acquisition of larger and more comprehensive data sets that better represent the vocal activity within an experiment. Furthermore, this system will allow more thorough analyses of the role that vocal signals play in social communication. We expect that such advances will broaden our understanding of social communication deficits in mouse models of neurological disorders. Copyright © 2018 Elsevier B.V. All rights reserved.
Scattoni, Maria Luisa; Crawley, Jacqueline; Ricceri, Laura
2009-01-01
In neonatal mice ultrasonic vocalizations have been studied both as an early communicative behavior of the pup-mother dyad and as a sign of an aversive affective state. Adult mice of both sexes produce complex ultrasonic vocalization patterns in different experimental/social contexts. All these vocalizations are becoming an increasingly valuable assay for behavioral phenotyping throughout the mouse life-span and alterations of the ultrasound patterns have been reported in several mouse models of neurodevelopmental disorders. Here we also show that the modulation of vocalizations by maternal cues (maternal potentiation paradigm) – originally identified and investigated in rats - can be measured in C57Bl/6 mouse pups with appropriate modifications of the rat protocol and can likely be applied to mouse behavioral phenotyping. In addition we suggest that a detailed qualitative evaluation of neonatal calls together with analysis of adult mouse vocalization patterns in both sexes in social settings, may lead to a greater understanding of the communication value of vocalizations in mice. Importantly, both neonatal and adult USV altered patterns can be determined during the behavioural phenotyping of mouse models of human neurodevelopmental and neuropsychiatric disorders, starting from those in which deficits in communication are a primary symptom. PMID:18771687
On cortical coding of vocal communication sounds in primates
NASA Astrophysics Data System (ADS)
Wang, Xiaoqin
2000-10-01
Understanding how the brain processes vocal communication sounds is one of the most challenging problems in neuroscience. Our understanding of how the cortex accomplishes this unique task should greatly facilitate our understanding of cortical mechanisms in general. Perception of species-specific communication sounds is an important aspect of the auditory behavior of many animal species and is crucial for their social interactions, reproductive success, and survival. The principles of neural representations of these behaviorally important sounds in the cerebral cortex have direct implications for the neural mechanisms underlying human speech perception. Our progress in this area has been relatively slow, compared with our understanding of other auditory functions such as echolocation and sound localization. This article discusses previous and current studies in this field, with emphasis on nonhuman primates, and proposes a conceptual platform to further our exploration of this frontier. It is argued that the prerequisite condition for understanding cortical mechanisms underlying communication sound perception and production is an appropriate animal model. Three issues are central to this work: (i) neural encoding of statistical structure of communication sounds, (ii) the role of behavioral relevance in shaping cortical representations, and (iii) sensory-motor interactions between vocal production and perception systems.
On drawing a line through the spectrogram: how do we understand deficits of vocal pitch imitation?
Pfordresher, Peter Q.; Larrouy-Maestri, Pauline
2015-01-01
In recent years there has been a remarkable increase in research focusing on deficits of pitch production in singing. A critical concern has been the identification of “poor pitch singers,” which we refer to more generally as individuals having a “vocal pitch imitation deficit.” The present paper includes a critical assessment of the assumption that vocal pitch imitation abilities can be treated as a dichotomy. Though this practice may be useful for data analysis and may be necessary within educational practice, we argue that this approach is complicated by a series of problems. Moreover, we argue that a more informative (and less problematic) approach comes from analyzing vocal pitch imitation abilities on a continuum, referred to as effect magnitude regression, and offer examples concerning how researchers may analyze data using this approach. We also argue that the understanding of this deficit may be better served by focusing on the effects of experimental manipulations on different individuals, rather than attempt to treat values of individual measures, and isolated tasks, as absolute measures of ability. PMID:26029088
Evaluation of Synthetic Self-Oscillating Models of the Vocal Folds
NASA Astrophysics Data System (ADS)
Hubler, Elizabeth P.; Weiland, Kelley S.; Hancock, Adrienne B.; Plesniak, Michael W.
2013-11-01
Approximately 30% of people will suffer from a voice disorder at some point in their lives. The probability doubles for those who rely heavily on their voice, such as teachers and singers. Synthetic vocal fold (VF) models are fabricated and evaluated experimentally in a vocal tract simulator to replicate physiological conditions. Pressure measurements are acquired along the vocal tract and high-speed images are captured at varying flow rates during VF oscillation to facilitate understanding of the characteristics of healthy and damaged VFs. The images are analyzed using a videokymography line-scan technique that has been used to examine VF motion and mucosal wave dynamics in vivo. Clinically relevant parameters calculated from the volume-velocity output of a circumferentially-vented mask (Rothenberg mask) are compared to patient data. This study integrates speech science with engineering and flow physics to overcome current limitations of synthetic VF models to properly replicate normal phonation in order to advance the understanding of resulting flow features, progression of pathological conditions, and medical techniques. Supported by the GW Institute for Biomedical Engineering (GWIBE) and GW Center for Biomimetics and Bioinspired Engineering (COBRE).
Vocalizations of adult male Asian koels (Eudynamys scolopacea) in the breeding season.
Khan, Abdul Aziz; Qureshi, Irfan Zia
2017-01-01
Defining the vocal repertoire provides a basis for understanding the role of acoustic signals in sexual and social interactions of an animal. The Asian koel (Eudynamys scolopacea) is a migratory bird which spends its summer breeding season in the plains of Pakistan. The bird is typically wary and secretive but produces loud and distinct calls, making it easily detected when unseen. Like the other birds in the wild, presumably Asian koels use their calls for social cohesion and coordination of different behaviors. To date, the description of vocal repertoire of the male Asian koel has been lacking. Presently we analyzed and described for the first time the vocalizations of the adult male Asian koel, recorded in two consecutive breeding seasons. Using 10 call parameters, we categorized the vocalization type into six different categories on the basis of spectrogram and statistical analyses, namely the; "type 1 cooee call", "type 2 cooee call", "type 1 coegh call", "type 2 coegh call", "wurroo call" and "coe call". These names were assigned not on the basis of functional analysis and were therefore onomatopoeic. Stepwise cross validated discriminant function analysis classified the vocalization correctly (100%) into the predicted vocal categories that we initially classified on the basis of spectrographic examination. Our findings enrich the biological knowledge about vocalizations of the adult male Asian koel and provide a foundation for future acoustic monitoring of the species, as well as for comparative studies with vocalizations of other bird species of the cuckoo family. Further studies on the vocalizations of the Asian koel are required to unravel their functions in sexual selection and individual recognition.
Zeitels, Steven M; Hillman, Robert E; Desloge, Rosemary; Mauri, Marcello; Doyle, Patricia B
2002-12-01
Phonomicrosurgery in performing artists has historically been approached with great trepidation, and vocal outcome data are sparse. The vocal liability of surgically disturbing the superficial lamina propria (SLP) and epithelium must be balanced with the inherent detrimental vocal effect of the lesion(s). A prospective investigation was performed on 185 performing artists who underwent phonomicrosurgical resection of 365 lesions: 201 nodules, 71 polyps, 66 varices and ectasias, 13 cysts, 8 keratotic lesions, 2 granulomas, 2 Reinke's edema, and 2 papillomas. Nearly all patients with SLP lesions reported improvement in their postsurgical vocal function. This subjective result was supported by objective acoustic and aerodynamic measures. All postsurgical objective vocal function measures fell within normal limits, including a few that displayed presurgical abnormalities. However, given the relative insensitivity of standard objective measures to assess higher-level vocal performance-related factors, it is even more noteworthy that 8 of 24 objective measures displayed statistically significant postsurgical improvements in vocal function. Such changes in objective measures mostly reflect overall enhancement in the efficiency of voice production. Phonomicrosurgical resection of vocal fold lesions in performing artists is enjoying an expanding role because of a variety of improvements in diagnostic assessment, surgical instrumentation and techniques, and specialized rehabilitation. Most of these lesions are the result of phonotrauma and arise within the SLP. Successful management depends on prudent patient selection and counseling, ultraprecise technique, and vigorqus vocal rehabilitation. Furthermore, an understanding of the vocal function and dysfunction of this high-performance population provides all otolaryngologists who manage laryngeal problems with valuable information that they can extrapolate for use in their practices.
Visual classification of feral cat Felis silvestris catus vocalizations.
Owens, Jessica L; Olsen, Mariana; Fontaine, Amy; Kloth, Christopher; Kershenbaum, Arik; Waller, Sara
2017-06-01
Cat vocal behavior, in particular, the vocal and social behavior of feral cats, is poorly understood, as are the differences between feral and fully domestic cats. The relationship between feral cat social and vocal behavior is important because of the markedly different ecology of feral and domestic cats, and enhanced comprehension of the repertoire and potential information content of feral cat calls can provide both better understanding of the domestication and socialization process, and improved welfare for feral cats undergoing adoption. Previous studies have used conflicting classification schemes for cat vocalizations, often relying on onomatopoeic or popular descriptions of call types (e.g., "miow"). We studied the vocalizations of 13 unaltered domestic cats that complied with our behavioral definition used to distinguish feral cats from domestic. A total of 71 acoustic units were extracted and visually analyzed for the construction of a hierarchical classification of vocal sounds, based on acoustic properties. We identified 3 major categories (tonal, pulse, and broadband) that further breakdown into 8 subcategories, and show a high degree of reliability when sounds are classified blindly by independent observers (Fleiss' Kappa K = 0.863). Due to the limited behavioral contexts in this study, additional subcategories of cat vocalizations may be identified in the future, but our hierarchical classification system allows for the addition of new categories and new subcategories as they are described. This study shows that cat vocalizations are diverse and complex, and provides an objective and reliable classification system that can be used in future studies.
Cyclic Adenosine Monophosphate Regulation of Ion Transport in Porcine Vocal Fold Mucosae
Sivasankar, Mahalakshmi; Nofziger, Charity; Blazer-Yost, Bonnie
2012-01-01
Objectives/Hypothesis Cyclic adenosine monophosphate (cAMP) is an important biological molecule that regulates ion transport and inflammatory responses in epithelial tissue. The present study examined whether the adenylyl cyclase activator, forskolin, would increase cAMP concentration in porcine vocal fold mucosa and whether the effects of increased cAMP would be manifested as a functional increase in transepithelial ion transport. Additionally, changes in cAMP concentrations following exposure to an inflammatory mediator, tumor necrosis factor-α (TNFα) were investigated. Study Design In vitro experimental design with matched treatment and control groups. Methods Porcine vocal fold mucosae (N = 30) and tracheal mucosae (N = 20) were exposed to forskolin, TNFα, or vehicle (dimethyl sulfoxide) treatment. cAMP concentrations were determined with enzyme-linked immunosorbent assay. Ion transport was measured using electrophysiological techniques. Results Thirty minute exposure to forskolin significantly increased cAMP concentration and ion transport in porcine vocal fold and tracheal mucosae. However, 30-minute and 2-hour exposure to TNFα did not significantly alter cAMP concentration. Conclusions We demonstrate that forskolin-sensitive adenylyl cyclase is present in vocal fold mucosa, and further, that the product, cAMP increases vocal fold ion transport. The results presented here contribute to our understanding of the intracellular mechanisms underlying vocal fold ion transport. As ion transport is important for maintaining superficial vocal fold hydration, data demonstrating forskolin-stimulated ion transport in vocal fold mucosa suggest opportunities for developing pharmacological treatments that increase surface hydration. PMID:18596479
Milutinović, Z; Vasiljević, J
1992-05-01
The etiological theories of vocal fold cysts can be divided into two basic groups: those of congenital and acquired cysts. In ongoing practice, the authors had noted that the greater number of cysts appeared at the functionally most active segment of the vocal folds which, on the other hand, has the least number of glands. Also, it had been noted that patients with vocal fold cysts tended to have hyperkinetic patterns of voice production. These observations indicated the possibility of a functional aspect in the etiology of vocal fold cysts, and consideration of such a possibility was the aim of this work. In 37 cases, the exact location of the cyst was established. In addition, the muscular activity of the phonatory apparatus was estimated, patient self-descriptions with respect to talkativeness were taken into account, and histological evaluations were made. The cysts were most frequently found in the area of the junction of the anterior and middle thirds of the free edge of the vocal fold. Muscular activity during speech and phonation was increased in study patients. Sixty-five percent of patients had epidermoid cysts and 35% had retention cysts of the vocal fold. According to study results, the functional aspect of cyst genesis has a marked role in the etiology of vocal fold cysts, which points to the great importance of functional care for cyst patients.
Readability and Understandability of Online Vocal Cord Paralysis Materials.
Balakrishnan, Vini; Chandy, Zachariah; Hseih, Amy; Bui, Thanh-Lan; Verma, Sunil P
2016-03-01
Patients use several online resources to learn about vocal cord paralysis (VCP). The objective of this study was to assess the readability and understandability of online VCP patient education materials (PEMs), with readability assessments and the Patient Education Materials Evaluation Tool (PEMAT), respectively. The relationship between readability and understandability was then analyzed. Descriptive and correlational design. Online PEMs were identified by performing a Google search with the term "vocal cord paralysis." After scientific webpages, news articles, and information for medical professionals were excluded, 29 articles from the first 50 search results were considered. Readability analysis was performed with 6 formulas. Four individuals with different educational backgrounds conducted understandability analysis with the PEMAT. Fleiss's Kappa interrater reliability analysis determined consistency among raters. Correlation between readability and understandability was determined with Pearson's correlation test. The reading level of the reviewed articles ranged from grades 9 to 17. Understandability ranged from 29% to 82%. Correlation analysis demonstrated a strong negative correlation between materials' readability and understandability (r = -0.462, P < .05). Online PEMs pertaining to VCP are written above the recommended reading levels. Overall, materials written at lower grade levels are more understandable. However, articles of identical grade levels had varying levels of understandability. The PEMAT may provide a more critical evaluation of the quality of a PEM when compared with readability formulas. Both readability and understandability should be used to evaluate PEMs. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.
Neural Correlates of Vocal Production and Motor Control in Human Heschl's Gyrus
Oya, Hiroyuki; Nourski, Kirill V.; Kawasaki, Hiroto; Larson, Charles R.; Brugge, John F.; Howard, Matthew A.; Greenlee, Jeremy D.W.
2016-01-01
The present study investigated how pitch frequency, a perceptually relevant aspect of periodicity in natural human vocalizations, is encoded in Heschl's gyrus (HG), and how this information may be used to influence vocal pitch motor control. We recorded local field potentials from multicontact depth electrodes implanted in HG of 14 neurosurgical epilepsy patients as they vocalized vowel sounds and received brief (200 ms) pitch perturbations at 100 Cents in their auditory feedback. Event-related band power responses to vocalizations showed sustained frequency following responses that tracked voice fundamental frequency (F0) and were significantly enhanced in posteromedial HG during speaking compared with when subjects listened to the playback of their own voice. In addition to frequency following responses, a transient response component within the high gamma frequency band (75–150 Hz) was identified. When this response followed the onset of vocalization, the magnitude of the response was the same for the speaking and playback conditions. In contrast, when this response followed a pitch shift, its magnitude was significantly enhanced during speaking compared with playback. We also observed that, in anterolateral HG, the power of high gamma responses to pitch shifts correlated with the magnitude of compensatory vocal responses. These findings demonstrate a functional parcellation of HG with neural activity that encodes pitch in natural human voice, distinguishes between self-generated and passively heard vocalizations, detects discrepancies between the intended and heard vocalization, and contains information about the resulting behavioral vocal compensations in response to auditory feedback pitch perturbations. SIGNIFICANCE STATEMENT The present study is a significant contribution to our understanding of sensor-motor mechanisms of vocal production and motor control. The findings demonstrate distinct functional parcellation of core and noncore areas within human auditory cortex on Heschl's gyrus that process natural human vocalizations and pitch perturbations in the auditory feedback. In addition, our data provide evidence for distinct roles of high gamma neural oscillations and frequency following responses for processing periodicity in human vocalizations during vocal production and motor control. PMID:26888939
Three-Dimensional Flow Separation Induced by a Model Vocal Fold Polyp
NASA Astrophysics Data System (ADS)
Stewart, Kelley C.; Erath, Byron D.; Plesniak, Michael W.
2012-11-01
The fluid-structure energy exchange process for normal speech has been studied extensively, but it is not well understood for pathological conditions. Polyps and nodules, which are geometric abnormalities that form on the medial surface of the vocal folds, can disrupt vocal fold dynamics and thus can have devastating consequences on a patient's ability to communicate. A recent in-vitro investigation of a model polyp in a driven vocal fold apparatus demonstrated that such a geometric abnormality considerably disrupts the glottal jet behavior and that this flow field adjustment was a likely reason for the severe degradation of the vocal quality in patients. Understanding of the formation and propagation of vortical structures from a geometric protuberance, and their subsequent impact on the aerodynamic loadings that drive vocal fold dynamic, is a critical component in advancing the treatment of this pathological condition. The present investigation concerns the three-dimensional flow separation induced by a wall-mounted prolate hemispheroid with a 2:1 aspect ratio in cross flow, i.e. a model vocal fold polyp. Unsteady three-dimensional flow separation and its impact of the wall pressure loading are examined using skin friction line visualization and wall pressure measurements. Supported by the National Science Foundation, Grant No. CBET-1236351 and GW Center for Biomimetics and Bioinspired Engineering (COBRE).
A duetting perspective on avian song learning.
Rivera-Cáceres, Karla D; Templeton, Christopher N
2017-12-25
Avian song learning has a rich history of study and has become the preeminent system for understanding the ontogeny of vocal communication in animals. Song learning in birds has many parallels with human language learning, ranging from the neural mechanisms involved to the importance of social factors in shaping signal acquisition. While much has been learned about the process of song learning, virtually all of the research done to date has focused on temperate species, where often only one sex (the male) sings. Duetting species, in which both males and females learn to sing and learn to combine their songs into temporally coordinated joint displays, could provide many insights into the processes by which vocal learning takes place. Here we highlight three key features of song learning-neuroendocrine control mechanisms, timing and life history stages of song acquisition, and the role of social factors in song selection and use-that have been elucidated from species where only males sing, and compare these with duetting species. We summarize what is known about song learning in duetting species and then provide several suggestions for fruitful directions for future research. We suggest that focusing research efforts on duetting species could significantly advance our understanding of vocal learning in birds and further cement the importance of avian species as models for understanding human conversations and the processes of vocal learning more broadly. Copyright © 2017 Elsevier B.V. All rights reserved.
Food-Associated Calling in Gorillas (Gorilla g. gorilla) in the Wild
Luef, Eva Maria; Breuer, Thomas; Pika, Simone
2016-01-01
Many nonhuman primates produce food-associated vocalizations upon encountering or ingesting particular food. Concerning the great apes, only food-associated vocalizations of chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) have been studied in detail, providing evidence that these vocalizations can be produced flexibly in relation to a variety of factors, such as the quantity and quality of food and/or the type of audience. Only anecdotal evidence exists of eastern (Gorilla beringei) and western gorillas (Gorilla gorilla) producing food-associated vocalizations, termed singing or humming. To enable a better understanding of the context in which these calls are produced, we investigated and compared the vocal behavior of two free-ranging groups of western lowland gorillas (Gorilla g. gorilla) at Mondika, Republic of Congo. Our results show that (a) food-associated call production occurs only during feeding and not in other contexts; (b) calling is not uniformly distributed across age and sex classes; (c) calls are only produced during feeding on specific foods; and (d) normally just one individual gives calls during group feeding sessions, however, certain food types elicit simultaneous calling of two or more individuals. Our findings provide new insight into the vocal abilities of gorillas but also carry larger implications for questions concerning vocal variability among the great apes. Food-associated calls of nonhuman primates have been shown to be flexible in terms of when they are used and who they are directed at, making them interesting vocalizations from the viewpoint of language evolution. Food-associated vocalizations in great apes can offer new opportunities to investigate the phylogenetic development of vocal communication within the primate lineage and can possibly contribute novel insights into the origins of human language. PMID:26909518
Vocal Qualities in Music Theater Voice: Perceptions of Expert Pedagogues.
Bourne, Tracy; Kenny, Dianna
2016-01-01
To gather qualitative descriptions of music theater vocal qualities including belt, legit, and mix from expert pedagogues to better define this voice type. This is a prospective, semistructured interview. Twelve expert teachers from United States, United Kingdom, Asia, and Australia were interviewed by Skype and asked to identify characteristics of music theater vocal qualities including vocal production, physiology, esthetics, pitch range, and pedagogical techniques. Responses were compared with published studies on music theater voice. Belt and legit were generally described as distinct sounds with differing physiological and technical requirements. Teachers were concerned that belt should be taught "safely" to minimize vocal health risks. There was consensus between teachers and published research on the physiology of the glottis and vocal tract; however, teachers were not in agreement about breathing techniques. Neither were teachers in agreement about the meaning of "mix." Most participants described belt as heavily weighted, thick folds, thyroarytenoid-dominant, or chest register; however, there was no consensus on an appropriate term. Belt substyles were named and generally categorized by weightedness or tone color. Descriptions of male belt were less clear than for female belt. This survey provides an overview of expert pedagogical perspectives on the characteristics of belt, legit, and mix qualities in the music theater voice. Although teacher responses are generally in agreement with published research, there are still many controversial issues and gaps in knowledge and understanding of this vocal technique. Breathing techniques, vocal range, mix, male belt, and vocal registers require continuing investigation so that we can learn more about efficient and healthy vocal function in music theater singing. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Auditory Signal Processing in Communication: Perception and Performance of Vocal Sounds
Prather, Jonathan F.
2013-01-01
Learning and maintaining the sounds we use in vocal communication require accurate perception of the sounds we hear performed by others and feedback-dependent imitation of those sounds to produce our own vocalizations. Understanding how the central nervous system integrates auditory and vocal-motor information to enable communication is a fundamental goal of systems neuroscience, and insights into the mechanisms of those processes will profoundly enhance clinical therapies for communication disorders. Gaining the high-resolution insight necessary to define the circuits and cellular mechanisms underlying human vocal communication is presently impractical. Songbirds are the best animal model of human speech, and this review highlights recent insights into the neural basis of auditory perception and feedback-dependent imitation in those animals. Neural correlates of song perception are present in auditory areas, and those correlates are preserved in the auditory responses of downstream neurons that are also active when the bird sings. Initial tests indicate that singing-related activity in those downstream neurons is associated with vocal-motor performance as opposed to the bird simply hearing itself sing. Therefore, action potentials related to auditory perception and action potentials related to vocal performance are co-localized in individual neurons. Conceptual models of song learning involve comparison of vocal commands and the associated auditory feedback to compute an error signal that is used to guide refinement of subsequent song performances, yet the sites of that comparison remain unknown. Convergence of sensory and motor activity onto individual neurons points to a possible mechanism through which auditory and vocal-motor signals may be linked to enable learning and maintenance of the sounds used in vocal communication. PMID:23827717
The evolution of speech: a comparative review.
Fitch
2000-07-01
The evolution of speech can be studied independently of the evolution of language, with the advantage that most aspects of speech acoustics, physiology and neural control are shared with animals, and thus open to empirical investigation. At least two changes were necessary prerequisites for modern human speech abilities: (1) modification of vocal tract morphology, and (2) development of vocal imitative ability. Despite an extensive literature, attempts to pinpoint the timing of these changes using fossil data have proven inconclusive. However, recent comparative data from nonhuman primates have shed light on the ancestral use of formants (a crucial cue in human speech) to identify individuals and gauge body size. Second, comparative analysis of the diverse vertebrates that have evolved vocal imitation (humans, cetaceans, seals and birds) provides several distinct, testable hypotheses about the adaptive function of vocal mimicry. These developments suggest that, for understanding the evolution of speech, comparative analysis of living species provides a viable alternative to fossil data. However, the neural basis for vocal mimicry and for mimesis in general remains unknown.
Visual classification of feral cat Felis silvestris catus vocalizations
Owens, Jessica L.; Olsen, Mariana; Fontaine, Amy; Kloth, Christopher; Kershenbaum, Arik
2017-01-01
Abstract Cat vocal behavior, in particular, the vocal and social behavior of feral cats, is poorly understood, as are the differences between feral and fully domestic cats. The relationship between feral cat social and vocal behavior is important because of the markedly different ecology of feral and domestic cats, and enhanced comprehension of the repertoire and potential information content of feral cat calls can provide both better understanding of the domestication and socialization process, and improved welfare for feral cats undergoing adoption. Previous studies have used conflicting classification schemes for cat vocalizations, often relying on onomatopoeic or popular descriptions of call types (e.g., “miow”). We studied the vocalizations of 13 unaltered domestic cats that complied with our behavioral definition used to distinguish feral cats from domestic. A total of 71 acoustic units were extracted and visually analyzed for the construction of a hierarchical classification of vocal sounds, based on acoustic properties. We identified 3 major categories (tonal, pulse, and broadband) that further breakdown into 8 subcategories, and show a high degree of reliability when sounds are classified blindly by independent observers (Fleiss’ Kappa K = 0.863). Due to the limited behavioral contexts in this study, additional subcategories of cat vocalizations may be identified in the future, but our hierarchical classification system allows for the addition of new categories and new subcategories as they are described. This study shows that cat vocalizations are diverse and complex, and provides an objective and reliable classification system that can be used in future studies. PMID:29491992
A meta-analysis of outcomes of hydration intervention on phonation threshold pressure.
Leydon, Ciara; Wroblewski, Marcin; Eichorn, Naomi; Sivasankar, Mahalakshmi
2010-11-01
Vocal fold hydration is purported to promote optimal biomechanical characteristics of vocal fold mucosa, increase efficiency of vocal fold oscillation, and enhance voice quality. The purpose of this work was to determine the magnitude and consistency of the effect of vocal fold hydration on vocal fold function across published clinical studies. We completed a comprehensive meta-analysis of the effects of superficial and systemic vocal fold hydration on phonation threshold pressure (PTP), a measure of efficiency of voice production. We identified 34 studies that examined the effects of hydration on vocal function. Of these studies, 14 examined the effects of hydration on PTP. Nine of these articles met the criteria for inclusion in this analysis. We observed an average effect size of 0.33, indicating that, overall, hydration treatment demonstrated a tendency to reduce PTP. However, this decrease in phonatory effort did not reach significance at the 95% confidence level. The effects of hydration intervention varied considerably across studies (-0.19 to 3.96). We considered that two factors, pitch level of the task and vocal health of participants, may have contributed to this variability in findings. However, our analysis found that these factors could not account for differences in effect size. To understand the variability in outcomes across studies, the role of factors that may impact the effects of hydration, such as the amount, type, and duration of intervention, must be determined. Only then can we obtain data to guide best clinical practice for protecting and rehabilitating vocal function. Copyright © 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
A Chinese alligator in heliox: formant frequencies in a crocodilian
Reber, Stephan A.; Nishimura, Takeshi; Janisch, Judith; Robertson, Mark; Fitch, W. Tecumseh
2015-01-01
ABSTRACT Crocodilians are among the most vocal non-avian reptiles. Adults of both sexes produce loud vocalizations known as ‘bellows’ year round, with the highest rate during the mating season. Although the specific function of these vocalizations remains unclear, they may advertise the caller's body size, because relative size differences strongly affect courtship and territorial behaviour in crocodilians. In mammals and birds, a common mechanism for producing honest acoustic signals of body size is via formant frequencies (vocal tract resonances). To our knowledge, formants have to date never been documented in any non-avian reptile, and formants do not seem to play a role in the vocalizations of anurans. We tested for formants in crocodilian vocalizations by using playbacks to induce a female Chinese alligator (Alligator sinensis) to bellow in an airtight chamber. During vocalizations, the animal inhaled either normal air or a helium/oxygen mixture (heliox) in which the velocity of sound is increased. Although heliox allows normal respiration, it alters the formant distribution of the sound spectrum. An acoustic analysis of the calls showed that the source signal components remained constant under both conditions, but an upward shift of high-energy frequency bands was observed in heliox. We conclude that these frequency bands represent formants. We suggest that crocodilian vocalizations could thus provide an acoustic indication of body size via formants. Because birds and crocodilians share a common ancestor with all dinosaurs, a better understanding of their vocal production systems may also provide insight into the communication of extinct Archosaurians. PMID:26246611
A Mutation Associated with Stuttering Alters Mouse Pup Ultrasonic Vocalizations.
Barnes, Terra D; Wozniak, David F; Gutierrez, Joanne; Han, Tae-Un; Drayna, Dennis; Holy, Timothy E
2016-04-13
A promising approach to understanding the mechanistic basis of speech is to study disorders that affect speech without compromising other cognitive or motor functions. Stuttering, also known as stammering, has been linked to mutations in the lysosomal enzyme-targeting pathway, but how this remarkably specific speech deficit arises from mutations in a family of general "cellular housekeeping" genes is unknown. To address this question, we asked whether a missense mutation associated with human stuttering causes vocal or other abnormalities in mice. We compared vocalizations from mice engineered to carry a mutation in the Gnptab (N-acetylglucosamine-1-phosphotransferase subunits alpha/beta) gene with wild-type littermates. We found significant differences in the vocalizations of pups with the human Gnptab stuttering mutation compared to littermate controls. Specifically, we found that mice with the mutation emitted fewer vocalizations per unit time and had longer pauses between vocalizations and that the entropy of the temporal sequence was significantly reduced. Furthermore, Gnptab missense mice were similar to wild-type mice on an extensive battery of non-vocal behaviors. We then used the same language-agnostic metrics for auditory signal analysis of human speech. We analyzed speech from people who stutter with mutations in this pathway and compared it to control speech and found abnormalities similar to those found in the mouse vocalizations. These data show that mutations in the lysosomal enzyme-targeting pathway produce highly specific effects in mouse pup vocalizations and establish the mouse as an attractive model for studying this disorder. Copyright © 2016 Elsevier Ltd. All rights reserved.
Kucinschi, Bogdan R; Scherer, Ronald C; DeWitt, Kenneth J; Ng, Terry T M
2006-06-01
Flow visualization with smoke particles illuminated by a laser sheet was used to obtain a qualitative description of the air flow structures through a dynamically similar 7.5x symmetric static scale model of the human larynx (divergence angle of 10 deg, minimal diameter of 0.04 cm real life). The acoustic level downstream of the vocal folds was measured by using a condenser microphone. False vocal folds (FVFs) were included. In general, the glottal flow was laminar and bistable. The glottal jet curvature increased with flow rate and decreased with the presence of the FVFs. The glottal exit flow for the lowest flow rate showed a curved jet which remained laminar for all geometries. For the higher flow rates, the jet flow patterns exiting the glottis showed a laminar jet core, transitioning to vortical structures, and leading spatially to turbulent dissipation. This structure was shortened and tightened with an increase in flow rate. The narrow FVF gap lengthened the flow structure and reduced jet curvature via acceleration of the flow. These results suggest that laryngeal flow resistance and the complex jet flow structure exiting the glottis are highly affected by flow rate and the presence of the false vocal folds. Acoustic consequences are discussed in terms of the quadrupole- and dipole-type sound sources due to ordered flow structures.
Singing and Vocal Instruction in Primary Schools: An Analysis from Six Case Studies in Spain
ERIC Educational Resources Information Center
Cuadrado, Albina; Rusinek, Gabriel
2016-01-01
This is an analysis of how specialist music teachers sing and teach how to sing, based on data collected from six case studies carried out in Spanish primary schools. The study aimed at understanding classroom singing practices, and in particular the provision or absence of vocal instruction in relation with teachers' singing models. The findings…
From Leisure to Work: Amateur Musicians Taking up Instrumental or Vocal Teaching as a Second Career
ERIC Educational Resources Information Center
Taylor, Angela; Hallam, Susan
2011-01-01
This article aims to increase our understanding of how amateur musicians become teachers as a change of career, how they use their musical and life skills in their teaching, and how their teaching impacts on their musical identity. The questionnaire responses of 67 career-change instrumental and vocal teachers showed evidence of their strong…
Miller, Nicola A; Gregory, Jennifer S; Aspden, Richard M; Stollery, Peter J; Gilbert, Fiona J
2014-09-01
The shape of the vocal tract and associated structures (eg, tongue and velum) is complicated and varies according to development and function. This variability challenges interpretation of voice experiments. Quantifying differences between shapes and understanding how vocal structures move in relation to each other is difficult using traditional linear and angle measurements. With statistical shape models, shape can be characterized in terms of independent modes of variation. Here, we build an active shape model (ASM) to assess morphologic and pitch-related functional changes affecting vocal structures and the airway. Using a cross-sectional study design, we obtained six midsagittal magnetic resonance images from 10 healthy adults (five men and five women) at rest, while breathing out, and while listening to, and humming low and high notes. Eighty landmark points were chosen to define the shape of interest and an ASM was built using these (60) images. Principal component analysis was used to identify independent modes of variation, and statistical analysis was performed using one-way repeated-measures analysis of variance. Twenty modes of variation were identified with modes 1 and 2 accounting for half the total variance. Modes 1 and 9 were significantly associated with humming low and high notes (P < 0.001) and showed coordinated changes affecting the cervical spine, vocal structures, and airway. Mode 2 highlighted wide structural variations between subjects. This study highlights the potential of active shape modeling to advance understanding of factors underlying morphologic and pitch-related functional variations affecting vocal structures and the airway in health and disease. Copyright © 2014 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
High-speed digital phonoscopy images analyzed by Nyquist plots
NASA Astrophysics Data System (ADS)
Yan, Yuling
2012-02-01
Vocal-fold vibration is a key dynamic event in voice production, and the vibratory characteristics of the vocal fold correlate closely with voice quality and health condition. Laryngeal imaging provides direct means to observe the vocal fold vibration; in the past, however, available modalities were either too slow or impractical to resolve the actual vocal fold vibrations. This limitation has now been overcome by high-speed digital imaging (HSDI) (or high-speed digital phonoscopy), which records images of the vibrating vocal folds at a rate of 2000 frames per second or higher- fast enough to resolve a specific, sustained phonatory vocal fold vibration. The subsequent image-based functional analysis of voice is essential to better understanding the mechanism underlying voice production, as well as assisting the clinical diagnosis of voice disorders. Our primary objective is to develop a comprehensive analytical platform for voice analysis using the HSDI recordings. So far, we have developed various analytical approaches for the HSDI-based voice analyses. These include Nyquist plots and associated analysese that are used along with FFT and Spectrogram in the analysis of the HSDI data representing normal voice and specific voice pathologies.
Vibrational dynamics of vocal folds using nonlinear normal modes.
Pinheiro, Alan P; Kerschen, Gaëtan
2013-08-01
Many previous works involving physical models, excised and in vivo larynges have pointed out nonlinear vibration in vocal folds during voice production. Moreover, theoretical studies involving mechanical modeling of these folds have tried to gain a profound understanding of the observed nonlinear phenomena. In this context, the present work uses the nonlinear normal mode theory to investigate the nonlinear modal behavior of 16 subjects using a two-mass mechanical modeling of the vocal folds. The free response of the conservative system at different energy levels is considered to assess the impact of the structural nonlinearity of the vocal fold tissues. The results show very interesting and complex nonlinear phenomena including frequency-energy dependence, subharmonic regimes and, in some cases, modal interactions, entrainment and bifurcations. Copyright © 2012 IPEM. Published by Elsevier Ltd. All rights reserved.
[Study of the supra-glottic pressure during partial constriction of the vocal tract].
Suares, M; Cayrayre, F; Ouaknine, M; de la Brèteque, B Amy; Giovanni, A
2004-01-01
Phonation in a small plastic tube 22 cm length and 5 mms diameter (basic exercise of the method of Dr Amy de la Brèteque), is current practice in vocal rehabilitation in France. This work aims to show the effects of this method on the glottic vibration. The hypothesis was that at the time of phonation in the tube with a strong flow as recommended in the method, the vocal cords vibrate without contact. This limits the mechanical trauma at this level. We have analyzed the sound production in a tube in 11 trained and not trained subjects. We simultaneously collected the intra-oral air pressure and the vocal signal which was subjected to a spectral analysis. Spectral analysis confirmed that the signal was produced correctly i.e. with a strong flow and without interruption of the sound less rich in harmonics. We interpreted these results in the light of our preceding works on the glottic vibration and we show that this vocal production was of the sinusoidal type; this implies the absence of physical contact between the vocal cords, which validates our hypothesis. Further works are necessary to better understand the physical relations between the supra-glottic aerodynamic phenomena and the vibratory functioning of the vocal cords and also to analyze the therapeutic potential ofthe method within speech therapy rehabilitation.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wood, R.; Springston, S.; Mechoso, C. R.
2011-01-21
The VAMOS Ocean-Cloud-Atmosphere-Land Study Regional Experiment (VOCALS-REx) was an international field program designed to make observations of poorly understood but critical components of the coupled climate system of the southeast Pacific. This region is characterized by strong coastal upwelling, the coolest SSTs in the tropical belt, and is home to the largest subtropical stratocumulus deck on Earth. The field intensive phase of VOCALS-REx took place during October and November 2008 and constitutes a critical part of a broader CLIVAR program (VOCALS) designed to develop and promote scientific activities leading to improved understanding, model simulations, and predictions of the southeastern Pacificmore » (SEP) coupled ocean-atmosphere-land system, on diurnal to interannual timescales. The other major components of VOCALS are a modeling program with a model hierarchy ranging from the local to global scales, and a suite of extended observations from regular research cruises, instrumented moorings, and satellites. The two central themes of VOCALS-REx focus upon (a) links between aerosols, clouds and precipitation and their impacts on marine stratocumulus radiative properties, and (b) physical and chemical couplings between the upper ocean and the lower atmosphere, including the role that mesoscale ocean eddies play. A set of hypotheses designed to be tested with the combined field, monitoring and modeling work in VOCALS is presented here. A further goal of VOCALS-REx is to provide datasets for the evaluation and improvement of large-scale numerical models. VOCALS-REx involved five research aircraft, two ships and two surface sites in northern Chile. We describe the instrument payloads and key mission strategies for these platforms and give a summary of the missions conducted.« less
Divergence of a stereotyped call in northern resident killer whales.
Grebner, Dawn M; Parks, Susan E; Bradley, David L; Miksis-Olds, Jennifer L; Capone, Dean E; Ford, John K B
2011-02-01
Northern resident killer whale pods (Orcinus orca) have distinctive stereotyped pulsed call repertoires that can be used to distinguish groups acoustically. Repertoires are generally stable, with the same call types comprising the repertoire of a given pod over a period of years to decades. Previous studies have shown that some discrete pulsed calls can be subdivided into variants or subtypes. This study suggests that new stereotyped calls may result from the gradual modification of existing call types through subtypes. Vocalizations of individuals and small groups of killer whales were collected using a bottom-mounted hydrophone array in Johnstone Strait, British Columbia in 2006 and 2007. Discriminant analysis of slope variations of a predominant call type, N4, revealed the presence of four distinct call subtypes. Similar to previous studies, there was a divergence of the N4 call between members of different matrilines of the same pod. However, this study reveals that individual killer whales produced multiple subtypes of the N4 call, indicating that divergence in the N4 call is not the result of individual differences, but rather may indicate the gradual evolution of a new stereotyped call.
von Merten, Sophie; Hoier, Svenja
2014-01-01
It has long been known that rodents emit signals in the ultrasonic range, but their role in social communication and mating is still under active exploration. While inbred strains of house mice have emerged as a favourite model to study ultrasonic vocalisation (USV) patterns, studies in wild animals and natural situations are still rare. We focus here on two wild derived mouse populations. We recorded them in dyadic encounters for extended periods of time to assess possible roles of USVs and their divergence between allopatric populations. We have analysed song frequency and duration, as well as spectral features of songs and syllables. We show that the populations have indeed diverged in several of these aspects and that USV patterns emitted in a mating context differ from those emitted in same sex encounters. We find that females vocalize not less, in encounters with another female even more than males. This implies that the current focus of USVs being emitted mainly by males within the mating context needs to be reconsidered. Using a statistical syntax analysis we find complex temporal sequencing patterns that could suggest that the syntax conveys meaningful information to the receivers. We conclude that wild mice use USV for complex social interactions and that USV patterns can diverge fast between populations. PMID:24816836
Rhythmic synchronization tapping to an audio–visual metronome in budgerigars
Hasegawa, Ai; Okanoya, Kazuo; Hasegawa, Toshikazu; Seki, Yoshimasa
2011-01-01
In all ages and countries, music and dance have constituted a central part in human culture and communication. Recently, vocal-learning animals such as parrots and elephants have been found to share rhythmic ability with humans. Thus, we investigated the rhythmic synchronization of budgerigars, a vocal-mimicking parrot species, under controlled conditions and a systematically designed experimental paradigm as a first step in understanding the evolution of musical entrainment. We trained eight budgerigars to perform isochronous tapping tasks in which they pecked a key to the rhythm of audio–visual metronome-like stimuli. The budgerigars showed evidence of entrainment to external stimuli over a wide range of tempos. They seemed to be inherently inclined to tap at fast tempos, which have a similar time scale to the rhythm of budgerigars' natural vocalizations. We suggest that vocal learning might have contributed to their performance, which resembled that of humans. PMID:22355637
Rhythmic synchronization tapping to an audio-visual metronome in budgerigars.
Hasegawa, Ai; Okanoya, Kazuo; Hasegawa, Toshikazu; Seki, Yoshimasa
2011-01-01
In all ages and countries, music and dance have constituted a central part in human culture and communication. Recently, vocal-learning animals such as parrots and elephants have been found to share rhythmic ability with humans. Thus, we investigated the rhythmic synchronization of budgerigars, a vocal-mimicking parrot species, under controlled conditions and a systematically designed experimental paradigm as a first step in understanding the evolution of musical entrainment. We trained eight budgerigars to perform isochronous tapping tasks in which they pecked a key to the rhythm of audio-visual metronome-like stimuli. The budgerigars showed evidence of entrainment to external stimuli over a wide range of tempos. They seemed to be inherently inclined to tap at fast tempos, which have a similar time scale to the rhythm of budgerigars' natural vocalizations. We suggest that vocal learning might have contributed to their performance, which resembled that of humans.
Rules of song development and their use in vocal interactions by birds with large repertoires.
Geberzahn, Nicole; Hultsch, Henrike
2004-06-01
Songbirds are well known for settling their disputes by vocal signals, and their singing plays a dominant role. Most studies on this issue have concentrated on bird species that develop and use small vocal repertoires. In this article we will go farther and focus on examples of how species with large song repertoires make use of their vocal competence. In particular, we will outline the study of interaction rules which have been elucidated by examining time- and pattern-specific relationships between signals exchanged by territorial neighbors. First we present an inquiry into the rules of song learning and development. In birds with large song repertoires, the ontogeny of such rules proceeds along a number of trajectories which help in understanding the often remarkable accomplishments of adult birds. In both approaches, our model species will be the Common Nightingale Luscinia megarhynchos that has been investigated intensively in the field and in the laboratory.
Kim, Geunyoung; Walden, Tedra A; Knieps, Linda J
2010-04-01
Studies of infant social referencing have indicated that infants might be more influenced by vocal information contained in emotional messages than by facial expression, especially during fearful message conditions. The present study investigated the characteristics of emotional channels that parents used during social referencing, and corresponding infants' behavioral changes. Results of Study 1 indicated that parents used more vocal information during positive message conditions. Unlike previous findings, infants' behavioral change was related to the frequency of vocal information during positive condition. For fearful messages, infants were more influenced by the number of multi-modal channels used and the frequency of visual information. Study 2 further showed that the intensity of vocal tone was related to infant regulation only during positive message conditions. The results imply that understanding of social context is important to make sense of parent-infant's emotional interaction. Copyright 2010 Elsevier Inc. All rights reserved.
Vocal Fold Pathologies and Three-Dimensional Flow Separation Phenomena
NASA Astrophysics Data System (ADS)
Apostoli, Adam G.; Weiland, Kelley S.; Plesniak, Michael W.
2013-11-01
Polyps and nodules are two different pathologies, which are geometric abnormalities that form on the medial surface of the vocal folds, and have been shown to significantly disrupt a person's ability to communicate. Although the mechanism by which the vocal folds self-oscillate and the three-dimensional nature of the glottal jet has been studied, the effect of irregularities caused by pathologies is not fully understood. Examining the formation and evolution of vortical structures created by a geometric protuberance is important, not only for understanding the aerodynamic forces exerted by these structures on the vocal folds, but also in the treatment of the above-mentioned pathological conditions. Using a wall-mounted prolate hemispheroid with a 2:1 aspect ratio in cross flow, the present investigation considers three-dimensional flow separation induced by a model vocal fold polyp. Building on previous work using skin friction line visualization, both the velocity flow field and wall pressure measurements around the model polyp are presented and compared. Supported by the National Science Foundation, Grant No. CBET-1236351 and GW Center for Biomimetics and Bioinspired Engineering (COBRE).
Biosimulation of Inflammation and Healing in Surgically Injured Vocal Folds
Li, Nicole Y. K.; Vodovotz, Yoram; Hebda, Patricia A.; Abbott, Katherine Verdolini
2010-01-01
Objectives The pathogenesis of vocal fold scarring is complex and remains to be deciphered. The current study is part of research endeavors aimed at applying systems biology approaches to address the complex biological processes involved in the pathogenesis of vocal fold scarring and other lesions affecting the larynx. Methods We developed a computational agent-based model (ABM) to quantitatively characterize multiple cellular and molecular interactions involved in inflammation and healing in vocal fold mucosa after surgical trauma. The ABM was calibrated with empirical data on inflammatory mediators (eg, tumor necrosis factor) and extracellular matrix components (eg, hyaluronan) from published studies on surgical vocal fold injury in the rat population. Results The simulation results reproduced and predicted trajectories seen in the empirical data from the animals. Moreover, the ABM studies suggested that hyaluronan fragments might be the clinical surrogate of tissue damage, a key variable that in these simulations both is enhanced by and further induces inflammation. Conclusions A relatively simple ABM such as the one reported in this study can provide new understanding of laryngeal wound healing and generate working hypotheses for further wet-lab studies. PMID:20583741
Biosimulation of inflammation and healing in surgically injured vocal folds.
Li, Nicole Y K; Vodovotz, Yoram; Hebda, Patricia A; Abbott, Katherine Verdolini
2010-06-01
The pathogenesis of vocal fold scarring is complex and remains to be deciphered. The current study is part of research endeavors aimed at applying systems biology approaches to address the complex biological processes involved in the pathogenesis of vocal fold scarring and other lesions affecting the larynx. We developed a computational agent-based model (ABM) to quantitatively characterize multiple cellular and molecular interactions involved in inflammation and healing in vocal fold mucosa after surgical trauma. The ABM was calibrated with empirical data on inflammatory mediators (eg, tumor necrosis factor) and extracellular matrix components (eg, hyaluronan) from published studies on surgical vocal fold injury in the rat population. The simulation results reproduced and predicted trajectories seen in the empirical data from the animals. Moreover, the ABM studies suggested that hyaluronan fragments might be the clinical surrogate of tissue damage, a key variable that in these simulations both is enhanced by and further induces inflammation. A relatively simple ABM such as the one reported in this study can provide new understanding of laryngeal wound healing and generate working hypotheses for further wet-lab studies.
Glottal aerodynamics in compliant, life-sized vocal fold models
NASA Astrophysics Data System (ADS)
McPhail, Michael; Dowell, Grant; Krane, Michael
2013-11-01
This talk presents high-speed PIV measurements in compliant, life-sized models of the vocal folds. A clearer understanding of the fluid-structure interaction of voiced speech, how it produces sound, and how it varies with pathology is required to improve clinical diagnosis and treatment of vocal disorders. Physical models of the vocal folds can answer questions regarding the fundamental physics of speech, as well as the ability of clinical measures to detect the presence and extent of disorder. Flow fields were recorded in the supraglottal region of the models to estimate terms in the equations of fluid motion, and their relative importance. Experiments were conducted over a range of driving pressures with flow rates, given by a ball flowmeter, and subglottal pressures, given by a micro-manometer, reported for each case. Imaging of vocal fold motion, vector fields showing glottal jet behavior, and terms estimated by control volume analysis will be presented. The use of these results for a comparison with clinical measures, and for the estimation of aeroacoustic source strengths will be discussed. Acknowledge support from NIH R01 DC005642.
Maciej, Peter; Ndao, Ibrahima; Hammerschmidt, Kurt; Fischer, Julia
2013-09-23
To understand the evolution of acoustic communication in animals, it is important to distinguish between the structure and the usage of vocal signals, since both aspects are subject to different constraints. In terrestrial mammals, the structure of calls is largely innate, while individuals have a greater ability to actively initiate or withhold calls. In closely related taxa, one would therefore predict a higher flexibility in call usage compared to call structure. In the present study, we investigated the vocal repertoire of free living Guinea baboons (Papio papio) and examined the structure and usage of the animals' vocal signals. Guinea baboons live in a complex multi-level social organization and exhibit a largely tolerant and affiliative social style, contrary to most other baboon taxa. To classify the vocal repertoire of male and female Guinea baboons, cluster analyses were used and focal observations were conducted to assess the usage of vocal signals in the particular contexts. In general, the vocal repertoire of Guinea baboons largely corresponded to the vocal repertoire other baboon taxa. The usage of calls, however, differed considerably from other baboon taxa and corresponded with the specific characteristics of the Guinea baboons' social behaviour. While Guinea baboons showed a diminished usage of contest and display vocalizations (a common pattern observed in chacma baboons), they frequently used vocal signals during affiliative and greeting interactions. Our study shows that the call structure of primates is largely unaffected by the species' social system (including grouping patterns and social interactions), while the usage of calls can be more flexibly adjusted, reflecting the quality of social interactions of the individuals. Our results support the view that the primary function of social signals is to regulate social interactions, and therefore the degree of competition and cooperation may be more important to explain variation in call usage than grouping patterns or group size.
2013-01-01
Background To understand the evolution of acoustic communication in animals, it is important to distinguish between the structure and the usage of vocal signals, since both aspects are subject to different constraints. In terrestrial mammals, the structure of calls is largely innate, while individuals have a greater ability to actively initiate or withhold calls. In closely related taxa, one would therefore predict a higher flexibility in call usage compared to call structure. In the present study, we investigated the vocal repertoire of free living Guinea baboons (Papio papio) and examined the structure and usage of the animals’ vocal signals. Guinea baboons live in a complex multi-level social organization and exhibit a largely tolerant and affiliative social style, contrary to most other baboon taxa. To classify the vocal repertoire of male and female Guinea baboons, cluster analyses were used and focal observations were conducted to assess the usage of vocal signals in the particular contexts. Results In general, the vocal repertoire of Guinea baboons largely corresponded to the vocal repertoire other baboon taxa. The usage of calls, however, differed considerably from other baboon taxa and corresponded with the specific characteristics of the Guinea baboons’ social behaviour. While Guinea baboons showed a diminished usage of contest and display vocalizations (a common pattern observed in chacma baboons), they frequently used vocal signals during affiliative and greeting interactions. Conclusions Our study shows that the call structure of primates is largely unaffected by the species’ social system (including grouping patterns and social interactions), while the usage of calls can be more flexibly adjusted, reflecting the quality of social interactions of the individuals. Our results support the view that the primary function of social signals is to regulate social interactions, and therefore the degree of competition and cooperation may be more important to explain variation in call usage than grouping patterns or group size. PMID:24059742
NASA Astrophysics Data System (ADS)
Yang, Jubiao; Krane, Michael; Zhang, Lucy
2013-11-01
Vocal fold vibrations and the glottal jet are successfully simulated using the modified Immersed Finite Element method (mIFEM), a fully coupled dynamics approach to model fluid-structure interactions. A self-sustained and steady vocal fold vibration is captured given a constant pressure input at the glottal entrance. The flow rates at different axial locations in the glottis are calculated, showing small variations among them due to the vocal fold motion and deformation. To further facilitate the understanding of the phonation process, two control volume analyses, specifically with Bernoulli's equation and Newton's 2nd law, are carried out for the glottal flow based on the simulation results. A generalized Bernoulli's equation is derived to interpret the correlations between the velocity and pressure temporally and spatially along the center line which is a streamline using a half-space model with symmetry boundary condition. A specialized Newton's 2nd law equation is developed and divided into terms to help understand the driving mechanism of the glottal flow.
Using statistical deformable models to reconstruct vocal tract shape from magnetic resonance images.
Vasconcelos, M J M; Rua Ventura, S M; Freitas, D R S; Tavares, J M R S
2010-10-01
The mechanisms involved in speech production are complex and have thus been subject to growing attention by the scientific community. It has been demonstrated that magnetic resonance imaging (MRI) is a powerful means in the understanding of the morphology of the vocal tract. Over the last few years, statistical deformable models have been successfully used to identify and characterize bones and organs in medical images and point distribution models (PDMs) have gained particular relevance. In this work, the suitability of these models has been studied to characterize and further reconstruct the shape of the vocal tract in the articulation of Portuguese European (EP) speech sounds, one of the most spoken languages worldwide, with the aid of MR images. Therefore, a PDM has been built from a set of MR images acquired during the artificially sustained articulation of 25 EP speech sounds. Following this, the capacity of this statistical model to characterize the shape deformation of the vocal tract during the production of sounds was analysed. Next, the model was used to reconstruct five EP oral vowels and the EP fricative consonants. As far as a study on speech production is concerned, this study is considered to be the first approach to characterize and reconstruct the vocal tract shape from MR images by using PDMs. In addition, the findings achieved permit one to conclude that this modelling technique compels an enhanced understanding of the dynamic speech events involved in sustained articulations based on MRI, which are of particular interest for speech rehabilitation and simulation.
Effects of dehydration on the viscoelastic properties of vocal folds in large deformations.
Miri, Amir K; Barthelat, François; Mongeau, Luc
2012-11-01
Dehydration may alter vocal fold viscoelastic properties, thereby hampering phonation. The effects of water loss induced by an osmotic pressure potential on vocal fold tissue viscoelastic properties were investigated. Porcine vocal folds were dehydrated by immersion in a hypertonic solution, and quasi-static and low-frequency dynamic traction tests were performed for elongations of up to 50%. Digital image correlation was used to determine local strains from surface deformations. The elastic modulus and the loss factor were then determined for normal and dehydrated tissues. An eight-chain hyperelastic model was used to describe the observed nonlinear stress-stretch behavior. Contrary to the expectations, the mass history indicated that the tissue absorbed water during cyclic extension when submerged in a hypertonic solution. During loading history, the elastic modulus was increased for dehydrated tissues as a function of strain. The response of dehydrated tissues was much less affected when the load was released. This observation suggests that hydration should be considered in micromechanical models of the vocal folds. The internal hysteresis, which is often linked to phonation effort, increased significantly with water loss. The effects of dehydration on the viscoelastic properties of vocal fold tissue were quantified in a systematic way. A better understanding of the role of hydration on the mechanical properties of vocal fold tissue may help to establish objective dehydration and phonotrauma criteria. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
The Vocal Tract Organ: A New Musical Instrument Using 3-D Printed Vocal Tracts.
Howard, David M
2017-10-27
The advent and now increasingly widespread availability of 3-D printers is transforming our understanding of the natural world by enabling observations to be made in a tangible manner. This paper describes the use of 3-D printed models of the vocal tract for different vowels that are used to create an acoustic output when stimulated with an appropriate sound source in a new musical instrument: the Vocal Tract Organ. The shape of each printed vocal tract is recovered from magnetic resonance imaging. It sits atop a loudspeaker to which is provided an acoustic L-F model larynx input signal that is controlled by the notes played on a musical instrument digital interface device such as a keyboard. The larynx input is subject to vibrato with extent and frequency adjustable as desired within the ranges usually found for human singing. Polyphonic inputs for choral singing textures can be applied via a single loudspeaker and vocal tract, invoking the approximation of linearity in the voice production system, thereby making multiple vowel stops a possibility while keeping the complexity of the instrument in reasonable check. The Vocal Tract Organ offers a much more human and natural sounding result than the traditional Vox Humana stops found in larger pipe organs, offering the possibility of enhancing pipe organs of the future as well as becoming the basis for a "multi-vowel" chamber organ in its own right. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Auditory responses in the amygdala to social vocalizations
NASA Astrophysics Data System (ADS)
Gadziola, Marie A.
The underlying goal of this dissertation is to understand how the amygdala, a brain region involved in establishing the emotional significance of sensory input, contributes to the processing of complex sounds. The general hypothesis is that communication calls of big brown bats (Eptesicus fuscus) transmit relevant information about social context that is reflected in the activity of amygdalar neurons. The first specific aim analyzed social vocalizations emitted under a variety of behavioral contexts, and related vocalizations to an objective measure of internal physiological state by monitoring the heart rate of vocalizing bats. These experiments revealed a complex acoustic communication system among big brown bats in which acoustic cues and call structure signal the emotional state of a sender. The second specific aim characterized the responsiveness of single neurons in the basolateral amygdala to a range of social syllables. Neurons typically respond to the majority of tested syllables, but effectively discriminate among vocalizations by varying the response duration. This novel coding strategy underscores the importance of persistent firing in the general functioning of the amygdala. The third specific aim examined the influence of acoustic context by characterizing both the behavioral and neurophysiological responses to natural vocal sequences. Vocal sequences differentially modify the internal affective state of a listening bat, with lower aggression vocalizations evoking the greatest change in heart rate. Amygdalar neurons employ two different coding strategies: low background neurons respond selectively to very few stimuli, whereas high background neurons respond broadly to stimuli but demonstrate variation in response magnitude and timing. Neurons appear to discriminate the valence of stimuli, with aggression sequences evoking robust population-level responses across all sound levels. Further, vocal sequences show improved discrimination among stimuli compared to isolated syllables, and this improved discrimination is expressed in part by the timing of action potentials. Taken together, these data support the hypothesis that big brown bat social vocalizations transmit relevant information about the social context that is encoded within the discharge pattern of amygdalar neurons ultimately responsible for coordinating appropriate social behaviors. I further propose that vocalization-evoked amygdalar activity will have significant impact on subsequent sensory processing and plasticity.
Sex hormones and the female voice.
Abitbol, J; Abitbol, P; Abitbol, B
1999-09-01
In the following, the authors examine the relationship between hormonal climate and the female voice through discussion of hormonal biochemistry and physiology and informal reporting on a study of 197 women with either premenstrual or menopausal voice syndrome. These facts are placed in a larger historical and cultural context, which is inextricably bound to the understanding of the female voice. The female voice evolves from childhood to menopause, under the varied influences of estrogens, progesterone, and testosterone. These hormones are the dominant factor in determining voice changes throughout life. For example, a woman's voice always develops masculine characteristics after an injection of testosterone. Such a change is irreversible. Conversely, male castrati had feminine voices because they lacked the physiologic changes associated with testosterone. The vocal instrument is comprised of the vibratory body, the respiratory power source and the oropharyngeal resonating chambers. Voice is characterized by its intensity, frequency, and harmonics. The harmonics are hormonally dependent. This is illustrated by the changes that occur during male and female puberty: In the female, the impact of estrogens at puberty, in concert with progesterone, produces the characteristics of the female voice, with a fundamental frequency one third lower than that of a child. In the male, androgens released at puberty are responsible for the male vocal frequency, an octave lower than that of a child. Premenstrual vocal syndrome is characterized by vocal fatigue, decreased range, a loss of power and loss of certain harmonics. The syndrome usually starts some 4-5 days before menstruation in some 33% of women. Vocal professionals are particularly affected. Dynamic vocal exploration by televideoendoscopy shows congestion, microvarices, edema of the posterior third of the vocal folds and a loss of its vibratory amplitude. The authors studied 97 premenstrual women who were prescribed a treatment of multivitamins, venous tone stimulants (phlebotonics), and anti-edematous drugs. We obtained symptomatic improvement in 84 patients. The menopausal vocal syndrome is characterized by lowered vocal intensity, vocal fatigue, a decreased range with loss of the high tones and a loss of vocal quality. In a study of 100 menopausal women, 17 presented with a menopausal vocal syndrome. To rehabilitate their voices, and thus their professional lives, patients were prescribed hormone replacement therapy and multi-vitamins. All 97 women showed signs of vocal muscle atrophy, reduction in the thickness of the mucosa and reduced mobility in the cricoarytenoid joint. Multi-factorial therapy (hormone replacement therapy and multi-vitamins) has to be individually adjusted to each case depending on body type, vocal needs, and other factors.
Ghate, Virendra P.; Albrecht, Bruce A.; Miller, Mark A.; ...
2014-01-13
Observations made during a 24-h period as part of the Variability of the American Monsoon Systems (VAMOS) Ocean–Cloud–Atmosphere–Land Study Regional Experiment (VOCALS-REx) are analyzed to study the radiation and turbulence associated with the stratocumulus-topped marine boundary layer (BL). The first 14 h exhibited a well-mixed (coupled) BL with an average cloud-top radiative flux divergence of ~130 W m 22; the BL was decoupled during the last 10 h with negligible radiative flux divergence. The averaged radiative cooling very close to the cloud top was -9.04 K h -1 in coupled conditions and -3.85 K h -1 in decoupled conditions. Thismore » is the first study that combined data from a vertically pointing Doppler cloud radar and a Doppler lidar to yield the vertical velocity structure of the entire BL. The averaged vertical velocity variance and updraft mass flux during coupled conditions were higher than those during decoupled conditions at all levels by a factor of 2 or more. The vertical velocity skewness was negative in the entire BL during coupled conditions, whereas it was weakly positive in the lower third of the BL and negative above during decoupled conditions. A formulation of velocity scale is proposed that includes the effect of cloud-top radiative cooling in addition to the surface buoyancy flux. When scaled by the velocity scale, the vertical velocity variance and coherent downdrafts had similar magnitude during the coupled and decoupled conditions. Finally, the coherent updrafts that exhibited a constant profile in the entire BL during both the coupled and decoupled conditions scaled well with the convective velocity scale to a value of ~0.5.« less
Riede, Tobias; Goller, Franz
2010-10-01
Song production in songbirds is a model system for studying learned vocal behavior. As in humans, bird phonation involves three main motor systems (respiration, vocal organ and vocal tract). The avian respiratory mechanism uses pressure regulation in air sacs to ventilate a rigid lung. In songbirds sound is generated with two independently controlled sound sources, which reside in a uniquely avian vocal organ, the syrinx. However, the physical sound generation mechanism in the syrinx shows strong analogies to that in the human larynx, such that both can be characterized as myoelastic-aerodynamic sound sources. Similarities include active adduction and abduction, oscillating tissue masses which modulate flow rate through the organ and a layered structure of the oscillating tissue masses giving rise to complex viscoelastic properties. Differences in the functional morphology of the sound producing system between birds and humans require specific motor control patterns. The songbird vocal apparatus is adapted for high speed, suggesting that temporal patterns and fast modulation of sound features are important in acoustic communication. Rapid respiratory patterns determine the coarse temporal structure of song and maintain gas exchange even during very long songs. The respiratory system also contributes to the fine control of airflow. Muscular control of the vocal organ regulates airflow and acoustic features. The upper vocal tract of birds filters the sounds generated in the syrinx, and filter properties are actively adjusted. Nonlinear source-filter interactions may also play a role. The unique morphology and biomechanical system for sound production in birds presents an interesting model for exploring parallels in control mechanisms that give rise to highly convergent physical patterns of sound generation. More comparative work should provide a rich source for our understanding of the evolution of complex sound producing systems. Copyright © 2009 Elsevier Inc. All rights reserved.
Vocal handicap index in popular and erudite professional singers.
Loiola-Barreiro, Camila Miranda; Silva, Marta Assumpção de Andrada E
To compare the voice handicap index of popular and erudite professional singers according to gender, age, professional experience time, and presence or absence of self-reported vocal complaints. One hundred thirty-two professional singers, 74 popular and 58 erudite, who responded to a questionnaire with regards to identification, age, gender, professional experience time in singing, musical genres (for popular singers), vocal classification (for erudite singers), presence of self-reported vocal complaints, and the specific protocols for popular (Modern Singing Handicap Index - MSHI) and erudite (Classical Singing Handicap Index - CSHI) singing. Higher proportion of women and higher incidence of vocal complaints were observed in the popular singers compared with the erudite singers. Most of the popular singers belonged to the genre of Brazilian Popular Music. Regarding the classification of erudite singers, there was greater participation of sopranos and tenors. No statistical differences were observed with respect to age and professional experience time between the groups. Comparison of the MSHI and CSHI scores showed no statistically significant difference between these scores and genre or age in both groups of singers. Professional experience time was related to the total score and the subscales disability and impairment in the MSHI, only for popular singers with vocal complaints. There was no correlation between these variables and the CSHI for erudite singers. The impact of vocal difficulty/problem interferes differently in these two musical genres when related to vocal complaint and professional experience time. The MSHI and CSHI protocols proved to be important tools not only for the identification of problems, but also for the understanding of how these individuals relate their voices with this occupational activity.
Life Experience of Patients With Unilateral Vocal Fold Paralysis.
Francis, David O; Sherman, Ariel E; Hovis, Kristen L; Bonnet, Kemberlee; Schlundt, David; Garrett, C Gaelyn; Davies, Louise
2018-05-01
Clinicians and patients benefit when they have a clear understanding of how medical conditions influence patients' life experiences. Patients' perspectives on life with unilateral vocal fold paralysis have not been well described. To promote patient-centered care by characterizing the patient experiences of living with unilateral vocal fold paralysis. This study used mixed methods: surveys using the voice and dysphagia handicap indexes (VHI and DHI) and semistructured interviews with adults with unilateral vocal cord paralysis recruited from a tertiary voice center. Recorded interviews were transcribed, coded using a hierarchical coding system, and analyzed using an iterative inductive-deductive approach. Symptom domains of the patient experience. In 36 patients (26 [72%] were female, and the median age and interquartile range [IQR] were 63 years [48-68 years]; median interview duration, 42 minutes), median VHI and DHI scores were 96 (IQR, 77-108) and 55.5 (IQR, 35-89) at the time of interviews, respectively. Frustration, isolation, fear, and altered self-identity were primary themes permeating patients' experiences. Frustrations related to limitations in communication, employment, and the medical system. Sources of fear included a loss of control, fear of further dysfunction or permanent disability, concern for health consequences (eg, aspiration pneumonia), and/or an inability to call for help in emergency situations. These experiences were modified by the following factors: resilience, self-efficacy, perceived sense of control, and social support systems. Effects of unilateral vocal fold paralysis extend beyond impaired voice and other somatic symptoms. Awareness of the extent to which these patients experience frustration, isolation, fear, and altered self-identity is important. A patient-centered approach to optimizing unilateral vocal fold paralysis treatment is enhanced by an understanding of both the physical dimension of this condition and how patients cope with the considerable emotional and social consequences. Recognizing the psychosocial dimensions of disease allows clinicians to communicate more effectively, be more empathetic, and to better personalize treatment plans, which may lead to improved patient care and patient satisfaction.
Traser, Louisa; Burdumy, Michael; Richter, Bernhard; Vicari, Marco; Echternach, Matthias
2014-01-01
Magnetic Resonance Imaging (MRI) of subjects in a supine position can be used to evaluate the configuration of the vocal tract during phonation. However, studies of speech phonation have shown that gravity can affect vocal tract shape and bias measurements. This is one of the reasons that MRI studies of singing phonation have used professionally trained singers as subjects, because they are generally considered to be less affected by the supine body position and environmental distractions. A study of untrained singers might not only contribute to the understanding of intuitive singing function and aid the evaluation of potential hazards for vocal health, but also provide insights into the effect of the supine position on singers in general. In the present study, an open configuration 0.25 T MRI system with a rotatable examination bed was used to study the effect of body position in 20 vocally untrained subjects. The subjects were asked to sing sustained tones in both supine and upright body positions on different pitches and in different register conditions. Morphometric measurements were taken from the acquired images of a sagittal slice depicting the vocal tract. The analysis concerning the vocal tract configuration in the two body positions revealed differences in 5 out of 10 measured articulatory parameters. In the upright position the jaw was less protruded, the uvula was elongated, the larynx more tilted and the tongue was positioned more to the front of the mouth than in the supine position. The findings presented are in agreement with several studies on gravitational effects in speech phonation, but contrast with the results of a previous study on professional singers of our group where only minor differences between upright and supine body posture were observed. The present study demonstrates that imaging of the vocal tract using weight-bearing MR imaging is a feasible tool for the study of sustained phonation in singing for vocally untrained subjects. PMID:25379885
Traser, Louisa; Burdumy, Michael; Richter, Bernhard; Vicari, Marco; Echternach, Matthias
2014-01-01
Magnetic Resonance Imaging (MRI) of subjects in a supine position can be used to evaluate the configuration of the vocal tract during phonation. However, studies of speech phonation have shown that gravity can affect vocal tract shape and bias measurements. This is one of the reasons that MRI studies of singing phonation have used professionally trained singers as subjects, because they are generally considered to be less affected by the supine body position and environmental distractions. A study of untrained singers might not only contribute to the understanding of intuitive singing function and aid the evaluation of potential hazards for vocal health, but also provide insights into the effect of the supine position on singers in general. In the present study, an open configuration 0.25 T MRI system with a rotatable examination bed was used to study the effect of body position in 20 vocally untrained subjects. The subjects were asked to sing sustained tones in both supine and upright body positions on different pitches and in different register conditions. Morphometric measurements were taken from the acquired images of a sagittal slice depicting the vocal tract. The analysis concerning the vocal tract configuration in the two body positions revealed differences in 5 out of 10 measured articulatory parameters. In the upright position the jaw was less protruded, the uvula was elongated, the larynx more tilted and the tongue was positioned more to the front of the mouth than in the supine position. The findings presented are in agreement with several studies on gravitational effects in speech phonation, but contrast with the results of a previous study on professional singers of our group where only minor differences between upright and supine body posture were observed. The present study demonstrates that imaging of the vocal tract using weight-bearing MR imaging is a feasible tool for the study of sustained phonation in singing for vocally untrained subjects.
Patient-Specific Computational Modeling of Human Phonation
NASA Astrophysics Data System (ADS)
Xue, Qian; Zheng, Xudong; University of Maine Team
2013-11-01
Phonation is a common biological process resulted from the complex nonlinear coupling between glottal aerodynamics and vocal fold vibrations. In the past, the simplified symmetric straight geometric models were commonly employed for experimental and computational studies. The shape of larynx lumen and vocal folds are highly three-dimensional indeed and the complex realistic geometry produces profound impacts on both glottal flow and vocal fold vibrations. To elucidate the effect of geometric complexity on voice production and improve the fundamental understanding of human phonation, a full flow-structure interaction simulation is carried out on a patient-specific larynx model. To the best of our knowledge, this is the first patient-specific flow-structure interaction study of human phonation. The simulation results are well compared to the established human data. The effects of realistic geometry on glottal flow and vocal fold dynamics are investigated. It is found that both glottal flow and vocal fold dynamics present a high level of difference from the previous simplified model. This study also paved the important step toward the development of computer model for voice disease diagnosis and surgical planning. The project described was supported by Grant Number ROlDC007125 from the National Institute on Deafness and Other Communication Disorders (NIDCD).
Insights into the role of elastin in vocal fold health and disease
Moore, Jaime
2011-01-01
Elastic fibers are large, complex and surprisingly poorly understood extracellular matrix (ECM) macromolecules. The elastin fiber, generated from a single human gene - elastin (ELN), is a self assembling integral protein that endows critical mechanic proprieties to elastic tissues and organs such as the skin, lungs, and arteries. The biology of elastic fibers is complex because they have multiple components, a tightly regulated developmental deposition, a multi-step hierarchical assembly and unique biomechanical functions. Elastin is present in vocal folds, where it plays a pivotal role in the quality of phonation. This review article provides an overview of the genesis of elastin and its wide- ranging structure and function. Specific distribution within the vocal fold lamina propria across the lifespan in normal and pathological states and its contribution to vocal fold biomechanics will be examined. Elastin and elastin-derived molecules are increasingly investigated for their application in tissue engineering. The properties of various elastin– based materials will be discussed and their current and future applications evaluated. A new level of understanding of the biomechanical properties of vocal fold elastin composites and their molecular basis should lead to new strategies for elastic fiber repair and regeneration in aging and disease. PMID:21708449
An Automated Procedure for Evaluating Song Imitation
Mandelblat-Cerf, Yael; Fee, Michale S.
2014-01-01
Songbirds have emerged as an excellent model system to understand the neural basis of vocal and motor learning. Like humans, songbirds learn to imitate the vocalizations of their parents or other conspecific “tutors.” Young songbirds learn by comparing their own vocalizations to the memory of their tutor song, slowly improving until over the course of several weeks they can achieve an excellent imitation of the tutor. Because of the slow progression of vocal learning, and the large amounts of singing generated, automated algorithms for quantifying vocal imitation have become increasingly important for studying the mechanisms underlying this process. However, methodologies for quantifying song imitation are complicated by the highly variable songs of either juvenile birds or those that learn poorly because of experimental manipulations. Here we present a method for the evaluation of song imitation that incorporates two innovations: First, an automated procedure for selecting pupil song segments, and, second, a new algorithm, implemented in Matlab, for computing both song acoustic and sequence similarity. We tested our procedure using zebra finch song and determined a set of acoustic features for which the algorithm optimally differentiates between similar and non-similar songs. PMID:24809510
Cicadas impact bird communication in a noisy tropical rainforest
Hall, Robert; Ray, William; Beck, Angela; Zook, James
2015-01-01
Many animals communicate through acoustic signaling, and “acoustic space” may be viewed as a limited resource that organisms compete for. If acoustic signals overlap, the information in them is masked, so there should be selection toward strategies that reduce signal overlap. The extent to which animals are able to partition acoustic space in acoustically diverse habitats such as tropical forests is poorly known. Here, we demonstrate that a single cicada species plays a major role in the frequency and timing of acoustic communication in a neotropical wet forest bird community. Using an automated acoustic monitor, we found that cicadas vary the timing of their signals throughout the day and that the frequency range and timing of bird vocalizations closely track these signals. Birds significantly avoid temporal overlap with cicadas by reducing and often shutting down vocalizations at the onset of cicada signals that utilize the same frequency range. When birds do vocalize at the same time as cicadas, the vocalizations primarily occur at nonoverlapping frequencies with cicada signals. Our results greatly improve our understanding of the community dynamics of acoustic signaling and reveal how patterns in biotic noise shape the frequency and timing of bird vocalizations in tropical forests. PMID:26023277
Medial surface dynamics of the vocal folds in an in vivo canine model
NASA Astrophysics Data System (ADS)
Doellinger, Michael; Berke, Gerald S.; Chhetri, Dinesh K.; Berry, David A.
2004-05-01
Quantitative measurement of the medial surface dynamics of the vocal folds is important for understanding how sound is generated in the larynx. However, such data are hard to gather because of the inaccessibility of the vocal folds. Recent studies have applied hemi-larynx methodology to excised human larynges, to visualize these dynamics. The present study extends this methodology to obtain similar quantitative measurements using an in vivo canine hemi-larynx setup, with varying levels of stimulation to the recurrent laryngeal nerve. Use of an in vivo model allows us to examine effects of intrinsic muscle contraction on the medial surface of the vocal folds, to provide greater insight into mechanisms of vocal control. Data were collected using digital high-speed imaging with a sampling frequency of up to 4000 Hz, and a spatial resolution of up to 1024×1024 pixels. Three-dimensional motion will be extracted, computed, visualized, and contrasted as a function of the level of stimulation to the recurrent laryngeal nerve. Results will also be compared to patterns of vibration in excised larynges. Finally, commonly applied quantitative analyses will be performed to investigate the underlying modes of vibration. [Work supported by NIH/NIDCD.
Inferring speaker attributes in adductor spasmodic dysphonia: ratings from unfamiliar listeners.
Isetti, Derek; Xuereb, Linnea; Eadie, Tanya L
2014-05-01
To determine whether unfamiliar listeners' perceptions of speakers with adductor spasmodic dysphonia (ADSD) differ from control speakers on the parameters of relative age, confidence, tearfulness, and vocal effort and are related to speaker-rated vocal effort or voice-specific quality of life. Twenty speakers with ADSD (including 6 speakers with ADSD plus tremor) and 20 age- and sex-matched controls provided speech recordings, completed a voice-specific quality-of-life instrument (Voice Handicap Index; Jacobson et al., 1997), and rated their own vocal effort. Twenty listeners evaluated speech samples for relative age, confidence, tearfulness, and vocal effort using rating scales. Listeners judged speakers with ADSD as sounding significantly older, less confident, more tearful, and more effortful than control speakers (p < .01). Increased vocal effort was strongly associated with decreased speaker confidence (rs = .88-.89) and sounding more tearful (rs = .83-.85). Self-rated speaker effort was moderately related (rs = .45-.52) to listener impressions. Listeners' perceptions of confidence and tearfulness were also moderately associated with higher Voice Handicap Index scores (rs = .65-.70). Unfamiliar listeners judge speakers with ADSD more negatively than control speakers, with judgments extending beyond typical clinical measures. The results have implications for counseling and understanding the psychosocial effects of ADSD.
Wolf Howling Is Mediated by Relationship Quality Rather Than Underlying Emotional Stress
Mazzini, Francesco; Townsend, Simon W.; Virányi, Zsófia; Range, Friederike
2013-01-01
Summary While considerable research has addressed the function of animal vocalizations, the proximate mechanisms driving call production remain surprisingly unclear. Vocalizations may be driven by emotions and the physiological state evoked by changes in the social-ecological environment [1, 2], or animals may have more control over their vocalizations, using them in flexible ways mediated by the animal’s understanding of its surrounding social world [3, 4]. While both explanations are plausible and neither excludes the other, to date no study has attempted to experimentally investigate the influence of both emotional and cognitive factors on animal vocal usage. We aimed to disentangle the relative contribution of both mechanisms by examining howling in captive wolves. Using a separation experiment and by measuring cortisol levels, we specifically investigated whether howling is a physiological stress response to group fragmentation [5] and whether it is driven by social factors, particularly relationship quality [6, 7]. Results showed that relationship quality between the howler and the leaving individual better predicted howling than did the current physiological state. Our findings shed important light on the degree to which animal vocal production can be considered as voluntary. Video Abstract PMID:23973297
Effects of Dehydration on the Viscoelastic Properties of Vocal Folds in Large Deformations
Miri, Amir K.; Barthelat, François; Mongeau, Luc
2012-01-01
Summary Dehydration may alter vocal fold viscoelastic properties, which may hamper phonation. The effects of water loss induced by an osmotic-pressure potential on vocal fold tissue viscoelastic properties were investigated. Porcine vocal folds were dehydrated by immersion in a hypertonic solution, and quasi-static and low-frequency dynamic traction tests were performed for elongations of up to 50%. Digital image correlation was used to determine local strains from surface deformations. The elastic modulus and the loss factor were then determined for normal and dehydrated tissues. An eight-chain hyperelastic model was used to describe the observed nonlinear stress-stretch behavior. Contrary to expectations, the mass history indicated that the tissue absorbed water during cyclic extension when submerged in a hypertonic solution. During loading history, the elastic modulus was increased for dehydrated tissues as a function of strain. The response of dehydrated tissues was much less affected when the load was releasing. This calls more attention to the modeling of vocal folds in micromechanics modeling. The internal hysteresis, which is often linked to phonation effort, increased significantly with water loss. The effects of dehydration on the viscoelastic properties of vocal fold tissue were quantified in a systematic way. The results will contribute to a better understanding of the basic biomechanics of voice production and ultimately will help establish objective dehydration and phonotrauma criteria. PMID:22483778
Lima, Alice; Lemasson, Alban; Boye, Martin; Hausberger, Martine
2017-12-01
Under natural conditions bottlenose dolphins (Tursiops truncatus) spend their time mostly feeding and then travelling, socializing, or resting. These activities are not randomly distributed, with feeding being higher in early morning and late afternoon. Social activities and vocal behavior seem to be very important in dolphin daily activity. This study aimed to describe the activity time-budget and its relation to vocal behavior for dolphins in a zoological park. We recorded behaviors and vocalizations of six dolphins over 2 months. All subjects performed more non-agonistic social interactions and play in the morning than in the afternoon. The different categories of vocalizations were distributed non-randomly throughout the day, with more chirps in the afternoon, when the animals were "less social." The most striking result was the strong correlation between activities and the categories of vocalizations produced. The results confirm the association between burst pulses and whistles with social activities, but also reveal that both are also associated with solitary play. More chirps were produced when dolphins were engaged in socio-sexual behaviors, emphasizing the need for further questioning about the function of this vocal category. This study reveals that: (i) in a group kept in zoological management, social activities are mostly present in the morning; and (ii) the acoustic signals produced by dolphins may give a reliable representation of their current activities. While more studies on the context of signal production are needed, our findings provide a useful tool for understanding free ranging dolphin behavior when they are not visible. © 2017 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Benboujja, Fouzi; Garcia, Jordan; Beaudette, Kathy; Strupler, Mathias; Hartnick, Christopher J.; Boudoux, Caroline
2016-02-01
Excessive and repetitive force applied on vocal fold tissue can induce benign vocal fold lesions. Children affected suffer from chronic hoarseness. In this instance, the vibratory ability of the folds, a complex layered microanatomy, becomes impaired. Histological findings have shown that lesions produce a remodeling of sup-epithelial vocal fold layers. However, our understanding of lesion features and development is still limited. Indeed, conventional imaging techniques do not allow a non-invasive assessment of sub-epithelial integrity of the vocal fold. Furthermore, it remains challenging to differentiate these sub-epithelial lesions (such as bilateral nodules, polyps and cysts) from a clinical perspective, as their outer surfaces are relatively similar. As treatment strategy differs for each lesion type, it is critical to efficiently differentiate sub-epithelial alterations involved in benign lesions. In this study, we developed an optical coherence tomography (OCT) based handheld probe suitable for pediatric laryngological imaging. The probe allows for rapid three-dimensional imaging of vocal fold lesions. The system is adapted to allow for high-resolution intra-operative imaging. We imaged 20 patients undergoing direct laryngoscopy during which we looked at different benign pediatric pathologies such as bilateral nodules, cysts and laryngeal papillomatosis and compared them to healthy tissue. We qualitatively and quantitatively characterized laryngeal pathologies and demonstrated the added advantage of using 3D OCT imaging for lesion discrimination and margin assessment. OCT evaluation of the integrity of the vocal cord could yield to a better pediatric management of laryngeal diseases.
Human Non-linguistic Vocal Repertoire: Call Types and Their Meaning.
Anikin, Andrey; Bååth, Rasmus; Persson, Tomas
2018-01-01
Recent research on human nonverbal vocalizations has led to considerable progress in our understanding of vocal communication of emotion. However, in contrast to studies of animal vocalizations, this research has focused mainly on the emotional interpretation of such signals. The repertoire of human nonverbal vocalizations as acoustic types, and the mapping between acoustic and emotional categories, thus remain underexplored. In a cross-linguistic naming task (Experiment 1), verbal categorization of 132 authentic (non-acted) human vocalizations by English-, Swedish- and Russian-speaking participants revealed the same major acoustic types: laugh, cry, scream, moan, and possibly roar and sigh. The association between call type and perceived emotion was systematic but non-redundant: listeners associated every call type with a limited, but in some cases relatively wide, range of emotions. The speed and consistency of naming the call type predicted the speed and consistency of inferring the caller's emotion, suggesting that acoustic and emotional categorizations are closely related. However, participants preferred to name the call type before naming the emotion. Furthermore, nonverbal categorization of the same stimuli in a triad classification task (Experiment 2) was more compatible with classification by call type than by emotion, indicating the former's greater perceptual salience. These results suggest that acoustic categorization may precede attribution of emotion, highlighting the need to distinguish between the overt form of nonverbal signals and their interpretation by the perceiver. Both within- and between-call acoustic variation can then be modeled explicitly, bringing research on human nonverbal vocalizations more in line with the work on animal communication.
Northrup, Jessie B.; Iverson, Jana M.
2015-01-01
This study examined vocal coordination during mother-infant interactions in the infant siblings (high risk infants; HR) of children with autism spectrum disorder (ASD), a population at heightened risk for developing language delays. Vocal coordination between mothers and HR infants was compared to a group of low risk (LR; no first- or second-degree relative with ASD) dyads, and used to predict later language development. Nine-month-old infants were videotaped at home playing with their mothers, and interactions were coded for the frequency and timing of vocalizations. Percent infant simultaneous speech was predictive of later language delay (LD), and dyads with LD infants were less coordinated with one another in average latency to respond than dyads with non-delayed (ND) infants. The degree of coordination between mothers and infants on this variable predicted a continuous measure of language development in the third year. This research underscores the importance of understanding early development in the context of interaction. PMID:26345517
Radford, Craig A; Ghazali, Shahriman M; Montgomery, John C; Jeffs, Andrew G
2016-01-01
Fish vocalisation is often a major component of underwater soundscapes. Therefore, interpretation of these soundscapes requires an understanding of the vocalisation characteristics of common soniferous fish species. This study of captive female bluefin gurnard, Chelidonichthys kumu, aims to formally characterise their vocalisation sounds and daily pattern of sound production. Four types of sound were produced and characterised, twice as many as previously reported in this species. These sounds fit two aural categories; grunt and growl, the mean peak frequencies for which ranged between 129 to 215 Hz. This species vocalized throughout the 24 hour period at an average rate of (18.5 ± 2.0 sounds fish-1 h-1) with an increase in vocalization rate at dawn and dusk. Competitive feeding did not elevate vocalisation as has been found in other gurnard species. Bluefin gurnard are common in coastal waters of New Zealand, Australia and Japan and, given their vocalization rate, are likely to be significant contributors to ambient underwater soundscape in these areas.
Radford, Craig A.; Ghazali, Shahriman M.; Montgomery, John C.; Jeffs, Andrew G.
2016-01-01
Fish vocalisation is often a major component of underwater soundscapes. Therefore, interpretation of these soundscapes requires an understanding of the vocalisation characteristics of common soniferous fish species. This study of captive female bluefin gurnard, Chelidonichthys kumu, aims to formally characterise their vocalisation sounds and daily pattern of sound production. Four types of sound were produced and characterised, twice as many as previously reported in this species. These sounds fit two aural categories; grunt and growl, the mean peak frequencies for which ranged between 129 to 215 Hz. This species vocalized throughout the 24 hour period at an average rate of (18.5 ± 2.0 sounds fish-1 h-1) with an increase in vocalization rate at dawn and dusk. Competitive feeding did not elevate vocalisation as has been found in other gurnard species. Bluefin gurnard are common in coastal waters of New Zealand, Australia and Japan and, given their vocalization rate, are likely to be significant contributors to ambient underwater soundscape in these areas. PMID:26890124
Vocal Fold Mucus Aggregation in Persons with Voice Disorders
Bonilha, Heather Shaw; White, Lisa; Kuckhahn, Kelsey; Gerlach, Terri Treman; Deliyski, Dimitar D.
2012-01-01
Mucus aggregation on the vocal folds is a common finding from laryngeal endoscopy. Patients with voice disorders report the presence of mucus aggregation. Patients also report that mucus aggregation causes them to clear their throat, a behavior believed to be harmful to vocal fold mucosa. Even though clinicians and patients report and discuss mucus aggregation, we have a limited understanding of mucus aggregation in persons with voice disorders. The primary goal of this study was to provide an initial assessment of the presence and features of mucus aggregation in persons with voice disorders. The secondary goal of this study was to determine if there are differences in mucus aggregation between persons with and without voice disorders. To address these goals, four features of mucus aggregation were judged from laryngeal endoscopy recordings from 54 speakers with voice disorders and compared to judgments of these same features in persons without voice disorders. The results from this study showed: (1) 100% of dysphonic speakers had visible mucus aggregation on their vocal folds. (2) Persons with hyperfunctional voice disorders had different mucus characteristics than persons with hypofunctional voice disorders (p=0.002). (3) Dysphonic speakers did not differ in frequency of mucus identified on the vocal folds than non-dysphonic speakers. However, the two groups had different mucus characteristics (p=0.001). Future studies are warranted to determine if these differences in mucus aggregation between persons with and without voice disorders relate to specific aspects of laryngeal pathology or patient characteristics, such as age and gender. Once we understand these relationships, we may be able to use this information to improve our diagnosis and treatment of patients with atypical laryngeal mucus aggregation. PMID:22510352
Duband, S; Timoshenko, A P; Mohammedi, R; Prades, J-M; Barral, F-G; Debout, M; Péoc'h, M
2009-11-20
To evaluate laryngoscopic findings in hanging cases and to compare them with magnetic resonance imaging (MRI) and forensic autopsy results. Postmortem nasolaryngofibroscopy and MRI of five people who died from hanging were performed. Three people who died from other causes than hanging were also examined with a flexible laryngofibroscope. The results were compared with injuries discovered during forensic autopsy. In all five hanging cases, laryngofibroscopic investigation showed a vocal fold position in complete adduction confirmed by MRI. This position did not seem to be influenced by the intensity of the forces applied to neck or postmortem delay and cadaveric phenomena. The vocal cords of the three non-hanging deceased were found in the intermediate position. These findings could suggest that pressure applied to the cervical nervous and cartilaginous structures or their elongation during hanging could lead to closure of the glottis with vocal cord adduction maintained after death. Laryngofibroscopic examination in hanging cases could be very useful in confirming the vital character of the hanging and understanding asphyxial phenomena in incomplete suspension without laryngeal crush.
Avian vocal mimicry: a unified conceptual framework.
Dalziell, Anastasia H; Welbergen, Justin A; Igic, Branislav; Magrath, Robert D
2015-05-01
Mimicry is a classical example of adaptive signal design. Here, we review the current state of research into vocal mimicry in birds. Avian vocal mimicry is a conspicuous and often spectacular form of animal communication, occurring in many distantly related species. However, the proximate and ultimate causes of vocal mimicry are poorly understood. In the first part of this review, we argue that progress has been impeded by conceptual confusion over what constitutes vocal mimicry. We propose a modified version of Vane-Wright's (1980) widely used definition of mimicry. According to our definition, a vocalisation is mimetic if the behaviour of the receiver changes after perceiving the acoustic resemblance between the mimic and the model, and the behavioural change confers a selective advantage on the mimic. Mimicry is therefore specifically a functional concept where the resemblance between heterospecific sounds is a target of selection. It is distinct from other forms of vocal resemblance including those that are the result of chance or common ancestry, and those that have emerged as a by-product of other processes such as ecological convergence and selection for large song-type repertoires. Thus, our definition provides a general and functionally coherent framework for determining what constitutes vocal mimicry, and takes account of the diversity of vocalisations that incorporate heterospecific sounds. In the second part we assess and revise hypotheses for the evolution of avian vocal mimicry in the light of our new definition. Most of the current evidence is anecdotal, but the diverse contexts and acoustic structures of putative vocal mimicry suggest that mimicry has multiple functions across and within species. There is strong experimental evidence that vocal mimicry can be deceptive, and can facilitate parasitic interactions. There is also increasing support for the use of vocal mimicry in predator defence, although the mechanisms are unclear. Less progress has been made in explaining why many birds incorporate heterospecific sounds into their sexual displays, and in determining whether these vocalisations are functionally mimetic or by-products of sexual selection for other traits such as repertoire size. Overall, this discussion reveals a more central role for vocal mimicry in the behavioural ecology of birds than has previously been appreciated. The final part of this review identifies important areas for future research. Detailed empirical data are needed on individual species, including on the structure of mimetic signals, the contexts in which mimicry is produced, how mimicry is acquired, and the ecological relationships between mimic, model and receiver. At present, there is little information and no consensus about the various costs of vocal mimicry for the protagonists in the mimicry complex. The diversity and complexity of vocal mimicry in birds raises important questions for the study of animal communication and challenges our view of the nature of mimicry itself. Therefore, a better understanding of avian vocal mimicry is essential if we are to account fully for the diversity of animal signals. © 2014 The Authors. Biological Reviews © 2014 Cambridge Philosophical Society.
Reproduction of mouse-pup ultrasonic vocalizations by nanocrystalline silicon thermoacoustic emitter
NASA Astrophysics Data System (ADS)
Kihara, Takashi; Harada, Toshihiro; Kato, Masahiro; Nakano, Kiyoshi; Murakami, Osamu; Kikusui, Takefumi; Koshida, Nobuyoshi
2006-01-01
As one of the functional properties of ultrasound generator based on efficient thermal transfer at the nanocrystalline silicon (nc-Si) layer surface, its potential as an ultrasonic simulator of vocalization signals is demonstrated by using the acoustic data of mouse-pup calls. The device composed of a surface-heating thin-film electrode, an nc-Si layer, and a single-crystalline silicon (c-Si) wafer, exhibits an almost completely flat frequency response over a wide range without any mechanical surface vibration systems. It is shown that the fabricated emitter can reproduce digitally recorded ultrasonic mouse-pups vocalizations very accurately in terms of the call duration, frequency dispersion, and sound pressure level. The thermoacoustic nc-Si device provides a powerful physical means for the understanding of ultrasonic communication mechanisms in various living animals.
Birds and anthropogenic noise: are urban songs adaptive?
Nemeth, Erwin; Brumm, Henrik
2010-10-01
In cities with intense low-frequency traffic noise, birds have been observed to sing louder and at a higher pitch. Several studies argue that higher song pitch is an adaptation to reduce masking from noise, and it has even been suggested that the song divergence between urban and nonurban songs might lead to reproductive isolation. Here we present models of signal transmission to compare the benefits of raised song amplitude and song pitch in terms of sound transmission. We chose two bird species that sing with higher pitch in urban areas, the great tit (Parus major) and the blackbird (Turdus merula). For both species, we calculated communication distances in response to different levels of urban noise and in their natural forest habitats. We found that an increase in vocal pitch increased communication distance only marginally. In contrast, vocal amplitude adjustments had a strong and significantly larger effect. Our results indicate that frequency changes of urban songs are not very effective in mitigating masking from traffic noise. Increased song pitch might not be an adaptation to reduce signal masking but a physiological side effect of singing at high amplitudes or an epiphenomenon of urbanization that is not related to signal transmission.
Affective divergence: automatic responses to others' emotions depend on group membership.
Weisbuch, Max; Ambady, Nalini
2008-11-01
Extant research suggests that targets' emotion expressions automatically evoke similar affect in perceivers. The authors hypothesized that the automatic impact of emotion expressions depends on group membership. In Experiments 1 and 2, an affective priming paradigm was used to measure immediate and preconscious affective responses to same-race or other-race emotion expressions. In Experiment 3, spontaneous vocal affect was measured as participants described the emotions of an ingroup or outgroup sports team fan. In these experiments, immediate and spontaneous affective responses depended on whether the emotional target was ingroup or outgroup. Positive responses to fear expressions and negative responses to joy expressions were observed in outgroup perceivers, relative to ingroup perceivers. In Experiments 4 and 5, discrete emotional responses were examined. In a lexical decision task (Experiment 4), facial expressions of joy elicited fear in outgroup perceivers, relative to ingroup perceivers. In contrast, facial expressions of fear elicited less fear in outgroup than in ingroup perceivers. In Experiment 5, felt dominance mediated emotional responses to ingroup and outgroup vocal emotion. These data support a signal-value model in which emotion expressions signal environmental conditions. (c) 2008 APA, all rights reserved.
Soderstrom, Ken; Wilson, Ashley R
2013-11-01
Zebra finch song is a learned behavior dependent upon successful progress through a sensitive period of late-postnatal development. This learning is associated with maturation of distinct brain nuclei and the fiber tract interconnections between them. We have previously found remarkably distinct and dense CB1 cannabinoid receptor expression within many of these song control brain regions, implying a normal role for endocannabinoid signaling in vocal learning. Activation of CB1 receptors via daily treatments with exogenous agonist during sensorimotor stages of song learning (but not in adulthood) results in persistent alteration of song patterns. Now we are working to understand physiological changes responsible for this cannabinoid-altered vocal learning. We have found that song-altering developmental treatments are associated with changes in expression of endocannabinoid signaling elements, including CB1 receptors and the principal CNS endogenous agonist, 2-AG. Within CNS, 2-AG is produced largely through activity of the α isoform of the enzyme diacylglycerol lipase (DAGLα). To better appreciate the role of 2-AG production in normal vocal development we have determined the spatial distribution of DAGLα expression within zebra finch CNS during vocal development. Early during vocal development at 25 days, DAGLα staining is typically light and of fibroid processes. Staining peaks late in the sensorimotor stage of song learning at 75 days and is characterized by fiber, neuropil and some staining of both small and large cell somata. Results provide insight to the normal role for endocannabinoid signaling in the maturation of brain regions responsible for song learning and vocal-motor output, and suggest mechanisms by which exogenous cannabinoid exposure alters acquisition of this form of vocal communication. Copyright © 2013 Elsevier B.V. All rights reserved.
Nonlinear laser scanning microscopy of human vocal folds.
Miri, Amir K; Tripathy, Umakanta; Mongeau, Luc; Wiseman, Paul W
2012-02-01
The purpose of this work was to apply nonlinear laser scanning microscopy (NLSM) for visualizing the morphology of extracellular matrix proteins within human vocal folds. This technique may potentially assist clinicians in making rapid diagnoses of vocal fold tissue disease or damage. Microstructural characterization based on NLSM provides valuable information for better understanding molecular mechanisms and tissue structure. Experimental, ex vivo human vocal fold. A custom-built multimodal nonlinear laser scanning microscope was used to scan fibrillar proteins in three 4% formaldehyde-fixed cadaveric samples. Collagen and elastin, key extracellular matrix proteins in the vocal fold lamina propria, were imaged by two nonlinear microscopy modalities: second harmonic generation (SHG) and two-photon fluorescence (TPF), respectively. An experimental protocol was introduced to characterize the geometrical properties of the imaged fibrous proteins. NLSM revealed the biomorphology of the human vocal fold fibrous proteins. No photobleaching was observed for the incident laser power of ∼60 mW before the excitation objective. Types I and III fibrillar collagen were imaged without label in the tissue by intrinsic SHG. Imaging while rotating the incident laser light-polarization direction confirmed a helical shape for the collagen fibers. The amplitude, periodicity, and overall orientation were then computed for the helically distributed collagen network. The elastin network was simultaneously imaged via TPF and found to have a basket-like structure. In some regions, particularly close to the epithelium, colocalization of both extracellular matrix components were observed. A benchmark study is presented for quantitative real-time, ex vivo, NLSM imaging of the extracellular macromolecules in human vocal fold lamina propria. The results are promising for clinical applications. Copyright © 2011 The American Laryngological, Rhinological, and Otological Society, Inc.
Blades, Brittany; Parks, Susan E.
2018-01-01
During the breeding season, male harbor seals (Phoca vitulina) make underwater acoustic displays using vocalizations known as roars. These roars have been shown to function in territory establishment in some breeding areas and have been hypothesized to be important for female choice, but the function of these sounds remains unresolved. This study consisted of a series of playback experiments in which captive female harbor seals were exposed to recordings of male roars to determine if females respond to recordings of male vocalizations and whether or not they respond differently to roars from categories with different acoustic characteristics. The categories included roars with characteristics of dominant males (longest duration, lowest frequency), subordinate males (shortest duration, highest frequency), combinations of call parameters from dominant and subordinate males (long duration, high frequency and short duration, low frequency), and control playbacks of water noise and water noise with tonal signals in the same frequency range as male signals. Results indicate that overall females have a significantly higher level of response to playbacks that imitate male vocalizations when compared to control playbacks of water noise. Specifically, there was a higher level of response to playbacks representing dominant male vocalization when compared to the control playbacks. For most individuals, there was a greater response to playbacks representing dominant male vocalizations compared to playbacks representing subordinate male vocalizations; however, there was no statistical difference between those two playback types. Additionally, there was no difference between the playbacks of call parameter combinations and the controls. Investigating female preference for male harbor seal vocalizations is a critical step in understanding the harbor seal mating system and further studies expanding on this captive study will help shed light on this important issue. PMID:29607261
Matthews, Leanna P; Blades, Brittany; Parks, Susan E
2018-01-01
During the breeding season, male harbor seals ( Phoca vitulina ) make underwater acoustic displays using vocalizations known as roars. These roars have been shown to function in territory establishment in some breeding areas and have been hypothesized to be important for female choice, but the function of these sounds remains unresolved. This study consisted of a series of playback experiments in which captive female harbor seals were exposed to recordings of male roars to determine if females respond to recordings of male vocalizations and whether or not they respond differently to roars from categories with different acoustic characteristics. The categories included roars with characteristics of dominant males (longest duration, lowest frequency), subordinate males (shortest duration, highest frequency), combinations of call parameters from dominant and subordinate males (long duration, high frequency and short duration, low frequency), and control playbacks of water noise and water noise with tonal signals in the same frequency range as male signals. Results indicate that overall females have a significantly higher level of response to playbacks that imitate male vocalizations when compared to control playbacks of water noise. Specifically, there was a higher level of response to playbacks representing dominant male vocalization when compared to the control playbacks. For most individuals, there was a greater response to playbacks representing dominant male vocalizations compared to playbacks representing subordinate male vocalizations; however, there was no statistical difference between those two playback types. Additionally, there was no difference between the playbacks of call parameter combinations and the controls. Investigating female preference for male harbor seal vocalizations is a critical step in understanding the harbor seal mating system and further studies expanding on this captive study will help shed light on this important issue.
Advances in our understanding of the Reinke space.
Thibeault, Susan L
2005-06-01
Normal vocal fold vibration depends critically upon the composition of the Reinke space or the lamina propria extracellular matrix. Alterations in the normal composition of the extracellular matrix result in a loss of normal vibratory function. In this article, the present literature on the Reinke space in normal and disease states is reviewed including publications in the multidisciplinary fields of biomechanics, histology, molecular biology, and tissue engineering. With recent technology advances, the etiology for benign lesions has been investigated with computer models and bioreactors. Particular extracellular matrix constituents in various benign vocal fold lesions--fibronectin, fibromodulin and hyaluronan--appear to be involved in altering the viscoelastic properties of the Reinke space. Significant basic science approaches to the investigation of the characterization of the Reinke space in vocal fold scarring has produced several potential future treatment avenues. Tissue-engineering approaches for regeneration of the Reinke space are the most recent addition to the literature showing promising research directions. Voice disorders represent a significant clinical problem. Research attempting to discover the underlying molecular and genetic regulation and homeostasis of the extracellular matrix of the Reinke space are essential. Effective future clinical interventions must be based upon the knowledge of how genetic and biologic features are disturbed in vocal diseases and how they relate to vocal symptoms.
3D analysis of the movements of the laryngeal cartilages during singing.
Unteregger, Fabian; Honegger, Flurin; Potthast, Silke; Zwicky, Salome; Schiwowa, Julia; Storck, Claudio
2017-07-01
The vocal range of untrained singers rarely exceeds one and a half octaves, but professional singers have a range of at least two and a half octaves. The aim of this study was to better understand the muscle and cartilage movements responsible for the control of vocal pitch in singing. Prospective study. We recruited 49 female professional singers (25 sopranos and 24 altos) and analyzed laryngeal three-dimensional images derived from high-resolution computed tomography scans obtained at the mean speaking fundamental frequency (F0) and at one (F1) and two octaves (F2) above this pitch. From F0 to F1, the only observable movement was a backward cricoid tilting caused by the cricothyroid muscles (CTMs), leading to vocal fold stretching. Above F1, a medial rotation and inward rocking of the arytenoid cartilages was observed, caused by the lateral cricothyroid muscles (LCAMs) and leading to inferior displacement of the vocal process of the arytenoid cartilage, and thus to further vocal fold stretching. Trained singers achieve the first octave of pitch elevation by simple cricothyroid approximation. Further pitch elevation necessitates a complex movement of the arytenoids, first by CTM contraction and second by LCAM contraction. 4. Laryngoscope, 127:1639-1643, 2017. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
Contextual effects of noise on vocalization encoding in primary auditory cortex
Ni, Ruiye; Bender, David A.; Shanechi, Amirali M.; Gamble, Jeffrey R.
2016-01-01
Robust auditory perception plays a pivotal function for processing behaviorally relevant sounds, particularly with distractions from the environment. The neuronal coding enabling this ability, however, is still not well understood. In this study, we recorded single-unit activity from the primary auditory cortex (A1) of awake marmoset monkeys (Callithrix jacchus) while delivering conspecific vocalizations degraded by two different background noises: broadband white noise and vocalization babble. Noise effects on neural representation of target vocalizations were quantified by measuring the responses' similarity to those elicited by natural vocalizations as a function of signal-to-noise ratio. A clustering approach was used to describe the range of response profiles by reducing the population responses to a summary of four response classes (robust, balanced, insensitive, and brittle) under both noise conditions. This clustering approach revealed that, on average, approximately two-thirds of the neurons change their response class when encountering different noises. Therefore, the distortion induced by one particular masking background in single-unit responses is not necessarily predictable from that induced by another, suggesting the low likelihood of a unique group of noise-invariant neurons across different background conditions in A1. Regarding noise influence on neural activities, the brittle response group showed addition of spiking activity both within and between phrases of vocalizations relative to clean vocalizations, whereas the other groups generally showed spiking activity suppression within phrases, and the alteration between phrases was noise dependent. Overall, the variable single-unit responses, yet consistent response types, imply that primate A1 performs scene analysis through the collective activity of multiple neurons. NEW & NOTEWORTHY The understanding of where and how auditory scene analysis is accomplished is of broad interest to neuroscientists. In this paper, we systematically investigated neuronal coding of multiple vocalizations degraded by two distinct noises at various signal-to-noise ratios in nonhuman primates. In the process, we uncovered heterogeneity of single-unit representations for different auditory scenes yet homogeneity of responses across the population. PMID:27881720
Contextual effects of noise on vocalization encoding in primary auditory cortex.
Ni, Ruiye; Bender, David A; Shanechi, Amirali M; Gamble, Jeffrey R; Barbour, Dennis L
2017-02-01
Robust auditory perception plays a pivotal function for processing behaviorally relevant sounds, particularly with distractions from the environment. The neuronal coding enabling this ability, however, is still not well understood. In this study, we recorded single-unit activity from the primary auditory cortex (A1) of awake marmoset monkeys (Callithrix jacchus) while delivering conspecific vocalizations degraded by two different background noises: broadband white noise and vocalization babble. Noise effects on neural representation of target vocalizations were quantified by measuring the responses' similarity to those elicited by natural vocalizations as a function of signal-to-noise ratio. A clustering approach was used to describe the range of response profiles by reducing the population responses to a summary of four response classes (robust, balanced, insensitive, and brittle) under both noise conditions. This clustering approach revealed that, on average, approximately two-thirds of the neurons change their response class when encountering different noises. Therefore, the distortion induced by one particular masking background in single-unit responses is not necessarily predictable from that induced by another, suggesting the low likelihood of a unique group of noise-invariant neurons across different background conditions in A1. Regarding noise influence on neural activities, the brittle response group showed addition of spiking activity both within and between phrases of vocalizations relative to clean vocalizations, whereas the other groups generally showed spiking activity suppression within phrases, and the alteration between phrases was noise dependent. Overall, the variable single-unit responses, yet consistent response types, imply that primate A1 performs scene analysis through the collective activity of multiple neurons. The understanding of where and how auditory scene analysis is accomplished is of broad interest to neuroscientists. In this paper, we systematically investigated neuronal coding of multiple vocalizations degraded by two distinct noises at various signal-to-noise ratios in nonhuman primates. In the process, we uncovered heterogeneity of single-unit representations for different auditory scenes yet homogeneity of responses across the population. Copyright © 2017 the American Physiological Society.
Vocal development and auditory perception in CBA/CaJ mice
NASA Astrophysics Data System (ADS)
Radziwon, Kelly E.
Mice are useful laboratory subjects because of their small size, their modest cost, and the fact that researchers have created many different strains to study a variety of disorders. In particular, researchers have found nearly 100 naturally occurring mouse mutations with hearing impairments. For these reasons, mice have become an important model for studies of human deafness. Although much is known about the genetic makeup and physiology of the laboratory mouse, far less is known about mouse auditory behavior. To fully understand the effects of genetic mutations on hearing, it is necessary to determine the hearing abilities of these mice. Two experiments here examined various aspects of mouse auditory perception using CBA/CaJ mice, a commonly used mouse strain. The frequency difference limens experiment tested the mouse's ability to discriminate one tone from another based solely on the frequency of the tone. The mice had similar thresholds as wild mice and gerbils but needed a larger change in frequency than humans and cats. The second psychoacoustic experiment sought to determine which cue, frequency or duration, was more salient when the mice had to identify various tones. In this identification task, the mice overwhelmingly classified the tones based on frequency instead of duration, suggesting that mice are using frequency when differentiating one mouse vocalization from another. The other two experiments were more naturalistic and involved both auditory perception and mouse vocal production. Interest in mouse vocalizations is growing because of the potential for mice to become a model of human speech disorders. These experiments traced mouse vocal development from infant to adult, and they tested the mouse's preference for various vocalizations. This was the first known study to analyze the vocalizations of individual mice across development. Results showed large variation in calling rates among the three cages of adult mice but results were highly consistent across all infant vocalizations. Although the preference experiment did not reveal significant differences between various mouse vocalizations, suggestions are given for future attempts to identify mouse preferences for auditory stimuli.
Tilsen, Sam; Spincemaille, Pascal; Xu, Bo; Doerschuk, Peter; Luh, Wen-Ming; Feldman, Elana; Wang, Yi
2016-01-01
Models of speech production typically assume that control over the timing of speech movements is governed by the selection of higher-level linguistic units, such as segments or syllables. This study used real-time magnetic resonance imaging of the vocal tract to investigate the anticipatory movements speakers make prior to producing a vocal response. Two factors were varied: preparation (whether or not speakers had foreknowledge of the target response) and pre-response constraint (whether or not speakers were required to maintain a specific vocal tract posture prior to the response). In prepared responses, many speakers were observed to produce pre-response anticipatory movements with a variety of articulators, showing that that speech movements can be readily dissociated from higher-level linguistic units. Substantial variation was observed across speakers with regard to the articulators used for anticipatory posturing and the contexts in which anticipatory movements occurred. The findings of this study have important consequences for models of speech production and for our understanding of the normal range of variation in anticipatory speech behaviors. PMID:26760511
Tilsen, Sam; Spincemaille, Pascal; Xu, Bo; Doerschuk, Peter; Luh, Wen-Ming; Feldman, Elana; Wang, Yi
2016-01-01
Models of speech production typically assume that control over the timing of speech movements is governed by the selection of higher-level linguistic units, such as segments or syllables. This study used real-time magnetic resonance imaging of the vocal tract to investigate the anticipatory movements speakers make prior to producing a vocal response. Two factors were varied: preparation (whether or not speakers had foreknowledge of the target response) and pre-response constraint (whether or not speakers were required to maintain a specific vocal tract posture prior to the response). In prepared responses, many speakers were observed to produce pre-response anticipatory movements with a variety of articulators, showing that that speech movements can be readily dissociated from higher-level linguistic units. Substantial variation was observed across speakers with regard to the articulators used for anticipatory posturing and the contexts in which anticipatory movements occurred. The findings of this study have important consequences for models of speech production and for our understanding of the normal range of variation in anticipatory speech behaviors.
Using nonlinear methods to quantify changes in infant limb movements and vocalizations.
Abney, Drew H; Warlaumont, Anne S; Haussman, Anna; Ross, Jessica M; Wallot, Sebastian
2014-01-01
The pairing of dynamical systems theory and complexity science brings novel concepts and methods to the study of infant motor development. Accordingly, this longitudinal case study presents a new approach to characterizing the dynamics of infant limb and vocalization behaviors. A single infant's vocalizations and limb movements were recorded from 51-days to 305-days of age. On each recording day, accelerometers were placed on all four of the infant's limbs and an audio recorder was worn on the child's chest. Using nonlinear time series analysis methods, such as recurrence quantification analysis and Allan factor, we quantified changes in the stability and multiscale properties of the infant's behaviors across age as well as how these dynamics relate across modalities and effectors. We observed that particular changes in these dynamics preceded or coincided with the onset of various developmental milestones. For example, the largest changes in vocalization dynamics preceded the onset of canonical babbling. The results show that nonlinear analyses can help to understand the functional co-development of different aspects of infant behavior.
Using nonlinear methods to quantify changes in infant limb movements and vocalizations
Abney, Drew H.; Warlaumont, Anne S.; Haussman, Anna; Ross, Jessica M.; Wallot, Sebastian
2014-01-01
The pairing of dynamical systems theory and complexity science brings novel concepts and methods to the study of infant motor development. Accordingly, this longitudinal case study presents a new approach to characterizing the dynamics of infant limb and vocalization behaviors. A single infant's vocalizations and limb movements were recorded from 51-days to 305-days of age. On each recording day, accelerometers were placed on all four of the infant's limbs and an audio recorder was worn on the child's chest. Using nonlinear time series analysis methods, such as recurrence quantification analysis and Allan factor, we quantified changes in the stability and multiscale properties of the infant's behaviors across age as well as how these dynamics relate across modalities and effectors. We observed that particular changes in these dynamics preceded or coincided with the onset of various developmental milestones. For example, the largest changes in vocalization dynamics preceded the onset of canonical babbling. The results show that nonlinear analyses can help to understand the functional co-development of different aspects of infant behavior. PMID:25161629
Evolution of vocal patterns: tuning hindbrain circuits during species divergence.
Barkan, Charlotte L; Zornik, Erik; Kelley, Darcy B
2017-03-01
The neural circuits underlying divergent courtship behaviors of closely related species provide a framework for insight into the evolution of motor patterns. In frogs, male advertisement calls serve as unique species identifiers and females prefer conspecific to heterospecific calls. Advertisement calls of three relatively recently (∼8.5 Mya) diverged species - Xenopus laevis , X. petersii and X. victorianus - include rapid trains of sound pulses (fast trills). We show that while fast trills are similar in pulse rate (∼60 pulses s -1 ) across the three species, they differ in call duration and period (time from the onset of one call to the onset of the following call). Previous studies of call production in X. laevis used an isolated brain preparation in which the laryngeal nerve produces compound action potentials that correspond to the advertisement call pattern (fictive calling). Here, we show that serotonin evokes fictive calling in X. petersii and X. victorianus as it does in X. laevis As in X. laevis , fictive fast trill in X. petersii and X. victorianus is accompanied by an N -methyl-d-aspartate receptor-dependent local field potential wave in a rostral hindbrain nucleus, DTAM. Across the three species, wave duration and period are strongly correlated with species-specific fast trill duration and period, respectively. When DTAM is isolated from the more rostral forebrain and midbrain and/or more caudal laryngeal motor nucleus, the wave persists at species-typical durations and periods. Thus, intrinsic differences within DTAM could be responsible for the evolutionary divergence of call patterns across these related species. © 2017. Published by The Company of Biologists Ltd.
Tobias, Joseph A; Seddon, Nathalie
2009-12-01
Natural selection is known to produce convergent phenotypes through mimicry or ecological adaptation. It has also been proposed that social selection--i.e., selection exerted by social competition--may drive convergent evolution in signals mediating interspecific communication, yet this idea remains controversial. Here, we use color spectrophotometry, acoustic analyses, and playback experiments to assess the hypothesis of adaptive signal convergence in two competing nonsister taxa, Hypocnemis peruviana and H. subflava (Aves: Thamnophilidae). We show that the structure of territorial songs in males overlaps in sympatry, with some evidence of convergent character displacement. Conversely, nonterritorial vocal and visual signals in males are strikingly diagnostic, in line with 6.8% divergence in mtDNA sequences. The same pattern of variation applies to females. Finally, we show that songs in both sexes elicit strong territorial responses within and between species, whereas songs of a third, allopatric and more closely related species (H. striata) are structurally divergent and elicit weaker responses. Taken together, our results provide compelling evidence that social selection can act across species boundaries to drive convergent or parallel evolution in taxa competing for space and resources.
Acoustics of the trained versus untrained singing voice.
Howard, David M
2009-06-01
Acoustic voice analysis is now widely available on today's multimedia computers and knowledge of the acoustics of the trained and untrained singing voice has advanced dramatically in recent years. New techniques have emerged that are providing clearer representations of aspects of the physiology of voice function and a greater understanding of the differences between the voices of untrained and trained singers. Improvements in endoscope technology are changing understanding of vocal fold function and videokymography provides a new way of interpreting the output; some new and interesting possibilities are emerging. Larynx height variation is a feature of untrained singing and singing in different styles and its measurement has been inaccurate hitherto; perhaps the laryngoaltimeter will provide a solution. Magnetic resonance imaging is now a vital tool for vocal tract shape measurement but a new bio-inspired computing is offering a possible alternative. Differences between an untrained and trained singing voice lie in one or more of breathing technique, larynx settings or vocal tract settings. Measurement techniques in each of these areas are important to provide data on the singing voice, and accurate data are essential for natural personalized electronic voice synthesis in the future.
Blend in Singing Ensemble Performance: Vibrato Production in a Vocal Quartet.
Daffern, Helena
2017-05-01
"Blend" is a defining characteristic of good vocal ensemble performance. To achieve this, directors often consider vibrato as a feature to be controlled and consequently restrict its use. Analysis of individual voices in ensemble situations presents several challenges, including the isolation of voices for analysis from recordings. This study considers vibrato production as a feature that contributes to blend through an ecological study of a vocal quartet. A vocal ensemble was recorded using head-worn microphones and electrolaryngograph electrodes to enable fundamental frequency analysis of the individual voices. The same four-part material was recorded over several weeks of rehearsal to allow analysis of conscious and subconscious changes to vibrato production over time. Alongside the recording of their rehearsal discussions, singers were also asked for opinions on vibrato production in connection with blend. The results indicate that vibrato is adjusted to some extent by individual singers to improve blend, with some instances of synchrony between voice parts. Some conscious alterations to vibrato were made to improve blend; however, these are not always evident in the data, suggesting that singers' own perceptions of their performance may be influenced by other factors. These findings indicate a need for further studies of vibrato as a feature of blend, particularly in terms of the synergies between expectation and actual production, and potential synchronicity between singers; increased understanding of vibrato in an ensemble setting will lead to more efficient rehearsal techniques and vocal training, and could prevent vocal misuse leading to pathology in the future. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Synthetic, multi-layer, self-oscillating vocal fold model fabrication.
Murray, Preston R; Thomson, Scott L
2011-12-02
Sound for the human voice is produced via flow-induced vocal fold vibration. The vocal folds consist of several layers of tissue, each with differing material properties. Normal voice production relies on healthy tissue and vocal folds, and occurs as a result of complex coupling between aerodynamic, structural dynamic, and acoustic physical phenomena. Voice disorders affect up to 7.5 million annually in the United States alone and often result in significant financial, social, and other quality-of-life difficulties. Understanding the physics of voice production has the potential to significantly benefit voice care, including clinical prevention, diagnosis, and treatment of voice disorders. Existing methods for studying voice production include in vivo experimentation using human and animal subjects, in vitro experimentation using excised larynges and synthetic models, and computational modeling. Owing to hazardous and difficult instrument access, in vivo experiments are severely limited in scope. Excised larynx experiments have the benefit of anatomical and some physiological realism, but parametric studies involving geometric and material property variables are limited. Further, they are typically only able to be vibrated for relatively short periods of time (typically on the order of minutes). Overcoming some of the limitations of excised larynx experiments, synthetic vocal fold models are emerging as a complementary tool for studying voice production. Synthetic models can be fabricated with systematic changes to geometry and material properties, allowing for the study of healthy and unhealthy human phonatory aerodynamics, structural dynamics, and acoustics. For example, they have been used to study left-right vocal fold asymmetry, clinical instrument development, laryngeal aerodynamics, vocal fold contact pressure, and subglottal acoustics (a more comprehensive list can be found in Kniesburges et al.) Existing synthetic vocal fold models, however, have either been homogenous (one-layer models) or have been fabricated using two materials of differing stiffness (two-layer models). This approach does not allow for representation of the actual multi-layer structure of the human vocal folds that plays a central role in governing vocal fold flow-induced vibratory response. Consequently, one- and two-layer synthetic vocal fold models have exhibited disadvantages such as higher onset pressures than what are typical for human phonation (onset pressure is the minimum lung pressure required to initiate vibration), unnaturally large inferior-superior motion, and lack of a "mucosal wave" (a vertically-traveling wave that is characteristic of healthy human vocal fold vibration). In this paper, fabrication of a model with multiple layers of differing material properties is described. The model layers simulate the multi-layer structure of the human vocal folds, including epithelium, superficial lamina propria (SLP), intermediate and deep lamina propria (i.e., ligament; a fiber is included for anterior-posterior stiffness), and muscle (i.e., body) layers. Results are included that show that the model exhibits improved vibratory characteristics over prior one- and two-layer synthetic models, including onset pressure closer to human onset pressure, reduced inferior-superior motion, and evidence of a mucosal wave.
A Mozart is not a Pavarotti: singers outperform instrumentalists on foreign accent imitation
Christiner, Markus; Reiterer, Susanne Maria
2015-01-01
Recent findings have shown that people with higher musical aptitude were also better in oral language imitation tasks. However, whether singing capacity and instrument playing contribute differently to the imitation of speech has been ignored so far. Research has just recently started to understand that instrumentalists develop quite distinct skills when compared to vocalists. In the same vein the role of the vocal motor system in language acquisition processes has poorly been investigated as most investigations (neurobiological and behavioral) favor to examine speech perception. We set out to test whether the vocal motor system can influence an ability to learn, produce and perceive new languages by contrasting instrumentalists and vocalists. Therefore, we investigated 96 participants, 27 instrumentalists, 33 vocalists and 36 non-musicians/non-singers. They were tested for their abilities to imitate foreign speech: unknown language (Hindi), second language (English) and their musical aptitude. Results revealed that both instrumentalists and vocalists have a higher ability to imitate unintelligible speech and foreign accents than non-musicians/non-singers. Within the musician group, vocalists outperformed instrumentalists significantly. Conclusion: First, adaptive plasticity for speech imitation is not reliant on audition alone but also on vocal-motor induced processes. Second, vocal flexibility of singers goes together with higher speech imitation aptitude. Third, vocal motor training, as of singers, may speed up foreign language acquisition processes. PMID:26379537
A Mozart is not a Pavarotti: singers outperform instrumentalists on foreign accent imitation.
Christiner, Markus; Reiterer, Susanne Maria
2015-01-01
Recent findings have shown that people with higher musical aptitude were also better in oral language imitation tasks. However, whether singing capacity and instrument playing contribute differently to the imitation of speech has been ignored so far. Research has just recently started to understand that instrumentalists develop quite distinct skills when compared to vocalists. In the same vein the role of the vocal motor system in language acquisition processes has poorly been investigated as most investigations (neurobiological and behavioral) favor to examine speech perception. We set out to test whether the vocal motor system can influence an ability to learn, produce and perceive new languages by contrasting instrumentalists and vocalists. Therefore, we investigated 96 participants, 27 instrumentalists, 33 vocalists and 36 non-musicians/non-singers. They were tested for their abilities to imitate foreign speech: unknown language (Hindi), second language (English) and their musical aptitude. Results revealed that both instrumentalists and vocalists have a higher ability to imitate unintelligible speech and foreign accents than non-musicians/non-singers. Within the musician group, vocalists outperformed instrumentalists significantly. First, adaptive plasticity for speech imitation is not reliant on audition alone but also on vocal-motor induced processes. Second, vocal flexibility of singers goes together with higher speech imitation aptitude. Third, vocal motor training, as of singers, may speed up foreign language acquisition processes.
Vocal learning in the functionally referential food grunts of chimpanzees.
Watson, Stuart K; Townsend, Simon W; Schel, Anne M; Wilke, Claudia; Wallace, Emma K; Cheng, Leveda; West, Victoria; Slocombe, Katie E
2015-02-16
One standout feature of human language is our ability to reference external objects and events with socially learned symbols, or words. Exploring the phylogenetic origins of this capacity is therefore key to a comprehensive understanding of the evolution of language. While non-human primates can produce vocalizations that refer to external objects in the environment, it is generally accepted that their acoustic structure is fixed and a product of arousal states. Indeed, it has been argued that the apparent lack of flexible control over the structure of referential vocalizations represents a key discontinuity with language. Here, we demonstrate vocal learning in the acoustic structure of referential food grunts in captive chimpanzees. We found that, following the integration of two groups of adult chimpanzees, the acoustic structure of referential food grunts produced for a specific food converged over 3 years. Acoustic convergence arose independently of preference for the food, and social network analyses indicated this only occurred after strong affiliative relationships were established between the original subgroups. We argue that these data represent the first evidence of non-human animals actively modifying and socially learning the structure of a meaningful referential vocalization from conspecifics. Our findings indicate that primate referential call structure is not simply determined by arousal and that the socially learned nature of referential words in humans likely has ancient evolutionary origins. Copyright © 2015 Elsevier Ltd. All rights reserved.
Van Stan, Jarrad H.; Mehta, Daryush D.; Zeitels, Steven M.; Burns, James A.; Barbu, Anca M.; Hillman, Robert E.
2015-01-01
Objectives Clinical management of phonotraumatic vocal fold lesions (nodules, polyps) is based largely on assumptions that abnormalities in habitual levels of sound pressure level (SPL), fundamental frequency (f0), and/or amount of voice use play a major role in lesion development and chronic persistence. This study used ambulatory voice monitoring to evaluate if significant differences in voice use exist between patients with phonotraumatic lesions and normal matched controls. Methods Subjects were 70 adult females: 35 with vocal fold nodules or polyps and 35 age-, sex-, and occupation-matched normal individuals. Weeklong summary statistics of voice use were computed from anterior neck surface acceleration recorded using a smartphone-based ambulatory voice monitor. Results Paired t-tests and Kolmogorov-Smirnov tests resulted in no statistically significant differences between patients and matched controls regarding average measures of SPL, f0, vocal dose measures, and voicing/voice rest periods. Paired t-tests comparing f0 variability between the groups resulted in statistically significant differences with moderate effect sizes. Conclusions Individuals with phonotraumatic lesions did not exhibit differences in average ambulatory measures of vocal behavior when compared with matched controls. More refined characterizations of underlying phonatory mechanisms and other potentially contributing causes are warranted to better understand risk factors associated with phonotraumatic lesions. PMID:26024911
Cornez, Gilles; Madison, Farrah N; Van der Linden, Annemie; Cornil, Charlotte; Yoder, Kathleen M; Ball, Gregory F; Balthazart, Jacques
2017-09-01
Perineuronal nets (PNN) are aggregations of chondroitin sulfate proteoglycans surrounding the soma and proximal processes of neurons, mostly GABAergic interneurons expressing parvalbumin. They limit the plasticity of their afferent synaptic connections. In zebra finches PNN develop in an experience-dependent manner in the song control nuclei HVC and RA (nucleus robustus arcopallialis) when young birds crystallize their song. Because songbird species that are open-ended learners tend to recapitulate each year the different phases of song learning until their song crystallizes at the beginning of the breeding season, we tested whether seasonal changes in PNN expression would be found in the song control nuclei of a seasonally breeding species such as the European starling. Only minimal changes in PNN densities and total number of cells surrounded by PNN were detected. However, comparison of the density of PNN and of PNN surrounding parvalbumin-positive cells revealed that these structures are far less numerous in starlings that show extensive adult vocal plasticity, including learning of new songs throughout the year, than in the closed-ended learner zebra finches. Canaries that also display some vocal plasticity across season but were never formally shown to learn new songs in adulthood were intermediate in this respect. Together these data suggest that establishment of PNN around parvalbumin-positive neurons in song control nuclei has diverged during evolution to control the different learning capacities observed in songbird species. This differential expression of PNN in different songbird species could represent a key cellular mechanism mediating species variation between closed-ended and open-ended learning strategies. © 2017 Wiley Periodicals, Inc. Develop Neurobiol 77: 975-994, 2017. © 2017 Wiley Periodicals, Inc.
Ey, E; Yang, M; Katz, A M; Woldeyohannes, L; Silverman, J L; Leblond, C S; Faure, P; Torquet, N; Le Sourd, A-M; Bourgeron, T; Crawley, J N
2012-11-01
Mutations in NLGN4X have been identified in individuals with autism spectrum disorders and other neurodevelopmental disorders. A previous study reported that adult male mice lacking neuroligin4 (Nlgn4) displayed social approach deficits in the three-chambered test, altered aggressive behaviors and reduced ultrasonic vocalizations. To replicate and extend these findings, independent comprehensive analyses of autism-relevant behavioral phenotypes were conducted in later generations of the same line of Nlgn4 mutant mice at the National Institute of Mental Health in Bethesda, MD, USA and at the Institut Pasteur in Paris, France. Adult social approach was normal in all three genotypes of Nlgn4 mice tested at both sites. Reciprocal social interactions in juveniles were similarly normal across genotypes. No genotype differences were detected in ultrasonic vocalizations in pups separated from the nest or in adults during reciprocal social interactions. Anxiety-like behaviors, self-grooming, rotarod and open field exploration did not differ across genotypes, and measures of developmental milestones and general health were normal. Our findings indicate an absence of autism-relevant behavioral phenotypes in subsequent generations of Nlgn4 mice tested at two locations. Testing environment and methods differed from the original study in some aspects, although the presence of normal sociability was seen in all genotypes when methods taken from Jamain et al. (2008) were used. The divergent results obtained from this study indicate that phenotypes may not be replicable across breeding generations, and highlight the significant roles of environmental, generational and/or procedural factors on behavioral phenotypes. Published 2012. This article is a U.S. Government work and is in the public domain in the USA.
NASA Astrophysics Data System (ADS)
Schwalm, Afton Leigh
California sea lions (Zalophus californianus) are a highly popular and easily recognized marine mammal in zoos, aquariums, circuses, and often seen by ocean visitors. They are highly vocal and gregarious on land. Surprisingly, little research has been performed on the vocalization types, source levels, acoustic properties, and functions of airborne sounds used by California sea lions. This research on airborne vocalizations of California sea lions will advance the understanding of this aspect of California sea lions communication, as well as examine the relationship between health condition and acoustic behavior. Using a PhillipsRTM digital recorder with attached microphone and a calibrated RadioShackRTM sound pressure level meter, acoustical data were recorded opportunistically on California sea lions during rehabilitation at The Marine Mammal Center in Sausalito, CA. Vocalizations were analyzed using frequency, time, and amplitude variables with Raven Pro: Interactive Sound Analysis Software Version 1.4 (The Cornell Lab of Ornithology, Ithaca, NY). Five frequency, three time, and four amplitude variables were analyzed for each vocalization. Differences in frequency, time, and amplitude variables were not significant by sex. The older California sea lion group produced vocalizations that were significantly lower in four frequency variables, significantly longer in two time variables, significantly higher in calibrated maximum and minimum amplitude variables, and significantly lower in frequency at maximum and minimum amplitude compared with pups. Six call types were identified: bark, goat, growl/grumble, bark/grumble, bark/growl, and grumble/moan. The growl/grumble call was higher in dominant beginning, ending, and minimum frequency, as well as in the frequency at maximum amplitude compared with the bark, goat, bark/grumble calls in the first versus last vocalization sample. The goat call was significantly higher in first harmonic interval than any other call type in the all vocalizations sample. The "fate" of a sea lion was categorized as: released, placed at another facility, remained at TMMC, euthanized, or died. To determine if acoustic features could be used to assess the recovery of a pup, the acoustic features of a pup's first recorded vocalization were compared with the frequency, time, and amplitude of the last vocalization recorded (i.e., before it was released or placed at another facility). In addition, all first vocalizations were pooled and all last vocalizations were pooled for acoustic analysis, regardless of their fate. Released pups had shorter duration calls, a greater first harmonic interval, and a higher dominant maximum frequency than either pups that died or pups remaining at TMMC. Released pups had a higher frequency at maximum and minimum amplitude compared to dead and remaining pups. Pups that died had significantly lower dominant ending frequency and a lower dominant minimum frequency than released or remaining pups. These results were supported by other studies on different species of otariids, phocids, and cetaceans. The preliminary analyses presented in this thesis holds promise that with additional data acoustic features of California sea lion airborne vocalizations could indicate sex, age, and possibly health condition or the potential for release.
Neural imaging in songbirds using fiber optic fluorescence microscopy
NASA Astrophysics Data System (ADS)
Nooshabadi, Fatemeh; Hearn, Gentry; Lints, Thierry; Maitland, Kristen C.
2012-02-01
The song control system of juvenile songbirds is an important model for studying the developmental acquisition and generation of complex learned vocal motor sequences, two processes that are fundamental to human speech and language. To understand the neural mechanisms underlying song production, it is critical to characterize the activity of identified neurons in the song control system when the bird is singing. Neural imaging in unrestrained singing birds, although technically challenging, will advance our understanding of neural ensemble coding mechanisms in this system. We are exploring the use of a fiber optic microscope for functional imaging in the brain of behaving and singing birds in order to better understand the contribution of a key brain nucleus (high vocal center nucleus; HVC) to temporal aspects of song motor control. We have constructed a fluorescence microscope with LED illumination, a fiber bundle for transmission of fluorescence excitation and emission light, a ~2x GRIN lens, and a CCD for image acquisition. The system has 2 μm resolution, 375 μm field of view, 200 μm working distance, and 1 mm outer diameter. As an initial characterization of this setup, neurons in HVC were imaged using the fiber optic microscope after injection of quantum dots or fluorescent retrograde tracers into different song nuclei. A Lucid Vivascope confocal microscope was used to confirm the imaging results. Long-term imaging of the activity of these neurons in juvenile birds during singing may lead us to a better understanding of the central motor codes for song and the central mechanism by which auditory experience modifies song motor commands to enable vocal learning and imitation.
Impaired perception of facial emotion in developmental prosopagnosia.
Biotti, Federica; Cook, Richard
2016-08-01
Developmental prosopagnosia (DP) is a neurodevelopmental condition characterised by difficulties recognising faces. Despite severe difficulties recognising facial identity, expression recognition is typically thought to be intact in DP; case studies have described individuals who are able to correctly label photographic displays of facial emotion, and no group differences have been reported. This pattern of deficits suggests a locus of impairment relatively late in the face processing stream, after the divergence of expression and identity analysis pathways. To date, however, there has been little attempt to investigate emotion recognition systematically in a large sample of developmental prosopagnosics using sensitive tests. In the present study, we describe three complementary experiments that examine emotion recognition in a sample of 17 developmental prosopagnosics. In Experiment 1, we investigated observers' ability to make binary classifications of whole-face expression stimuli drawn from morph continua. In Experiment 2, observers judged facial emotion using only the eye-region (the rest of the face was occluded). Analyses of both experiments revealed diminished ability to classify facial expressions in our sample of developmental prosopagnosics, relative to typical observers. Imprecise expression categorisation was particularly evident in those individuals exhibiting apperceptive profiles, associated with problems encoding facial shape accurately. Having split the sample of prosopagnosics into apperceptive and non-apperceptive subgroups, only the apperceptive prosopagnosics were impaired relative to typical observers. In our third experiment, we examined the ability of observers' to classify the emotion present within segments of vocal affect. Despite difficulties judging facial emotion, the prosopagnosics exhibited excellent recognition of vocal affect. Contrary to the prevailing view, our results suggest that many prosopagnosics do experience difficulties classifying expressions, particularly those with apperceptive profiles. These individuals may have difficulties forming view-invariant structural descriptions at an early stage in the face processing stream, before identity and expression pathways diverge. Copyright © 2016 Elsevier Ltd. All rights reserved.
Understanding The Neural Mechanisms Involved In Sensory Control Of Voice Production
Parkinson, Amy L.; Flagmeier, Sabina G.; Manes, Jordan L.; Larson, Charles R.; Rogers, Bill; Robin, Donald A.
2012-01-01
Auditory feedback is important for the control of voice fundamental frequency (F0). In the present study we used neuroimaging to identify regions of the brain responsible for sensory control of the voice. We used a pitch-shift paradigm where subjects respond to an alteration, or shift, of voice pitch auditory feedback with a reflexive change in F0. To determine the neural substrates involved in these audio-vocal responses, subjects underwent fMRI scanning while vocalizing with or without pitch-shifted feedback. The comparison of shifted and unshifted vocalization revealed activation bilaterally in the superior temporal gyrus (STG) in response to the pitch shifted feedback. We hypothesize that the STG activity is related to error detection by auditory error cells located in the superior temporal cortex and efference copy mechanisms whereby this region is responsible for the coding of a mismatch between actual and predicted voice F0. PMID:22406500
Hogikyan, N D; Appel, S; Guinn, L W; Haxer, M J
1999-03-01
This study was undertaken to better understand current regional opinions regarding vocal fold nodules in adult singers. A questionnaire was sent to 298 persons representing the 3 professional groups most involved with the care of singers with vocal nodules: otolaryngologists, speech pathologists, and teachers of singing. The questionnaire queried respondents about their level of experience with this problem, and their beliefs about causative factors, career impact, and optimum treatment. Responses within and between groups were similar, with differences between groups primarily in the magnitude of positive or negative responses, rather than in the polarity of the responses. Prevailing opinions included: recognition of causative factors in both singing and speaking voice practices, optimism about responsiveness to appropriate treatment, enthusiasm for coordinated voice therapy and voice training as first-line treatment, and acceptance of microsurgical management as appropriate treatment if behavioral management fails.
In vitro experimental investigation of voice production
Horáčcek, Jaromír; Brücker, Christoph; Becker, Stefan
2012-01-01
The process of human phonation involves a complex interaction between the physical domains of structural dynamics, fluid flow, and acoustic sound production and radiation. Given the high degree of nonlinearity of these processes, even small anatomical or physiological disturbances can significantly affect the voice signal. In the worst cases, patients can lose their voice and hence the normal mode of speech communication. To improve medical therapies and surgical techniques it is very important to understand better the physics of the human phonation process. Due to the limited experimental access to the human larynx, alternative strategies, including artificial vocal folds, have been developed. The following review gives an overview of experimental investigations of artificial vocal folds within the last 30 years. The models are sorted into three groups: static models, externally driven models, and self-oscillating models. The focus is on the different models of the human vocal folds and on the ways in which they have been applied. PMID:23181007
Garland, Ellen C; Goldizen, Anne W; Lilley, Matthew S; Rekdahl, Melinda L; Garrigue, Claire; Constantine, Rochelle; Hauser, Nan Daeschler; Poole, M Michael; Robbins, Jooke; Noad, Michael J
2015-08-01
For cetaceans, population structure is traditionally determined by molecular genetics or photographically identified individuals. Acoustic data, however, has provided information on movement and population structure with less effort and cost than traditional methods in an array of taxa. Male humpback whales (Megaptera novaeangliae) produce a continually evolving vocal sexual display, or song, that is similar among all males in a population. The rapid cultural transmission (the transfer of information or behavior between conspecifics through social learning) of different versions of this display between distinct but interconnected populations in the western and central South Pacific region presents a unique way to investigate population structure based on the movement dynamics of a song (acoustic) display. Using 11 years of data, we investigated an acoustically based population structure for the region by comparing stereotyped song sequences among populations and years. We used the Levenshtein distance technique to group previously defined populations into (vocally based) clusters based on the overall similarity of their song display in space and time. We identified the following distinct vocal clusters: western cluster, 1 population off eastern Australia; central cluster, populations around New Caledonia, Tonga, and American Samoa; and eastern region, either a single cluster or 2 clusters, one around the Cook Islands and the other off French Polynesia. These results are consistent with the hypothesis that each breeding aggregation represents a distinct population (each occupied a single, terminal node) in a metapopulation, similar to the current understanding of population structure based on genetic and photo-identification studies. However, the central vocal cluster had higher levels of song-sharing among populations than the other clusters, indicating that levels of vocal connectivity varied within the region. Our results demonstrate the utility and value of using culturally transmitted vocal patterns as a way of defining connectivity to infer population structure. We suggest vocal patterns be incorporated by the International Whaling Commission in conjunction with traditional methods in the assessment of structure. © 2015, Society for Conservation Biology.
Oral and vocal fold diadochokinesis in dysphonic women.
Louzada, Talita; Beraldinelle, Roberta; Berretin-Felix, Giédre; Brasolotto, Alcione Ghedini
2011-01-01
The evaluation of oral and vocal fold diadochokinesis (DDK) in individuals with voice disorders may contribute to the understanding of factors that affect the balanced vocal production. Scientific studies that make use of this assessment tool support the knowledge advance of this area, reflecting the development of more appropriate therapeutic planning. To compare the results of oral and vocal fold DDK in dysphonic women and in women without vocal disorders. For this study, 28 voice recordings of women from 19 to 54 years old, diagnosed with dysphonia and submitted to a voice assessment from speech pathologist and otorhinolaryngologist, were used. The control group included 30 nondysphonic women evaluated in prior research from normal adults. The analysis parameters like number and duration of emissions, as well as the regularity of the repetition of syllables "pa", "ta", "ka" and the vowels "a" and "i," were provided by the Advanced Motor Speech Profile program (MSP) Model-5141, version-2.5.2 (KayPentax). The DDK sequence "pataka" was analyzed quantitatively through the Sound Forge 7.0 program, as well as manually with the audio-visual help of sound waves. Average values of oral and vocal fold DDK dysphonic and nondysphonic women were compared using the "t Student" test and were considered significant when p<0.05. The findings showed no significant differences between populations; however, the coefficient of variation of period (CvP) and jitter of period (JittP) average of the "ka," "a" and "i" emissions, respectively, were higher in dysphonic women (CvP=10.42%, 12.79%, 12.05%; JittP=2.05%, 6.05%, 3.63%) compared to the control group (CvP=8.86%; 10.95%, 11.20%; JittP=1.82%, 2.98%, 3.15%). Although the results do not indicate any difficulties in oral and laryngeal motor control in the dysphonic group, the largest instability in vocal fold DDK in the experimental group should be considered, and studies of this ability in individuals with communication disorders must be intensified.
Gender Differences in the Recognition of Vocal Emotions
Lausen, Adi; Schacht, Annekathrin
2018-01-01
The conflicting findings from the few studies conducted with regard to gender differences in the recognition of vocal expressions of emotion have left the exact nature of these differences unclear. Several investigators have argued that a comprehensive understanding of gender differences in vocal emotion recognition can only be achieved by replicating these studies while accounting for influential factors such as stimulus type, gender-balanced samples, number of encoders, decoders, and emotional categories. This study aimed to account for these factors by investigating whether emotion recognition from vocal expressions differs as a function of both listeners' and speakers' gender. A total of N = 290 participants were randomly and equally allocated to two groups. One group listened to words and pseudo-words, while the other group listened to sentences and affect bursts. Participants were asked to categorize the stimuli with respect to the expressed emotions in a fixed-choice response format. Overall, females were more accurate than males when decoding vocal emotions, however, when testing for specific emotions these differences were small in magnitude. Speakers' gender had a significant impact on how listeners' judged emotions from the voice. The group listening to words and pseudo-words had higher identification rates for emotions spoken by male than by female actors, whereas in the group listening to sentences and affect bursts the identification rates were higher when emotions were uttered by female than male actors. The mixed pattern for emotion-specific effects, however, indicates that, in the vocal channel, the reliability of emotion judgments is not systematically influenced by speakers' gender and the related stereotypes of emotional expressivity. Together, these results extend previous findings by showing effects of listeners' and speakers' gender on the recognition of vocal emotions. They stress the importance of distinguishing these factors to explain recognition ability in the processing of emotional prosody. PMID:29922202
Mechanism of and Threshold Biomechanical Conditions for Falsetto Voice Onset
Deguchi, Shinji
2011-01-01
The sound source of a voice is produced by the self-excited oscillation of the vocal folds. In modal voice production, a drastic increase in transglottal pressure after vocal fold closure works as a driving force that develops self-excitation. Another type of vocal fold oscillation with less pronounced glottal closure observed in falsetto voice production has been accounted for by the mucosal wave theory. The classical theory assumes a quasi-steady flow, and the expected driving force onto the vocal folds under wavelike motion is derived from the Bernoulli effect. However, wavelike motion is not always observed during falsetto voice production. More importantly, the application of the quasi-steady assumption to a falsetto voice with a fundamental frequency of several hundred hertz is unsupported by experiments. These considerations suggested that the mechanism of falsetto voice onset may be essentially different from that explained by the mucosal wave theory. In this paper, an alternative mechanism is submitted that explains how self-excitation reminiscent of the falsetto voice could be produced independent of the glottal closure and wavelike motion. This new explanation is derived through analytical procedures by employing only general unsteady equations of motion for flow and solids. The analysis demonstrated that a convective acceleration of a flow induced by rapid wall movement functions as a negative damping force, leading to the self-excitation of the vocal folds. The critical subglottal pressure and volume flow are expressed as functions of vocal fold biomechanical properties, geometry, and voice fundamental frequency. The analytically derived conditions are qualitatively and quantitatively reasonable in view of reported measurement data of the thresholds required for falsetto voice onset. Understanding of the voice onset mechanism and the explicit mathematical descriptions of thresholds would be beneficial for the diagnosis and treatment of voice diseases and the development of artificial vocal folds. PMID:21408178
Vocal clans in sperm whales (Physeter macrocephalus).
Rendell, L E; Whitehead, H
2003-01-01
Cultural transmission may be a significant source of variation in the behaviour of whales and dolphins, especially as regards their vocal signals. We studied variation in the vocal output of 'codas' by sperm whale social groups. Codas are patterns of clicks used by female sperm whales in social circumstances. The coda repertoires of all known social units (n = 18, each consisting of about 11 females and immatures with long-term relationships) and 61 out of 64 groups (about two social units moving together for periods of days) that were recorded in the South Pacific and Caribbean between 1985 and 2000 can be reliably allocated into six acoustic 'clans', five in the Pacific and one in the Caribbean. Clans have ranges that span thousands of kilometres, are sympatric, contain many thousands of whales and most probably result from cultural transmission of vocal patterns. Units seem to form groups preferentially with other units of their own clan. We suggest that this is a rare example of sympatric cultural variation on an oceanic scale. Culture may thus be a more important determinant of sperm whale population structure than genes or geography, a finding that has major implications for our understanding of the species' behavioural and population biology. PMID:12614570
Systemic Hydration: Relating Science to Clinical Practice in Vocal Health
Hartley, Naomi A.; Thibeault, Susan L.
2014-01-01
Objectives To examine the current state of the science regarding the role of systemic hydration in vocal function and health. Study Design Literature Review Methods Literature search spanning multiple disciplines, including speech-language pathology, nutrition and dietetics, medicine, sports and exercise science, physiology and biomechanics. Results The relationship between hydration and physical function is an area of common interest amongst multiple professions. Each discipline provides valuable insight into the connection between performance and water balance, as well as complimentary methods of investigation. Existing voice literature suggests a relationship between hydration and voice production, however the underlying mechanisms are not yet defined and a treatment effect for systemic hydration remains to be demonstrated. Literature from other disciplines sheds light on methodological shortcomings and in some cases offers an alternative explanation for observed phenomena. Conclusions A growing body of literature in the field of voice science is documenting a relationship between hydration and vocal function, however greater understanding is required to guide best practice in the maintenance of vocal health and management of voice disorders. Integration of knowledge and technical expertise from multiple disciplines facilitates analysis of existing literature and provides guidance as to future research. PMID:24880674
Is killer whale dialect evolution random?
Filatova, Olga A; Burdin, Alexandr M; Hoyt, Erich
2013-10-01
The killer whale is among the few species in which cultural change accumulates over many generations, leading to cumulative cultural evolution. Killer whales have group-specific vocal repertoires which are thought to be learned rather than being genetically coded. It is supposed that divergence between vocal repertoires of sister groups increases gradually over time due to random learning mistakes and innovations. In this case, the similarity of calls across groups must be correlated with pod relatedness and, consequently, with each other. In this study we tested this prediction by comparing the patterns of call similarity between matrilines of resident killer whales from Eastern Kamchatka. We calculated the similarity of seven components from three call types across 14 matrilines. In contrast to the theoretical predictions, matrilines formed different clusters on the dendrograms made by different calls and even by different components of the same call. We suggest three possible explanations for this phenomenon. First, the lack of agreement between similarity patterns of different components may be the result of constraints in the call structure. Second, it is possible that call components change in time with different speed and/or in different directions. Third, horizontal cultural transmission of call features may occur between matrilines. Copyright © 2013 Elsevier B.V. All rights reserved.
Gwee, Chyi Yin; Christidis, Les; Eaton, James A; Norman, Janette A; Trainor, Colin R; Verbelen, Philippe; Rheindt, Frank E
2017-04-01
Known for their rich biodiversity and high level of endemism, the islands of Wallacea serve as natural laboratories for the study of spatio-temporal evolution and patterns of species diversification. Our study focuses on the owl genus Ninox, particularly the Southern Boobook (N. novaeseelandiae) and Moluccan Boobook (N. squamipila) complexes, which are widely distributed across Australasia. We conducted bioacoustic and multi-locus DNA analyses of 24 Ninox owl taxa to evaluate relationships and levels of divergence within the two complexes and ultimately assess the relationship between patterns of taxonomic differentiation and bioclimatic factors. We found that taxa that are vocally and/or genetically distinct from populations on the Australian mainland are found on islands that are significantly larger and higher in altitude than taxa that are vocally and/or genetically indistinct from populations on the Australian mainland. This pattern suggests that taxa occurring on small, low-lying Wallacean islands are likely to be recent colonisers that have dispersed from Australia. Overall, our observations demonstrate that the genus Ninox is likely to have colonised the Wallacean region multiple times as small, low-lying islands undergo frequent extinction, whereas populations on large and high-altitude islands are more resilient. Copyright © 2017 Elsevier Inc. All rights reserved.
Laryngeal evidence for the first and second passaggio in professionally trained sopranos
Burk, Fabian; Köberlein, Marie; Selamtzis, Andreas; Döllinger, Michael; Burdumy, Michael; Richter, Bernhard
2017-01-01
Introduction Due to a lack of empirical data, the current understanding of the laryngeal mechanics in the passaggio regions (i.e., the fundamental frequency ranges where vocal registration events usually occur) of the female singing voice is still limited. Material and methods In this study the first and second passaggio regions of 10 professionally trained female classical soprano singers were analyzed. The sopranos performed pitch glides from A3 (ƒo = 220 Hz) to A4 (ƒo = 440 Hz) and from A4 (ƒo = 440 Hz) to A5 (ƒo = 880 Hz) on the vowel [iː]. Vocal fold vibration was assessed with trans-nasal high speed videoendoscopy at 20,000 fps, complemented by simultaneous electroglottographic (EGG) and acoustic recordings. Register breaks were perceptually rated by 12 voice experts. Voice stability was documented with the EGG-based sample entropy. Glottal opening and closing patterns during the passaggi were analyzed, supplemented with open quotient data extracted from the glottal area waveform. Results In both the first and the second passaggio, variations of vocal fold vibration patterns were found. Four distinct patterns emerged: smooth transitions with either increasing or decreasing durations of glottal closure, abrupt register transitions, and intermediate loss of vocal fold contact. Audible register transitions (in both the first and second passaggi) generally coincided with higher sample entropy values and higher open quotient variance through the respective passaggi. Conclusions Noteworthy vocal fold oscillatory registration events occur in both the first and the second passaggio even in professional sopranos. The respective transitions are hypothesized to be caused by either (a) a change of laryngeal biomechanical properties; or by (b) vocal tract resonance effects, constituting level 2 source-filter interactions. PMID:28467509
Experiments on Analysing Voice Production: Excised (Human, Animal) and In Vivo (Animal) Approaches
Döllinger, Michael; Kobler, James; Berry, David A.; Mehta, Daryush D.; Luegmair, Georg; Bohr, Christopher
2015-01-01
Experiments on human and on animal excised specimens as well as in vivo animal preparations are so far the most realistic approaches to simulate the in vivo process of human phonation. These experiments do not have the disadvantage of limited space within the neck and enable studies of the actual organ necessary for phonation, i.e., the larynx. The studies additionally allow the analysis of flow, vocal fold dynamics, and resulting acoustics in relation to well-defined laryngeal alterations. Purpose of Review This paper provides an overview of the applications and usefulness of excised (human/animal) specimen and in vivo animal experiments in voice research. These experiments have enabled visualization and analysis of dehydration effects, vocal fold scarring, bifurcation and chaotic vibrations, three-dimensional vibrations, aerodynamic effects, and mucosal wave propagation along the medial surface. Quantitative data will be shown to give an overview of measured laryngeal parameter values. As yet, a full understanding of all existing interactions in voice production has not been achieved, and thus, where possible, we try to indicate areas needing further study. Recent Findings A further motivation behind this review is to highlight recent findings and technologies related to the study of vocal fold dynamics and its applications. For example, studies of interactions between vocal tract airflow and generation of acoustics have recently shown that airflow superior to the glottis is governed by not only vocal fold dynamics but also by subglottal and supraglottal structures. In addition, promising new methods to investigate kinematics and dynamics have been reported recently, including dynamic optical coherence tomography, X-ray stroboscopy and three-dimensional reconstruction with laser projection systems. Finally, we touch on the relevance of vocal fold dynamics to clinical laryngology and to clinically-oriented research. PMID:26581597
Lamina propria of the human vocal fold: histomorphometric study of collagen fibers.
Prades, Jean-Michel; Dumollard, Jean Marc; Duband, Sébastien; Timoshenko, Andrei; Richard, Céline; Dubois, Marie Dominique; Martin, Christian; Peoc'h, Michel
2010-04-01
Since the seminal work of M. Hirano, which defined the three-layered lamina propria of the human vocal fold, there has been confusion in the labeling of each layer. Recent studies described the composition of fibers and interstitial molecules within the lamina propria leading to various biomechanical properties. However, collagen fibers appear as the most important structure component. We used an optical analysis and the picrosirius-polarization method to describe collagen fibers from six adult and two fetal human larynges fixed in formalin and frontally sectioned in the middle part of the vocal fold. The deep layer of the lamina propria is the most densely organized band of collagen fibers penetrating the superficial muscle bundles of the vocal muscle. The mean thickness of this layer is about 36% of the lamina propria and shows a network of strongly birefringent fibers (collagen type I and III). The superficial layer of the lamina propria is a narrow band of collagen fibers immediately below the basement membrane of the epithelium. The mean thickness of this layer is about 13% of the lamina propria and shows strong birefringent fibers. The intermediate layer is the less densely organized band between the deep and superficial layers. The mean thickness of this layer is about 51% of the lamina propria and shows clear, green weakly birefringent fibers characterized as collagen type III. The fetal lamina propria contains only a monolayer distribution of loose collagen fibers between the epithelium and the vocal muscle. These results help describe the distribution of collagen fibers within the lamina propria of the human vocal fold and have implications to understand the cover-body theory of voice production both in the adult and newborn.
Waaramaa, Teija
2015-10-01
The present study focused on the identification of emotions in cross-cultural conditions on different continents and among subjects with divergent language backgrounds. The aim was to investigate whether the perception of the basic emotions from nonsense vocal samples was universal, dependent on voice quality, musicality, and/or gender. Listening tests for 350 participants were conducted on location in a variety of cultures: China, Egypt, Estonia, Finland, Russia, Sweden, and the USA. The results suggested that the voice quality parameters played a role in the identification of emotions without the linguistic content. Cultural background may affect the interpretation of the emotions more than the presumed universality. Musical interest tended to facilitate emotion identification. No gender differences were found.
Chen, Wenli; Woo, Peak; Murry, Thomas
2017-09-01
High-speed videoendoscopy captures the cycle-to-cycle vibratory motion of each individual vocal fold in normal and severely disordered phonation. Therefore, it provides a direct method to examine the specific vibratory changes following vocal fold surgery. The purpose of this study was to examine the vocal fold vibratory pattern changes in the surgically treated pathologic vocal fold and the contralateral vocal fold in three vocal pathologies: vocal polyp (n = 3), paresis or paralysis (n = 3), and scar (n = 3). Digital kymography was used to extract high-speed kymographic vocal fold images at the mid-membranous region of the vocal fold. Spectral analysis was subsequently applied to the digital kymography to quantify the cycle-to-cycle movements of each vocal fold, expressed as a spectrum. Surgical modification resulted in significantly improved spectral power of the treated pathologic vocal fold. Furthermore, the contralateral vocal fold also presented with improved spectral power irrespective of vocal pathology. In comparison with normal vocal fold spectrum, postsurgical vocal fold vibrations continued to demonstrate decreased vibratory amplitude in both vocal folds. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Favaro, Livio; Gamba, Marco; Alfieri, Chiara; Pessani, Daniela; McElligott, Alan G
2015-11-25
The African penguin is a nesting seabird endemic to southern Africa. In penguins of the genus Spheniscus vocalisations are important for social recognition. However, it is not clear which acoustic features of calls can encode individual identity information. We recorded contact calls and ecstatic display songs of 12 adult birds from a captive colony. For each vocalisation, we measured 31 spectral and temporal acoustic parameters related to both source and filter components of calls. For each parameter, we calculated the Potential of Individual Coding (PIC). The acoustic parameters showing PIC ≥ 1.1 were used to perform a stepwise cross-validated discriminant function analysis (DFA). The DFA correctly classified 66.1% of the contact calls and 62.5% of display songs to the correct individual. The DFA also resulted in the further selection of 10 acoustic features for contact calls and 9 for display songs that were important for vocal individuality. Our results suggest that studying the anatomical constraints that influence nesting penguin vocalisations from a source-filter perspective, can lead to a much better understanding of the acoustic cues of individuality contained in their calls. This approach could be further extended to study and understand vocal communication in other bird species.
Favaro, Livio; Gamba, Marco; Alfieri, Chiara; Pessani, Daniela; McElligott, Alan G.
2015-01-01
The African penguin is a nesting seabird endemic to southern Africa. In penguins of the genus Spheniscus vocalisations are important for social recognition. However, it is not clear which acoustic features of calls can encode individual identity information. We recorded contact calls and ecstatic display songs of 12 adult birds from a captive colony. For each vocalisation, we measured 31 spectral and temporal acoustic parameters related to both source and filter components of calls. For each parameter, we calculated the Potential of Individual Coding (PIC). The acoustic parameters showing PIC ≥ 1.1 were used to perform a stepwise cross-validated discriminant function analysis (DFA). The DFA correctly classified 66.1% of the contact calls and 62.5% of display songs to the correct individual. The DFA also resulted in the further selection of 10 acoustic features for contact calls and 9 for display songs that were important for vocal individuality. Our results suggest that studying the anatomical constraints that influence nesting penguin vocalisations from a source-filter perspective, can lead to a much better understanding of the acoustic cues of individuality contained in their calls. This approach could be further extended to study and understand vocal communication in other bird species. PMID:26602001
Tsantani, Maria S; Belin, Pascal; Paterson, Helena M; McAleer, Phil
2016-08-01
Vocal pitch has been found to influence judgments of perceived trustworthiness and dominance from a novel voice. However, the majority of findings arise from using only male voices and in context-specific scenarios. In two experiments, we first explore the influence of average vocal pitch on first-impression judgments of perceived trustworthiness and dominance, before establishing the existence of an overall preference for high or low pitch across genders. In Experiment 1, pairs of high- and low-pitched temporally reversed recordings of male and female vocal utterances were presented in a two-alternative forced-choice task. Results revealed a tendency to select the low-pitched voice over the high-pitched voice as more trustworthy, for both genders, and more dominant, for male voices only. Experiment 2 tested an overall preference for low-pitched voices, and whether judgments were modulated by speech content, using forward and reversed speech to manipulate context. Results revealed an overall preference for low pitch, irrespective of direction of speech, in male voices only. No such overall preference was found for female voices. We propose that an overall preference for low pitch is a default prior in male voices irrespective of context, whereas pitch preferences in female voices are more context- and situation-dependent. The present study confirms the important role of vocal pitch in the formation of first-impression personality judgments and advances understanding of the impact of context on pitch preferences across genders.
A New Species of Frog (Anura: Dicroglossidae) Discovered from the Mega City of Dhaka.
Howlader, Mohammad Sajid Ali; Nair, Abhilash; Merilä, Juha
2016-01-01
We describe a new species of frog of the genus Zakerana discovered from the urban core of Dhaka, Bangladesh, one of the most densely populated cities in the world. Although the new species is morphologically similar to the geographically proximate congeners in the Bangladeshi cricket frog group, we show that it can be distinguished from all congeners on the basis of morphological characters, advertisement calls and variation in two mitochondrial DNA genes (12S rRNA and 16S rRNA). Apart from several diagnostic differences in body proportions, the new species differs from other Zakerana species in having a flattened snout (from ventral view) projecting over the lower jaw, and diagnostic trapezoid-shaped red markings on the vocal sac in males. Molecular genetic analyses show that the new species is highly divergent (3.1-20.1% sequence divergence) from all congeneric species, and forms a well-supported clade with its sister species, Zakerana asmati. The discovery of a new amphibian species from the urban core of Dhaka together with several recent descriptions of new amphibian species from Bangladesh may indicate that more amphibian species remain to be discovered from this country.
A coevolutionary arms race causes ecological speciation in crossbills.
Smith, Julie W; Benkman, Craig W
2007-04-01
We examined three ecological factors potentially causing premating reproductive isolation to determine whether divergent selection as a result of coevolution between South Hills crossbills (Loxia curvirostra complex) and Rocky Mountain lodgepole pine (Pinus contorta latifolia) promotes ecological speciation. One factor was habitat isolation arising because of enhanced seed defenses of lodgepole pine in the South Hills. This caused the crossbill call types (morphologically and vocally differentiated forms) adapted to alternative resources to be rare. Another occurred when crossbills of other call types moved into the South Hills late in the breeding season and feeding conditions were deteriorating so that relatively few non-South Hills crossbills bred ("immigrant infecundity"). Finally, among those crossbills that bred, pairing was strongly assortative by call type (behavioral isolation). Total reproductive isolation between South Hills crossbills and the two other crossbills most common in the South Hills (call types 2 and 5) summed to .9975 and .9998, respectively, on a scale of 0 (no reproductive isolation) to 1 (complete reproductive isolation). These extremely high levels of reproductive isolation indicate that the divergent selection resulting from the coevolutionary arms race between crossbills and lodgepole pine is causing the South Hills crossbill to speciate.
A New Species of Frog (Anura: Dicroglossidae) Discovered from the Mega City of Dhaka
Howlader, Mohammad Sajid Ali; Nair, Abhilash; Merilä, Juha
2016-01-01
We describe a new species of frog of the genus Zakerana discovered from the urban core of Dhaka, Bangladesh, one of the most densely populated cities in the world. Although the new species is morphologically similar to the geographically proximate congeners in the Bangladeshi cricket frog group, we show that it can be distinguished from all congeners on the basis of morphological characters, advertisement calls and variation in two mitochondrial DNA genes (12S rRNA and 16S rRNA). Apart from several diagnostic differences in body proportions, the new species differs from other Zakerana species in having a flattened snout (from ventral view) projecting over the lower jaw, and diagnostic trapezoid-shaped red markings on the vocal sac in males. Molecular genetic analyses show that the new species is highly divergent (3.1–20.1% sequence divergence) from all congeneric species, and forms a well-supported clade with its sister species, Zakerana asmati. The discovery of a new amphibian species from the urban core of Dhaka together with several recent descriptions of new amphibian species from Bangladesh may indicate that more amphibian species remain to be discovered from this country. PMID:26934699
Artificially lengthened and constricted vocal tract in vocal training methods.
Bele, Irene Velsvik
2005-01-01
It is common practice in vocal training to make use of vocal exercise techniques that involve partial occlusion of the vocal tract. Various techniques are used; some of them form an occlusion within the front part of the oral cavity or at the lips. Another vocal exercise technique involves lengthening the vocal tract; for example, the method of phonation into small tubes. This essay presents some studies made on the effects of various vocal training methods that involve an artificially lengthened and constricted vocal tract. The influence of sufficient acoustic impedance on vocal fold vibration and economical voice production is presented.
Translating birdsong: songbirds as a model for basic and applied medical research.
Brainard, Michael S; Doupe, Allison J
2013-07-08
Songbirds, long of interest to basic neuroscience, have great potential as a model system for translational neuroscience. Songbirds learn their complex vocal behavior in a manner that exemplifies general processes of perceptual and motor skill learning and, more specifically, resembles human speech learning. Song is subserved by circuitry that is specialized for vocal learning and production but that has strong similarities to mammalian brain pathways. The combination of highly quantifiable behavior and discrete neural substrates facilitates understanding links between brain and behavior, both in normal states and in disease. Here we highlight (a) behavioral and mechanistic parallels between birdsong and aspects of speech and social communication, including insights into mirror neurons, the function of auditory feedback, and genes underlying social communication disorders, and (b) contributions of songbirds to understanding cortical-basal ganglia circuit function and dysfunction, including the possibility of harnessing adult neurogenesis for brain repair.
Translating Birdsong: Songbirds as a model for basic and applied medical research
2014-01-01
Songbirds, long of interest to basic neuroscientists, have great potential as a model system for translational neuroscience. Songbirds learn their complex vocal behavior in a manner that exemplifies general processes of perceptual and motor skill learning, and more specifically resembles human speech learning. Song is subserved by circuitry that is specialized for vocal learning and production, but that has strong similarities to mammalian brain pathways. The combination of a highly quantifiable behavior and discrete neural substrates facilitates understanding links between brain and behavior, both normally and in disease. Here we highlight 1) behavioral and mechanistic parallels between birdsong and aspects of speech and social communication, including insights into mirror neurons, the function of auditory feedback, and genes underlying social communication disorders, and 2) contributions of songbirds to understanding cortical-basal ganglia circuit function and dysfunction, including the possibility of harnessing adult neurogenesis for brain repair. PMID:23750515
Wetlands explain most in the genetic divergence pattern of Oncomelania hupensis.
Liang, Lu; Liu, Yang; Liao, Jishan; Gong, Peng
2014-10-01
Understanding the divergence patterns of hosts could shed lights on the prediction of their parasite transmission. No effort has been devoted to understand the drivers of genetic divergence pattern of Oncomelania hupensis, the only intermediate host of Schistosoma japonicum. Based on a compilation of two O. hupensis gene datasets covering a wide geographic range in China and an array of geographical distance and environmental dissimilarity metrics built from earth observation data and ecological niche modeling, we conducted causal modeling analysis via simple, partial Mantel test and local polynomial fitting to understand the interactions among isolation-by-distance, isolation-by-environment, and genetic divergence. We found that geography contributes more to genetic divergence than environmental isolation, and among all variables involved, wetland showed the strongest correlation with the genetic pairwise distances. These results suggested that in China, O. hupensis dispersal is strongly linked to the distribution of wetlands, and the current divergence pattern of both O. hupensis and schistosomiasis might be altered due to the changed wetland pattern with the accomplishment of the Three Gorges Dam and the South-to-North water transfer project. Copyright © 2014 Elsevier B.V. All rights reserved.
White, Stephanie A.
2009-01-01
Could a mutation in a single gene be the evolutionary lynchpin supporting the development of human language? A rare mutation in the molecule known as FOXP2 discovered in a human family seemed to suggest so, and its sequence phylogeny reinforced a Chomskian view that language emerged wholesale in humans. Spurred by this discovery, research in primates, rodents and birds suggests that FoxP2 and other language-related genes are interactors in the neuromolecular networks that underlie subsystems of language, such symbolic understanding, vocal learning and theory of mind. The whole picture will only come together through comparative and integrative study into how the human language singularity evolved. PMID:19913899
The siren song of vocal fundamental frequency for romantic relationships.
Weusthoff, Sarah; Baucom, Brian R; Hahlweg, Kurt
2013-01-01
A multitude of factors contribute to why and how romantic relationships are formed as well as whether they ultimately succeed or fail. Drawing on evolutionary models of attraction and speech production as well as integrative models of relationship functioning, this review argues that paralinguistic cues (more specifically the fundamental frequency of the voice) that are initially a strong source of attraction also increase couples' risk for relationship failure. Conceptual similarities and differences between the multiple operationalizations and interpretations of vocal fundamental frequency are discussed and guidelines are presented for understanding both convergent and non-convergent findings. Implications for clinical practice and future research are discussed.
Warren, Victoria E; Marques, Tiago A; Harris, Danielle; Thomas, Len; Tyack, Peter L; Aguilar de Soto, Natacha; Hickmott, Leigh S; Johnson, Mark P
2017-03-01
Passive acoustic monitoring has become an increasingly prevalent tool for estimating density of marine mammals, such as beaked whales, which vocalize often but are difficult to survey visually. Counts of acoustic cues (e.g., vocalizations), when corrected for detection probability, can be translated into animal density estimates by applying an individual cue production rate multiplier. It is essential to understand variation in these rates to avoid biased estimates. The most direct way to measure cue production rate is with animal-mounted acoustic recorders. This study utilized data from sound recording tags deployed on Blainville's (Mesoplodon densirostris, 19 deployments) and Cuvier's (Ziphius cavirostris, 16 deployments) beaked whales, in two locations per species, to explore spatial and temporal variation in click production rates. No spatial or temporal variation was detected within the average click production rate of Blainville's beaked whales when calculated over dive cycles (including silent periods between dives); however, spatial variation was detected when averaged only over vocal periods. Cuvier's beaked whales exhibited significant spatial and temporal variation in click production rates within vocal periods and when silent periods were included. This evidence of variation emphasizes the need to utilize appropriate cue production rates when estimating density from passive acoustic data.
Larrouy-Maestri, Pauline; Magis, David; Morsomme, Dominique
2014-05-01
The operatic singing technique is frequently used in classical music. Several acoustical parameters of this specific technique have been studied but how these parameters combine remains unclear. This study aims to further characterize the Western operatic singing technique by observing the effects of melody and technique on acoustical and musical parameters of the singing voice. Fifty professional singers performed two contrasting melodies (popular song and romantic melody) with two vocal techniques (with and without operatic singing technique). The common quality parameters (energy distribution, vibrato rate, and extent), perturbation parameters (standard deviation of the fundamental frequency, signal-to-noise ratio, jitter, and shimmer), and musical features (fundamental frequency of the starting note, average tempo, and sound pressure level) of the 200 sung performances were analyzed. The results regarding the effect of melody and technique on the acoustical and musical parameters show that the choice of melody had a limited impact on the parameters observed, whereas a particular vocal profile appeared depending on the vocal technique used. This study confirms that vocal technique affects most of the parameters examined. In addition, the observation of quality, perturbation, and musical parameters contributes to a better understanding of the Western operatic singing technique. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Acoustic characteristics of simulated respiratory-induced vocal tremor.
Lester, Rosemary A; Story, Brad H
2013-05-01
The purpose of this study was to investigate the relation of respiratory forced oscillation to the acoustic characteristics of vocal tremor. Acoustical analyses were performed to determine the characteristics of the intensity and fundamental frequency (F0) for speech samples obtained by Farinella, Hixon, Hoit, Story, and Jones (2006) using a respiratory forced oscillation paradigm with 5 healthy adult males to simulate vocal tremor involving respiratory pressure modulation. The analyzed conditions were sustained productions of /a/ with amplitudes of applied pressure of 0, 1, 2, and 4 cmH2O and a rate of 5 Hz. Forced oscillation of the respiratory system produced modulation of the intensity and F0 for all participants. Variability was observed between participants and conditions in the change in intensity and F0 per unit of pressure change, as well as in the mean intensity and F0. However, the extent of modulation of intensity and F0 generally increased as the applied pressure increased, as would be expected. These findings suggest that individuals develop idiosyncratic adaptations to pressure modulations, which are important to understanding aspects of variability in vocal tremor, and highlight the need to assess all components of the speech mechanism that may be directly or indirectly affected by tremor.
A new measure of child vocal reciprocity in children with autism spectrum disorder.
Harbison, Amy L; Woynaroski, Tiffany G; Tapp, Jon; Wade, Joshua W; Warlaumont, Anne S; Yoder, Paul J
2018-06-01
Children's vocal development occurs in the context of reciprocal exchanges with a communication partner who models "speechlike" productions. We propose a new measure of child vocal reciprocity, which we define as the degree to which an adult vocal response increases the probability of an immediately following child vocal response. Vocal reciprocity is likely to be associated with the speechlikeness of vocal communication in young children with autism spectrum disorder (ASD). Two studies were conducted to test the utility of the new measure. The first used simulated vocal samples with randomly sequenced child and adult vocalizations to test the accuracy of the proposed index of child vocal reciprocity. The second was an empirical study of 21 children with ASD who were preverbal or in the early stages of language development. Daylong vocal samples collected in the natural environment were computer analyzed to derive the proposed index of child vocal reciprocity, which was highly stable when derived from two daylong vocal samples and was associated with speechlikeness of vocal communication. This association was significant even when controlling for chance probability of child vocalizations to adult vocal responses, probability of adult vocalizations, or probability of child vocalizations. A valid measure of children's vocal reciprocity might eventually improve our ability to predict which children are on track to develop useful speech and/or are most likely to respond to language intervention. A link to a free, publicly-available software program to derive the new measure of child vocal reciprocity is provided. Autism Res 2018, 11: 903-915. © 2018 International Society for Autism Research, Wiley Periodicals, Inc. Children and adults often engage in back-and-forth vocal exchanges. The extent to which they do so is believed to support children's early speech and language development. Two studies tested a new measure of child vocal reciprocity using computer-generated and real-life vocal samples of young children with autism collected in natural settings. The results provide initial evidence of accuracy, test-retest reliability, and validity of the new measure of child vocal reciprocity. A sound measure of children's vocal reciprocity might improve our ability to predict which children are on track to develop useful speech and/or are most likely to respond to language intervention. A free, publicly-available software program and manuals are provided. © 2018 International Society for Autism Research, Wiley Periodicals, Inc.
Perszyk, Danielle R; Waxman, Sandra R
2016-08-01
Well before they understand their first words, infants have begun to link language and cognition. This link is initially broad: At 3months, listening to both human and nonhuman primate vocalizations supports infants' object categorization, a building block of cognition. But by 6months, the link has narrowed: Only human vocalizations support categorization. What mechanisms underlie this rapid tuning process? Here, we document the crucial role of infants' experience as infants tune this link to cognition. Merely exposing infants to nonhuman primate vocalizations permits them to preserve, rather than sever, the link between these signals and categorization. Exposing infants to backward speech-a signal that fails to support categorization in the first year of life-does not have this advantage. This new evidence illuminates the central role of early experience as infants specify which signals, from an initially broad set, they will continue to link to core cognitive capacities. Copyright © 2016 Elsevier B.V. All rights reserved.
The perception of self in birds.
Derégnaucourt, Sébastien; Bovet, Dalila
2016-10-01
The perception of self is an important topic in several disciplines such as ethology, behavioral ecology, psychology, developmental and cognitive neuroscience. Self-perception is investigated by experimentally exposing different species of animals to self-stimuli such as their own image, smell or vocalizations. Here we review more than one hundred studies using these methods in birds, a taxonomic group that exhibits a rich diversity regarding ecology and behavior. Exposure to self-image is the main method for studying self-recognition, while exposing birds to their own smell is generally used for the investigation of homing or odor-based kin discrimination. Self-produced vocalizations - especially in oscine songbirds - are used as stimuli for understanding the mechanisms of vocal coding/decoding both at the neural and at the behavioral levels. With this review, we highlight the necessity to study the perception of self in animals cross-modally and to consider the role of experience and development, aspects that can be easily monitored in captive populations of birds. Copyright © 2016 Elsevier Ltd. All rights reserved.
Medial surface dynamics of an in vivo canine vocal fold during phonation
NASA Astrophysics Data System (ADS)
Döllinger, Michael; Berry, David A.; Berke, Gerald S.
2005-05-01
Quantitative measurement of the medial surface dynamics of the vocal folds is important for understanding how sound is generated within the larynx. Building upon previous excised hemilarynx studies, the present study extended the hemilarynx methodology to the in vivo canine larynx. Through use of an in vivo model, the medial surface dynamics of the vocal fold were examined as a function of active thyroarytenoid muscle contraction. Data were collected using high-speed digital imaging at a sampling frequency of 2000 Hz, and a spatial resolution of 1024×1024 pixels. Chest-like and fry-like vibrations were observed, but could not be distinguished based on the input stimulation current to the recurrent laryngeal nerve. The subglottal pressure did distinguish the registers, as did an estimate of the thyroarytenoid muscle activity. Upon quantification of the three-dimensional motion, the method of Empirical Eigenfunctions was used to extract the underlying modes of vibration, and to investigate mechanisms of sustained oscillation. Results were compared with previous findings from excised larynx experiments and theoretical models. .
Neuromuscular control of fundamental frequency and glottal posture at phonation onset
Chhetri, Dinesh K.; Neubauer, Juergen; Berry, David A.
2012-01-01
The laryngeal neuromuscular mechanisms for modulating glottal posture and fundamental frequency are of interest in understanding normal laryngeal physiology and treating vocal pathology. The intrinsic laryngeal muscles in an in vivo canine model were electrically activated in a graded fashion to investigate their effects on onset frequency, phonation onset pressure, vocal fold strain, and glottal distance at the vocal processes. Muscle activation plots for these laryngeal parameters were evaluated for the interaction of following pairs of muscle activation conditions: (1) cricothyroid (CT) versus all laryngeal adductors (TA/LCA/IA), (2) CT versus LCA/IA, (3) CT versus thyroarytenoid (TA) and, (4) TA versus LCA/IA (LCA: lateral cricoarytenoid muscle, IA: interarytenoid). Increases in onset frequency and strain were primarily affected by CT activation. Onset pressure correlated with activation of all adductors in activation condition 1, but primarily with CT activation in conditions 2 and 3. TA and CT were antagonistic for strain. LCA/IA activation primarily closed the cartilaginous glottis while TA activation closed the mid-membranous glottis. PMID:22352513
2005-01-01
An approach based on Skinner's (1957) theory of verbal behavior has been developed to understand and teach elementary communication skills to children with autism and developmental disabilities (Sundberg & Partington, 1998). However, few studies have directly examined the characteristics of emerging language in children with developmental disabilities. The purpose of this study was to develop and evaluate an assessment for identifying the elementary functions of vocal speech in children. Participants were 4 children with developmental disabilities, aged 6 years to 12 years, who exhibited at least one distinguishable vocal response (word or phrase) frequently in the natural environment. The assessment focused on three verbal operants delineated by Skinner (mand, tact, and intraverbal). One or more functions were identified for at least one vocal response of each child. Results suggested that this assessment would be useful for (a) evaluating Skinner's theory, (b) guiding decisions about language training for individual children, and (c) studying the nature of expressive language development in children with developmental disabilities. PMID:16270841
Lerman, Dorothea C; Parten, Mandy; Addison, Laura R; Vorndran, Christina M; Volkert, Valerie M; Kodak, Tiffany
2005-01-01
An approach based on Skinner's (1957) theory of verbal behavior has been developed to understand and teach elementary communication skills to children with autism and developmental disabilities (Sundberg & Partington, 1998). However, few studies have directly examined the characteristics of emerging language in children with developmental disabilities. The purpose of this study was to develop and evaluate an assessment for identifying the elementary functions of vocal speech in children. Participants were 4 children with developmental disabilities, aged 6 years to 12 years, who exhibited at least one distinguishable vocal response (word or phrase) frequently in the natural environment. The assessment focused on three verbal operants delineated by Skinner (mand, tact, and intraverbal). One or more functions were identified for at least one vocal response of each child. Results suggested that this assessment would be useful for (a) evaluating Skinner's theory, (b) guiding decisions about language training for individual children, and (c) studying the nature of expressive language development in children with developmental disabilities.
Paradoxical vocal fold motion in children and adolescents.
Sandage, Mary J; Zelazny, Sherri K
2004-10-01
Paradoxical vocal fold motion (PVFM) is a complex adductory disorder of the vocal folds that frequently is mistaken for asthma. PVFM typically requires behavioral intervention by a trained speech-language pathologist for complete resolution of the symptoms. Once thought to be limited to adults, PVFM has been increasingly documented and successfully treated in the child and adolescent population. Understanding PVFM requires thorough knowledge of the differential diagnoses, the clinical features of PVFM, the differentiation of PVFM from asthma, the medical professionals involved in the diagnosis and treatment, and the behavioral interventions that are commonly prescribed. Teachers, school nurses, and coaches may be the first professionals to see the symptoms in children and assume that they have asthma. Successful referral, diagnosis, and behavioral treatment requires a team of individuals in the child's community, including the school speech-language pathologist, who can work together to ensure identification and resolution of the symptoms. This article discusses etiologies, differential diagnosis, referral, medical management, evaluation, and behavioral treatment of the child or adolescent with PVFM.
Understanding the intentional acoustic behavior of humpback whales: a production-based approach.
Cazau, Dorian; Adam, Olivier; Laitman, Jeffrey T; Reidenberg, Joy S
2013-09-01
Following a production-based approach, this paper deals with the acoustic behavior of humpback whales. This approach investigates various physical factors, which are either internal (e.g., physiological mechanisms) or external (e.g., environmental constraints) to the respiratory tractus of the whale, for their implications in sound production. This paper aims to describe a functional scenario of this tractus for the generation of vocal sounds. To do so, a division of this tractus into three different configurations is proposed, based on the air recirculation process which determines air sources and laryngeal valves. Then, assuming a vocal function (in sound generation or modification) for several specific anatomical components, an acoustic characterization of each of these configurations is proposed to link different spectral features, namely, fundamental frequencies and formant structures, to specific vocal production mechanisms. A discussion around the question of whether the whale is able to fully exploit the acoustic potential of its respiratory tractus is eventually provided.
Bhatt, Y M; Hans, P S; Belloso, A
2010-05-01
Vagus nerve stimulators are devices used in the management of patients with drug-refractory epilepsy unsuitable for resective or disconnective surgery. Implanted usually by neurosurgeons, these devices are infrequently encountered by otolaryngologists. Despite significant anti-seizure efficacy, side effects related to laryngopharyngeal stimulation are not uncommon. A 28-year-old man with a history of effective vagus nerve stimulator use presented with a cluster of seizures and respiratory distress associated with intermittent stridor. The duration of stridor corresponded to the period of vagus nerve stimulation. Endoscopy revealed forced adduction of the left vocal fold against a medialised right vocal fold. The device was switched off and the stridor immediately resolved. Airway compromise is an under-recognised side effect of vagus nerve stimulation. We describe the first known case of stridor and contralateral vocal fold palsy in a vagus nerve stimulator user. We highlight the need for better understanding amongst otolaryngologists of the laryngopharyngeal side effects of this technology.
Broaddus-Lawrence, P L; Treole, K; McCabe, R B; Allen, R L; Toppin, L
2000-03-01
The purpose of the present study was to determine the effects of vocal hygiene education on the vocal hygiene behaviors and perceptual vocal characteristics of untrained singers. Eleven adult untrained singers served as subjects. They attended four 1-hour class sessions on vocal hygiene, including anatomy and physiology of the phonatory mechanism, vocally abusive behaviors, voice disorders commonly seen in singers, and measures to prevent voice disorders. Pre- and postinstruction surveys were used to record subjects' vocal abuses and their perceptions of their speaking and singing voice. They also rated their perceived value of vocal hygiene education. Results revealed minimal changes in vocal hygiene behaviors and perceptual voice characteristics. The subjects did report a high degree of benefit and learning, however.
Acoustic Analysis and Electroglottography in Elite Vocal Performers.
Villafuerte-Gonzalez, Rocio; Valadez-Jimenez, Victor M; Sierra-Ramirez, Jose A; Ysunza, Pablo Antonio; Chavarria-Villafuerte, Karen; Hernandez-Lopez, Xochiquetzal
2017-05-01
Acoustic analysis of voice (AAV) and electroglottography (EGG) have been used for assessing vocal quality in patients with voice disorders. The effectiveness of these procedures for detecting mild disturbances in vocal quality in elite vocal performers has been controversial. To compare acoustic parameters obtained by AAV and EGG before and after vocal training to determine the effectiveness of these procedures for detecting vocal improvements in elite vocal performers. Thirty-three elite vocal performers were studied. The study group included 14 males and 19 females, ages 18-40 years, without a history of voice disorders. Acoustic parameters were obtained through AAV and EGG before and after vocal training using the Linklater method. Nonsignificant differences (P > 0.05) were found between values of fundamental frequency (F 0 ), shimmer, and jitter obtained by both procedures before vocal training. Mean F 0 was similar after vocal training. Jitter percentage as measured by AAV showed nonsignificant differences (P > 0.05) before and after vocal training. Shimmer percentage as measured by AAV demonstrated a significant reduction (P < 0.05) after vocal training. As measured by EGG after vocal training, shimmer and jitter were significantly reduced (P < 0.05); open quotient was significantly increased (P < 0.05); and irregularity was significantly reduced (P < 0.05). AAV and EGG were effective for detecting improvements in vocal function after vocal training in male and female elite vocal performers undergoing vocal training. EGG demonstrated better efficacy for detecting improvements and provided additional parameters as compared to AAV. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Oral and vocal fold diadochokinesis in dysphonic women
LOUZADA, Talita; BERALDINELLE, Roberta; BERRETIN-FELIX, Giédre; BRASOLOTTO, Alcione Ghedini
2011-01-01
The evaluation of oral and vocal fold diadochokinesis (DDK) in individuals with voice disorders may contribute to the understanding of factors that affect the balanced vocal production. Scientific studies that make use of this assessment tool support the knowledge advance of this area, reflecting the development of more appropriate therapeutic planning. Objective To compare the results of oral and vocal fold DDK in dysphonic women and in women without vocal disorders. Material and methods For this study, 28 voice recordings of women from 19 to 54 years old, diagnosed with dysphonia and submitted to a voice assessment from speech pathologist and otorhinolaryngologist, were used. The control group included 30 nondysphonic women evaluated in prior research from normal adults. The analysis parameters like number and duration of emissions, as well as the regularity of the repetition of syllables "pa", "ta", "ka" and the vowels "a" and "i," were provided by the Advanced Motor Speech Profile program (MSP) Model-5141, version-2.5.2 (KayPentax). The DDK sequence "pataka" was analyzed quantitatively through the Sound Forge 7.0 program, as well as manually with the audio-visual help of sound waves. Average values of oral and vocal fold DDK dysphonic and nondysphonic women were compared using the "t Student" test and were considered significant when p<0.05. Results The findings showed no significant differences between populations; however, the coefficient of variation of period (CvP) and jitter of period (JittP) average of the "ka," "a" and "i" emissions, respectively, were higher in dysphonic women (CvP=10.42%, 12.79%, 12.05%; JittP=2.05%, 6.05%, 3.63%) compared to the control group (CvP=8.86%; 10.95%, 11.20%; JittP=1.82%, 2.98%, 3.15%). Conclusion Although the results do not indicate any difficulties in oral and laryngeal motor control in the dysphonic group, the largest instability in vocal fold DDK in the experimental group should be considered, and studies of this ability in individuals with communication disorders must be intensified. PMID:22230989
Can Birds Perceive Rhythmic Patterns? A Review and Experiments on a Songbird and a Parrot Species
ten Cate, Carel; Spierings, Michelle; Hubert, Jeroen; Honing, Henkjan
2016-01-01
While humans can easily entrain their behavior with the beat in music, this ability is rare among animals. Yet, comparative studies in non-human species are needed if we want to understand how and why this ability evolved. Entrainment requires two abilities: (1) recognizing the regularity in the auditory stimulus and (2) the ability to adjust the own motor output to the perceived pattern. It has been suggested that beat perception and entrainment are linked to the ability for vocal learning. The presence of some bird species showing beat induction, and also the existence of vocal learning as well as vocal non-learning bird taxa, make them relevant models for comparative research on rhythm perception and its link to vocal learning. Also, some bird vocalizations show strong regularity in rhythmic structure, suggesting that birds might perceive rhythmic structures. In this paper we review the available experimental evidence for the perception of regularity and rhythms by birds, like the ability to distinguish regular from irregular stimuli over tempo transformations and report data from new experiments. While some species show a limited ability to detect regularity, most evidence suggests that birds attend primarily to absolute and not relative timing of patterns and to local features of stimuli. We conclude that, apart from some large parrot species, there is limited evidence for beat and regularity perception among birds and that the link to vocal learning is unclear. We next report the new experiments in which zebra finches and budgerigars (both vocal learners) were first trained to distinguish a regular from an irregular pattern of beats and then tested on various tempo transformations of these stimuli. The results showed that both species reduced the discrimination after tempo transformations. This suggests that, as was found in earlier studies, they attended mainly to local temporal features of the stimuli, and not to their overall regularity. However, some individuals of both species showed an additional sensitivity to the more global pattern if some local features were left unchanged. Altogether our study indicates both between and within species variation, in which birds attend to a mixture of local and to global rhythmic features. PMID:27242635
Fluid-Structure Interactions with Flexible and Rigid Bodies
NASA Astrophysics Data System (ADS)
Daily, David Jesse
Fluid structure interactions occur to some extent in nearly every type of fluid flow. Understanding how structures interact with fluids and visa-versa is of vital importance in many engineering applications. The purpose of this research is to explore how fluids interact with flexible and rigid structures. A computational model was used to model the fluid structure interactions of vibrating synthetic vocal folds. The model simulated the coupling of the fluid and solid domains using a fluid-structure interface boundary condition. The fluid domain used a slightly compressible flow solver to allow for the possibility of acoustic coupling with the subglottal geometry and vibration of the vocal fold model. As the subglottis lengthened, the frequency of vibration decreased until a new acoustic mode could form in the subglottis. Synthetic aperture particle image velocimetry (SAPIV) is a three-dimensional particle tracking technique. SAPIV was used to image the jet of air that emerges from vibrating human vocal folds (glottal jet) during phonation. The three-dimensional reconstruction of the glottal jet found faint evidence of flow characteristics seen in previous research, such as axis-switching, but did not have sufficient resolution to detect small features. SAPIV was further applied to reconstruct the smaller flow characteristics of the glottal jet of vibrating synthetic vocal folds. Two- and four-layer synthetic vocal fold models were used to determine how the glottal jet from the synthetic models compared to the glottal jet from excised human vocal folds. The two- and four-layer models clearly exhibited axis-switching which has been seen in other 3D analyses of the glottal jet. Cavitation in a quiescent fluid can break a rigid structure such as a glass bottle. A new cavitation number was derived to include acceleration and pressure head at cavitation onset. A cavitation stick was used to validate the cavitation number by filling it with different depths and hitting the stick to cause fluid cavitation. Acceleration was measured using an accelerometer and cavitation bubbles were detected using a high-speed camera. Cavitation in an accelerating fluid occurred at a cavitation number of 1. Keywords: Fluid structure interaction, vocal folds, acoustics, SAPIV, cavitation, slightly compressible
Exploring Attitudes of Indian Classical Singers Toward Seeking Vocal Health Care.
Gunjawate, Dhanshree R; Aithal, Venkataraja U; Guddattu, Vasudeva; Kishore, Amrutha; Bellur, Rajashekhar
2016-11-01
The attitude of Indian classical singers toward seeking vocal health care is a dimension yet to be explored. The current study was aimed to determine the attitudes of these singers toward seeking vocal health care and further understand the influence of age and gender. Cross-sectional. A 10-item self-report questionnaire adapted from a study on contemporary commercial music singers was used. An additional question was added to ask if the singer was aware about the profession and role of speech-language pathologists (SLPs). The questionnaire was administered on 55 randomly selected self-identified trained Indian classical singers who rated the items using a five-point Likert scale. Demographic variables were summarized using descriptive statistics and t test was used to compare the mean scores between genders and age groups. Of the singers, 78.2% were likely to see a doctor for heath-related problems, whereas 81.8% were unlikely to seek medical care for voice-related problems; the difference was statistically significant (P < 0.001). Responses for the questions assessing the attitudes toward findings from medical examination by a specialist revealed a statistically significant difference (P = 0.02) between the genders. Age did not have a significant influence on the responses. Only 23.6% of the respondents were aware about the profession and the role of SLPs. The findings are in tune with western literature reporting hesitation of singers toward seeking vocal health care and draws attention of SLPs to promote their role in vocal health awareness and management. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Differential Expression of Glutamate Receptors in Avian Neural Pathways for Learned Vocalization
WADA, KAZUHIRO; SAKAGUCHI, HIRONOBU; JARVIS, ERICH D.; HAGIWARA, MASATOSHI
2008-01-01
Learned vocalization, the substrate for human language, is a rare trait. It is found in three distantly related groups of birds—parrots, hummingbirds, and songbirds. These three groups contain cerebral vocal nuclei for learned vocalization not found in their more closely related vocal nonlearning relatives. Here, we cloned 21 receptor subunits/subtypes of all four glutamate receptor families (AMPA, kainate, NMDA, and metabotropic) and examined their expression in vocal nuclei of songbirds. We also examined expression of a subset of these receptors in vocal nuclei of hummingbirds and parrots, as well as in the brains of dove species as examples of close vocal nonlearning relatives. Among the 21 subunits/subtypes, 19 showed higher and/or lower prominent differential expression in songbird vocal nuclei relative to the surrounding brain subdivisions in which the vocal nuclei are located. This included relatively lower levels of all four AMPA subunits in lMAN, strikingly higher levels of the kainite subunit GluR5 in the robust nucleus of the arcopallium (RA), higher and lower levels respectively of the NMDA subunits NR2A and NR2B in most vocal nuclei and lower levels of the metabotropic group I subtypes (mGluR1 and -5) in most vocal nuclei and the group II subtype (mGluR2), showing a unique expression pattern of very low levels in RA and very high levels in HVC. The splice variants of AMPA subunits showed further differential expression in vocal nuclei. Some of the receptor subunits/subtypes also showed differential expression in hummingbird and parrot vocal nuclei. The magnitude of differential expression in vocal nuclei of all three vocal learners was unique compared with the smaller magnitude of differences found for nonvocal areas of vocal learners and vocal nonlearners. Our results suggest that evolution of vocal learning was accompanied by differential expression of a conserved gene family for synaptic transmission and plasticity in vocal nuclei. They also suggest that neural activity and signal transduction in vocal nuclei of vocal learners will be different relative to the surrounding brain areas. PMID:15236466
Neural Correlates of the Lombard Effect in Primate Auditory Cortex
Eliades, Steven J.
2012-01-01
Speaking is a sensory-motor process that involves constant self-monitoring to ensure accurate vocal production. Self-monitoring of vocal feedback allows rapid adjustment to correct perceived differences between intended and produced vocalizations. One important behavior in vocal feedback control is a compensatory increase in vocal intensity in response to noise masking during vocal production, commonly referred to as the Lombard effect. This behavior requires mechanisms for continuously monitoring auditory feedback during speaking. However, the underlying neural mechanisms are poorly understood. Here we show that when marmoset monkeys vocalize in the presence of masking noise that disrupts vocal feedback, the compensatory increase in vocal intensity is accompanied by a shift in auditory cortex activity toward neural response patterns seen during vocalizations under normal feedback condition. Furthermore, we show that neural activity in auditory cortex during a vocalization phrase predicts vocal intensity compensation in subsequent phrases. These observations demonstrate that the auditory cortex participates in self-monitoring during the Lombard effect, and may play a role in the compensation of noise masking during feedback-mediated vocal control. PMID:22855821
The perceptual features of vocal fatigue as self-reported by a group of actors and singers.
Kitch, J A; Oates, J
1994-09-01
Performers (10 actors/10 singers) rated via a self-report questionnaire the severity of their voice-related changes when vocally fatigued. Similar frequency patterns and perceptual features of vocal fatigue were found across subjects. Actors rated "power" aspects (e.g., voice projection) and singers rated vocal dynamic aspects (e.g., pitch range) of their voices as most affected when vocally fatigued. Vocal fatigue was evidenced by changes in kinesthetic/proprioceptive sensations and vocal dynamics. The causes and context of vocal fatigue were vocal misuse, being "run down," high performance demands, and using high pitch/volume levels. Further research is needed to delineate the perceptual features of "normal" levels of vocal fatigue and its possible causes.
Drinking Songs: Alcohol Effects on Learned Song of Zebra Finches
Olson, Christopher R.; Owen, Devin C.; Ryabinin, Andrey E.; Mello, Claudio V.
2014-01-01
Speech impairment is one of the most intriguing and least understood effects of alcohol on cognitive function, largely due to the lack of data on alcohol effects on vocalizations in the context of an appropriate experimental model organism. Zebra finches, a representative songbird and a premier model for understanding the neurobiology of vocal production and learning, learn song in a manner analogous to how humans learn speech. Here we show that when allowed access, finches readily drink alcohol, increase their blood ethanol concentrations (BEC) significantly, and sing a song with altered acoustic structure. The most pronounced effects were decreased amplitude and increased entropy, the latter likely reflecting a disruption in the birds’ ability to maintain the spectral structure of song under alcohol. Furthermore, specific syllables, which have distinct acoustic structures, were differentially influenced by alcohol, likely reflecting a diversity in the neural mechanisms required for their production. Remarkably, these effects on vocalizations occurred without overt effects on general behavioral measures, and importantly, they occurred within a range of BEC that can be considered risky for humans. Our results suggest that the variable effects of alcohol on finch song reflect differential alcohol sensitivity of the brain circuitry elements that control different aspects of song production. They also point to finches as an informative model for understanding how alcohol affects the neuronal circuits that control the production of learned motor behaviors. PMID:25536524
Bennur, Sharath; Tsunada, Joji; Cohen, Yale E; Liu, Robert C
2013-11-01
Acoustic communication between animals requires them to detect, discriminate, and categorize conspecific or heterospecific vocalizations in their natural environment. Laboratory studies of the auditory-processing abilities that facilitate these tasks have typically employed a broad range of acoustic stimuli, ranging from natural sounds like vocalizations to "artificial" sounds like pure tones and noise bursts. However, even when using vocalizations, laboratory studies often test abilities like categorization in relatively artificial contexts. Consequently, it is not clear whether neural and behavioral correlates of these tasks (1) reflect extensive operant training, which drives plastic changes in auditory pathways, or (2) the innate capacity of the animal and its auditory system. Here, we review a number of recent studies, which suggest that adopting more ethological paradigms utilizing natural communication contexts are scientifically important for elucidating how the auditory system normally processes and learns communication sounds. Additionally, since learning the meaning of communication sounds generally involves social interactions that engage neuromodulatory systems differently than laboratory-based conditioning paradigms, we argue that scientists need to pursue more ethological approaches to more fully inform our understanding of how the auditory system is engaged during acoustic communication. This article is part of a Special Issue entitled "Communication Sounds and the Brain: New Directions and Perspectives". Copyright © 2013 Elsevier B.V. All rights reserved.
De Carli Monteiro, Juliane Petry; Condez, Thais Helena; De Anchietta Garcia, Paulo Christiano; Comitti, EstevÃo Jasper; Amaral, Ivan Borel; Haddad, CÉlio Fernando Baptista
2018-04-12
We describe a new species of Brachycephalus from municipality of São Francisco do Sul and municipality of Itapoá, in the state of Santa Catarina, Brazil, southern Atlantic Forest. The new species is known from six localities from near sea level up to 250 meters and represents the first record of a "pumpkin-toadlet" occurring in the lowlands. Morphological traits and phylogenetic analysis of a fragment the 16S mtDNA gene place the new species in the Brachycephalus pernix group. The new species is supported by external morphology, osteology, advertisement call, and mtDNA divergence. It is characterized, among other traits by a dorsal body color dark green with a dark brown vertebral stripe, and an orange background; snout-vent length of 9.2-10.8 mm in males and 11.1-12.4 mm in females; and advertisement call short (0.02-0.03 seconds), composed of one high-frequency note (dominant frequency 6.6-7.3 kHz). We observed synchronized alternation in the emission of vocalizations among neighbor males, indicating that males of the new species are able to hear and use vocalizations to interact with each other. We provide descriptions of clutch, eggs, and juvenile and observations on parental care. The new species has not been recorded within any protected area and can be threatened by human-induced habitat loss and modification.
Tong, Zhixiang; Duncan, Randall L.
2013-01-01
We are interested in the in vitro engineering of artificial vocal fold tissues via the strategic combination of multipotent mesenchymal stem cells (MSCs), physiologically relevant mechanical stimulations, and biomimetic artificial matrices. We have constructed a vocal fold bioreactor that is capable of imposing vibratory stimulations on the cultured cells at human phonation frequencies. Separately, fibrous poly (ɛ-caprolactone) (PCL) scaffolds emulating the ligamentous structure of the vocal fold were prepared by electrospinning, were incorporated in the vocal fold bioreactor, and were driven into a wave-like motion in an axisymmetrical fashion by the oscillating air. MSC-laden PCL scaffolds were subjected to vibrations at 200 Hz with a normal center displacement of ∼40 μm for a total of 7 days. A continuous (CT) or a 1 h-on-1 h-off (OF) regime with a total dynamic culture time of 12 h per day was applied. The dynamic loading did not cause any physiological trauma to the cells. Immunohistotochemical staining revealed the reinforcement of the actin filament and the enhancement of α5β1 integrin expression under selected dynamic culture conditions. Cellular expression of essential vocal fold extracellular matrix components, such as elastin, hyaluronic acid, and matrix metalloproteinase-1, was significantly elevated as compared with the static controls, and the OF regime is more conducive to matrix production than the CT vibration mode. Analyses of genes of typical fibroblast hallmarks (tenascin-C, collagen III, and procollagen I) as well as markers for MSC differentiation into nonfibroblastic lineages confirmed MSCs' adaptation of fibroblastic behaviors. Overall, the high-frequency vibratory stimulation, when combined with a synthetic fibrous scaffold, serves as a potent modulator of MSC functions. The novel bioreactor system presented here, as a versatile, yet well-controlled model, offers an in vitro platform for understanding vibration-induced mechanotransduction and for engineering of functional vocal fold tissues. PMID:23516973
Reinforcement of Infant Vocalizations through Contingent Vocal Imitation
ERIC Educational Resources Information Center
Pelaez, Martha; Virues-Ortega, Javier; Gewirtz, Jacob L.
2011-01-01
Maternal vocal imitation of infant vocalizations is highly prevalent during face-to-face interactions of infants and their caregivers. Although maternal vocal imitation has been associated with later verbal development, its potentially reinforcing effect on infant vocalizations has not been explored experimentally. This study examined the…
2018-01-01
Human vocal development is dependent on learning by imitation through social feedback between infants and caregivers. Recent studies have revealed that vocal development is also influenced by parental feedback in marmoset monkeys, suggesting vocal learning mechanisms in nonhuman primates. Marmoset infants that experience more contingent vocal feedback than their littermates develop vocalizations more rapidly, and infant marmosets with limited parental interaction exhibit immature vocal behavior beyond infancy. However, it is yet unclear whether direct parental interaction is an obligate requirement for proper vocal development because all monkeys in the aforementioned studies were able to produce the adult call repertoire after infancy. Using quantitative measures to compare distinct call parameters and vocal sequence structure, we show that social interaction has a direct impact not only on the maturation of the vocal behavior but also on acoustic call structures during vocal development. Monkeys with limited parental interaction during development show systematic differences in call entropy, a measure for maturity, compared with their normally raised siblings. In addition, different call types were occasionally uttered in motif-like sequences similar to those exhibited by vocal learners, such as birds and humans, in early vocal development. These results indicate that a lack of parental interaction leads to long-term disturbances in the acoustic structure of marmoset vocalizations, suggesting an imperative role for social interaction in proper primate vocal development. PMID:29651461
Gultekin, Yasemin B; Hage, Steffen R
2018-04-01
Human vocal development is dependent on learning by imitation through social feedback between infants and caregivers. Recent studies have revealed that vocal development is also influenced by parental feedback in marmoset monkeys, suggesting vocal learning mechanisms in nonhuman primates. Marmoset infants that experience more contingent vocal feedback than their littermates develop vocalizations more rapidly, and infant marmosets with limited parental interaction exhibit immature vocal behavior beyond infancy. However, it is yet unclear whether direct parental interaction is an obligate requirement for proper vocal development because all monkeys in the aforementioned studies were able to produce the adult call repertoire after infancy. Using quantitative measures to compare distinct call parameters and vocal sequence structure, we show that social interaction has a direct impact not only on the maturation of the vocal behavior but also on acoustic call structures during vocal development. Monkeys with limited parental interaction during development show systematic differences in call entropy, a measure for maturity, compared with their normally raised siblings. In addition, different call types were occasionally uttered in motif-like sequences similar to those exhibited by vocal learners, such as birds and humans, in early vocal development. These results indicate that a lack of parental interaction leads to long-term disturbances in the acoustic structure of marmoset vocalizations, suggesting an imperative role for social interaction in proper primate vocal development.
Histopathologic study of human vocal fold mucosa unphonated over a decade.
Sato, Kiminori; Umeno, Hirohito; Ono, Takeharu; Nakashima, Tadashi
2011-12-01
Mechanotransduction caused by vocal fold vibration could possibly be an important factor in the maintenance of extracellular matrices and layered structure of the human adult vocal fold mucosa as a vibrating tissue after the layered structure has been completed. Vocal fold stellate cells (VFSCs) in the human maculae flavae of the vocal fold mucosa are inferred to be involved in the metabolism of extracellular matrices of the vocal fold mucosa. Maculae flavae are also considered to be an important structure in the growth and development of the human vocal fold mucosa. Tension caused by phonation (vocal fold vibration) is hypothesized to stimulate the VFSCs to accelerate production of extracellular matrices. A human adult vocal fold mucosa unphonated over a decade was investigated histopathologically. Vocal fold mucosa unphonated for 11 years and 2 months of a 64-year-old male with cerebral hemorrhage was investigated by light and electron microscopy. The vocal fold mucosae (including maculae flavae) were atrophic. The vocal fold mucosa did not have a vocal ligament, Reinke's space or a layered structure. The lamina propria appeared as a uniform structure. Morphologically, the VFSCs synthesized fewer extracellular matrices, such as fibrous protein and glycosaminoglycan. Consequently, VFSCs appeared to decrease their level of activity.
Coos, booms, and hoots: The evolution of closed-mouth vocal behavior in birds.
Riede, Tobias; Eliason, Chad M; Miller, Edward H; Goller, Franz; Clarke, Julia A
2016-08-01
Most birds vocalize with an open beak, but vocalization with a closed beak into an inflating cavity occurs in territorial or courtship displays in disparate species throughout birds. Closed-mouth vocalizations generate resonance conditions that favor low-frequency sounds. By contrast, open-mouth vocalizations cover a wider frequency range. Here we describe closed-mouth vocalizations of birds from functional and morphological perspectives and assess the distribution of closed-mouth vocalizations in birds and related outgroups. Ancestral-state optimizations of body size and vocal behavior indicate that closed-mouth vocalizations are unlikely to be ancestral in birds and have evolved independently at least 16 times within Aves, predominantly in large-bodied lineages. Closed-mouth vocalizations are rare in the small-bodied passerines. In light of these results and body size trends in nonavian dinosaurs, we suggest that the capacity for closed-mouth vocalization was present in at least some extinct nonavian dinosaurs. As in birds, this behavior may have been limited to sexually selected vocal displays, and hence would have co-occurred with open-mouthed vocalizations. © 2016 The Author(s). Evolution © 2016 The Society for the Study of Evolution.
The Vocal Repertoire of Adult and Neonate Giant Otters (Pteronura brasiliensis)
Mumm, Christina A. S.; Knörnschild, Mirjam
2014-01-01
Animals use vocalizations to exchange information about external events, their own physical or motivational state, or about individuality and social affiliation. Infant babbling can enhance the development of the full adult vocal repertoire by providing ample opportunity for practice. Giant otters are very social and frequently vocalizing animals. They live in highly cohesive groups, generally including a reproductive pair and their offspring born in different years. This basic social structure may vary in the degree of relatedness of the group members. Individuals engage in shared group activities and different social roles and thus, the social organization of giant otters provides a basis for complex and long-term individual relationships. We recorded and analysed the vocalizations of adult and neonate giant otters from wild and captive groups. We classified the adult vocalizations according to their acoustic structure, and described their main behavioural context. Additionally, we present the first description of vocalizations uttered in babbling bouts of new born giant otters. We expected to find 1) a sophisticated vocal repertoire that would reflect the species’ complex social organisation, 2) that giant otter vocalizations have a clear relationship between signal structure and function, and 3) that the vocal repertoire of new born giant otters would comprise age-specific vocalizations as well as precursors of the adult repertoire. We found a vocal repertoire with 22 distinct vocalization types produced by adults and 11 vocalization types within the babbling bouts of the neonates. A comparison within the otter subfamily suggests a relation between vocal and social complexity, with the giant otters being the socially and vocally most complex species. PMID:25391142
Doellinger, Michael; Lohscheller, Joerg; McWhorter, Andrew; Kunduk, Melda
2009-03-01
We investigate the potential of high-speed digital imaging technique (HSI) and the phonovibrogram (PVG) analysis in normal vocal fold dynamics by studying the effects of continuous voice use (vocal loading) during the workday. One healthy subject was recorded at sustained phonation 13 times within 2 consecutive days in the morning before and in the afternoon after vocal loading, respectively. Vocal fold dynamics were extracted and visualized by PVGs. The characteristic PVG patterns were extracted representing vocal fold vibration types. The parameter values were then analyzed by statistics regarding vocal load, left-right PVG asymmetries, anterior-posterior PVG asymmetries, and opening-closing differences. For the first time, the direct impact of vocal load could be determined by analyzing vocal fold dynamics. For same vocal loading conditions, equal dynamical behavior of the vocal folds were confirmed. Comparison of recordings performed in the morning with the recordings after work revealed significant changes in vibration behavior, indicating impact of occurring vocal load. Left-right asymmetries in vocal fold dynamics were found confirming earlier assumptions. Different dynamics between opening and closing procedure as well as for anterior and posterior parts were found. Constant voice usage stresses the vocal folds even in healthy subjects and can be detected by applying the PVG technique. Furthermore, left-right PVG asymmetries do occur in healthy voice to a certain extent. HSI in combination with PVG analysis seems to be a promising tool for investigation of vocal fold fatigue and pathologies resulting in small forms of dynamical changes.
Garcia, Elisângela Zacanti; Yamashita, Hélio Kiitiro; Garcia, Davi Sousa; Padovani, Marina Martins Pereira; Azevedo, Renata Rangel; Chiari, Brasília Maria
2016-01-01
Cone beam computed tomography (CBCT), which represents an alternative to traditional computed tomography and magnetic resonance imaging, may be a useful instrument to study vocal tract physiology related to vocal exercises. This study aims to evaluate the applicability of CBCT to the assessment of variations in the vocal tract of healthy individuals before and after vocal exercises. Voice recordings and CBCT images before and after vocal exercises performed by 3 speech-language pathologists without vocal complaints were collected and compared. Each participant performed 1 type of exercise, i.e., Finnish resonance tube technique, prolonged consonant "b" technique, or chewing technique. The analysis consisted of an acoustic analysis and tomographic imaging. Modifications of the vocal tract settings following vocal exercises were properly detected by CBCT, and changes in the acoustic parameters were, for the most part, compatible with the variations detected in image measurements. CBCT was shown to be capable of properly assessing the changes in vocal tract settings promoted by vocal exercises. © 2017 S. Karger AG, Basel.
Iconicity can ground the creation of vocal symbols.
Perlman, Marcus; Dale, Rick; Lupyan, Gary
2015-08-01
Studies of gestural communication systems find that they originate from spontaneously created iconic gestures. Yet, we know little about how people create vocal communication systems, and many have suggested that vocalizations do not afford iconicity beyond trivial instances of onomatopoeia. It is unknown whether people can generate vocal communication systems through a process of iconic creation similar to gestural systems. Here, we examine the creation and development of a rudimentary vocal symbol system in a laboratory setting. Pairs of participants generated novel vocalizations for 18 different meanings in an iterative 'vocal' charades communication game. The communicators quickly converged on stable vocalizations, and naive listeners could correctly infer their meanings in subsequent playback experiments. People's ability to guess the meanings of these novel vocalizations was predicted by how close the vocalization was to an iconic 'meaning template' we derived from the production data. These results strongly suggest that the meaningfulness of these vocalizations derived from iconicity. Our findings illuminate a mechanism by which iconicity can ground the creation of vocal symbols, analogous to the function of iconicity in gestural communication systems.
Calcium-activated potassium (BK) channels are encoded by duplicate slo1 genes in teleost fishes.
Rohmann, Kevin N; Deitcher, David L; Bass, Andrew H
2009-07-01
Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue.
Calcium-Activated Potassium (BK) Channels Are Encoded by Duplicate slo1 Genes in Teleost Fishes
Deitcher, David L.; Bass, Andrew H.
2009-01-01
Calcium-activated, large conductance potassium (BK) channels in tetrapods are encoded by a single slo1 gene, which undergoes extensive alternative splicing. Alternative splicing generates a high level of functional diversity in BK channels that contributes to the wide range of frequencies electrically tuned by the inner ear hair cells of many tetrapods. To date, the role of BK channels in hearing among teleost fishes has not been investigated at the molecular level, although teleosts account for approximately half of all extant vertebrate species. We identified slo1 genes in teleost and nonteleost fishes using polymerase chain reaction and genetic sequence databases. In contrast to tetrapods, all teleosts examined were found to express duplicate slo1 genes in the central nervous system, whereas nonteleosts that diverged prior to the teleost whole-genome duplication event express a single slo1 gene. Phylogenetic analyses further revealed that whereas other slo1 duplicates were the result of a single duplication event, an independent duplication occurred in a basal teleost (Anguilla rostrata) following the slo1 duplication in teleosts. A third, independent slo1 duplication (autotetraploidization) occurred in salmonids. Comparison of teleost slo1 genomic sequences to their tetrapod orthologue revealed a reduced number of alternative splice sites in both slo1 co-orthologues. For the teleost Porichthys notatus, a focal study species that vocalizes with maximal spectral energy in the range electrically tuned by BK channels in the inner ear, peripheral tissues show the expression of either one (e.g., vocal muscle) or both (e.g., inner ear) slo1 paralogues with important implications for both auditory and vocal physiology. Additional loss of expression of one slo1 paralogue in nonneural tissues in P. notatus suggests that slo1 duplicates were retained via subfunctionalization. Together, the results predict that teleost fish achieve a diversity of BK channel subfunction via gene duplication, rather than increased alternative splicing as witnessed for the tetrapod and invertebrate orthologue. PMID:19321796
Dependence of phonation threshold pressure on vocal tract acoustics and vocal fold tissue mechanics.
Chan, Roger W; Titze, Ingo R
2006-04-01
Analytical and computer simulation studies have shown that the acoustic impedance of the vocal tract as well as the viscoelastic properties of vocal fold tissues are critical for determining the dynamics and the energy transfer mechanism of vocal fold oscillation. In the present study, a linear, small-amplitude oscillation theory was revised by taking into account the propagation of a mucosal wave and the inertive reactance (inertance) of the supraglottal vocal tract as the major energy transfer mechanisms for flow-induced self-oscillation of the vocal fold. Specifically, analytical results predicted that phonation threshold pressure (Pth) increases with the viscous shear properties of the vocal fold, but decreases with vocal tract inertance. This theory was empirically tested using a physical model of the larynx, where biological materials (fat, hyaluronic acid, and fibronectin) were implanted into the vocal fold cover to investigate the effect of vocal fold tissue viscoelasticity on Pth. A uniform-tube supraglottal vocal tract was also introduced to examine the effect of vocal tract inertance on Pth. Results showed that Pth decreased with the inertive impedance of the vocal tract and increased with the viscous shear modulus (G") or dynamic viscosity (eta') of the vocal fold cover, consistent with theoretical predictions. These findings supported the potential biomechanical benefits of hyaluronic acid as a surgical bioimplant for repairing voice disorders involving the superficial layer of the lamina propria, such as scarring, sulcus vocalis, atrophy, and Reinke's edema.
ERIC Educational Resources Information Center
Remijn, Gerard B.; Kikuchi, Mitsuru; Yoshimura, Yuko; Shitamichi, Kiyomi; Ueno, Sanae; Tsubokawa, Tsunehisa; Kojima, Haruyuki; Higashida, Haruhiro; Minabe, Yoshio
2017-01-01
Purpose: The purpose of this study was to assess cortical hemodynamic response patterns in 3- to 7-year-old children listening to two speech modes: normally vocalized and whispered speech. Understanding whispered speech requires processing of the relatively weak, noisy signal, as well as the cognitive ability to understand the speaker's reason for…
Van Lierde, Kristiane M; D'haeseleer, Evelien; Wuyts, Floris L; De Ley, Sophia; Geldof, Ruben; De Vuyst, Julie; Sofie, Claeys
2010-09-01
The purpose of the present cross-sectional study was to determine the objective vocal quality and the vocal characteristics (vocal risk factors, vocal and corporal complaints) in 197 female students in speech-language pathology during the 4 years of study. The objective vocal quality was measured by means of the Dysphonia Severity Index (DSI). Perceptual voice assessment, the Voice Handicap Index (VHI), questionnaires addressing vocal risks, and vocal and corporal complaints during and/or after voice usage were performed. Speech-language pathology (SLP) students have a borderline vocal quality corresponding to a DSI% of 68. The analysis of variance revealed no significant change of the objective vocal quality between the first bachelor year and the master year. No psychosocial handicapping effect of the voice was observed by means of the VHI total, though there was an effect at the functional VHI level in addition to some vocal complaints. Ninety-three percent of the student SLPs reported the presence of corporal pain during and/or after speaking. In particular, sore throat and headache were mentioned as the prevalent corporal pain symptoms. A longitudinal study of the objective vocal quality of the same subjects during their career as an SLP might provide new insights. 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
University Vocal Training and Vocal Health of Music Educators and Music Therapists
ERIC Educational Resources Information Center
Baker, Vicki D.; Cohen, Nicki
2017-01-01
The purpose of this study was to describe the university vocal training and vocal health of music educators and music therapists. The participants (N = 426), music educators (n = 351) and music therapists (n = 75), completed a survey addressing demographics, vocal training, voice usage, and vocal health. Both groups reported singing at least 50%…
Monkey vocal tracts are speech-ready.
Fitch, W Tecumseh; de Boer, Bart; Mathur, Neil; Ghazanfar, Asif A
2016-12-01
For four decades, the inability of nonhuman primates to produce human speech sounds has been claimed to stem from limitations in their vocal tract anatomy, a conclusion based on plaster casts made from the vocal tract of a monkey cadaver. We used x-ray videos to quantify vocal tract dynamics in living macaques during vocalization, facial displays, and feeding. We demonstrate that the macaque vocal tract could easily produce an adequate range of speech sounds to support spoken language, showing that previous techniques based on postmortem samples drastically underestimated primate vocal capabilities. Our findings imply that the evolution of human speech capabilities required neural changes rather than modifications of vocal anatomy. Macaques have a speech-ready vocal tract but lack a speech-ready brain to control it.
Vocal Dose Measures: Quantifying Accumulated Vibration Exposure in Vocal Fold Tissues
Titze, Ingo R.; Švec, Jan G.; Popolo, Peter S.
2011-01-01
To measure the exposure to self-induced tissue vibration in speech, three vocal doses were defined and described: distance dose, which accumulates the distance that tissue particles of the vocal folds travel in an oscillatory trajectory; energy dissipation dose, which accumulates the total amount of heat dissipated over a unit volume of vocal fold tissues; and time dose, which accumulates the total phonation time. These doses were compared to a previously used vocal dose measure, the vocal loading index, which accumulates the number of vibration cycles of the vocal folds. Empirical rules for viscosity and vocal fold deformation were used to calculate all the doses from the fundamental frequency (F0) and sound pressure level (SPL) values of speech. Six participants were asked to read in normal, monotone, and exaggerated speech and the doses associated with these vocalizations were calculated. The results showed that large F0 and SPL variations in speech affected the dose measures, suggesting that accumulation of phonation time alone is insufficient. The vibration exposure of the vocal folds in normal speech was related to the industrial limits for hand-transmitted vibration, in which the safe distance dose was derived to be about 500 m. This limit was found rather low for vocalization; it was related to a comparable time dose of about 17 min of continuous vocalization, or about 35 min of continuous reading with normal breathing and unvoiced segments. The voicing pauses in normal speech and dialogue effectively prolong the safe time dose. The derived safety limits for vocalization will likely require refinement based on a more detailed knowledge of the differences in hand and vocal fold tissue morphology and their response to vibrational stress, and on the effect of recovery of the vocal fold tissue during voicing pauses. PMID:12959470
Neural FoxP2 and FoxP1 expression in the budgerigar, an avian species with adult vocal learning.
Hara, Erina; Perez, Jemima M; Whitney, Osceola; Chen, Qianqian; White, Stephanie A; Wright, Timothy F
2015-04-15
Vocal learning underlies acquisition of both language in humans and vocal signals in some avian taxa. These bird groups and humans exhibit convergent developmental phases and associated brain pathways for vocal communication. The transcription factor FoxP2 plays critical roles in vocal learning in humans and songbirds. Another member of the forkhead box gene family, FoxP1 also shows high expression in brain areas involved in vocal learning and production. Here, we investigate FoxP2 and FoxP1 mRNA and protein in adult male budgerigars (Melopsittacus undulatus), a parrot species that exhibits vocal learning as both juveniles and adults. To examine these molecules in adult vocal learners, we compared their expression patterns in the budgerigar striatal nucleus involved in vocal learning, magnocellular nucleus of the medial striatum (MMSt), across birds with different vocal states, such as vocalizing to a female (directed), vocalizing alone (undirected), and non-vocalizing. We found that both FoxP2 mRNA and protein expressions were consistently lower in MMSt than in the adjacent striatum regardless of the vocal states, whereas previous work has shown that songbirds exhibit down-regulation in the homologous region, Area X, only after singing alone. In contrast, FoxP1 levels were high in MMSt compared to the adjacent striatum in all groups. Taken together these results strengthen the general hypothesis that FoxP2 and FoxP1 have specialized expression in vocal nuclei across a range of taxa, and suggest that the adult vocal plasticity seen in budgerigars may be a product of persistent down-regulation of FoxP2 in MMSt. Copyright © 2015 Elsevier B.V. All rights reserved.
Neural FoxP2 and FoxP1 expression in the budgerigar, an avian species with adult vocal learning
Hara, Erina; Perez, Jemima M.; Whitney, Osceola; Chen, Qianqian; White, Stephanie A.; Wright, Timothy F.
2015-01-01
Vocal learning underlies acquisition of both language in humans and vocal signals in some avian taxa. These bird groups and humans exhibit convergent developmental phases and associated brain pathways for vocal communication. The transcription factor FoxP2 plays critical roles in vocal learning in humans and songbirds. Another member of the forkhead box gene family, FoxP1 also shows high expression in brain areas involved in vocal learning and production. Here, we investigate FoxP2 and FoxP1 mRNA and protein in adult male budgerigars (Melopsittacus undulatus), a parrot species that exhibits vocal learning as both juveniles and adults. To examine these molecules in adult vocal learners, we compared their expression patterns in the budgerigar striatal nucleus involved in vocal learning, magnocellular nucleus of the medial striatum (MMSt), across birds with different vocal states, such as vocalizing to a female (directed), vocalizing alone (undirected), and non-vocalizing. We found that both FoxP2 mRNA and protein expressions were consistently lower in MMSt than in the adjacent striatum regardless of the vocal states, whereas previous work has shown that songbirds exhibit downregulation in the homologous region, Area X, only after singing alone. In contrast, FoxP1 levels were high in MMSt compared to the adjacent striatum in all groups. Taken together these results strengthen the general hypothesis that FoxP2 and FoxP1 have specialized expression in vocal nuclei across a range of taxa, and suggest that the adult vocal plasticity seen in budgerigars may be a product of persistent down-regulation of FoxP2 in MMSt. PMID:25601574
Vocal dose measures: quantifying accumulated vibration exposure in vocal fold tissues.
Titze, Ingo R; Svec, Jan G; Popolo, Peter S
2003-08-01
To measure the exposure to self-induced tissue vibration in speech, three vocal doses were defined and described: distance dose, which accumulates the distance that tissue particles of the vocal folds travel in an oscillatory trajectory; energy dissipation dose, which accumulates the total amount of heat dissipated over a unit volume of vocal fold tissues; and time dose, which accumulates the total phonation time. These doses were compared to a previously used vocal dose measure, the vocal loading index, which accumulates the number of vibration cycles of the vocal folds. Empirical rules for viscosity and vocal fold deformation were used to calculate all the doses from the fundamental frequency (F0) and sound pressure level (SPL) values of speech. Six participants were asked to read in normal, monotone, and exaggerated speech and the doses associated with these vocalizations were calculated. The results showed that large F0 and SPL variations in speech affected the dose measures, suggesting that accumulation of phonation time alone is insufficient. The vibration exposure of the vocal folds in normal speech was related to the industrial limits for hand-transmitted vibration, in which the safe distance dose was derived to be about 500 m. This limit was found rather low for vocalization; it was related to a comparable time dose of about 17 min of continuous vocalization, or about 35 min of continuous reading with normal breathing and unvoiced segments. The voicing pauses in normal speech and dialogue effectively prolong the safe time dose. The derived safety limits for vocalization will likely require refinement based on a more detailed knowledge of the differences in hand and vocal fold tissue morphology and their response to vibrational stress, and on the effect of recovery of the vocal fold tissue during voicing pauses.
Sielska-Badurek, Ewelina; Osuch-Wójcikiewicz, Ewa; Sobol, Maria; Kazanecka, Ewa; Niemczyk, Kazimierz
2017-01-01
This study investigated vocal function knowledge and vocal tract sensorimotor self-awareness and the impact of functional voice rehabilitation on vocal function knowledge and self-awareness. This is a prospective, randomized study. Twenty singers (study group [SG]) completed a questionnaire before and after functional voice rehabilitation. Twenty additional singers, representing the control group, also completed the questionnaire without functional voice rehabilitation at a 3-month interval. The questionnaire consisted of three parts. The first part evaluated the singers' attitude to the anatomical and physiological knowledge of the vocal tract and their self-esteem of the knowledge level. The second part assessed the theoretical knowledge of the singers' vocal tract physiology. The third part of the questionnaire assessed singers' sensorimotor self-awareness of the vocal tract. The results showed that most singers indicated that knowledge of the vocal tract's anatomy and physiology is useful (59% SG, 67% control group). However, 75% of all participants defined their knowledge of the vocal tract's anatomy and physiology as weak or inadequate. In the SG, vocal function knowledge at the first assessment was 45%. After rehabilitation, the level increased to 67.7%. Vocal tract sensorimotor self-awareness initially was 38.9% in SG but rose to 66.7%. Findings of the study suggest that classical singers lack knowledge about the physiology of the vocal mechanism, especially the breathing patterns. In addition, they have low sensorimotor self-awareness of their vocal tract. The results suggest that singers would benefit from receiving services from phoniatrists and speech-language pathologists during their voice training. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Scheerer, N E; Jacobson, D S; Jones, J A
2016-02-09
Auditory feedback plays an important role in the acquisition of fluent speech; however, this role may change once speech is acquired and individuals no longer experience persistent developmental changes to the brain and vocal tract. For this reason, we investigated whether the role of auditory feedback in sensorimotor learning differs across children and adult speakers. Participants produced vocalizations while they heard their vocal pitch predictably or unpredictably shifted downward one semitone. The participants' vocal pitches were measured at the beginning of each vocalization, before auditory feedback was available, to assess the extent to which the deviant auditory feedback modified subsequent speech motor commands. Sensorimotor learning was observed in both children and adults, with participants' initial vocal pitch increasing following trials where they were exposed to predictable, but not unpredictable, frequency-altered feedback. Participants' vocal pitch was also measured across each vocalization, to index the extent to which the deviant auditory feedback was used to modify ongoing vocalizations. While both children and adults were found to increase their vocal pitch following predictable and unpredictable changes to their auditory feedback, adults produced larger compensatory responses. The results of the current study demonstrate that both children and adults rapidly integrate information derived from their auditory feedback to modify subsequent speech motor commands. However, these results also demonstrate that children and adults differ in their ability to use auditory feedback to generate compensatory vocal responses during ongoing vocalization. Since vocal variability also differed across the children and adult groups, these results also suggest that compensatory vocal responses to frequency-altered feedback manipulations initiated at vocalization onset may be modulated by vocal variability. Copyright © 2015 IBRO. Published by Elsevier Ltd. All rights reserved.
Audio-vocal interaction in single neurons of the monkey ventrolateral prefrontal cortex.
Hage, Steffen R; Nieder, Andreas
2015-05-06
Complex audio-vocal integration systems depend on a strong interconnection between the auditory and the vocal motor system. To gain cognitive control over audio-vocal interaction during vocal motor control, the PFC needs to be involved. Neurons in the ventrolateral PFC (VLPFC) have been shown to separately encode the sensory perceptions and motor production of vocalizations. It is unknown, however, whether single neurons in the PFC reflect audio-vocal interactions. We therefore recorded single-unit activity in the VLPFC of rhesus monkeys (Macaca mulatta) while they produced vocalizations on command or passively listened to monkey calls. We found that 12% of randomly selected neurons in VLPFC modulated their discharge rate in response to acoustic stimulation with species-specific calls. Almost three-fourths of these auditory neurons showed an additional modulation of their discharge rates either before and/or during the monkeys' motor production of vocalization. Based on these audio-vocal interactions, the VLPFC might be well positioned to combine higher order auditory processing with cognitive control of the vocal motor output. Such audio-vocal integration processes in the VLPFC might constitute a precursor for the evolution of complex learned audio-vocal integration systems, ultimately giving rise to human speech. Copyright © 2015 the authors 0270-6474/15/357030-11$15.00/0.
Multiple Coordination Patterns in Infant and Adult Vocalizations
Abney, Drew H.; Warlaumont, Anne S.; Oller, D. Kimbrough; Wallot, Sebastian; Kello, Christopher T.
2017-01-01
The study of vocal coordination between infants and adults has led to important insights into the development of social, cognitive, emotional and linguistic abilities. We used an automatic system to identify vocalizations produced by infants and adults over the course of the day for fifteen infants studied longitudinally during the first two years of life. We measured three different types of vocal coordination: coincidence-based, rate-based, and cluster-based. Coincidence-based and rate-based coordination are established measures in the developmental literature. Cluster-based coordination is new and measures the strength of matching in the degree to which vocalization events occur in hierarchically nested clusters. We investigated whether various coordination patterns differ as a function of vocalization type, whether different coordination patterns provide unique information about the dynamics of vocal interaction, and how the various coordination patterns each relate to infant age. All vocal coordination patterns displayed greater coordination for infant speech-related vocalizations, adults adapted the hierarchical clustering of their vocalizations to match that of infants, and each of the three coordination patterns had unique associations with infant age. Altogether, our results indicate that vocal coordination between infants and adults is multifaceted, suggesting a complex relationship between vocal coordination and the development of vocal communication. PMID:29375276
Behroozmand, Roozbeh; Karvelis, Laura; Liu, Hanjun; Larson, Charles R.
2009-01-01
Objective The present study investigated whether self-vocalization enhances auditory neural responsiveness to voice pitch feedback perturbation and how this vocalization-induced neural modulation can be affected by the extent of the feedback deviation. Method Event related potentials (ERPs) were recorded in 15 subjects in response to +100, +200 and +500 cents pitch-shifted voice auditory feedback during active vocalization and passive listening to the playback of the self-produced vocalizations. Result The amplitude of the evoked P1 (latency: 73.51 ms) and P2 (latency: 199.55 ms) ERP components in response to feedback perturbation were significantly larger during vocalization than listening. The difference between P2 peak amplitudes during vocalization vs. listening was shown to be significantly larger for +100 than +500 cents stimulus. Conclusion Results indicate that the human auditory cortex is more responsive to voice F0 feedback perturbations during vocalization than passive listening. Greater vocalization-induced enhancement of the auditory responsiveness to smaller feedback perturbations may imply that the audio-vocal system detects and corrects for errors in vocal production that closely match the expected vocal output. Significance Findings of this study support previous suggestions regarding the enhanced auditory sensitivity to feedback alterations during self-vocalization, which may serve the purpose of feedback-based monitoring of one’s voice. PMID:19520602
Gilboa-Schechtman, Eva; Galili, Lior; Sahar, Yair; Amir, Ofer
2014-01-01
Social Anxiety (SA) has been shown to be associated with compensatory deficits in pro-social behavior following exclusion and with failure to capitalize on social success. We assessed the subjective and expressive responses of high (n = 48) and low (n = 56) socially anxious individuals to exclusion, acceptance, and popularity induced by a participation in an online ball-tossing game. Before the manipulation, participants read aloud neutral and command utterances. Following the manipulation, participants rated their mood and cognitions and re-read the utterances. Acoustic properties (fundamental frequency-mF0, vocal intensity) of these utterances were analyzed. We found greater differences in self-esteem between high and low socially anxious individuals following the exclusion condition, as compared to the acceptance condition. Among low socially anxious individuals, exclusion promoted increased vocal confidence, as indicated by decreased mF0 and increased vocal intensity in uttering commands; High socially anxious individuals exhibited an opposite reaction, responding to exclusion by decreased vocal confidence. Following popularity, high SA was associated with decreased enhancement in mood and self-esteem in women but not in men. Consistent with evolutionary and interpersonal accounts of SA, we highlight the importance of examining the effects of SA and gender on events indicating unambiguous and unanimous social acceptance. Examining reactivity to changes in belongingness may have important implications for understanding the core mechanisms of SA.
First insights into the vocal repertoire of infant and juvenile Southern white rhinoceros.
Linn, Sabrina N; Boeer, Michael; Scheumann, Marina
2018-01-01
Describing vocal repertoires represents an essential step towards gaining an overview about the complexity of acoustic communication in a given species. The analysis of infant vocalisations is essential for understanding the development and usage of species-specific vocalisations, but is often underrepresented, especially in species with long inter-birth intervals such as the white rhinoceros. Thus, this study aimed for the first time to characterise the infant and juvenile vocal repertoire of the Southern white rhinoceros and to relate these findings to the adult vocal repertoire. The behaviour of seven mother-reared white rhinoceros calves (two males, five females) and one hand-reared calf (male), ranging from one month to four years, was simultaneously audio and video-taped at three zoos. Normally reared infants and juveniles uttered four discriminable call types (Whine, Snort, Threat, and Pant) that were produced in different behavioural contexts. All call types were also uttered by the hand-reared calf. Call rates of Whines, but not of the other call types, decreased with age. These findings provide the first evidence that infant and juvenile rhinoceros utter specific call types in distinct contexts, even if they grow up with limited social interaction with conspecifics. By comparing our findings with the current literature on vocalisations of adult white rhinoceros and other solitary rhinoceros species, we discuss to which extent differences in the social lifestyle across species affect acoustic communication in mammals.
Acoustic signals of baby black caimans.
Vergne, Amélie L; Aubin, Thierry; Taylor, Peter; Mathevon, Nicolas
2011-12-01
In spite of the importance of crocodilian vocalizations for the understanding of the evolution of sound communication in Archosauria and due to the small number of experimental investigations, information concerning the vocal world of crocodilians is limited. By studying black caimans Melanosuchus niger in their natural habitat, here we supply the experimental evidence that juvenile crocodilians can use a graded sound system in order to elicit adapted behavioral responses from their mother and siblings. By analyzing the acoustic structure of calls emitted in two different situations ('undisturbed context', during which spontaneous calls of juvenile caimans were recorded without perturbing the group, and a simulated 'predator attack', during which calls were recorded while shaking juveniles) and by testing their biological relevance through playback experiments, we reveal the existence of two functionally different types of juvenile calls that produce a different response from the mother and other siblings. Young black caimans can thus modulate the structure of their vocalizations along an acoustic continuum as a function of the emission context. Playback experiments show that both mother and juveniles discriminate between these 'distress' and 'contact' calls. Acoustic communication is thus an important component mediating relationships within family groups in caimans as it is in birds, their archosaurian relatives. Although probably limited, the vocal repertoire of young crocodilians is capable of transmitting the information necessary for allowing siblings and mother to modulate their behavior. Copyright © 2011 Elsevier GmbH. All rights reserved.
Gilboa-Schechtman, Eva; Galili, Lior; Sahar, Yair; Amir, Ofer
2014-01-01
Social Anxiety (SA) has been shown to be associated with compensatory deficits in pro-social behavior following exclusion and with failure to capitalize on social success. We assessed the subjective and expressive responses of high (n = 48) and low (n = 56) socially anxious individuals to exclusion, acceptance, and popularity induced by a participation in an online ball-tossing game. Before the manipulation, participants read aloud neutral and command utterances. Following the manipulation, participants rated their mood and cognitions and re-read the utterances. Acoustic properties (fundamental frequency–mF0, vocal intensity) of these utterances were analyzed. We found greater differences in self-esteem between high and low socially anxious individuals following the exclusion condition, as compared to the acceptance condition. Among low socially anxious individuals, exclusion promoted increased vocal confidence, as indicated by decreased mF0 and increased vocal intensity in uttering commands; High socially anxious individuals exhibited an opposite reaction, responding to exclusion by decreased vocal confidence. Following popularity, high SA was associated with decreased enhancement in mood and self-esteem in women but not in men. Consistent with evolutionary and interpersonal accounts of SA, we highlight the importance of examining the effects of SA and gender on events indicating unambiguous and unanimous social acceptance. Examining reactivity to changes in belongingness may have important implications for understanding the core mechanisms of SA. PMID:24672463
Kaufman, Allison B; Colbert-White, Erin N; Burgess, Curt
2013-09-01
Previous research has described the significant role that social interaction plays in both the acquisition and use of speech by parrots. The current study analyzed the speech of one home-raised African Grey parrot (Psittacus erithacus erithacus) across three different social contexts: owner interacting with parrot in the same room, owner and parrot interacting out of view in adjacent rooms, and parrot home alone. The purpose was to determine the extent to which the subject's speech reflected an understanding of the contextual substitutability (e.g., the word street can be substituted in context for the word road) of the vocalizations that comprised the units in her repertoire (i.e., global co-occurrence of repertoire units; Burgess in Behav Res Methods Instrum Comput 30:188-198, 1998; Lund and Burgess in Behav Res Methods Instrum Comput 28:203-208, 1996). This was accomplished via the human language model hyperspace analog to language (HAL). HAL is contextually driven and bootstraps language "rules" from input without human intervention. Because HAL does not require human tutelage, it provided an objective measure to empirically examine the parrot's vocalizations. Results indicated that the subject's vocalization patterns did contain global co-occurrence. The presence of this quality in this nonhuman's speech may be strongly indicative of higher-order cognitive skills.
McGettigan, Carolyn; Eisner, Frank; Agnew, Zarinah K; Manly, Tom; Wisbey, Duncan; Scott, Sophie K
2014-01-01
Historically, the study of human identity perception has focused on faces, but the voice is also central to our expressions and experiences of identity (P. Belin, Fecteau, & Bedard, 2004). Our voices are highly flexible and dynamic; talkers speak differently depending on their health, emotional state, and the social setting, as well as extrinsic factors such as background noise. However, to date, there have been no studies of the neural correlates of identity modulation in speech production. In the current fMRI experiment, we measured the neural activity supporting controlled voice change in adult participants performing spoken impressions. We reveal that deliberate modulation of vocal identity recruits the left anterior insula and inferior frontal gyrus, supporting the planning of novel articulations. Bilateral sites in posterior superior temporal/inferior parietal cortex and a region in right mid/anterior superior temporal sulcus showed greater responses during the emulation of specific vocal identities than for impressions of generic accents. Using functional connectivity analyses, we describe roles for these three sites in their interactions with the brain regions supporting speech planning and production. Our findings mark a significant step toward understanding the neural control of vocal identity, with wider implications for the cognitive control of voluntary motor acts. PMID:23691984
The influence of pitch and loudness changes on the acoustics of vocal tremor.
Dromey, Christopher; Warrick, Paul; Irish, Jonathan
2002-10-01
The effect of tremor on phonation is to modulate an otherwise steady sound source in its amplitude, fundamental frequency, or both. The severity of untreated vocal tremor has been reported to change under certain conditions that may be related to muscle tension. In order to better understand the phenomenon of vocal tremor, its acoustic properties were examined as individuals volitionally altered their pitch and loudness. These voice conditions were anticipated to alter the tension of the intrinsic laryngeal muscles. The voices of 10 individuals with a diagnosis of vocal tremor were recorded before participating in a longitudinal treatment study. They produced vowels at low and high pitch and loudness levels as well as in a comfortable voice condition. Acoustic analyses quantified the amplitude and frequency modulations of the speakers' voices across the various conditions. Individual speakers varied in the way the pitch and loudness changes affected their tremor, but the following statistically significant effects for the speakers as a group were observed: Higher pitch phonation was associated with a more rapid rate for both amplitude and frequency modulations. Amplitude modulation become faster for louder phonation. Low-pitched phonotion led to decreases in the extent of amplitude tremor. Varying pitch led to dramatic changes in the phase relationship between amplitude and frequency modulation in some of the speakers, whereas this effect was not apparent in other speakers.
Limiting parental feedback disrupts vocal development in marmoset monkeys
Gultekin, Yasemin B.; Hage, Steffen R.
2017-01-01
Vocalizations of human infants undergo dramatic changes across the first year by becoming increasingly mature and speech-like. Human vocal development is partially dependent on learning by imitation through social feedback between infants and caregivers. Recent studies revealed similar developmental processes being influenced by parental feedback in marmoset monkeys for apparently innate vocalizations. Marmosets produce infant-specific vocalizations that disappear after the first postnatal months. However, it is yet unclear whether parental feedback is an obligate requirement for proper vocal development. Using quantitative measures to compare call parameters and vocal sequence structure we show that, in contrast to normally raised marmosets, marmosets that were separated from parents after the third postnatal month still produced infant-specific vocal behaviour at subadult stages. These findings suggest a significant role of social feedback on primate vocal development until the subadult stages and further show that marmoset monkeys are a compelling model system for early human vocal development. PMID:28090084
Fukushima, Makoto; Saunders, Richard C; Fujii, Naotaka; Averbeck, Bruno B; Mishkin, Mortimer
2014-01-01
Vocal production is an example of controlled motor behavior with high temporal precision. Previous studies have decoded auditory evoked cortical activity while monkeys listened to vocalization sounds. On the other hand, there have been few attempts at decoding motor cortical activity during vocal production. Here we recorded cortical activity during vocal production in the macaque with a chronically implanted electrocorticographic (ECoG) electrode array. The array detected robust activity in motor cortex during vocal production. We used a nonlinear dynamical model of the vocal organ to reduce the dimensionality of `Coo' calls produced by the monkey. We then used linear regression to evaluate the information in motor cortical activity for this reduced representation of calls. This simple linear model accounted for circa 65% of the variance in the reduced sound representations, supporting the feasibility of using the dynamical model of the vocal organ for decoding motor cortical activity during vocal production.
Female mice ultrasonically interact with males during courtship displays
Neunuebel, Joshua P; Taylor, Adam L; Arthur, Ben J; Egnor, SE Roian
2015-01-01
During courtship males attract females with elaborate behaviors. In mice, these displays include ultrasonic vocalizations. Ultrasonic courtship vocalizations were previously attributed to the courting male, despite evidence that both sexes produce virtually indistinguishable vocalizations. Because of this similarity, and the difficulty of assigning vocalizations to individuals, the vocal contribution of each individual during courtship is unknown. To address this question, we developed a microphone array system to localize vocalizations from socially interacting, individual adult mice. With this system, we show that female mice vocally interact with males during courtship. Males and females jointly increased their vocalization rates during chases. Furthermore, a female's participation in these vocal interactions may function as a signal that indicates a state of increased receptivity. Our results reveal a novel form of vocal communication during mouse courtship, and lay the groundwork for a mechanistic dissection of communication during social behavior. DOI: http://dx.doi.org/10.7554/eLife.06203.001 PMID:26020291
[Temperament of children with vocal fold nodules].
Wei, Youhua; Wang, Zhinan; Xu, Zhongqiang; Chen, Ping; Hao, Lili
2009-11-01
To examine the temperament of children with vocal fold nodules. To compare the temperament dimension and temperamental types of 42 children with vocal fold nodules with 46 vocally normal children, using Chinese children's Temperament Problem Screening system (CCTPSs). The children with vocal fold nodules differed significantly from the comparison group in their temperament dimension's adaptability, intensity of reaction, mood value, persistency and temperamental types. There are more difficult and slow-to-warm-up children in patients with vocal fold nodules than vocally normal children.
Thomas, Ashish; Suyesh, Robin; Biju, S. D.; Bee, Mark A.
2014-01-01
Quantitative descriptions of animal vocalizations can inform an understanding of their evolutionary functions, the mechanisms for their production and perception, and their potential utility in taxonomy, population monitoring, and conservation. The goal of this study was to provide the first acoustical and statistical analysis of the advertisement calls of Nasikabatrachus sahyadrensis. Commonly known as the Indian purple frog, N. sahyadrensis is an endangered species endemic to the Western Ghats of India. As the only known species in its family (Nasikabatrachidae), it has ancient evolutionary ties to frogs restricted to the Seychelles archipelago (Sooglossidae). The role of vocalizations in the behavior of this unique species poses interesting questions, as the animal is fossorial and potentially earless and it breeds explosively above the soil for only about two weeks a year. In this study, we quantified 19 acoustic properties of 208 calls recorded from 10 males. Vocalizations were organized into distinct call groups typically composed of two to six short (59 ms), pulsatile calls, each consisting of about five to seven pulses produced at a rate of about 106 pulses/s. The frequency content of the call consisted of a single dominant peak between 1200–1300 Hz and there was no frequency modulation. The patterns of variation within and among individuals were typical of those seen in other frogs. Few of the properties we measured were related to temperature, body size, or condition, though there was little variation in temperature. Field observations and recordings of captive individuals indicated that males engaged in both antiphonal calling and call overlap with nearby calling neighbors. We discuss our findings in relation to previous work on vocal behavior in other fossorial frogs and in sooglossid frogs. PMID:24516517
Spuz, Catherine A; Borszcz, George S
2012-04-01
The amygdala contributes to generation of affective behaviors to threats. The prototypical threat to an individual is exposure to a noxious stimulus and the amygdaloid central nucleus (CeA) receives nociceptive input that is mediated by glutamatergic neurotransmission. The present study evaluated the contribution of glutamate receptors in CeA to generation of the affective response to acute pain in rats. Vocalizations that occur following a brief noxious tail shock (vocalization afterdischarges) are a validated rodent model of pain affect, and were preferentially suppressed by bilateral injection into CeA of the NMDA receptor antagonist D-2-amino-5-phosphonovalerate (AP5, 1 μg, 2 μg, or 4 μg) or the non-NMDA receptor antagonist 6-Cyano-7-nitroquinoxaline-2,3-dione disodium (CNQX, .25 μg, .5 μg, 1 μg, or 2 μg). Vocalizations that occur during tail shock were suppressed to a lesser degree, whereas spinal motor reflexes (tail flick and hind limb movements) were unaffected by injection of AP5 or CNQX into CeA. Unilateral administration of AP5 or CNQX into CeA of either hemisphere also selectively elevated vocalization thresholds. Bilateral administration of AP5 or CNQX produced greater increases in vocalization thresholds than the same doses of antagonists administered unilaterality into either hemisphere indicating synergistic hemispheric interactions. The amygdala contributes to production of emotional responses to environmental threats. Blocking glutamate neurotransmission within the central nucleus of the amygdala suppressed rats' emotional response to acute painful stimulation. Understanding the neurobiology underlying emotional responses to pain will provide insights into new treatments for pain and its associated affective disorders. Copyright © 2012 American Pain Society. Published by Elsevier Inc. All rights reserved.
Elucidating spatially explicit behavioral landscapes in the Willow Flycatcher
Bakian, Amanda V.; Sullivan, Kimberly A.; Paxton, Eben H.
2012-01-01
Animal resource selection is a complex, hierarchical decision-making process, yet resource selection studies often focus on the presence and absence of an animal rather than the animal's behavior at resource use locations. In this study, we investigate foraging and vocalization resource selection in a population of Willow Flycatchers, Empidonax traillii adastus, using Bayesian spatial generalized linear models. These models produce “behavioral landscapes” in which space use and resource selection is linked through behavior. Radio telemetry locations were collected from 35 adult Willow Flycatchers (n = 14 males, n = 13 females, and n = 8 unknown sex) over the 2003 and 2004 breeding seasons at Fish Creek, Utah. Results from the 2-stage modeling approach showed that habitat type, perch position, and distance from the arithmetic mean of the home range (in males) or nest site (in females) were important factors influencing foraging and vocalization resource selection. Parameter estimates from the individual-level models indicated high intraspecific variation in the use of the various habitat types and perch heights for foraging and vocalization. On the population level, Willow Flycatchers selected riparian habitat over other habitat types for vocalizing but used multiple habitat types for foraging including mountain shrub, young riparian, and upland forest. Mapping of observed and predicted foraging and vocalization resource selection indicated that the behavior often occurred in disparate areas of the home range. This suggests that multiple core areas may exist in the home ranges of individual flycatchers, and demonstrates that the behavioral landscape modeling approach can be applied to identify spatially and behaviorally distinct core areas. The behavioral landscape approach is applicable to a wide range of animal taxa and can be used to improve our understanding of the spatial context of behavior and resource selection.
Reward and vocal production: song-associated place preference in songbirds.
Riters, Lauren V; Stevenson, Sharon A
2012-05-15
Vocal production is crucial for successful social interactions in multiple species. Reward can strongly influence behavior; however, the extent to which reward systems influence vocal behavior is unknown. In songbirds, singing occurs in different contexts. It can be spontaneous and undirected (e.g., song produced alone or as part of a large flock) or directed towards a conspecific (e.g., song used to attract a mate or influence a competitor). In this study, we developed a conditioned place preference paradigm to measure reward associated with different types of singing behavior in two songbird species. Both male zebra finches and European starlings developed a preference for a chamber associated with production of undirected song, suggesting that the production of undirected song is tightly coupled to intrinsic reward. In contrast, neither starlings nor zebra finches developed a place preference in association with directed song; however, male starlings singing directed song that failed to attract a female developed a place aversion. Unsuccessful contact calling behavior was also associated with a place aversion. These findings suggest that directed vocal behavior is not tightly linked to intrinsic reward but may be externally reinforced by social interactions. Data across two species thus support the hypothesis that the production of undirected but not directed song is tightly coupled to intrinsic reward. This study is the first to identify song-associated reward and suggests that reward associated with vocal production differs depending upon the context in which communication occurs. The findings have implications for understanding what motivates animals to engage in social behaviors and ways in which distinct reward mechanisms function to direct socially appropriate behaviors. Copyright © 2012 Elsevier Inc. All rights reserved.
Auditory and audio-vocal responses of single neurons in the monkey ventral premotor cortex.
Hage, Steffen R
2018-03-20
Monkey vocalization is a complex behavioral pattern, which is flexibly used in audio-vocal communication. A recently proposed dual neural network model suggests that cognitive control might be involved in this behavior, originating from a frontal cortical network in the prefrontal cortex and mediated via projections from the rostral portion of the ventral premotor cortex (PMvr) and motor cortex to the primary vocal motor network in the brainstem. For the rapid adjustment of vocal output to external acoustic events, strong interconnections between vocal motor and auditory sites are needed, which are present at cortical and subcortical levels. However, the role of the PMvr in audio-vocal integration processes remains unclear. In the present study, single neurons in the PMvr were recorded in rhesus monkeys (Macaca mulatta) while volitionally producing vocalizations in a visual detection task or passively listening to monkey vocalizations. Ten percent of randomly selected neurons in the PMvr modulated their discharge rate in response to acoustic stimulation with species-specific calls. More than four-fifths of these auditory neurons showed an additional modulation of their discharge rates either before and/or during the monkeys' motor production of the vocalization. Based on these audio-vocal interactions, the PMvr might be well positioned to mediate higher order auditory processing with cognitive control of the vocal motor output to the primary vocal motor network. Such audio-vocal integration processes in the premotor cortex might constitute a precursor for the evolution of complex learned audio-vocal integration systems, ultimately giving rise to human speech. Copyright © 2018 Elsevier B.V. All rights reserved.
Iwaki, Shinobu; Maeda, Tatsuyoshi; Saito, Miki; Otsuki, Naoki; Takahashi, Miki; Wakui, Emi; Shinomiya, Hirotaka; Morimoto, Koichi; Inoue, Hiroyuki; Masuoka, Hiroo; Miyauchi, Akira; Nibu, Ken-Ichi
2017-03-01
Quality of voice after immediate recurrent laryngeal nerve (RLN) reconstruction in thyroid cancers has not been thoroughly studied. Thirteen patients with fixed vocal cords (fixed vocal cord group) and 8 patients with intact or impaired mobile vocal cords (mobile vocal cord group) who had immediate RLN reconstruction simultaneously with total thyroidectomy, and patients who had arytenoid adduction and thyroplasty for vocal cord paralysis caused by previous surgery (arytenoid adduction thyroplasty group) were enrolled in this study. Preoperative phonation efficiency index was significantly lower (p = .008) in the fixed vocal cord group than in the mobile vocal cord group. One year after surgery, all voice parameters of the patients in the fixed vocal cord group had improved, compared with their preoperative data. The fixed vocal cord group had attained satisfactory voice qualities equivalent to those of the mobile vocal cord group in terms of various voice parameters. The present results support the idea that immediate RLN reconstruction at the time of surgery for thyroid cancers may spare the need for subsequent arytenoid adduction thyroplasty even in the patients with preoperatively fixed vocal cords. © 2016 Wiley Periodicals, Inc. Head Neck 39: 427-431, 2017. © 2016 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Švancara, P.; Horáček, J.; Švec, J. G.
The study presents a three-dimensional (3D) finite element (FE) model of the flow-induced self-oscillation of the human vocal folds in interaction with acoustics of simplified vocal tract models. The 3D vocal tract models of the acoustic spaces shaped for simulation of phonation of Czech vowels [a:], [i:] and [u:] were created by converting the data from the magnetic resonance images (MRI). For modelling of the fluid-structure interaction, explicit coupling scheme with separated solvers for fluid and structure domain was utilized. The FE model comprises vocal folds pretension before starting phonation, large deformations of the vocal fold tissue, vocal-fold collisions, fluid-structure interaction, morphing the fluid mesh according to the vocal-fold motion (Arbitrary Lagrangian-Eulerian approach), unsteady viscous compressible airflow described by the Navier-Stokes equations and airflow separation. The developed FE model enables to study the relationship between flow-induced vibrations of the vocal folds and acoustic wave propagation in the vocal tract and can also be used to simulate for example pathological changes in the vocal fold tissue and their influence on the voice production.
The effect of voice amplification on occupational vocal dose in elementary school teachers.
Gaskill, Christopher S; O'Brien, Shenendoah G; Tinter, Sara R
2012-09-01
Two elementary school teachers, one with and one without a history of vocal complaints, wore a vocal dosimeter all day at school for a 3-week period. In the second week, each teacher wore a portable voice amplifier. Each teacher showed a reduction in vocal intensity during the week of amplification, with a larger effect for the teacher with vocal difficulties. This teacher also showed a decrease in hourly vocal fold distance dose as measured by the dosimeter despite incurring longer phonation times. Fundamental frequency and vocal fold cycle dose did not appear to be affected by the use of amplification during the teaching day. Both teachers showed evidence of a possible moderate effect of adjusting vocal intensity in the week after amplification, possibly as a means to recalibrate their perceived vocal loudness. This study demonstrates the usefulness of both vocal dosimetry and amplification in monitoring and modifying vocal dose in an occupational setting and reinforces previous data suggesting the effectiveness of amplification in reducing the vocal load in schoolteachers. Implications of the data for future research regarding prevention and treatment of occupational voice disorders are discussed. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Harold, Meredith Poore; Barlow, Steven M.
2012-01-01
The vocalizations and jaw kinematics of 30 infants aged 6–8 months were recorded using a Motion Analysis System and audiovisual technologies. This study represents the first attempt to determine the effect of play environment on infants’ rate of vocalization and jaw movement. Four play conditions were compared: watching videos, social contingent reinforcement and vocal modeling with an adult, playing alone with small toys, and playing alone with large toys. The fewest vocalizations and spontaneous movement were observed when infants were watching videos or interacting with an adult. Infants vocalized most when playing with large toys. The small toys, which naturally elicited gross motor movement (e.g., waving, banging, shaking), educed fewer vocalizations. This study was also the first to quantify the kinematics of vocalized and non-vocalized jaw movements of 6–8 month-old infants. Jaw kinematics did not differentiate infants who produced canonical syllables from those who did not. All infants produced many jaw movements without vocalization. However, during vocalization, infants were unlikely to move their jaw. This contradicts current theories that infant protophonic vocalizations are jaw dominant. Results of the current study can inform socio-linguistic and kinematic theories of canonical babbling. PMID:23261792
Kuo, Chung-Feng Jeffrey; Chu, Yueng-Hsiang; Wang, Po-Chun; Lai, Chun-Yu; Chu, Wen-Lin; Leu, Yi-Shing; Wang, Hsing-Won
2013-12-01
The human larynx is an important organ for voice production and respiratory mechanisms. The vocal cord is approximated for voice production and open for breathing. The videolaryngoscope is widely used for vocal cord examination. At present, physicians usually diagnose vocal cord diseases by manually selecting the image of the vocal cord opening to the largest extent (abduction), thus maximally exposing the vocal cord lesion. On the other hand, the severity of diseases such as vocal palsy, atrophic vocal cord is largely dependent on the vocal cord closing to the smallest extent (adduction). Therefore, diseases can be assessed by the image of the vocal cord opening to the largest extent, and the seriousness of breathy voice is closely correlated to the gap between vocal cords when closing to the smallest extent. The aim of the study was to design an automatic vocal cord image selection system to improve the conventional selection process by physicians and enhance diagnosis efficiency. Also, due to the unwanted fuzzy images resulting from examination process caused by human factors as well as the non-vocal cord images, texture analysis is added in this study to measure image entropy to establish a screening and elimination system to effectively enhance the accuracy of selecting the image of the vocal cord closing to the smallest extent. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Responses of primate frontal cortex neurons during natural vocal communication.
Miller, Cory T; Thomas, A Wren; Nummela, Samuel U; de la Mothe, Lisa A
2015-08-01
The role of primate frontal cortex in vocal communication and its significance in language evolution have a controversial history. While evidence indicates that vocalization processing occurs in ventrolateral prefrontal cortex neurons, vocal-motor activity has been conjectured to be primarily subcortical and suggestive of a distinctly different neural architecture from humans. Direct evidence of neural activity during natural vocal communication is limited, as previous studies were performed in chair-restrained animals. Here we recorded the activity of single neurons across multiple regions of prefrontal and premotor cortex while freely moving marmosets engaged in a natural vocal behavior known as antiphonal calling. Our aim was to test whether neurons in marmoset frontal cortex exhibited responses during vocal-signal processing and/or vocal-motor production in the context of active, natural communication. We observed motor-related changes in single neuron activity during vocal production, but relatively weak sensory responses for vocalization processing during this natural behavior. Vocal-motor responses occurred both prior to and during call production and were typically coupled to the timing of each vocalization pulse. Despite the relatively weak sensory responses a population classifier was able to distinguish between neural activity that occurred during presentations of vocalization stimuli that elicited an antiphonal response and those that did not. These findings are suggestive of the role that nonhuman primate frontal cortex neurons play in natural communication and provide an important foundation for more explicit tests of the functional contributions of these neocortical areas during vocal behaviors. Copyright © 2015 the American Physiological Society.
Responses of primate frontal cortex neurons during natural vocal communication
Thomas, A. Wren; Nummela, Samuel U.; de la Mothe, Lisa A.
2015-01-01
The role of primate frontal cortex in vocal communication and its significance in language evolution have a controversial history. While evidence indicates that vocalization processing occurs in ventrolateral prefrontal cortex neurons, vocal-motor activity has been conjectured to be primarily subcortical and suggestive of a distinctly different neural architecture from humans. Direct evidence of neural activity during natural vocal communication is limited, as previous studies were performed in chair-restrained animals. Here we recorded the activity of single neurons across multiple regions of prefrontal and premotor cortex while freely moving marmosets engaged in a natural vocal behavior known as antiphonal calling. Our aim was to test whether neurons in marmoset frontal cortex exhibited responses during vocal-signal processing and/or vocal-motor production in the context of active, natural communication. We observed motor-related changes in single neuron activity during vocal production, but relatively weak sensory responses for vocalization processing during this natural behavior. Vocal-motor responses occurred both prior to and during call production and were typically coupled to the timing of each vocalization pulse. Despite the relatively weak sensory responses a population classifier was able to distinguish between neural activity that occurred during presentations of vocalization stimuli that elicited an antiphonal response and those that did not. These findings are suggestive of the role that nonhuman primate frontal cortex neurons play in natural communication and provide an important foundation for more explicit tests of the functional contributions of these neocortical areas during vocal behaviors. PMID:26084912
How small could a pup sound? The physical bases of signaling body size in harbor seals
Gross, Stephanie; Garcia, Maxime; Rubio-Garcia, Ana; de Boer, Bart
2017-01-01
Abstract Vocal communication is a crucial aspect of animal behavior. The mechanism which most mammals use to vocalize relies on three anatomical components. First, air overpressure is generated inside the lower vocal tract. Second, as the airstream goes through the glottis, sound is produced via vocal fold vibration. Third, this sound is further filtered by the geometry and length of the upper vocal tract. Evidence from mammalian anatomy and bioacoustics suggests that some of these three components may covary with an animal’s body size. The framework provided by acoustic allometry suggests that, because vocal tract length (VTL) is more strongly constrained by the growth of the body than vocal fold length (VFL), VTL generates more reliable acoustic cues to an animal’s size. This hypothesis is often tested acoustically but rarely anatomically, especially in pinnipeds. Here, we test the anatomical bases of the acoustic allometry hypothesis in harbor seal pups Phoca vitulina. We dissected and measured vocal tract, vocal folds, and other anatomical features of 15 harbor seals post-mortem. We found that, while VTL correlates with body size, VFL does not. This suggests that, while body growth puts anatomical constraints on how vocalizations are filtered by harbor seals’ vocal tract, no such constraints appear to exist on vocal folds, at least during puppyhood. It is particularly interesting to find anatomical constraints on harbor seals’ vocal tracts, the same anatomical region partially enabling pups to produce individually distinctive vocalizations. PMID:29492005
Roy, Sabyasachi; Zhao, Lingyun; Wang, Xiaoqin
2016-11-30
Although evidence from human studies has long indicated the crucial role of the frontal cortex in speech production, it has remained uncertain whether the frontal cortex in nonhuman primates plays a similar role in vocal communication. Previous studies of prefrontal and premotor cortices of macaque monkeys have found neural signals associated with cue- and reward-conditioned vocal production, but not with self-initiated or spontaneous vocalizations (Coudé et al., 2011; Hage and Nieder, 2013), which casts doubt on the role of the frontal cortex of the Old World monkeys in vocal communication. A recent study of marmoset frontal cortex observed modulated neural activities associated with self-initiated vocal production (Miller et al., 2015), but it did not delineate whether these neural activities were specifically attributed to vocal production or if they may result from other nonvocal motor activity such as orofacial motor movement. In the present study, we attempted to resolve these issues and examined single neuron activities in premotor cortex during natural vocal exchanges in the common marmoset (Callithrix jacchus), a highly vocal New World primate. Neural activation and suppression were observed both before and during self-initiated vocal production. Furthermore, by comparing neural activities between self-initiated vocal production and nonvocal orofacial motor movement, we identified a subpopulation of neurons in marmoset premotor cortex that was activated or suppressed by vocal production, but not by orofacial movement. These findings provide clear evidence of the premotor cortex's involvement in self-initiated vocal production in natural vocal behaviors of a New World primate. Human frontal cortex plays a crucial role in speech production. However, it has remained unclear whether the frontal cortex of nonhuman primates is involved in the production of self-initiated vocalizations during natural vocal communication. Using a wireless multichannel neural recording technique, we observed in the premotor cortex neural activation and suppression both before and during self-initiated vocalizations when marmosets, a highly vocal New World primate species, engaged in vocal exchanges with conspecifics. A novel finding of the present study is the discovery of a subpopulation of premotor cortex neurons that was activated by vocal production, but not by orofacial movement. These observations provide clear evidence of the premotor cortex's involvement in vocal production in a New World primate species. Copyright © 2016 the authors 0270-6474/16/3612168-12$15.00/0.
Phase-Specific Vocalizations of Male Mice at the Initial Encounter during the Courtship Sequence
Matsumoto, Yui K.; Okanoya, Kazuo
2016-01-01
Mice produce ultrasonic vocalizations featuring a variety of syllables. Vocalizations are observed during social interactions. In particular, males produce numerous syllables during courtship. Previous studies have shown that vocalizations change according to sexual behavior, suggesting that males vary their vocalizations depending on the phase of the courtship sequence. To examine this process, we recorded large sets of mouse vocalizations during male–female interactions and acoustically categorized these sounds into 12 vocal types. We found that males emitted predominantly short syllables during the first minute of interaction, more long syllables in the later phases, and mainly harmonic sounds during mounting. These context- and time-dependent changes in vocalization indicate that vocal communication during courtship in mice consists of at least three stages and imply that each vocalization type has a specific role in a phase of the courtship sequence. Our findings suggest that recording for a sufficiently long time and taking the phase of courtship into consideration could provide more insights into the role of vocalization in mouse courtship behavior in future study. PMID:26841117
Iconicity can ground the creation of vocal symbols
Perlman, Marcus; Dale, Rick; Lupyan, Gary
2015-01-01
Studies of gestural communication systems find that they originate from spontaneously created iconic gestures. Yet, we know little about how people create vocal communication systems, and many have suggested that vocalizations do not afford iconicity beyond trivial instances of onomatopoeia. It is unknown whether people can generate vocal communication systems through a process of iconic creation similar to gestural systems. Here, we examine the creation and development of a rudimentary vocal symbol system in a laboratory setting. Pairs of participants generated novel vocalizations for 18 different meanings in an iterative ‘vocal’ charades communication game. The communicators quickly converged on stable vocalizations, and naive listeners could correctly infer their meanings in subsequent playback experiments. People's ability to guess the meanings of these novel vocalizations was predicted by how close the vocalization was to an iconic ‘meaning template’ we derived from the production data. These results strongly suggest that the meaningfulness of these vocalizations derived from iconicity. Our findings illuminate a mechanism by which iconicity can ground the creation of vocal symbols, analogous to the function of iconicity in gestural communication systems. PMID:26361547
Hunter, Eric J.; Titze, Ingo R.
2012-01-01
Objectives To quantify the recovery of voice following a 2-hour vocal loading exercise (oral reading). Methods 86 adult participants tracked their voice recovery using short vocal tasks and perceptual ratings after an initial vocal loading exercise and for the following two days. Results Short-term recovery was apparent with 90% recovery within 4-6 hours and full recovery at 12-18 hours. Recovery was shown to be similar to a dermal wound healing trajectory. Conclusions The new recovery trajectory highlighted by the vocal loading exercise in the current study is called a vocal recovery trajectory. By comparing vocal fatigue to dermal wound healing, this trajectory is parallel to a chronic wound healing trajectory (as opposed to an acute wound healing trajectory). This parallel suggests that vocal fatigue from the daily use of the voice could be treated as a chronic wound, with the healing and repair mechanisms in a state of constant repair. In addition, there is likely a vocal fatigue threshold at which point the level of tissue damage would shift the chronic healing trajectory to an acute healing trajectory. PMID:19663377
Viscoelastic properties of rabbit vocal folds after augmentation.
Hertegård, Stellan; Dahlqvist, Ake; Laurent, Claude; Borzacchiello, Assunta; Ambrosio, Luigi
2003-03-01
Vocal fold function is closely related to tissue viscoelasticity. Augmentation substances may alter the viscoelastic properties of vocal fold tissues and hence their vibratory capacity. We sought to investigate the viscoelastic properties of rabbit vocal folds in vitro after injections of various augmentation substances. Polytetrafluoroethylene (Teflon), cross-linked collagen (Zyplast), and cross-linked hyaluronan, hylan b gel (Hylaform) were injected into the lamina propria and the thyroarytenoid muscle of rabbit vocal folds. Dynamic viscosity of the injected vocal fold as a function of frequency was measured with a Bohlin parallel-plate rheometer during small-amplitude oscillation. All injected vocal folds showed a decreasing dynamic viscosity with increasing frequency. Vocal fold samples injected with hylan b gel showed the lowest dynamic viscosity, quite close to noninjected control samples. Vocal folds injected with polytetrafluoroethylene showed the highest dynamic viscosity followed by the collagen samples. The data indicated that hylan b gel in short-term renders the most natural viscoelastic properties to the vocal fold among the substances tested. This is of importance to restore/preserve the vibratory capacity of the vocal folds when glottal insufficiency is treated with injections.
On the role of the reticular formation in vocal pattern generation.
Jürgens, Uwe; Hage, Steffen R
2007-09-04
This review is an attempt to localize the brain region responsible for pattern generation of species-specific vocalizations. A catalogue is set up, listing the criteria considered to be essential for a vocal pattern generator. According to this catalogue, a vocal pattern generator should show vocalization-correlated activity, starting before vocal onset and reflecting specific acoustic features of the vocalization. Artificial activation by electrical or glutamatergic stimulation should produce artificially sounding vocalization. Lesioning is expected to have an inhibitory or deteriorating effect on vocalization. Anatomically, a vocal pattern generator can be assumed to have direct or, at least, oligosynaptic connections with all the motoneuron pools involved in phonation. A survey of the literature reveals that the only area meeting all these criteria is a region, reaching from the parvocellular pontine reticular formation just above the superior olive through the lateral reticular formation around the facial nucleus and nucleus ambiguus down to the caudalmost medulla, including the dorsal and ventral reticular nuclei and nucleus retroambiguus. It is proposed that vocal pattern generation takes place within this whole region.
Embodied, Embedded Language Use
Fowler, Carol A.
2011-01-01
Language use has a public face that is as important to study as the private faces under intensive psycholinguistic study. In the domain of phonology, public use of speech must meet an interpersonal “parity” constraint if it is to serve to communicate. That is, spoken language forms must reliably be identified by listeners. To that end, language forms are embodied, at the lowest level of description, as phonetic gestures of the vocal tract that lawfully structure informational media such as air and light. Over time, under the parity constraint, sound inventories emerge over communicative exchanges that have the property of sufficient identifiability. Communicative activities involve more than vocal tract actions. Talkers gesture and use facial expressions and eye gaze to communicate. Listeners embody their language understandings, exhibiting dispositions to behave in ways related to language understanding. Moreover, linguistic interchanges are embedded in the larger context of language use. Talkers recruit the environment in their communicative activities, for example, in using deictic points. Moreover, in using language as a “coordination device,” interlocutors mutually entrain. PMID:21243080
Communication patterns within a group of shelter dogs and implications for their welfare.
Petak, Irena
2013-01-01
Keeping shelter dogs in groups provides them with a more socially and physically enriched environment, but eventually it may cause them stress. Understanding dogs' communication could help shelter staff recognize and prevent undesirable communicative patterns and encourage desirable ones. Therefore, the objective of this study was to determine communication patterns in a group of dogs in a shelter. The observed dogs were engaged in different classes of dyadic and group interactions. Certain dogs were frequently initiators of dyadic interactions, and different dogs were the recipients. The predominant form of dyadic interactions was a neutral one, and aggressive behavior was rarely observed. The tendency of certain dogs to interact continuously may represent a nuisance for less social individuals. All of the dogs participated in 3 defined classes of group interactions. At the group level, the dogs frequently interact vocally or olfactorily. A major welfare problem may be very vocal dogs because their vocalizations are noisy and broadcast far-reaching signals. The frequency of some group interactions was reduced by the amount of time the dogs had in the shelter.
Samlan, Robin A.; Story, Brad H.
2011-01-01
Purpose To relate vocal fold structure and kinematics to two acoustic measures: cepstral peak prominence (CPP) and the amplitude of the first harmonic relative to the second (H1-H2). Method A computational, kinematic model of the medial surfaces of the vocal folds was used to specify features of vocal fold structure and vibration in a manner consistent with breathy voice. Four model parameters were altered: degree of vocal fold adduction, surface bulging, vibratory nodal point, and supraglottal constriction. CPP and H1-H2 were measured from simulated glottal area, glottal flow and acoustic waveforms and related to the underlying vocal fold kinematics. Results CPP decreased with increased separation of the vocal processes, whereas the nodal point location had little effect. H1-H2 increased as a function of separation of the vocal processes in the range of 1–1.5 mm and decreased with separation > 1.5 mm. Conclusions CPP is generally a function of vocal process separation. H1*-H2* will increase or decrease with vocal process separation based on vocal fold shape, pivot point for the rotational mode, and supraglottal vocal tract shape, limiting its utility as an indicator of breathy voice. Future work will relate the perception of breathiness to vocal fold kinematics and acoustic measures. PMID:21498582
Tan, Melin; Pitman, Michael J
2011-07-01
We present a patient with a novel finding of bilateral mucosal bridges, bilateral type III trans-vocal fold sulci vocales, and a vocal fold polyp. Although sulci and mucosal bridges occur in the vocal folds, it is rare to find multiples of these lesions in a single patient, and it is even more uncommon when they occur in conjunction with a vocal fold polyp. To our knowledge, this is the first description of a vocal fold polyp in combination with multiple vocal fold bridges and multiple type III sulci vocales in a single patient. To describe and visually present the diagnosis and treatment of a patient with an intracordal polyp, bilateral mucosal bridges, as well as bilateral type III trans-vocal fold sulci vocales. Presentation of a set of high definition intraoperative photos displaying the extent of the vocal fold lesions and the resection of the intracordal polyp. This patient presented with only 6 months of significant dysphonia. It was felt that the recent change in voice was because of the polyp and not the bridges or sulci vocales. Considering the patient's presentation and the possible morbidity of resection of mucosal bridges and sulci, only the polyp was excised. Postoperatively, the patient's voice returned to his acceptable mild baseline dysphonia, and the benefit has persisted 6 months postoperatively. The combination of bilateral mucosal bridges, bilateral type III sulcus vocalis, and an intracordal polyp in one patient is rare if not novel. Treatment of the polyp alone returned the patient's voice to his lifelong baseline of mild dysphonia. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Mouterde, Solveig C; Elie, Julie E; Mathevon, Nicolas; Theunissen, Frédéric E
2017-03-29
One of the most complex tasks performed by sensory systems is "scene analysis": the interpretation of complex signals as behaviorally relevant objects. The study of this problem, universal to species and sensory modalities, is particularly challenging in audition, where sounds from various sources and localizations, degraded by propagation through the environment, sum to form a single acoustical signal. Here we investigated in a songbird model, the zebra finch, the neural substrate for ranging and identifying a single source. We relied on ecologically and behaviorally relevant stimuli, contact calls, to investigate the neural discrimination of individual vocal signature as well as sound source distance when calls have been degraded through propagation in a natural environment. Performing electrophysiological recordings in anesthetized birds, we found neurons in the auditory forebrain that discriminate individual vocal signatures despite long-range degradation, as well as neurons discriminating propagation distance, with varying degrees of multiplexing between both information types. Moreover, the neural discrimination performance of individual identity was not affected by propagation-induced degradation beyond what was induced by the decreased intensity. For the first time, neurons with distance-invariant identity discrimination properties as well as distance-discriminant neurons are revealed in the avian auditory cortex. Because these neurons were recorded in animals that had prior experience neither with the vocalizers of the stimuli nor with long-range propagation of calls, we suggest that this neural population is part of a general-purpose system for vocalizer discrimination and ranging. SIGNIFICANCE STATEMENT Understanding how the brain makes sense of the multitude of stimuli that it continually receives in natural conditions is a challenge for scientists. Here we provide a new understanding of how the auditory system extracts behaviorally relevant information, the vocalizer identity and its distance to the listener, from acoustic signals that have been degraded by long-range propagation in natural conditions. We show, for the first time, that single neurons, in the auditory cortex of zebra finches, are capable of discriminating the individual identity and sound source distance in conspecific communication calls. The discrimination of identity in propagated calls relies on a neural coding that is robust to intensity changes, signals' quality, and decreases in the signal-to-noise ratio. Copyright © 2017 Mouterde et al.
Ng, Nathaniel S R; Wilton, Peter R; Prawiradilaga, Dewi Malia; Tay, Ywee Chieh; Indrawan, Mochamad; Garg, Kritika M; Rheindt, Frank E
2017-09-01
The role of Pleistocene Ice Age in tropical diversification is poorly understood, especially in archipelagos, in which glaciation-induced sea level fluctuations may lead to complicated changes in land distribution. To assess how Pleistocene land bridges may have facilitated gene flow in tropical archipelagos, we investigated patterns of diversification in the rarely-collected rusty-bellied fantail Rhipidura teysmanni (Passeriformes: Rhipiduridae) complex from Wallacea using a combination of bioacoustic traits and whole-genome sequencing methods (dd-RADSeq). We report a biogeographic leapfrog pattern in the vocalizations of these birds, and uncover deep genomic divergence among island populations despite the presence of intermittent land connections between some. We demonstrate how rare instances of genetic introgression have affected the evolution of this species complex, and document the presence of double introgressive mitochondrial sweeps, highlighting the dangers of using only mitochondrial DNA in evolutionary research. By applying different tree inference approaches, we demonstrate how concatenation methods can give inaccurate results when investigating divergence in closely-related taxa. Our study highlights high levels of cryptic avian diversity in poorly-explored Wallacea, elucidates complex patterns of Pleistocene climate-mediated diversification in an elusive montane songbird, and suggests that Pleistocene land bridges may have accounted for limited connectivity among montane Wallacean biota. Copyright © 2017 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Reidenberg, Joy S.; Laitman, Jeffrey T.
2004-05-01
Cetaceans produce sounds at opposite ends of the frequency spectrum. The laryngeal role in odontocete sound production (echolocation, communication) remains unclear. Mysticete infrasonics are presumed to be laryngeal in origin, but production mechanisms are unknown. To address this, we examined postmortem larynges in 6 mysticete species (3 genera) and compared them to our odontocete collection (20 species/15 genera). Results indicate that the rostral portion of the odontocete larynx is elongated, narrow, rigid, and normally positioned intranarially. This portion of the mysticete larynx is comparatively shortened, open, pliable, and in Megaptera may be retracted from its intranarial position. Internally, mysticete vocal folds are thick, paired, and oriented horizontally, compared with the thin, usually unpaired, and vertically oriented odontocete fold. Mysticetes may generate low frequency sounds via pneumatically driven fold vibrations, which then pass to attached laryngeal sac walls, through overlying throat pleats, to water. Rorqual mysticetes may also vibrate paired corniculate flaps while regulating airflow into the nasal region. Infrasonic pulses may pass through adjacent soft palate, skull, or nasal cartilages to water. Laryngeal anatomy in mysticetes and odontocetes appears highly divergent. These morphological differences may correlate to adaptations for producing infrasonic (mysticete) or ultrasonic (odontocete) communication. [Work supported by ONR:N00014-96-1-0764, ONR:N00014-99-1-0815, and AMNHSOF.
Vocal Tones Influence Young Children’s Responses to Prohibitions
Dahl, Audun; Tran, Amy Q.
2016-01-01
Vocal reactions to child transgressions convey information about the nature of those transgressions. The present research investigated children’s ability to make use of such vocal reactions. Study 1 investigated infants’ compliance with a vocal prohibition telling them to stay away from a toy. Compared to younger infants, older infants showed greater compliance with prohibitions elicited by moral (interpersonal harm) transgressions, but not with prohibitions elicited by pragmatic (inconvenience) transgressions. Study 2 investigated preschoolers’ use of firm-stern vocalizations (associated with moral transgressions) and positive vocalizations (associated with pragmatic transgressions). Most children guessed that the firm-stern vocalizations were uttered in response to a moral transgression and the positive vocalization were uttered in response to a pragmatic transgression. These two studies suggest that children use vocal tones, along with other experiences, to guide their compliance with and interpretation of prohibitions. PMID:27518810
Selection is stronger in early-versus-late stages of divergence in a Neotropical livebearing fish.
Ingley, Spencer J; Johnson, Jerald B
2016-03-01
How selection acts to drive trait evolution at different stages of divergence is of fundamental importance in our understanding of the origins of biodiversity. Yet, most studies have focused on a single point along an evolutionary trajectory. Here, we provide a case study evaluating the strength of divergent selection acting on life-history traits at early-versus-late stages of divergence in Brachyrhaphis fishes. We find that the difference in selection is stronger in the early-diverged population than the late-diverged population, and that trait differences acquired early are maintained over time. © 2016 The Author(s).
Maurer, D; Hess, M; Gross, M
1996-12-01
Theoretic investigations of the "source-filter" model have indicated a pronounced acoustic interaction of glottal source and vocal tract. Empirical investigations of formant pattern variations apart from changes in vowel identity have demonstrated a direct relationship between the fundamental frequency and the patterns. As a consequence of both findings, independence of phonation and articulation may be limited in the speech process. Within the present study, possible interdependence of phonation and phoneme was investigated: vocal fold vibrations and larynx position for vocalizations of different vowels in a healthy man and woman were examined by high-speed light-intensified digital imaging. We found 1) different movements of the vocal folds for vocalizations of different vowel identities within one speaker and at similar fundamental frequency, and 2) constant larynx position within vocalization of one vowel identity, but different positions for vocalizations of different vowel identities. A possible relationship between the vocal fold vibrations and the phoneme is discussed.
Harold, Meredith Poore; Barlow, Steven M
2013-02-01
The vocalizations and jaw kinematics of 30 infants aged 6-8 months were recorded using a Motion Analysis System and audiovisual technologies. This study represents the first attempt to determine the effect of play environment on infants' rate of vocalization and jaw movement. Four play conditions were compared: watching videos, social contingent reinforcement and vocal modeling with an adult, playing alone with small toys, and playing alone with large toys. The fewest vocalizations and spontaneous movement were observed when infants were watching videos or interacting with an adult. Infants vocalized most when playing with large toys. The small toys, which naturally elicited gross motor movement (e.g., waving, banging, shaking), educed fewer vocalizations. This study was also the first to quantify the kinematics of vocalized and non-vocalized jaw movements of 6-8 month-old infants. Jaw kinematics did not differentiate infants who produced canonical syllables from those who did not. All infants produced many jaw movements without vocalization. However, during vocalization, infants were unlikely to move their jaw. This contradicts current theories that infant protophonic vocalizations are jaw-dominant. Results of the current study can inform socio-linguistic and kinematic theories of canonical babbling. Copyright © 2012 Elsevier Inc. All rights reserved.
The Risk of Vocal Fold Atrophy after Serial Corticosteroid Injections of the Vocal Fold.
Shi, Lucy L; Giraldez-Rodriguez, Laureano A; Johns, Michael M
2016-11-01
The aim of this study was to illustrate the risk of vocal fold atrophy in patients who receive serial subepithelial steroid injections for vocal fold scar. This study is a retrospective case report of two patients who underwent a series of weekly subepithelial infusions of 10 mg/mL dexamethasone for benign vocal fold lesion. Shortly after the procedures, both patients developed a weak and breathy voice. The first patient was a 53-year-old man with radiation-induced vocal fold stiffness. Six injections were performed unilaterally, and 1 week later, he developed unilateral vocal fold atrophy with new glottal insufficiency. The second patient was a 67-year-old woman with severe vocal fold inflammation related to laryngitis and calcinosis, Raynaud's phenomenon, esophagean dysmotility, sclerodactyly, and telangiectasia (CREST) syndrome. Five injections were performed bilaterally, and 1 week later, she developed bilateral vocal fold atrophy with a large midline glottal gap during phonation. In both cases, the steroid-induced vocal atrophy resolved spontaneously after 4 months. Serial subepithelial steroid infusions of the vocal folds, although safe in the majority of patients, carry the risk of causing temporary vocal fold atrophy when given at short intervals. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Predicting Achievable Fundamental Frequency Ranges in Vocalization Across Species
Titze, Ingo; Riede, Tobias; Mau, Ted
2016-01-01
Vocal folds are used as sound sources in various species, but it is unknown how vocal fold morphologies are optimized for different acoustic objectives. Here we identify two main variables affecting range of vocal fold vibration frequency, namely vocal fold elongation and tissue fiber stress. A simple vibrating string model is used to predict fundamental frequency ranges across species of different vocal fold sizes. While average fundamental frequency is predominantly determined by vocal fold length (larynx size), range of fundamental frequency is facilitated by (1) laryngeal muscles that control elongation and by (2) nonlinearity in tissue fiber tension. One adaptation that would increase fundamental frequency range is greater freedom in joint rotation or gliding of two cartilages (thyroid and cricoid), so that vocal fold length change is maximized. Alternatively, tissue layers can develop to bear a disproportionate fiber tension (i.e., a ligament with high density collagen fibers), increasing the fundamental frequency range and thereby vocal versatility. The range of fundamental frequency across species is thus not simply one-dimensional, but can be conceptualized as the dependent variable in a multi-dimensional morphospace. In humans, this could allow for variations that could be clinically important for voice therapy and vocal fold repair. Alternative solutions could also have importance in vocal training for singing and other highly-skilled vocalizations. PMID:27309543
Histopathologic investigations of the unphonated human child vocal fold mucosa.
Sato, Kiminori; Umeno, Hirohito; Nakashima, Tadashi; Nonaka, Satoshi; Harabuchi, Yasuaki
2012-01-01
Vocal fold stellate cells (VFSCs) in the maculae flavae (MFe) located at both ends of the vocal fold mucosa are inferred to be involved in the metabolism of extracellular matrices. MFe are also considered to be an important structure in the growth and development of the human vocal fold mucosa. Tension caused by phonation (vocal fold vibration) is hypothesized to stimulate VFSCs to accelerate production of extracellular matrices. Human child vocal fold mucosae unphonated since birth were investigated histologically. Histologic analysis of human child vocal fold mucosa. Vocal fold mucosae, which have remained unphonated since birth, of two children (7 and 12 years old) with cerebral palsy were investigated by light and electron microscopy and compared with normal subjects. Vocal fold mucosae and MFe were hypoplastic and rudimentary and did not have a vocal ligament, Reinke's space, or the layered structure. The lamina propria appeared as a uniform structure. Some VFSCs in the MFe showed degeneration and not many vesicles were present at the periphery of the cytoplasm. The VFSCs synthesized fewer extracellular matrices, such as fibrous protein and glycosaminoglycan. The VFSCs appeared to have decreased activity. Vocal fold vibration (phonation) after birth is an important factor in the growth and development of the human vocal fold mucosa. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Viscoelasticity of rabbit vocal folds after injection augmentation.
Dahlqvist, Ake; Gärskog, Ola; Laurent, Claude; Hertegård, Stellan; Ambrosio, Luigi; Borzacchiello, Assunta
2004-01-01
Vocal fold function is related to the viscoelasticity of the vocal fold tissue. Augmentation substances used for injection treatment of voice insufficiency may alter the viscoelastic properties of vocal folds and their vibratory capacity. The objective was to compare the mechanical properties (viscoelasticity) of various injectable substances and the viscoelasticity of rabbit vocal folds, 6 months after injection with one of these substances. Animal model. Cross-linked collagen (Zyplast), double cross-linked hyaluronan (hylan B gel), dextranomers in hyaluronan (DHIA), and polytetrafluoroethylene (Teflon) were injected into rabbit vocal folds. Six months after the injection, the animals were killed and the right- and left-side vocal folds were removed. Dynamic viscosity of the injected substances and the vocal folds was measured with a Bohlin parallel-plate rheometer during small-amplitude oscillation. All injected vocal folds showed a decreasing dynamic viscosity with increasing frequency. Hylan B gel and DiHA showed the lowest dynamic viscosity values, and vocal folds injected with these substances also showed the lowest dynamic viscosity (similar to noninjected control samples). Teflon (and vocal folds injected with Teflon) showed the highest dynamic viscosity values, followed by the collagen samples. Substances with low viscoelasticity alter the mechanical properties of the vocal fold to a lesser degree than substances with a high viscoelasticity. The data indicated that hylan B gel and DiHA render the most natural viscoelastic properties to the vocal folds. These substances seem to be appropriate for preserving or restoring the vibratory capacity of the vocal folds when glottal insufficiency is treated with augmentative injections.
Neural Representation of a Target Auditory Memory in a Cortico-Basal Ganglia Pathway
Bottjer, Sarah W.
2013-01-01
Vocal learning in songbirds, like speech acquisition in humans, entails a period of sensorimotor integration during which vocalizations are evaluated via auditory feedback and progressively refined to achieve an imitation of memorized vocal sounds. This process requires the brain to compare feedback of current vocal behavior to a memory of target vocal sounds. We report the discovery of two distinct populations of neurons in a cortico-basal ganglia circuit of juvenile songbirds (zebra finches, Taeniopygia guttata) during vocal learning: (1) one in which neurons are selectively tuned to memorized sounds and (2) another in which neurons are selectively tuned to self-produced vocalizations. These results suggest that neurons tuned to learned vocal sounds encode a memory of those target sounds, whereas neurons tuned to self-produced vocalizations encode a representation of current vocal sounds. The presence of neurons tuned to memorized sounds is limited to early stages of sensorimotor integration: after learning, the incidence of neurons encoding memorized vocal sounds was greatly diminished. In contrast to this circuit, neurons known to drive vocal behavior through a parallel cortico-basal ganglia pathway show little selective tuning until late in learning. One interpretation of these data is that representations of current and target vocal sounds in the shell circuit are used to compare ongoing patterns of vocal feedback to memorized sounds, whereas the parallel core circuit has a motor-related role in learning. Such a functional subdivision is similar to mammalian cortico-basal ganglia pathways in which associative-limbic circuits mediate goal-directed responses, whereas sensorimotor circuits support motor aspects of learning. PMID:24005299
Vocal Tremor Analysis with the Vocal Demodulator.
ERIC Educational Resources Information Center
Winholtz, William S.; Ramig, Lorraine Olson
1992-01-01
This paper describes the Vocal Demodulator as a new device for analysis of vocal tremor. The Vocal Demodulator produces amplitude-demodulated and frequency-demodulated outputs and measures the frequency and level of low-frequency tremor components in sustained phonation. The paper describes quantification of the demodulation process, validation…
Wang, Weian; Lu, Rong
2013-06-01
To investigate the effect of laryngoscopic surgery combined with nasal endoscopic system for the treatment of vocal cords benign lesions. Fifty-two patients admitted to our department with vocal cords benign lesions (including vocal polyps, vocal nodules, vocal cord cyst) underwent laryngoscopic surgery combined with nasal endoscopic system. All patients were treated successfully once and for all without any significant postoperative complication. The laryngoscopic surgery combined with nasal endoscopic system is a safe, minimally invasive and simple method for the treatment of benign lesions of vocal cords.
Dysphonia and vocal fold telangiectasia in hereditary hemorrhagic telangiectasia.
Chang, Joseph; Yung, Katherine C
2014-11-01
This case report is the first documentation of dysphonia and vocal fold telangiectasia as a complication of hereditary hemorrhagic telangiectasia (HHT). Case report of a 40-year-old man with HHT presenting with 2 years of worsening hoarseness. Hoarseness corresponded with a period of anticoagulation. Endoscopy revealed vocal fold scarring, vocal fold telangiectasias, and plica ventricular is suggestive of previous submucosal vocal fold hemorrhage and subsequent counterproductive compensation with ventricular phonation. Hereditary hemorrhagic telangiectasia may present as dysphonia with vocal fold telangiectasias and place patients at risk of vocal fold hemorrhage. © The Author(s) 2014.
Vocal fry may undermine the success of young women in the labor market.
Anderson, Rindy C; Klofstad, Casey A; Mayew, William J; Venkatachalam, Mohan
2014-01-01
Vocal fry is speech that is low pitched and creaky sounding, and is increasingly common among young American females. Some argue that vocal fry enhances speaker labor market perceptions while others argue that vocal fry is perceived negatively and can damage job prospects. In a large national sample of American adults we find that vocal fry is interpreted negatively. Relative to a normal speaking voice, young adult female voices exhibiting vocal fry are perceived as less competent, less educated, less trustworthy, less attractive, and less hirable. The negative perceptions of vocal fry are stronger for female voices relative to male voices. These results suggest that young American females should avoid using vocal fry speech in order to maximize labor market opportunities.
Universal mechanisms of sound production and control in birds and mammals
Elemans, C.P.H; Rasmussen, J.H.; Herbst, C.T.; Düring, D.N.; Zollinger, S.A.; Brumm, H.; Srivastava, K.; Svane, N.; Ding, M.; Larsen, O.N.; Sober, S.J.; Švec, J.G.
2015-01-01
As animals vocalize, their vocal organ transforms motor commands into vocalizations for social communication. In birds, the physical mechanisms by which vocalizations are produced and controlled remain unresolved because of the extreme difficulty in obtaining in vivo measurements. Here, we introduce an ex vivo preparation of the avian vocal organ that allows simultaneous high-speed imaging, muscle stimulation and kinematic and acoustic analyses to reveal the mechanisms of vocal production in birds across a wide range of taxa. Remarkably, we show that all species tested employ the myoelastic-aerodynamic (MEAD) mechanism, the same mechanism used to produce human speech. Furthermore, we show substantial redundancy in the control of key vocal parameters ex vivo, suggesting that in vivo vocalizations may also not be specified by unique motor commands. We propose that such motor redundancy can aid vocal learning and is common to MEAD sound production across birds and mammals, including humans. PMID:26612008
Universal mechanisms of sound production and control in birds and mammals.
Elemans, C P H; Rasmussen, J H; Herbst, C T; Düring, D N; Zollinger, S A; Brumm, H; Srivastava, K; Svane, N; Ding, M; Larsen, O N; Sober, S J; Švec, J G
2015-11-27
As animals vocalize, their vocal organ transforms motor commands into vocalizations for social communication. In birds, the physical mechanisms by which vocalizations are produced and controlled remain unresolved because of the extreme difficulty in obtaining in vivo measurements. Here, we introduce an ex vivo preparation of the avian vocal organ that allows simultaneous high-speed imaging, muscle stimulation and kinematic and acoustic analyses to reveal the mechanisms of vocal production in birds across a wide range of taxa. Remarkably, we show that all species tested employ the myoelastic-aerodynamic (MEAD) mechanism, the same mechanism used to produce human speech. Furthermore, we show substantial redundancy in the control of key vocal parameters ex vivo, suggesting that in vivo vocalizations may also not be specified by unique motor commands. We propose that such motor redundancy can aid vocal learning and is common to MEAD sound production across birds and mammals, including humans.
Correlation between vocal tract symptoms and modern singing handicap index in church gospel singers.
Pinheiro, Joel; Silverio, Kelly Cristina Alves; Siqueira, Larissa Thaís Donalonso; Ramos, Janine Santos; Brasolotto, Alcione Ghedini; Zambon, Fabiana; Behlau, Mara
2017-08-24
To verify the correlation between vocal tract discomfort symptoms and perceived voice handicaps in gospel singers, analyzing possible differences according to gender. 100 gospel singers volunteered, 50 male and 50 female. All participants answered two questionnaires: Vocal Tract Discomfort (VTD) scale and the Modern Singing Handicap Index (MSHI) that investigates the vocal handicap perceived by singers, linking the results of both instruments (p<0.05). Women presented more perceived handicaps and also more frequent and higher intensity vocal tract discomfort. Furthermore, the more frequent and intense the vocal tract symptoms, the higher the vocal handicap for singing. Female gospel singers present higher frequency and intensity of vocal tract discomfort symptoms, as well as higher voice handicap for singing than male gospel singers. The higher the frequency and intensity of the laryngeal symptoms, the higher the vocal handicap will be.
FE Modelling of the Fluid-Structure-Acoustic Interaction for the Vocal Folds Self-Oscillation
NASA Astrophysics Data System (ADS)
Švancara, Pavel; Horáček, J.; Hrůza, V.
The flow induced self-oscillation of the human vocal folds in interaction with acoustic processes in the simplified vocal tract model was explored by three-dimensional (3D) finite element (FE) model. Developed FE model includes vocal folds pretension before phonation, large deformations of the vocal fold tissue, vocal folds contact, fluid-structure interaction, morphing the fluid mesh according the vocal folds motion (Arbitrary Lagrangian-Eulerian approach), unsteady viscous compressible airflow described by the Navier-Stokes equations and airflow separation during the glottis closure. Iterative partitioned approach is used for modelling the fluid-structure interaction. Computed results prove that the developed model can be used for simulation of the vocal folds self-oscillation and resulting acoustic waves. The developed model enables to numerically simulate an influence of some pathological changes in the vocal fold tissue on the voice production.
Vocal development in a Waddington landscape
Teramoto, Yayoi; Takahashi, Daniel Y; Holmes, Philip; Ghazanfar, Asif A
2017-01-01
Vocal development is the adaptive coordination of the vocal apparatus, muscles, the nervous system, and social interaction. Here, we use a quantitative framework based on optimal control theory and Waddington’s landscape metaphor to provide an integrated view of this process. With a biomechanical model of the marmoset monkey vocal apparatus and behavioral developmental data, we show that only the combination of the developing vocal tract, vocal apparatus muscles and nervous system can fully account for the patterns of vocal development. Together, these elements influence the shape of the monkeys’ vocal developmental landscape, tilting, rotating or shifting it in different ways. We can thus use this framework to make quantitative predictions regarding how interfering factors or experimental perturbations can change the landscape within a species, or to explain comparative differences in vocal development across species DOI: http://dx.doi.org/10.7554/eLife.20782.001 PMID:28092262
Domanski, Mark C; Preciado, Diego A
2012-01-01
Phrenic nerve pacing can be used to treat congenital central hypoventilation syndrome (CCHS). We report how the lack of normal vocal cord tone during phrenic paced respiration can result in passive vocal cord collapse and produce obstructive symptoms. We describe a case of passive vocal cord collapse during phrenic nerve paced respiration in a patient with CCHS. As far as we know, this is the first report of this etiology of airway obstruction. The patient, a 7-year-old with CCHS and normal waking vocal cord movement, continued to require nightly continuous positive airway pressure (CPAP) despite successful utilization of phrenic nerve pacers. On direct laryngoscopy, the patient's larynx was observed while the diaphragmatic pacers were sequentially engaged. No abnormal vocal cord stimulation was witnessed during engaging of either phrenic nerve stimulator. However, the lack of normal inspiratory vocal cord abduction during phrenic nerve-paced respiration resulted in vocal cord collapse and partial obstruction due to passive adduction of the vocal cords through the Bernoulli effect. Bilateral phrenic nerve stimulation resulted in more vocal cord collapse than unilateral stimulation. The lack of vocal cord abduction on inspiration presents a limit to phrenic nerve pacers.
Cazau, Dorian; Adam, Olivier; Aubin, Thierry; Laitman, Jeffrey T; Reidenberg, Joy S
2016-10-10
Although mammalian vocalizations are predominantly harmonically structured, they can exhibit an acoustic complexity with nonlinear vocal sounds, including deterministic chaos and frequency jumps. Such sounds are normative events in mammalian vocalizations, and can be directly traceable to the nonlinear nature of vocal-fold dynamics underlying typical mammalian sound production. In this study, we give qualitative descriptions and quantitative analyses of nonlinearities in the song repertoire of humpback whales from the Ste Marie channel (Madagascar) to provide more insight into the potential communication functions and underlying production mechanisms of these features. A low-dimensional biomechanical modeling of the whale's U-fold (vocal folds homolog) is used to relate specific vocal mechanisms to nonlinear vocal features. Recordings of living humpback whales were searched for occurrences of vocal nonlinearities (instabilities). Temporal distributions of nonlinearities were assessed within sound units, and between different songs. The anatomical production sources of vocal nonlinearities and the communication context of their occurrences in recordings are discussed. Our results show that vocal nonlinearities may be a communication strategy that conveys information about the whale's body size and physical fitness, and thus may be an important component of humpback whale songs.
NASA Astrophysics Data System (ADS)
Cazau, Dorian; Adam, Olivier; Aubin, Thierry; Laitman, Jeffrey T.; Reidenberg, Joy S.
2016-10-01
Although mammalian vocalizations are predominantly harmonically structured, they can exhibit an acoustic complexity with nonlinear vocal sounds, including deterministic chaos and frequency jumps. Such sounds are normative events in mammalian vocalizations, and can be directly traceable to the nonlinear nature of vocal-fold dynamics underlying typical mammalian sound production. In this study, we give qualitative descriptions and quantitative analyses of nonlinearities in the song repertoire of humpback whales from the Ste Marie channel (Madagascar) to provide more insight into the potential communication functions and underlying production mechanisms of these features. A low-dimensional biomechanical modeling of the whale’s U-fold (vocal folds homolog) is used to relate specific vocal mechanisms to nonlinear vocal features. Recordings of living humpback whales were searched for occurrences of vocal nonlinearities (instabilities). Temporal distributions of nonlinearities were assessed within sound units, and between different songs. The anatomical production sources of vocal nonlinearities and the communication context of their occurrences in recordings are discussed. Our results show that vocal nonlinearities may be a communication strategy that conveys information about the whale’s body size and physical fitness, and thus may be an important component of humpback whale songs.
Schneiderová, Irena; Zouhar, Jan
2014-01-01
Shrews have rich vocal repertoires that include vocalizations within the human audible frequency range and ultrasonic vocalizations. Here, we recorded and analyzed in detail the acoustic structure of a vocalization with unclear functional significance that was spontaneously produced by 15 adult, captive Asian house shrews (Suncus murinus) while they were lying motionless and resting in their nests. This vocalization was usually emitted repeatedly in a long series with regular intervals. It showed some structural variability; however, the shrews most frequently emitted a tonal, low-frequency vocalization with minimal frequency modulation and a low, non-vocal click that was clearly noticeable at its beginning. There was no effect of sex, but the acoustic structure of the analyzed vocalizations differed significantly between individual shrews. The encoded individuality was low, but it cannot be excluded that this individuality would allow discrimination of family members, i.e., a male and female with their young, collectively resting in a common nest. The question remains whether the Asian house shrews indeed perceive the presence of their mates, parents or young resting in a common nest via the resting-associated vocalization and whether they use it to discriminate among their family members. Additional studies are needed to explain the possible functional significance of resting-associated vocalizations emitted by captive Asian house shrews. Our study highlights that the acoustic communication of shrews is a relatively understudied topic, particularly considering that they are highly vocal mammals. PMID:25390304
The Role of Lexical Stress on the Use of Vocal Fry in Young Adult Female Speakers.
Gibson, Todd A
2017-01-01
Vocal fry is a voice register often used by young adult women for sociolinguistic purposes. Some acoustic correlates of lexical stress, however, appear incompatible with the use of vocal fry. The objective of this study was to systematically examine the role of lexical stress in the use of vocal fry by young adult women. This is a semi-randomized controlled laboratory study. Fifty female undergraduate students were recorded repeating one-, two-, three-, and four-syllable nonwords that conformed to English phonotactics. Nonwords were presented in order from shorter to longer lengths, with stimuli randomized within syllable length. Perceptual analyses of recordings were augmented by acoustic analyses to identify each syllable in which vocal fry occurred. Eighty-six percent of participants produced at least one episode of vocal fry. Vocal fry was more likely to occur in unstressed than stressed position, and the likelihood increased as distance from the stressed syllable increased. There was considerable variability in the use of vocal fry. Frequent and infrequent users varied on the degree to which they used vocal fry in single-syllable nonwords. Vocal fry use persists among young adult women even in the absence of syntactic and pragmatic influences. Lexical stress appeared to dramatically reduce the use of vocal fry. Patterns of vocal fry use appeared to be different for frequent and infrequent users of this vocal register. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Samlan, Robin A; Story, Brad H
2011-10-01
To relate vocal fold structure and kinematics to 2 acoustic measures: cepstral peak prominence (CPP) and the amplitude of the first harmonic relative to the second (H1-H2). The authors used a computational, kinematic model of the medial surfaces of the vocal folds to specify features of vocal fold structure and vibration in a manner consistent with breathy voice. Four model parameters were altered: degree of vocal fold adduction, surface bulging, vibratory nodal point, and supraglottal constriction. CPP and H1-H2 were measured from simulated glottal area, glottal flow, and acoustic waveforms and were related to the underlying vocal fold kinematics. CPP decreased with increased separation of the vocal processes, whereas the nodal point location had little effect. H1-H2 increased as a function of separation of the vocal processes in the range of 1.0 mm to 1.5 mm and decreased with separation > 1.5 mm. CPP is generally a function of vocal process separation. H1*-H2* (see paragraph 6 of article text for an explanation of the asterisks) will increase or decrease with vocal process separation on the basis of vocal fold shape, pivot point for the rotational mode, and supraglottal vocal tract shape, limiting its utility as an indicator of breathy voice. Future work will relate the perception of breathiness to vocal fold kinematics and acoustic measures.
Syllabic Patterns in the Early Vocalizations of Quichua Children
ERIC Educational Resources Information Center
Gildersleeve-Neumann, Christina E.; Davis, Barbara L.; Macneilage, Peter F.
2013-01-01
To understand the interactions between production patterns common to children regardless of language environment and the early appearance of production effects based on perceptual learning from the ambient language requires the study of languages with diverse phonological properties. Few studies have evaluated early phonological acquisition…
Practical Ways To Improve Singing in Early Childhood Classrooms.
ERIC Educational Resources Information Center
Neelly, Linda P.
2002-01-01
Suggests developmentally appropriate ways early childhood teachers can help young children express, interpret, and understand their experiences through song. Discusses the importance of singing for gaining music and literacy skills and encouraging self-expression and creativity. Includes tips related to posture, breathing, vocal exploration, vocal…
Holt, Marla M; Noren, Dawn P; Dunkin, Robin C; Williams, Terrie M
2015-06-01
Many animals produce louder, longer or more repetitious vocalizations to compensate for increases in environmental noise. Biological costs of increased vocal effort in response to noise, including energetic costs, remain empirically undefined in many taxa, particularly in marine mammals that rely on sound for fundamental biological functions in increasingly noisy habitats. For this investigation, we tested the hypothesis that an increase in vocal effort would result in an energetic cost to the signaler by experimentally measuring oxygen consumption during rest and a 2 min vocal period in dolphins that were trained to vary vocal loudness across trials. Vocal effort was quantified as the total acoustic energy of sounds produced. Metabolic rates during the vocal period were, on average, 1.2 and 1.5 times resting metabolic rate (RMR) in dolphin A and B, respectively. As vocal effort increased, we found that there was a significant increase in metabolic rate over RMR during the 2 min following sound production in both dolphins, and in total oxygen consumption (metabolic cost of sound production plus recovery costs) in the dolphin that showed a wider range of vocal effort across trials. Increases in vocal effort, as a consequence of increases in vocal amplitude, repetition rate and/or duration, are consistent with behavioral responses to noise in free-ranging animals. Here, we empirically demonstrate for the first time in a marine mammal, that these vocal modifications can have an energetic impact at the individual level and, importantly, these data provide a mechanistic foundation for evaluating biological consequences of vocal modification in noise-polluted habitats. © 2015. Published by The Company of Biologists Ltd.
Endoscopic laterofixation in bilateral vocal cords paralysis in children.
Lidia, Zawadzka-Glos; Magdalena, Frackiewicz; Mieczyslaw, Chmielik
2010-06-01
Vocal cords paralysis is the second most frequent cause of laryngeal stridor in children. Symptoms of congenital vocal cords paralysis can occur shortly after birth or later. Vocal cords paralysis can be unilateral or bilateral. Symptoms of unilateral paralysis include hoarse weeping or stridor during a deep inhalation. In children unilateral vocal cords paralysis often retreats spontaneously or can be completely compensated. Children with bilateral vocal cords paralysis present mainly breathing disorders while phonation is normal. Symptoms are different, starting from complete occlusion of respiratory tracts and ending on small symptoms connected with the lack of effort tolerance. When symptoms are severe, patients from this group require a tracheotomy. The lack of restoration of normal function of vocal cords or lack of complete compensation and maintenance of symptoms are an indication for surgical treatment. The aim of this study is to present results of the treatment of bilateral vocal cords paralysis in children using the endoscopic method of laterofixation of vocal cords. In the Pediatric ENT Department between 1998 and 2009 sixty four children with dyspnoea and/or phonation disorders caused by vocal cords paralysis were treated. In ten cases laterofixation of vocal cords was performed, in most cases with good result. In this article the authors present the method of endoscopic laterofixation and achieved results. Endoscopic laterofixation of vocal cords in children is a safe and an easy method of surgical treatment of bilateral vocal cords paralysis. This method can be used as a first and often as a one stage treatment of vocal cords paralysis. In some cases this procedure is insufficient and has to be completed with other methods. Copyright (c) 2010 Elsevier Ireland Ltd. All rights reserved.
Female Presence and Estrous State Influence Mouse Ultrasonic Courtship Vocalizations
Hanson, Jessica L.; Hurley, Laura M.
2012-01-01
The laboratory mouse is an emerging model for context-dependent vocal signaling and reception. Mouse ultrasonic vocalizations are robustly produced in social contexts. In adults, male vocalization during courtship has become a model of interest for signal-receiver interactions. These vocalizations can be grouped into syllable types that are consistently produced by different subspecies and strains of mice. Vocalizations are unique to individuals, vary across development, and depend on social housing conditions. The behavioral significance of different syllable types, including the contexts in which different vocalizations are made and the responses listeners have to different types of vocalizations, is not well understood. We examined the effect of female presence and estrous state on male vocalizations by exploring the use of syllable types and the parameters of syllables during courtship. We also explored correlations between vocalizations and other behaviors. These experimental manipulations produced four main findings: 1) vocalizations varied among males, 2) the production of USVs and an increase in the use of a specific syllable type were temporally related to mounting behavior, 3) the frequency (kHz), bandwidth, and duration of syllables produced by males were influenced by the estrous phase of female partners, and 4) syllable types changed when females were removed. These findings show that mouse ultrasonic courtship vocalizations are sensitive to changes in female phase and presence, further demonstrating the context-sensitivity of these calls. PMID:22815817
[Varices of the vocal cord: report of 21 cases].
Li, Jin-rang; Sun, Jian-jun
2006-04-01
To study the diagnosis and treatment of varices of the vocal cord. The clinical data of 21 cases with varix of vocal cord were analyzed. All the patients presented hoarseness. There were 15 female and 6 male cases with their ages ranged from 23 to 68 years (median 44 years old). The varix was found on the right vocal cord in 12 cases, on the left vocal cord in 9 cases. Isolated varix existed on the vocal cord in 10 cases, varix with vocal cord polyps or nodules in 10 cases, varix with vocal cord paralysis in 1 case. All the patients were diagnosed under the laryngovideoscopy. The lesions appeared on the superior surface of the vocal cord. Varices manifested as abnormally dilated capillary running in the anterior to posterior direction in 6 cases, as clusters of capillary in 3 cases, as a dot or small sheet or short line of capillary in 12 cases. The varices were disappeared in 2 of 8 cases with vocal cord varices and polyps after removed the polyps. The varices of others patients had no change after following up for more than 6 months, but one patient happened hemorrhage of the contralateral vocal cord. Varices are most commonly seen in female. Laryngovideoscopy is the key in determining the vocal fold varices. Management of patients with a varix includes medical therapy, speech therapy, and occasionally surgical vaporization.
Hertegård, S; Dahlqvist, A; Goodyer, E
2006-07-01
The scarring model resulted in significant damage and elevated viscoelasticity of the lamina propria. Hyaluronan preparations may alter viscoelasticity in scarred rabbit vocal folds. Vocal fold scarring results in stiffness of the lamina propria and severe voice problems. The aims of this study were to examine the degree of scarring achieved in the experiment and to measure the viscoelastic properties after injection of hyaluronan in rabbit vocal folds. Twenty-two vocal folds from 15 New Zealand rabbits were scarred, 8 vocal folds were controls. After 8 weeks 12 of the scarred vocal folds received injections with 2 types of cross-linked hyaluronan products and 10 scarred folds were injected with saline. After 11 more weeks the animals were sacrificed. After dissection, 15 vocal folds were frozen for viscoelastic measurements, whereas 14 vocal folds were prepared and stained. Measurements were made of the lamina propria thickness. Viscoelasticity was measured on intact vocal folds with a linear skin rheometer (LSR) adapted to laryngeal measurements. Measurements on the digitized slides showed a thickened lamina propria in the scarred samples as compared with the normal vocal folds (p<0.05). The viscoelastic analysis showed a tendency to stiffening of the scarred vocal folds as compared with the normal controls (p=0.05). There was large variation in stiffness between the two injected hyaluronan products.
Vocalization Subsystem Responses to a Temporarily Induced Unilateral Vocal Fold Paralysis
ERIC Educational Resources Information Center
Croake, Daniel J.; Andreatta, Richard D.; Stemple, Joseph C.
2018-01-01
Purpose: The purpose of this study is to quantify the interactions of the 3 vocalization subsystems of respiration, phonation, and resonance before, during, and after a perturbation to the larynx (temporarily induced unilateral vocal fold paralysis) in 10 vocally healthy participants. Using dynamic systems theory as a guide, we hypothesized that…
NASA Astrophysics Data System (ADS)
Saidi, Hiba; Erath, Byron D.
2015-11-01
The vocal folds play a major role in human communication by initiating voiced sound production. During voiced speech, the vocal folds are set into sustained vibrations. Synthetic self-oscillating vocal fold models are regularly employed to gain insight into flow-structure interactions governing the phonation process. Commonly, a fixed boundary condition is applied to the lateral, anterior, and posterior sides of the synthetic vocal fold models. However, physiological observations reveal the presence of adipose tissue on the lateral surface between the thyroid cartilage and the vocal folds. The goal of this study is to investigate the influence of including this substrate layer of adipose tissue on the dynamics of phonation. For a more realistic representation of the human vocal folds, synthetic multi-layer vocal fold models have been fabricated and tested while including a soft lateral layer representative of adipose tissue. Phonation parameters have been collected and are compared to those of the standard vocal fold models. Results show that vocal fold kinematics are affected by adding the adipose tissue layer as a new boundary condition.
Yamauchi, Akihito; Yokonishi, Hisayuki; Imagawa, Hiroshi; Sakakibara, Ken-Ichi; Nito, Takaharu; Tayama, Niro
2016-11-01
The goal of this work was to objectively elucidate the vibratory characteristics of vocal fold paralysis (VFP) using high-speed digital imaging (HSDI). HSDI was performed in 29 vocally healthy subjects (12 women and 17 men) and in 107 patients with VFP (40 women and 67 men). Then, the HSDI data were evaluated by visual-perceptual rating, single-line kymography, multiline kymography, laryngotopography, and glottal area waveform analysis. Patients with VFP compared with vocally healthy subjects revealed more frequent incomplete glottal closure, greater asymmetry in amplitude, mucosal wave, frequency, and phase, as well as larger open quotient, smaller speed index, larger maximal and minimal glottal area, and smaller glottal area difference. Paralyzed vocal folds in VFP revealed reduced mucosal wave than nonparalyzed vocal folds in VFP or in intact vocal folds in vocally healthy subjects. HSDI was effective in documenting the characteristics of vocal fold vibrations in patients with VFP and in exploring the vibratory disturbance for estimating the severity of dysphonia. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Singers' interest and knowledge levels of vocal function and dysfunction: survey findings.
Braun-Janzen, Colleen; Zeine, Lina
2009-07-01
A questionnaire investigating the levels of interest in and knowledge of vocal function and dysfunction was completed by 129 singers. Those with professional singing experience indicated significantly greater interest and higher perceived knowledge levels than amateurs in areas of vocal anatomy and physiology, vocal hygiene, and functional vocal pathologies. Greater interest levels, but not higher perceived knowledge levels were reported by professional singers (PSs) in the area of the role of the speech-language pathologist (SLP) and the voice. Professionals answered significantly more knowledge-based questions correctly than amateurs in all areas except the role of the SLP and the voice. However, findings indicated wide variability in knowledge levels of both groups. Singing teachers (STs) within the group significantly outperformed the remainder of the group in areas of vocal anatomy and physiology, vocal hygiene, and functional vocal pathologies. Scores of the choir directors (CDs) within the group were not significantly superior to the remainder of the group except in the area of functional vocal pathologies. Implications for a preventative approach to vocal health are discussed.
Early experience shapes vocal neural coding and perception in songbirds
Woolley, Sarah M. N.
2012-01-01
Songbirds, like humans, are highly accomplished vocal learners. The many parallels between speech and birdsong and conserved features of mammalian and avian auditory systems have led to the emergence of the songbird as a model system for studying the perceptual mechanisms of vocal communication. Laboratory research on songbirds allows the careful control of early life experience and high-resolution analysis of brain function during vocal learning, production and perception. Here, I review what songbird studies have revealed about the role of early experience in the development of vocal behavior, auditory perception and the processing of learned vocalizations by auditory neurons. The findings of these studies suggest general principles for how exposure to vocalizations during development and into adulthood influences the perception of learned vocal signals. PMID:22711657
Interstitial protein alterations in rabbit vocal fold with scar.
Thibeault, Susan L; Bless, Diane M; Gray, Steven D
2003-09-01
Fibrous and interstitial proteins compose the extracellular matrix of the vocal fold lamina propria and account for its biomechanic properties. Vocal fold scarring is characterized by altered biomechanical properties, which create dysphonia. Although alterations of the fibrous proteins have been confirmed in the rabbit vocal fold scar, interstitial proteins, which are known to be important in wound repair, have not been investigated to date. Using a rabbit model, interstitial proteins decorin, fibromodulin, and fibronectin were examined immunohistologically, two months postinduction of vocal fold scar by means of forcep biopsy. Significantly decreased decorin and fibromodulin with significantly increased fibronectin characterized scarred vocal fold tissue. The implications of altered interstitial proteins levels and their affect on the fibrous proteins will be discussed in relation to increased vocal fold stiffness and viscosity, which characterizes vocal fold scar.
The forgotten cause of stridor in the emergency department.
Ng, Tian-Tee
2017-01-01
Paradoxical Vocal Fold Movement Disorder is where the larynx exhibits paradoxical vocal cords closure during respiration, creating partial airway obstruction. Causes of vocal fold movement disorder are multifactorial, and patients describe tightness of throat, difficulty getting air in, have stridor, and do not respond to inhalers. We propose using transnasal laryngoscopy examination, which will show narrowing of vocal cords on inspiration, and The Pittsburgh Vocal Cord Dysfunction Index with a cutoff score of ≥4 to distinguish vocal fold movement disorder from asthma and other causes of stridor. Management of paradoxical vocal fold movement disorder involves a combination of pharmacological, psychological, psychiatric, and speech training. Paradoxical vocal fold movement disorder is a very treatable cause of stridor, so long as it is identified and other organic causes are excluded.
Evolutionary Origins for Social Vocalization in a Vertebrate Hindbrain–Spinal Compartment
Bass, Andrew H.; Gilland, Edwin H.; Baker, Robert
2008-01-01
The macroevolutionary events leading to neural innovations for social communication, such as vocalization, are essentially unexplored. Many fish vocalize during female courtship and territorial defense, as do amphibians, birds, and mammals. Here, we map the neural circuitry for vocalization in larval fish and show that the vocal network develops in a segment-like region across the most caudal hindbrain and rostral spinal cord. Taxonomic analysis demonstrates a highly conserved pattern between fish and all major lineages of vocal tetrapods. We propose that the vocal basis for acoustic communication among vertebrates evolved from an ancestrally shared developmental compartment already present in the early fishes. PMID:18635807
Non-song vocalizations of pygmy blue whales in Geographe Bay, Western Australia.
Recalde-Salas, A; Salgado Kent, C P; Parsons, M J G; Marley, S A; McCauley, R D
2014-05-01
Non-song vocalizations of migrating pygmy blue whales (Balaenoptera musculus brevicauda) in Western Australia are described. Simultaneous land-based visual observations and underwater acoustic recordings detected 27 groups in Geographe Bay, WA over 2011 to 2012. Six different vocalizations were recorded that were not repeated in a pattern or in association with song, and thus were identified as non-song vocalizations. Five of these were not previously described for this population. Their acoustic characteristics and context are presented. Given that 56% of groups vocalized, 86% of which produced non-song vocalizations and 14% song units, the inclusion of non-song vocalizations in passive-acoustic monitoring is proposed.
Paul, Nilanjan; Kumar, Suman; Chatterjee, Indranil; Mukherjee, Biswarup
2011-01-01
In-depth study on laryngeal biomechanics and vocal fold vibratory patterns reveal that a single vibratory cycle can be divided into two major phases, the closed and open phase, which is subdivided into opening and closing phases. Studies reveal that the relative time course of abduction and adduction, which in turn is dependent on the relative relaxing and tensing of the vocal fold cover and body, to be the determining factor in production of a particular vocal register like the modal (or chest), falsetto, glottal fry registers. Studies further point out Electroglottography to be particularly suitable for the study of vocal vibratory patterns during register changes. However, to date, there has been limited study on quantitative parameterization of EGG wave form in vocal fry register. Moreover, contradictory findings abound in literature regarding effects of gender and vowel types on vocal vibratory patterns, especially during phonation at different registers. The present study endeavors to find out the effects of vowel and gender differences on the vocal fold vibratory patterns in different registers and how these would be reflected in standard EGG parameters of Contact Quotient (CQ) and Contact Index (CI), taking into consideration the Indian sociolinguistic context. Electroglottographic recordings of 10 young adults (5 males and 5 females) were taken while the subjects phonated the three vowels /a/,/i/,/u/ each in two vocal registers, modal and vocal fry. Obtained raw EGG were further normalized using the Derived EGG algorithm and theCQ and CI values were derived. Obtained data were subject to statistical analysis using the 3-way ANOVA with gender, vowel and vocal register as the three variables. Post-hoc Dunnett C multiple comparison analysis were also performed. Results reveal that CQ values are significantly higher in vocal fry than modal phonation for both males and females, indicating a relatively hyperconstricted vocal system during vocal fry. The males have significantly greater CQ values than females both at modal and vocal fry phonations which indicate that the males are predisposed to greater vocal fold constriction. Females demonstrated no significant increase in CI values in vocal fry state; and in some cases actually decrease in the CI values which suggest an inherently distinct vocal fold physiological adjustment from that in males. No vowel effects were found in any conditions. Perturbation values (CQP and CIP) are significantly more in vocal fry register than in modal register, and the increase was more in case of females than males. The findings give strong evidence to certain hypotheses in literature regarding effects of vowel, gender and phonatory register on vocal fold vibratory patterns.
Kaneko, Mami; Kishimoto, Yo; Suzuki, Ryo; Kawai, Yoshitaka; Tateya, Ichiro; Hirano, Shigeru
2017-05-01
Professional voice users, such as singers and teachers, are at greater risk of developing vocal fold injury from excessive use of voice; thus, protection of the vocal fold is essential. One of the most important factors that aggravates injury is the production of reactive oxygen species at the wound site. The purpose of the current study was to assess the effect of astaxanthin, a strong antioxidant, on the protection of the vocal fold from injury and inflammation due to vocal loading. This study is an institutional review board-approved human clinical trial. Ten male subjects underwent a 60-minute vocal loading session and received vocal assessments prior to, immediately after, and 30 minutes postvocal loading (AST(-) status). All subjects were then prescribed 24 mg/day of astaxanthin for 28 days, after which they received the same vocal task and assessments (AST(+) status). Phonatory parameters were compared between both groups. Aerodynamic assessment, acoustic analysis, and GRBAS scale (grade, roughness, breathiness, asthenia, and strain) were significantly worse in the AST(-) status immediately after vocal loading, but improved by 30 minutes after loading. In contrast, none of the phonatory parameters in the AST(+) status were statistically worse, even when measured immediately after vocal loading. No allergic responses or adverse effects were observed after administration of astaxanthin. The current results suggest that astaxanthin can protect the vocal fold from injury and inflammation caused by vocal loading possibly through the regulation of oxidative stress. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Ghassemi, Marzyeh; Van Stan, Jarrad H; Mehta, Daryush D; Zañartu, Matías; Cheyne, Harold A; Hillman, Robert E; Guttag, John V
2014-06-01
Voice disorders are medical conditions that often result from vocal abuse/misuse which is referred to generically as vocal hyperfunction. Standard voice assessment approaches cannot accurately determine the actual nature, prevalence, and pathological impact of hyperfunctional vocal behaviors because such behaviors can vary greatly across the course of an individual's typical day and may not be clearly demonstrated during a brief clinical encounter. Thus, it would be clinically valuable to develop noninvasive ambulatory measures that can reliably differentiate vocal hyperfunction from normal patterns of vocal behavior. As an initial step toward this goal we used an accelerometer taped to the neck surface to provide a continuous, noninvasive acceleration signal designed to capture some aspects of vocal behavior related to vocal cord nodules, a common manifestation of vocal hyperfunction. We gathered data from 12 female adult patients diagnosed with vocal fold nodules and 12 control speakers matched for age and occupation. We derived features from weeklong neck-surface acceleration recordings by using distributions of sound pressure level and fundamental frequency over 5-min windows of the acceleration signal and normalized these features so that intersubject comparisons were meaningful. We then used supervised machine learning to show that the two groups exhibit distinct vocal behaviors that can be detected using the acceleration signal. We were able to correctly classify 22 of the 24 subjects, suggesting that in the future measures of the acceleration signal could be used to detect patients with the types of aberrant vocal behaviors that are associated with hyperfunctional voice disorders.
Tourette syndrome: the self under siege.
Leckman, James F; Bloch, Michael H; Scahill, Lawrence; King, Robert A
2006-08-01
Tourette syndrome is a neurodevelopmental disorder characterized by motor and vocal tics--rapid, repetitive, stereotyped movements or vocalizations. Tourette syndrome typically has a prepubertal onset, and boys are more commonly affected than girls. Symptoms usually begin with transient bouts of simple motor tics. By age 10 years, most children are aware of nearly irresistible somatosensory urges that precede the tics. These urges likely reflect a defect in sensorimotor gating because they intrude into the child's conscious awareness and become a source of distraction and distress. A momentary sense of relief typically follows the completion of a tic. Over the course of hours, tics occur in bouts, with a regular intertic interval. Tics increase during periods of emotional excitement and fatigue. Tics can become "complex" in nature and appear to be purposeful. Tics can be willfully suppressed for brief intervals and can be evoked by the mere mention of them. Tics typically diminish during periods of goal-directed behavior, especially those that involve both heightened attention and fine motor or vocal control, as occur in musical and athletic performances. Over the course of months, tics wax and wane. New tics appear, often in response to new sources of somatosensory irritation, such as the appearance of a persistent vocal tic (a cough) following a cold. Over the course of years, tic severity typically peaks between 8 and 12 years of age. By the end of the second decade of life, many individuals are virtually tic free. Less than 20% of cases continue to experience clinically impairing tics as adults. Tics rarely occur in isolation, and other coexisting conditions--such as behavioral disinhibition, hypersensitivity to a broad range of sensory stimuli, problems with visual motor integration, procedural learning difficulties, attention-deficit hyperactivity disorder (ADHD), obsessive-compulsive disorder, depression, anxiety, and emotional instability--are often a greater source of impairment than the tics themselves. Emerging behavioral treatments of Tourette syndrome are based in part on an understanding of the moment-to-moment experience of somatosensory urges and motor response. With identification of specific genes of major effect and advances in our understanding of the neural circuitry of sensorimotor gating, habit formation, and procedural memory--together with insights from postmortem brain studies, in vivo brain imaging, and electrophysiologic recordings--we might be on the threshold of a deeper understanding of the phenomenology and natural history of Tourette syndrome.
Signal analysis of the female singing voice: Features for perceptual singer identity
NASA Astrophysics Data System (ADS)
Mellody, Maureen
2001-07-01
Individual singing voices tend to be easy for a listener to identify, particularly when compared to the difficulty of identifying the performer of any other musical instrument. What cues does a listener use to identify a particular singing voice? This work seeks to identify a set of features with which one can synthesize notes with the vocal quality of a particular singer. Such analysis and synthesis influences computer music (in the creation of synthetic sounds with different timbre), vocal pedagogy (as a training tool to help singers understand properties of their own voice as well as different professional-quality voices), and vocal health (to identify improper behavior in vocal production). The problem of singer identification is approached in three phases: signal analysis, the development of low- order representations, and perceptual evaluation. To perform the signal analysis, a high-resolution time- frequency distribution is applied to vowel tokens from sopranos and mezzo-sopranos. From these results, low- order representations are created for each singer's notes, which are used to synthesize sounds with the timbral quality of that singer. Finally, these synthesized sounds, along with original recordings, are evaluated by trained listeners in a variety of perceptual experiments to determine the extent to which the vocal quality of the desired singer is captured. Results from the signal analysis show that amplitude and frequency estimates extracted from the time-frequency signal analysis can be used to re-create each signal with little degradation in quality and no loss of perceptual identity. Low-order representations derived from the signal analysis are used in clustering and classification, which successfully clusters signals with corresponding singer identity. Finally, perceptual results indicate that trained listeners are, surprisingly, only modestly successful at correctly identifying the singer of a recording, and find the task to be particularly difficult for certain voices and extremely easy for others. Listeners also indicate that the majority of sounds synthesized with the low-order representations sufficiently capture the desired vocal timbre. Again, the task is easy for certain voices and much more difficult when evaluating other singers, consistent with the results from the original recordings.
Children's Recognition of Emotions from Vocal Cues
ERIC Educational Resources Information Center
Sauter, Disa A.; Panattoni, Charlotte; Happe, Francesca
2013-01-01
Emotional cues contain important information about the intentions and feelings of others. Despite a wealth of research into children's understanding of facial signals of emotions, little research has investigated the developmental trajectory of interpreting affective cues in the voice. In this study, 48 children ranging between 5 and 10 years were…
Environmental drivers of anuran calling phenology in a seasonal neotropical ecosystem
Christopher M. Schalk; Dan Saenz
2015-01-01
Temporal variation represents an important component in understanding the structure of ecological communities and species coexistence. We examined calling phenology of an assemblage of anurans in the Gran Chaco ecoregion of Bolivia by deploying automated recording devices to document nocturnally vocalizing amphibians nightly at seven...
The Influence of Interactive Context on Prelinguistic Vocalizations and Maternal Responses
ERIC Educational Resources Information Center
Gros-Louis, Julie; West, Meredith J.; King, Andrew P.
2016-01-01
Many studies have documented influences of maternal responsiveness on cognitive and language development. Given the bidirectionality of interactions in caregiver-infant dyads, it is important to understand how infant behavior elicits variable responses. Prior studies have shown that mothers respond differentially to features of prelinguistic…
Improvisation: An Essential Element of Musical Proficiency.
ERIC Educational Resources Information Center
Dobbins, Bill
1980-01-01
The author discusses the importance of improvisation, suggesting that improvisation be introduced in the earliest stages of education and be taught through an approach that integrates ear training, sight-reading, instrumental and vocal techniques and theory into a unified and complete understanding of music as a language. (Author/KC)
Qualia: A Prescription for Developing a Quality Health Threat Assessment
2008-12-01
neurophysiology , so she knows the physical reaction of the retina when light spectrums enter the eye. Her training includes an understanding of the vocal chords...in a gradual increase in acceptance, termed the slope of enlightenment . With continued emphasis and adjustment, the project concept will continue
From Band Rooms to Choral Risers.
ERIC Educational Resources Information Center
Washington, William Lee
1999-01-01
Offers suggestions for instrumentalists who have become choral instructors explaining that their first goal should be to learn how to identify and understand an ideal vocal sound. Finds that the hardest aspect of choral instruction is developing a teaching approach. Offers five steps to assist instrumentalists in molding their own approach. (CMK)
Information-Processing Modules and Their Relative Modality Specificity
ERIC Educational Resources Information Center
Anderson, John R.; Qin, Yulin; Jung, Kwan-Jin; Carter, Cameron S.
2007-01-01
This research uses fMRI to understand the role of eight cortical regions in a relatively complex information-processing task. Modality of input (visual versus auditory) and modality of output (manual versus vocal) are manipulated. Two perceptual regions (auditory cortex and fusiform gyrus) only reflected perceptual encoding. Two motor regions were…
Impact of call center work in subjective voice symptoms and complaints--an analytic study.
Rechenberg, Leila; Goulart, Bárbara Niegia Garcia de; Roithmann, Renato
2011-12-01
To estimate the prevalence of vocal symptoms, occupational risk factors, associated symptoms and their impact on the professional activity of the telemarketers. Cross-section analytical study with 124 telemarketers and 109 administrative workers (control group) selected from a random sample stratified by gender. The subjects answered an anonymous self-administered questionnaire involving issues related to the presence of vocal symptoms, potential risk factors for dysphonia, and vocal impact of symptoms in professional activity. The presence of one or more voice symptoms that occurred daily or weekly was considered positive for the presence of vocal symptoms. The prevalence of vocal symptoms was found in 33% of telemarketers and in 21% of the control group, indicating an association between vocal symptoms and the activity of the telemarketer. When adjusted for confounders, this association remained in the sense of risk. In telemarketers, the sensation of dry air, ambient noise, and lack of vocal rest were the most frequently reported complaints reported by those presenting vocal symptoms. Almost 70% of telemarketers with vocal symptoms reported that these symptoms interfere with their professional activity. The rate of absenteeism by vocal symptoms in this group was 29%. Vocal symptoms are common in most telemarketers when compared to their peer controls, and significantly affect their job performance.
Wingstrand, Vibe Lindeblad; Jensen, David H.; Bork, Kristian; Sebbesen, Lars; Balle, Jesper; Fischer-Nielsen, Anne; von Buchwald, Christian
2016-01-01
Objectives Therapy with mesenchymal stem cells exhibits potential for the development of novel interventions for many diseases and injuries. The use of mesenchymal stem cells in regenerative therapy for vocal fold scarring exhibited promising results to reduce stiffness and enhance the biomechanical properties of injured vocal folds. This study evaluated the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring. Data Sources PubMed, Embase, the Cochrane Library and Google Scholar were searched. Methods Controlled studies that assessed the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring were included. Primary outcomes were viscoelastic properties and mucosal wave amplitude. Results Seven preclinical animal studies (n = 152 single vocal folds) were eligible for inclusion. Evaluation of viscoelastic parameters revealed a decreased dynamic viscosity (η’) and elastic modulus (G’), i.e., decreased resistance and stiffness, in scarred vocal folds treated with mesenchymal stem cells compared to non-treated scarred vocal folds. Mucosal wave amplitude was increased in scarred vocal folds treated with mesenchymal stem cells vs. non-treated scarred vocal folds. Conclusion The results from these studies suggest an increased regenerative effect of therapy with mesenchymal stem cells for scarred vocal folds and are encouraging for further clinical studies. PMID:27631373
Wingstrand, Vibe Lindeblad; Grønhøj Larsen, Christian; Jensen, David H; Bork, Kristian; Sebbesen, Lars; Balle, Jesper; Fischer-Nielsen, Anne; von Buchwald, Christian
2016-01-01
Therapy with mesenchymal stem cells exhibits potential for the development of novel interventions for many diseases and injuries. The use of mesenchymal stem cells in regenerative therapy for vocal fold scarring exhibited promising results to reduce stiffness and enhance the biomechanical properties of injured vocal folds. This study evaluated the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring. PubMed, Embase, the Cochrane Library and Google Scholar were searched. Controlled studies that assessed the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring were included. Primary outcomes were viscoelastic properties and mucosal wave amplitude. Seven preclinical animal studies (n = 152 single vocal folds) were eligible for inclusion. Evaluation of viscoelastic parameters revealed a decreased dynamic viscosity (η') and elastic modulus (G'), i.e., decreased resistance and stiffness, in scarred vocal folds treated with mesenchymal stem cells compared to non-treated scarred vocal folds. Mucosal wave amplitude was increased in scarred vocal folds treated with mesenchymal stem cells vs. non-treated scarred vocal folds. The results from these studies suggest an increased regenerative effect of therapy with mesenchymal stem cells for scarred vocal folds and are encouraging for further clinical studies.
Modulation of voice related to tremor and vibrato
NASA Astrophysics Data System (ADS)
Lester, Rosemary Anne
Modulation of voice is a result of physiologic oscillation within one or more components of the vocal system including the breathing apparatus (i.e., pressure supply), the larynx (i.e. sound source), and the vocal tract (i.e., sound filter). These oscillations may be caused by pathological tremor associated with neurological disorders like essential tremor or by volitional production of vibrato in singers. Because the acoustical characteristics of voice modulation specific to each component of the vocal system and the effect of these characteristics on perception are not well-understood, it is difficult to assess individuals with vocal tremor and to determine the most effective interventions for reducing the perceptual severity of the disorder. The purpose of the present studies was to determine how the acoustical characteristics associated with laryngeal-based vocal tremor affect the perception of the magnitude of voice modulation, and to determine if adjustments could be made to the voice source and vocal tract filter to alter the acoustic output and reduce the perception of modulation. This research was carried out using both a computational model of speech production and trained singers producing vibrato to simulate laryngeal-based vocal tremor with different voice source characteristics (i.e., vocal fold length and degree of vocal fold adduction) and different vocal tract filter characteristics (i.e., vowel shapes). It was expected that, by making adjustments to the voice source and vocal tract filter that reduce the amplitude of the higher harmonics, the perception of magnitude of voice modulation would be reduced. The results of this study revealed that listeners' perception of the magnitude of modulation of voice was affected by the degree of vocal fold adduction and the vocal tract shape with the computational model, but only by the vocal quality (corresponding to the degree of vocal fold adduction) with the female singer. Based on regression analyses, listeners' judgments were predicted by modulation information in both low and high frequency bands. The findings from these studies indicate that production of a breathy vocal quality might be a useful compensatory strategy for reducing the perceptual severity of modulation of voice for individuals with tremor affecting the larynx.
Psychosocial Intervention for Young Children With Chronic Tics
2018-06-18
Tourette's Syndrome; Tourette's Disorder; Tourette's Disease; Tourette Disorder; Tourette Disease; Tic Disorder, Combined Vocal and Multiple Motor; Multiple Motor and Vocal Tic Disorder, Combined; Gilles de La Tourette's Disease; Gilles de la Tourette Syndrome; Gilles De La Tourette's Syndrome; Combined Vocal and Multiple Motor Tic Disorder; Combined Multiple Motor and Vocal Tic Disorder; Chronic Motor and Vocal Tic Disorder
A Rat Excised Larynx Model of Vocal Fold Scar
ERIC Educational Resources Information Center
Welham, Nathan V.; Montequin, Douglas W.; Tateya, Ichiro; Tateya, Tomoko; Choi, Seong Hee; Bless, Diane M.
2009-01-01
Purpose: To develop and evaluate a rat excised larynx model for the measurement of acoustic, aerodynamic, and vocal fold vibratory changes resulting from vocal fold scar. Method: Twenty-four 4-month-old male Sprague-Dawley rats were assigned to 1 of 4 experimental groups: chronic vocal fold scar, chronic vocal fold scar treated with 100-ng basic…
First insights into the vocal repertoire of infant and juvenile Southern white rhinoceros
Boeer, Michael; Scheumann, Marina
2018-01-01
Describing vocal repertoires represents an essential step towards gaining an overview about the complexity of acoustic communication in a given species. The analysis of infant vocalisations is essential for understanding the development and usage of species-specific vocalisations, but is often underrepresented, especially in species with long inter-birth intervals such as the white rhinoceros. Thus, this study aimed for the first time to characterise the infant and juvenile vocal repertoire of the Southern white rhinoceros and to relate these findings to the adult vocal repertoire. The behaviour of seven mother-reared white rhinoceros calves (two males, five females) and one hand-reared calf (male), ranging from one month to four years, was simultaneously audio and video-taped at three zoos. Normally reared infants and juveniles uttered four discriminable call types (Whine, Snort, Threat, and Pant) that were produced in different behavioural contexts. All call types were also uttered by the hand-reared calf. Call rates of Whines, but not of the other call types, decreased with age. These findings provide the first evidence that infant and juvenile rhinoceros utter specific call types in distinct contexts, even if they grow up with limited social interaction with conspecifics. By comparing our findings with the current literature on vocalisations of adult white rhinoceros and other solitary rhinoceros species, we discuss to which extent differences in the social lifestyle across species affect acoustic communication in mammals. PMID:29513670
NASA Astrophysics Data System (ADS)
Rupitsch, Stefan J.; Ilg, Jürgen; Sutor, Alexander; Lerch, Reinhard; Döllinger, Michael
2011-08-01
In order to obtain a deeper understanding of the human phonation process and the mechanisms generating sound, realistic setups are built up containing artificial vocal folds. Usually, these vocal folds consist of viscoelastic materials (e.g., polyurethane mixtures). Reliable simulation based studies on the setups require the mechanical properties of the utilized viscoelastic materials. The aim of this work is the identification of mechanical material parameters (Young's modulus, Poisson's ratio, and loss factor) for those materials. Therefore, we suggest a low-cost measurement setup, the so-called vibration transmission analyzer (VTA) enabling to analyze the transfer behavior of viscoelastic materials for propagating mechanical waves. With the aid of a mathematical Inverse Method, the material parameters are adjusted in a convenient way so that the simulation results coincide with the measurement results for the transfer behavior. Contrary to other works, we determine frequency dependent functions for the mechanical properties characterizing the viscoelastic material in the frequency range of human speech (100-250 Hz). The results for three different materials clearly show that the Poisson's ratio is close to 0.5 and that the Young's modulus increases with higher frequencies. For a frequency of 400 Hz, the Young's modulus of the investigated viscoelastic materials is approximately 80% higher than for the static case (0 Hz). We verify the identified mechanical properties with experiments on fabricated vocal fold models. Thereby, only small deviations between measurements and simulations occur.
Quality and Readability of English-Language Internet Information for Voice Disorders.
Dueppen, Abigail J; Bellon-Harn, Monica L; Radhakrishnan, Nandhakumar; Manchaiah, Vinaya
2017-12-15
The purpose of this study is to evaluate the readability and quality of English-language Internet information related to vocal hygiene, vocal health, and prevention of voice disorders. This study extends recent work because it evaluates readability, content quality, and website origin across broader search criteria than previous studies evaluating online voice material. Eighty-five websites were aggregated using five different country-specific search engines. Websites were then analyzed using quality and readability assessments. The entire web page was evaluated; however, no information or links beyond the first page was reviewed. Statistical calculations were employed to examine website ratings, differences between website origin and quality and readability scores, and correlations between readability instruments. Websites exhibited acceptable quality as measured by the DISCERN. However, only one website obtained the Health On the Net certification. Significant differences in quality were found among website origin, with government websites receiving higher quality ratings. Approximate educational levels required to comprehend information on the websites ranged from 8 to 9 years of education. Significant differences were found between website origin and readability measures with higher levels of education required to understand information on websites of nonprofit organizations. Current vocal hygiene, vocal health, and prevention of voice disorders websites were found to exhibit acceptable levels of quality and readability. However, highly rated Internet information related to voice care should be made more accessible to voice clients through Health On the Net certification. Published by Elsevier Inc.
Eliciting and Analyzing Male Mouse Ultrasonic Vocalization (USV) Songs
Chabout, Jonathan; Jones-Macopson, Joshua; Jarvis, Erich D.
2017-01-01
Mice produce ultrasonic vocalizations (USVs) in a variety of social contexts throughout development and adulthood. These USVs are used for mother-pup retrieval1, juvenile interactions2, opposite and same sex interactions345, and territorial interactions6. For decades, the USVs have been used by investigators as proxies to study neuropsychiatric and developmental or behavioral disorders789, and more recently to understand mechanisms and evolution of vocal communication among vertebrates10. Within the sexual interactions, adult male mice produce USV songs, which have some features similar to courtship songs of songbirds11. The use of such multisyllabic repertoires can increase potential flexibility and information they carry, as they can be varied in how elements are organized and recombined, namely syntax. In this protocol a reliable method to elicit USV songs from male mice in various social contexts, such as exposure to fresh female urine, anesthetized animals, and estrus females is described. This includes conditions to induce a large amount of syllables from the mice. We reduce recording of ambient noises with inexpensive sound chambers, and present a quantification method to automatically detect, classify and analyze the USVs. The latter includes evaluation of call-rate, vocal repertoire, acoustic parameters, and syntax. Various approaches and insight on using playbacks to study an animal's preference for specific song types are described. These methods were used to describe acoustic and syntax changes across different contexts in male mice, and song preferences in female mice. PMID:28518074
LANGUAGE DEVELOPMENT. The developmental dynamics of marmoset monkey vocal production.
Takahashi, D Y; Fenley, A R; Teramoto, Y; Narayanan, D Z; Borjon, J I; Holmes, P; Ghazanfar, A A
2015-08-14
Human vocal development occurs through two parallel interactive processes that transform infant cries into more mature vocalizations, such as cooing sounds and babbling. First, natural categories of sounds change as the vocal apparatus matures. Second, parental vocal feedback sensitizes infants to certain features of those sounds, and the sounds are modified accordingly. Paradoxically, our closest living ancestors, nonhuman primates, are thought to undergo few or no production-related acoustic changes during development, and any such changes are thought to be impervious to social feedback. Using early and dense sampling, quantitative tracking of acoustic changes, and biomechanical modeling, we showed that vocalizations in infant marmoset monkeys undergo dramatic changes that cannot be solely attributed to simple consequences of growth. Using parental interaction experiments, we found that contingent parental feedback influences the rate of vocal development. These findings overturn decades-old ideas about primate vocalizations and show that marmoset monkeys are a compelling model system for early vocal development in humans. Copyright © 2015, American Association for the Advancement of Science.
Carroll, Thomas; Nix, John; Hunter, Eric; Titze, Ingo; Abaza, Mona
2016-01-01
Objectives To evaluate vocal fatigue by using objective and subjective measurements of dose recorded by the National Center for Voice and Speech (NCVS) Dosimeter™ (Dosimeter). Study Design and Setting Seven subjects completed a two-week study period. The Dosimeter recorded vocal load, soft phonation tasks and subjective soft voice ratings. Three vocal doses (time, distance, and cycle) were measured in classical singers' larynges during an intensive practice period. Results Spikes in vocal load are reflected as harsher subjective ratings on the same day as well as 24–72 hours later. When at least 48 hours of vocal rest occurred before a vocal load, improved subjective evaluations were seen after the load. Conclusions The NCVS Dosimeter appears to be an effective tool for data collection on prolonged use of the voice. Significance This is the first multi-day study comparing objective and subjective data on vocal fatigue in a group of professional singers. PMID:17011424
Goldstein, Michael H.; Schwade, Jennifer A.; Bornstein, Marc H.
2014-01-01
The early noncry vocalizations of infants are salient social signals. Caregivers spontaneously respond to 30-50% of these sounds, and their responsiveness to infants' prelinguistic noncry vocalizations facilitates the development of phonology and speech. Have infants learned that their vocalizations influence the behavior of social partners? If infants have learned the contingency between their vocalizing and the social responses of others, they should show an extinction burst when the contingency is removed, increasing their rate of noncry vocalizing then decreasing. Thirty-eight 5-month-olds were tested in the still-face paradigm, during which they engaged in a 2-min still-face interaction with an unfamiliar adult. When the adult assumed a still face, infants showed an extinction burst. This pattern of infant vocalizations suggests that 5-month-olds have learned the social efficacy of their vocalizations on caregivers' behavior. Furthermore, the magnitude of 5-month infants' extinction bursts predicted their language comprehension at 13 months. PMID:19489893
Vocal tract length and acoustics of vocalization in the domestic dog (Canis familiaris).
Riede, T; Fitch, T
1999-10-01
The physical nature of the vocal tract results in the production of formants during vocalisation. In some animals (including humans), receivers can derive information (such as body size) about sender characteristics on the basis of formant characteristics. Domestication and selective breeding have resulted in a high variability in head size and shape in the dog (Canis familiaris), suggesting that there might be large differences in the vocal tract length, which could cause formant behaviour to affect interbreed communication. Lateral radiographs were made of dogs from several breeds ranging in size from a Yorkshire terrier (2.5 kg) to a German shepherd (50 kg) and were used to measure vocal tract length. In addition, we recorded an acoustic signal (growling) from some dogs. Significant correlations were found between vocal tract length, body mass and formant dispersion, suggesting that formant dispersion can deliver information about the body size of the vocalizer. Because of the low correlation between vocal tract length and the first formant, we predict a non-uniform vocal tract shape.
Elias-Costa, Agustin J; Montesinos, Rachel; Grant, Taran; Faivovich, Julián
2017-11-01
Anuran vocal sacs are elastic chambers that recycle exhaled air during vocalizations and are present in males of most species of frogs. Most knowledge of the diversity of vocal sacs relates to external morphology; detailed information on internal anatomy is available for few groups of frogs. Frogs of the family Hylodidae, which is endemic to the Atlantic Forest of Brazil and adjacent Argentina and Paraguay, have three patterns of vocal sac morphology-that is, single, subgular; paired, lateral; and absent. The submandibular musculature and structure of the vocal sac mucosa (the internal wall of the vocal sac) of exemplar species of this family and relatives were studied. In contrast to previous accounts, we found that all species of Crossodactylus and Hylodes possess paired, lateral vocal sacs, with the internal mucosa of each sac being separate from the contralateral one. Unlike all other frogs for which data are available, the mucosa of the vocal sacs in these genera is not supported externally by the mm. intermandibularis and interhyoideus. Rather, the vocal sac mucosa projects through the musculature and is free in the submandibular lymphatic sac. The presence of paired, lateral vocal sacs, the internal separation of the sac mucosae, and their projection through the m. interhyoideus are synapomorphies of the family. Furthermore, the specific configuration of the m. interhyoideus allows asymmetric inflation of paired vocal sacs, a feature only reported in species of these diurnal, stream-dwelling frogs. © 2017 Wiley Periodicals, Inc.
Oral breathing challenge in participants with vocal attrition.
Sivasankar, Mahalakshmi; Fisher, Kimberly V
2003-12-01
Vocal folds undergo osmotic challenge by mouth breathing during singing, exercising, and loud speaking. Just 15 min of obligatory oral breathing, to dry the vocal folds, increases phonation threshold pressure (Pth) and expiratory vocal effort in healthy speakers (M. Sivasankar & K. Fisher, 2002). We questioned whether oral breathing is more detrimental to phonation in healthy participants with a history of temporary vocal attrition. The effects of a 15-min oral or nasal breathing challenge on Pth and perceived expiratory vocal effort were compared for participants reporting symptoms of vocal attrition (N = 18, ages 19-38 years) and normal controls (N = 20, ages 19-33 years). Post-challenge-prechallenge differences in Pth (deltaPth) and effort (deltaEffort) revealed that oral breathing, but not nasal breathing, increased Pth (p < .001 ) and effort (p < .001) at low, comfortable, and high pitch. deltaPth was significantly greater in participants with vocal attrition than in normal controls (p < .001). Nasal breathing reduced Pth for all controls but not for all participants reporting vocal attrition. deltaPth was significantly and linearly correlated with deltaEffort (rvocal attrition = .81, p < .001; rcontrol = .84, p < .001). We speculate that the greater increases in Pth in participants reporting vocal attrition may result from delayed or inadequate compensatory response to superficial laryngeal dehydration. Obligatory oral breathing may place voice users at risk for exacerbating vocal attrition. That sol layer depletion by obligatory oral breathing increased Pth and vocal effort provides support for the role of superficial hydration in maintaining ease of phonation.
Vocal coordination and vocal imitation: a role for mirror neurons?
Newman, John D
2014-04-01
Some birds and mammals have vocal communication systems in which coordination between individuals is important. Examples would include duetting or antiphonal calling in some birds and mammals, rapid exchanges of the same vocalization, and vocal exchanges between paired individuals and other nearby pairs. Mirror neurons may play a role in such systems but become functional only after experience.
Ohta, Noriyuki; Mori, Takahiko
2007-11-01
Vocal cord paralysis is one of the frequently encountered complications after aortic surgery. However, reports of vocal cord paralysis after aortic surgery have been limited. In a retrospective cohort study of vocal cord paralysis after aortic surgery at a general hospital, we sought factors related to its development after aortic surgery to the descending thoracic aorta via left posterolateral thoracotomy. We reviewed data for a total of 69 patients who, between 1989 and 1995, underwent aortic surgery to the descending thoracic aorta. We assessed factors associated with the development of vocal cord paralysis and postoperative complications. Postoperative vocal cord paralysis appeared in 19 patients. Multiple logistic regression analysis revealed two risk factors for vocal cord paralysis: chronic dilatation of the aorta at the left subclavian artery (odds ratio = 8.67) and anastomosis proximal to the left subclavian artery (odds ratio = 17.7). The duration of mechanical ventilation was significantly prolonged for patients with vocal cord paralysis. Certain surgical factors associated with left subclavian artery increase the risk of vocal cord paralysis after surgery on the descending thoracic aorta. Vocal cord paralysis after aortic surgery did not increase aspiration pneumonia but was associated with pulmonary complications.
Tyack, Peter L
2008-08-01
The classic evidence for vocal production learning involves imitation of novel, often anthropogenic sounds. Among mammals, this has been reported for dolphins, elephants, harbor seals, and humans. A broader taxonomic distribution has been reported for vocal convergence, where the acoustic properties of calls from different individuals converge when they are housed together in captivity or form social bonds in the wild. Vocal convergence has been demonstrated for animals as diverse as songbirds, parakeets, hummingbirds, bats, elephants, cetaceans, and primates. For most species, call convergence is thought to reflect a group-distinctive identifier, with shared calls reflecting and strengthening social bonds. A ubiquitous function for vocal production learning that is starting to receive attention involves modifying signals to improve communication in a noisy channel. Pooling data on vocal imitation, vocal convergence, and compensation for noise suggests a wider taxonomic distribution of vocal production learning among mammals than has been generally appreciated. The wide taxonomic distribution of this evidence for vocal production learning suggests that perhaps more of the neural underpinnings for vocal production learning are in place in mammals than is usually recognized. (c) 2008 APA, all rights reserved
Assessing and treating vocal stereotypy in children with autism.
Ahearn, William H; Clark, Kathy M; MacDonald, Rebecca P F; Chung, Bo In
2007-01-01
Previous research implies that stereotypic behavior tends to be maintained by the sensory consequences produced by engaging in the response. Few investigations, however, have focused on vocal stereotypy. The current study examined the noncommunicative vocalizations of 4 children with an autism spectrum disorder. First, functional analyses were conducted in an attempt to identify the function of each child's behavior. For each of the participants, it was found that vocal stereotypy was likely not maintained by the social consequences. Following assessment, response interruption and redirection (RIRD) was implemented in an ABAB design to determine whether vocal stereotypy could be successfully redirected. RIRD involved a teacher issuing a series of vocal demands the child readily complied with during regular academic programming. Vocal demands were presented contingent on the occurrence of vocal stereotypy and were continuously presented until the child complied with three consecutively issued demands without emitting vocal stereotypy. For each child, RIRD produced levels of vocal stereotypy substantially lower than those observed in baseline. For 3 of the children, an increase in appropriate communication was also observed. The children's teachers were trained to implement RIRD. Brief follow-up probes and anecdotal information implied that the treatment had a positive impact in the natural environment.
Assessing and Treating Vocal Stereotypy in children with Autism
Ahearn, William H; Clark, Kathy M; MacDonald, Rebecca P.F; In Chung, Bo
2007-01-01
Previous research implies that stereotypic behavior tends to be maintained by the sensory consequences produced by engaging in the response. Few investigations, however, have focused on vocal stereotypy. The current study examined the noncommunicative vocalizations of 4 children with an autism spectrum disorder. First, functional analyses were conducted in an attempt to identify the function of each child's behavior. For each of the participants, it was found that vocal stereotypy was likely not maintained by the social consequences. Following assessment, response interruption and redirection (RIRD) was implemented in an ABAB design to determine whether vocal stereotypy could be successfully redirected. RIRD involved a teacher issuing a series of vocal demands the child readily complied with during regular academic programming. Vocal demands were presented contingent on the occurrence of vocal stereotypy and were continuously presented until the child complied with three consecutively issued demands without emitting vocal stereotypy. For each child, RIRD produced levels of vocal stereotypy substantially lower than those observed in baseline. For 3 of the children, an increase in appropriate communication was also observed. The children's teachers were trained to implement RIRD. Brief follow-up probes and anecdotal information implied that the treatment had a positive impact in the natural environment. PMID:17624067
McCathren, R B; Yoder, P J; Warren, S F
1999-08-01
This study tested the relationship between prelinguistic vocalization and expressive vocabulary 1 year later in young children with mild to moderate developmental delays. Three vocalization variables were tested: rate of all vocalization, rate of vocalizations with consonants, and rate of vocalizations used interactively. The 58 toddlers in the study were 17-34 months old, not sensory impaired, and had Bayley Mental Development Indices (Bayley, 1969; Bayley, 1993) from 35-85. In addition, the children had fewer than 3 words in their expressive vocabularies and during classroom observation each showed at least one instance of intentional prelinguistic communication before testing. Selected sections of the Communication and Symbolic Behavior Scales procedures (CSBS; Wetherby & Prizant, 1993) were administered at the beginning and at the end of the study. The vocal measures were obtained in the initial CSBS session. One measure of expressive vocabulary was obtained in the CSBS session at the end of the study. In addition, expressive vocabulary was measured in a nonstructured play session at the end of the study. We predicted that rate of vocalization, rate of vocalizations with consonants, and rate of vocalizations used interactively would all be positively related to later expressive vocabulary. The results confirmed the predictions.
Comprehensive Outcome Researches of Intralesional Steroid Injection on Benign Vocal Fold Lesions.
Wang, Chi-Te; Lai, Mei-Shu; Hsiao, Tzu-Yu
2015-09-01
This study investigated multidimensional treatment outcomes, including prognostic factors and side effects of vocal fold steroid injection (VFSI). We recruited 126 consecutive patients, including patients with 49 nodules, 47 polyps, and 30 mucus retention cysts. All the patients received VFSI under local anesthesia in the office settings. Treatment outcomes were evaluated 1 and 2 months after the procedure, including endoscopic evaluation, perceptual voice quality (GRB scores), acoustic analysis, and 10-item Voice Handicap Index (VHI-10). More than 80% of the patients reported subjective improvements after VFSI. Objective measurements revealed significant improvements from baseline in most of the outcome parameters (P<0.05). Higher occupational vocal demands and fibrotic vocal nodules were significantly associated with poorer clinical responses as measured by the VHI-10 and GRB scores, respectively. For vocal polyps, dysphonia for more than 12 months were significantly associated with higher postoperative VHI-10 scores, whereas patients with laryngopharyngeal reflux (LPR) showed significantly poor postoperative voice quality as measured by GRB scores. Side effects after VFSI included hematoma (27%), triamcinolone deposits (4%), and vocal atrophy (1%), which resolved spontaneously within 1-2 months. Presentation with vocal fold ectasias/varicosities and higher vocal demands were significantly correlated with postoperative vocal hematoma. This study demonstrated significant improvements after VFSI in vocal nodules, polyps, and cysts. Occupational vocal demand and subtypes of vocal nodules are closely related to the treatment outcomes after VFSI, whereas symptom duration and LPR were significant prognostic factors for VFSI treatment outcomes in vocal polyps. Side effects after receiving VFSI were mostly self-limited without sequel, whereas the incidence rates might be varied by the injection approach and the timing for postoperative follow-up. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Riede, Tobias; Li, Zhiheng; Tokuda, Isao T; Farmer, Colleen G
2015-04-01
Sauropsid vocalization is mediated by the syrinx in birds and the larynx in extant reptiles; but whereas avian vocal production has received much attention, the vocal mechanism of basal reptilians is poorly understood. The American alligator (Alligator mississippiensis) displays a large vocal repertoire during mating and in parent-offspring interactions. Although vocal outputs of these behaviors have received some attention, the underlying mechanism of sound production remains speculative. Here, we investigate the laryngeal anatomy of juvenile and adult animals by macroscopic and histological methods. Observations of the cartilaginous framework and associated muscles largely corroborate earlier findings, but one muscle, the cricoarytenoideus, exhibits a heretofore unknown extrinsic insertion that has important implications for effective regulation of vocal fold length and tension. Histological investigation of the larynx revealed a layered vocal fold morphology. The thick lamina propria consists of non-homogenous extracellular matrix containing collagen fibers that are tightly packed below the epithelium but loosely organized deep inside the vocal fold. We found few elastic fibers but comparatively high proportions of hyaluronan. Similar organizational complexity is also seen in mammalian vocal folds and the labia of the avian syrinx: convergent morphologies that suggest analogous mechanisms for sound production. In tensile tests, alligator vocal folds demonstrated a linear stress-strain behavior in the low strain region and nonlinear stress responses at strains larger than 15%, which is similar to mammalian vocal fold tissue. We have integrated morphological and physiological data in a two-mass vocal fold model, providing a systematic description of the possible acoustic space that could be available to an alligator larynx. Mapping actual call production onto possible acoustic space validates the model's predictions. © 2015. Published by The Company of Biologists Ltd.
Precise Motor Control Enables Rapid Flexibility in Vocal Behavior of Marmoset Monkeys.
Pomberger, Thomas; Risueno-Segovia, Cristina; Löschner, Julia; Hage, Steffen R
2018-03-05
Investigating the evolution of human speech is difficult and controversial because human speech surpasses nonhuman primate vocal communication in scope and flexibility [1-3]. Monkey vocalizations have been assumed to be largely innate, highly affective, and stereotyped for over 50 years [4, 5]. Recently, this perception has dramatically changed. Current studies have revealed distinct learning mechanisms during vocal development [6-8] and vocal flexibility, allowing monkeys to cognitively control when [9, 10], where [11], and what to vocalize [10, 12, 13]. However, specific call features (e.g., duration, frequency) remain surprisingly robust and stable in adult monkeys, resulting in rather stereotyped and discrete call patterns [14]. Additionally, monkeys seem to be unable to modulate their acoustic call structure under reinforced conditions beyond natural constraints [15, 16]. Behavioral experiments have shown that monkeys can stop sequences of calls immediately after acoustic perturbation but cannot interrupt ongoing vocalizations, suggesting that calls consist of single impartible pulses [17, 18]. Using acoustic perturbation triggered by the vocal behavior itself and quantitative measures of resulting vocal adjustments, we show that marmoset monkeys are capable of producing calls with durations beyond the natural boundaries of their repertoire by interrupting ongoing vocalizations rapidly after perturbation onset. Our results indicate that marmosets are capable of interrupting vocalizations only at periodic time points throughout calls, further supported by the occurrence of periodically segmented phees. These ideas overturn decades-old concepts on primate vocal pattern generation, indicating that vocalizations do not consist of one discrete call pattern but are built of many sequentially uttered units, like human speech. Copyright © 2018 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Horita, Haruhito; Kobayashi, Masahiko; Liu, Wan-chun; Oka, Kotaro; Jarvis, Erich D.; Wada, Kazuhiro
2012-01-01
Mechanisms for the evolution of convergent behavioral traits are largely unknown. Vocal learning is one such trait that evolved multiple times and is necessary in humans for the acquisition of spoken language. Among birds, vocal learning is evolved in songbirds, parrots, and hummingbirds. Each time similar forebrain song nuclei specialized for vocal learning and production have evolved. This finding led to the hypothesis that the behavioral and neuroanatomical convergences for vocal learning could be associated with molecular convergence. We previously found that the neural activity-induced gene dual specificity phosphatase 1 (dusp1) was up-regulated in non-vocal circuits, specifically in sensory-input neurons of the thalamus and telencephalon; however, dusp1 was not up-regulated in higher order sensory neurons or motor circuits. Here we show that song motor nuclei are an exception to this pattern. The song nuclei of species from all known vocal learning avian lineages showed motor-driven up-regulation of dusp1 expression induced by singing. There was no detectable motor-driven dusp1 expression throughout the rest of the forebrain after non-vocal motor performance. This pattern contrasts with expression of the commonly studied activity-induced gene egr1, which shows motor-driven expression in song nuclei induced by singing, but also motor-driven expression in adjacent brain regions after non-vocal motor behaviors. In the vocal non-learning avian species, we found no detectable vocalizing-driven dusp1 expression in the forebrain. These findings suggest that independent evolutions of neural systems for vocal learning were accompanied by selection for specialized motor-driven expression of the dusp1 gene in those circuits. This specialized expression of dusp1 could potentially lead to differential regulation of dusp1-modulated molecular cascades in vocal learning circuits. PMID:22876306
The Distribution and Severity of Tremor in Speech Structures of Persons with Vocal Tremor.
Hemmerich, Abby L; Finnegan, Eileen M; Hoffman, Henry T
2017-05-01
Vocal tremor may be associated with cyclic oscillations in the pulmonary, laryngeal, velopharyngeal, or oral regions. This study aimed to correlate the overall severity of vocal tremor with the distribution and severity of tremor in structures involved. Endoscopic and clinical examinations were completed on 20 adults with vocal tremor and two age-matched controls during sustained phonation. Two judges rated the severity of vocal tremor and the severity of tremor affecting each of 13 structures. Participants with mild vocal tremor typically presented with tremor in three laryngeal structures, moderate vocal tremor in five structures (laryngeal and another region), and severe vocal tremor in eight structures affecting all regions. The severity of tremor was lowest (mean = 1.2 out of 3) in persons with mild vocal tremor and greater in persons with moderate (mean = 1.5) and severe vocal tremor (mean = 1.4). Laryngeal structures were most frequently (95%) and severely (1.7 out of 3) affected, followed by velopharynx (40% occurrence, 1.3 severity), pulmonary (40% occurrence, 1.1 severity), and oral (40% occurrence, 1.0 severity) regions. Regression analyses indicated tremor severity of the supraglottic structures, and vertical laryngeal movement contributed most to vocal tremor severity during sustained phonation (r = 0.77, F = 16.17, P < 0.0001). A strong positive correlation (r = 0.72) was found between the Tremor Index and the severity of the vocal tremor during sustained phonation. It is useful to obtain a wide endoscopic view of the larynx to visualize tremor, which is rarely isolated to the true vocal folds alone. Published by Elsevier Inc.
Vocal cysts: clinical, endoscopic, and surgical aspects.
Martins, Regina Helena Garcia; Santana, Marcela Ferreira; Tavares, Elaine Lara Mendes
2011-01-01
Vocal cysts are benign laryngeal lesions, which affect children and adults. They can be classified as epidermic or mucous-retention cyst. The objective was to study the clinical, endoscopic, and surgical aspects of vocal cysts. We reviewed the medical charts of 72 patients with vocal cysts, considering age, gender, occupation, time of vocal symptoms, nasosinusal and gastroesophageal symptoms, vocal abuse, tabagism, alcoholism, associated lesions, treatment, and histological details. Of the 72 cases, 46 were adults (36 females and 10 male) and 26 were children (eight girls and 18 boys). As far as occupation is concerned, there was a higher incidence of students and teachers. All the patients had symptoms of chronic hoarseness. Nasosinusal (27.77%) and gastroesophageal (32%) symptoms were not relevant. Vocal abuse was reported by 45.83%, smoking by 18%, and alcoholism by 8.4% of the patients. Unilateral cysts were seen in 93% of the cases, 22 patients had associated lesions, such as bridge, sulcus vocalis, and microweb. Surgical treatment was performed in 46 cases. Histological analysis of the epidermic cysts revealed a cavity with caseous content, covered by stratified squamous epithelium, often keratinized. Mucous cysts presented mucous content, and the walls were coated by a cylindrical ciliated epithelium. Vocal cysts are benign vocal fold lesions that affect children and adults, being often associated with vocal overuse, which frequently affects people who use their voices professionally. Vocal symptoms are chronic in course, often times since childhood, and the treatment of choice is surgical removal. A careful examination of the vocal folds is necessary during surgery, because other laryngeal lesions may be associated with vocal cysts. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Exploring the anatomical encoding of voice with a mathematical model of the vocal system.
Assaneo, M Florencia; Sitt, Jacobo; Varoquaux, Gael; Sigman, Mariano; Cohen, Laurent; Trevisan, Marcos A
2016-11-01
The faculty of language depends on the interplay between the production and perception of speech sounds. A relevant open question is whether the dimensions that organize voice perception in the brain are acoustical or depend on properties of the vocal system that produced it. One of the main empirical difficulties in answering this question is to generate sounds that vary along a continuum according to the anatomical properties the vocal apparatus that produced them. Here we use a mathematical model that offers the unique possibility of synthesizing vocal sounds by controlling a small set of anatomically based parameters. In a first stage the quality of the synthetic voice was evaluated. Using specific time traces for sub-glottal pressure and tension of the vocal folds, the synthetic voices generated perceptual responses, which are indistinguishable from those of real speech. The synthesizer was then used to investigate how the auditory cortex responds to the perception of voice depending on the anatomy of the vocal apparatus. Our fMRI results show that sounds are perceived as human vocalizations when produced by a vocal system that follows a simple relationship between the size of the vocal folds and the vocal tract. We found that these anatomical parameters encode the perceptual vocal identity (male, female, child) and show that the brain areas that respond to human speech also encode vocal identity. On the basis of these results, we propose that this low-dimensional model of the vocal system is capable of generating realistic voices and represents a novel tool to explore the voice perception with a precise control of the anatomical variables that generate speech. Furthermore, the model provides an explanation of how auditory cortices encode voices in terms of the anatomical parameters of the vocal system. Copyright © 2016 Elsevier Inc. All rights reserved.
Arriaga, Gustavo; Zhou, Eric P.; Jarvis, Erich D.
2012-01-01
Humans and song-learning birds communicate acoustically using learned vocalizations. The characteristic features of this social communication behavior include vocal control by forebrain motor areas, a direct cortical projection to brainstem vocal motor neurons, and dependence on auditory feedback to develop and maintain learned vocalizations. These features have so far not been found in closely related primate and avian species that do not learn vocalizations. Male mice produce courtship ultrasonic vocalizations with acoustic features similar to songs of song-learning birds. However, it is assumed that mice lack a forebrain system for vocal modification and that their ultrasonic vocalizations are innate. Here we investigated the mouse song system and discovered that it includes a motor cortex region active during singing, that projects directly to brainstem vocal motor neurons and is necessary for keeping song more stereotyped and on pitch. We also discovered that male mice depend on auditory feedback to maintain some ultrasonic song features, and that sub-strains with differences in their songs can match each other's pitch when cross-housed under competitive social conditions. We conclude that male mice have some limited vocal modification abilities with at least some neuroanatomical features thought to be unique to humans and song-learning birds. To explain our findings, we propose a continuum hypothesis of vocal learning. PMID:23071596
Dopaminergic Contributions to Vocal Learning
Hoffmann, Lukas A.; Saravanan, Varun; Wood, Alynda N.; He, Li
2016-01-01
Although the brain relies on auditory information to calibrate vocal behavior, the neural substrates of vocal learning remain unclear. Here we demonstrate that lesions of the dopaminergic inputs to a basal ganglia nucleus in a songbird species (Bengalese finches, Lonchura striata var. domestica) greatly reduced the magnitude of vocal learning driven by disruptive auditory feedback in a negative reinforcement task. These lesions produced no measureable effects on the quality of vocal performance or the amount of song produced. Our results suggest that dopaminergic inputs to the basal ganglia selectively mediate reinforcement-driven vocal plasticity. In contrast, dopaminergic lesions produced no measurable effects on the birds' ability to restore song acoustics to baseline following the cessation of reinforcement training, suggesting that different forms of vocal plasticity may use different neural mechanisms. SIGNIFICANCE STATEMENT During skill learning, the brain relies on sensory feedback to improve motor performance. However, the neural basis of sensorimotor learning is poorly understood. Here, we investigate the role of the neurotransmitter dopamine in regulating vocal learning in the Bengalese finch, a songbird with an extremely precise singing behavior that can nevertheless be reshaped dramatically by auditory feedback. Our findings show that reduction of dopamine inputs to a region of the songbird basal ganglia greatly impairs vocal learning but has no detectable effect on vocal performance. These results suggest a specific role for dopamine in regulating vocal plasticity. PMID:26888928
Conversational Entrainment of Vocal Fry in Young Adult Female American English Speakers.
Borrie, Stephanie A; Delfino, Christine R
2017-07-01
Conversational entrainment, the natural tendency for people to modify their behaviors to more closely match their communication partner, is examined as one possible mechanism modulating the prevalence of vocal fry in the speech of young American women engaged in spoken dialogue. Twenty young adult female American English speakers engaged in two spoken dialogue tasks-one with a young adult female American English conversational partner who exhibited substantial vocal fry and one with a young adult female American English conversational partner who exhibited quantifiably less vocal fry. Dialogues were analyzed for proportion of vocal fry, by speaker, and two measures of communicative success (efficiency and enjoyment). Participants employed significantly more vocal fry when conversing with the partner who exhibited substantial vocal fry than when conversing with the partner who exhibited quantifiably less vocal fry. Further, greater similarity between communication partners in their use of vocal fry tracked with higher scores of communicative efficiency and communicative enjoyment. Conversational entrainment offers a mechanistic framework that may be used to explain, to some degree, the frequency with which vocal fry is employed by young American women engaged in spoken dialogue. Further, young American women who modulated their vocal patterns during dialogue to match those of their conversational partner gained more efficiency and enjoyment from their interactions, demonstrating the cognitive and social benefits of entrainment. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Mouse vocal communication system: are ultrasounds learned or innate?
Arriaga, Gustavo; Jarvis, Erich D.
2013-01-01
Mouse ultrasonic vocalizations (USVs) are often used as behavioral readouts of internal states, to measure effects of social and pharmacological manipulations, and for behavioral phenotyping of mouse models for neuropsychiatric and neurodegenerative disorders. However, little is known about the neurobiological mechanisms of rodent USV production. Here we discuss the available data to assess whether male mouse song behavior and the supporting brain circuits resemble those of known vocal non-learning or vocal learning species. Recent neurobiology studies have demonstrated that the mouse USV brain system includes motor cortex and striatal regions, and that the vocal motor cortex sends a direct sparse projection to the brainstem vocal motor nucleus ambiguous, a projection thought be unique to humans among mammals. Recent behavioral studies have reported opposing conclusions on mouse vocal plasticity, including vocal ontogeny changes in USVs over early development that might not be explained by innate maturation processes, evidence for and against a role for auditory feedback in developing and maintaining normal mouse USVs, and evidence for and against limited vocal imitation of song pitch. To reconcile these findings, we suggest that the trait of vocal learning may not be dichotomous but encompass a broad set of behavioral and neural traits we call the continuum hypothesis, and that mice possess some of the traits associated with a capacity for limited vocal learning. PMID:23295209
Assessing Vocal Development in Infants and Toddlers
Nathani, Suneeti; Ertmer, David J.
2012-01-01
The purpose of this study was to examine changes in prelinguistic vocal productions during the first 20 months of life. Vocalizations were classified into 23 mutually exclusive and exhaustive types, and grouped into five ascending levels using the Stark Assessment of Early Vocal Development-Revised (SAEVD-R). Data from 30 typically developing infants, aged 0–20 months, show that older infants attained higher developmental levels on the SAEVD-R than younger infants. Infants 0–2, 3–5, and 6–8 months of age primarily produced vocalizations from Levels 1 (Reflexive), 2 (Control of Phonation), and 3 (Expansion). Infants 9–20 months of age also produced vocalizations from Level 4 (Basic Canonical Syllables). Only infants from 16–20 months of age produced Level 5 (Advanced Forms) vocalizations in significant quantities. The outcomes indicate that the SAEVD-R is a valuable instrument for evaluating prelinguistic vocal development. PMID:16728333
Phonetogram changes for trained singers over a nine-month period of vocal training.
LeBorgne, Wendy DeLeo; Weinrich, Barbara D
2002-03-01
Professional vocalists encounter demands requiring voluntary control of phonation, while utilizing a considerable range of frequency and intensity. These quantifiable acoustic events can be measured and represented in a phonetogram. Previous research has compared the phonetograms of trained and untrained voices and found significant differences between these groups. This study was designed to assess the effects of vocal training for singers over a period of nine months. Phonetogram contour changes were examined, with the primary focus on expansion of frequency range and/or intensity control. Twenty-one first-year, master's level, vocal music students, who were engaged in an intensive vocal performance curriculum, participated in this study. Following nine months of vocal training, significant differences were revealed in the subjects' mean frequency range and minimum vocal intensity across frequency levels. There was no significant difference for the mean maximum vocal intensity across frequency levels following vocal training.
Fournet, Michelle E; Szabo, Andy; Mellinger, David K
2015-01-01
On low-latitude breeding grounds, humpback whales produce complex and highly stereotyped songs as well as a range of non-song sounds associated with breeding behaviors. While on their Southeast Alaskan foraging grounds, humpback whales produce a range of previously unclassified non-song vocalizations. This study investigates the vocal repertoire of Southeast Alaskan humpback whales from a sample of 299 non-song vocalizations collected over a 3-month period on foraging grounds in Frederick Sound, Southeast Alaska. Three classification systems were used, including aural spectrogram analysis, statistical cluster analysis, and discriminant function analysis, to describe and classify vocalizations. A hierarchical acoustic structure was identified; vocalizations were classified into 16 individual call types nested within four vocal classes. The combined classification method shows promise for identifying variability in call stereotypy between vocal groupings and is recommended for future classification of broad vocal repertoires.
Identification of a motor to auditory pathway important for vocal learning
Roberts, Todd F.; Hisey, Erin; Tanaka, Masashi; Kearney, Matthew; Chattree, Gaurav; Yang, Cindy F.; Shah, Nirao M.; Mooney, Richard
2017-01-01
Summary Learning to vocalize depends on the ability to adaptively modify the temporal and spectral features of vocal elements. Neurons that convey motor-related signals to the auditory system are theorized to facilitate vocal learning, but the identity and function of such neurons remain unknown. Here we identify a previously unknown neuron type in the songbird brain that transmits vocal motor signals to the auditory cortex. Genetically ablating these neurons in juveniles disrupted their ability to imitate features of an adult tutor’s song. Ablating these neurons in adults had little effect on previously learned songs, but interfered with their ability to adaptively modify the duration of vocal elements and largely prevented the degradation of song’s temporal features normally caused by deafening. These findings identify a motor to auditory circuit essential to vocal imitation and to the adaptive modification of vocal timing. PMID:28504672
ERIC Educational Resources Information Center
Hackworth, Rhonda S.
2007-01-01
This study examined the effects of vocal hygiene and behavior modification instruction on self-reported behaviors of music teachers. Subjects (N = 76) reported daily behaviors for eight weeks: water consumption, warm-up, talking over music/noise, vocal rest, nonverbal commands, and vocal problems. Subjects were in experimental group 1 or 2, or the…
Choi, Jeong-Seok; Kim, Nahn Ju; Klemuk, Sarah; Jang, Yun Ho; Park, In Suh; Ahn, Kyung Hyun; Lim, Jae-Yol; Kim, Young-Mo
2012-09-01
To compare the rheological characteristics of structurally different hyaluronic acid (HA)-based biomaterials that are presently used for phonosurgery and to investigate their influence on the viscoelastic properties of vocal folds after implantation in an in vivo rabbit model. In vitro and in vivo rheometric investigation. Experimental laboratory, Inha and Seoul National Universities. Viscoelastic shear properties of 3 HA-based biomaterials (Rofilan, Restylane, and Reviderm) were measured with a strain-controlled rheometer. These biomaterials were injected into the deep layers of rabbit vocal folds, and viscoelastic moduli of the injected vocal folds were determined 2 months after the injection. The vocal fold specimens were observed using a light microscope and a transmission electron microscope. All HA-based biomaterials showed similar levels of shear viscosity, which were slightly higher than that of human vocal folds reported in previous studies. Compared with noninjected control vocal folds, there were no significant differences in the magnitudes of both elastic shear modulus (G') and viscous modulus (G") of injected vocal folds among all of the materials. Light microscopic images showed that all materials were observed in the deep layers of vocal folds and electron scanning images revealed that injected HA particles were homogeneously distributed in regions of collagenous fibers. HA-based biomaterials could preserve the viscoelastic properties of the vocal folds, when they were injected into vocal folds in an in vivo rabbit model. However, further studies on the influence of the biomaterials on the viscoelasticity of human vocal folds in ECM surroundings are still needed.
Voice Use Among Music Theory Teachers: A Voice Dosimetry and Self-Assessment Study.
Schiller, Isabel S; Morsomme, Dominique; Remacle, Angélique
2017-07-25
This study aimed (1) to investigate music theory teachers' professional and extra-professional vocal loading and background noise exposure, (2) to determine the correlation between vocal loading and background noise, and (3) to determine the correlation between vocal loading and self-evaluation data. Using voice dosimetry, 13 music theory teachers were monitored for one workweek. The parameters analyzed were voice sound pressure level (SPL), fundamental frequency (F0), phonation time, vocal loading index (VLI), and noise SPL. Spearman correlation was used to correlate vocal loading parameters (voice SPL, F0, and phonation time) and noise SPL. Each day, the subjects self-assessed their voice using visual analog scales. VLI and self-evaluation data were correlated using Spearman correlation. Vocal loading parameters and noise SPL were significantly higher in the professional than in the extra-professional environment. Voice SPL, phonation time, and female subjects' F0 correlated positively with noise SPL. VLI correlated with self-assessed voice quality, vocal fatigue, and amount of singing and speaking voice produced. Teaching music theory is a profession with high vocal demands. More background noise is associated with increased vocal loading and may indirectly increase the risk for voice disorders. Correlations between VLI and self-assessments suggest that these teachers are well aware of their vocal demands and feel their effect on voice quality and vocal fatigue. Visual analog scales seem to represent a useful tool for subjective vocal loading assessment and associated symptoms in these professional voice users. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
The Interaction of Surface Hydration and Vocal Loading on Voice Measures.
Fujiki, Robert Brinton; Chapleau, Abigail; Sundarrajan, Anusha; McKenna, Victoria; Sivasankar, M Preeti
2017-03-01
Vocal loading tasks provide insight regarding the mechanisms underlying healthy laryngeal function. Determining the manner in which the larynx can most efficiently be loaded is a complex task. The goal of this study was to determine if vocal loading could be achieved in 30 minutes by altering phonatory mode. Owing to the fact that surface hydration facilitates efficient vocal fold oscillation, the effects of environmental humidity on vocal loading were also examined. This study also investigated whether the detrimental effects of vocal loading could be attenuated by increasing environmental humidity. Sixteen vocally healthy adults (8 men, 8 women) completed a 30-minute vocal loading task in low and moderate humidity. The order of humidities was counterbalanced across subjects. The vocal loading task consisted of reading with elevated pitch and pressed vocal quality and low pitch and pressed and/or raspy vocal quality in the presence of 65 dB ambient, multi-talker babble noise. Significant effects were observed for (1) cepstral peak prominence on soft sustained phonation at 10th and 80th pitches, (2) perceived phonatory effort, and (3) perceived tiredness ratings. No loading effects were observed for cepstral peak prominence on the rainbow passage, although fundamental frequency on the rainbow passage increased post loading. No main effect was observed for humidity. Following a 30-minute vocal loading task involving altering laryngeal vibratory mode in combination with increased volume. Also, moderate environmental humidity did not significantly attenuate the negative effects of loading. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Convergent Differential Regulation of Parvalbumin in the Brains of Vocal Learners
Hara, Erina; Rivas, Miriam V.; Ward, James M.; Okanoya, Kazuo; Jarvis, Erich D.
2012-01-01
Spoken language and learned song are complex communication behaviors found in only a few species, including humans and three groups of distantly related birds – songbirds, parrots, and hummingbirds. Despite their large phylogenetic distances, these vocal learners show convergent behaviors and associated brain pathways for vocal communication. However, it is not clear whether this behavioral and anatomical convergence is associated with molecular convergence. Here we used oligo microarrays to screen for genes differentially regulated in brain nuclei necessary for producing learned vocalizations relative to adjacent brain areas that control other behaviors in avian vocal learners versus vocal non-learners. A top candidate gene in our screen was a calcium-binding protein, parvalbumin (PV). In situ hybridization verification revealed that PV was expressed significantly higher throughout the song motor pathway, including brainstem vocal motor neurons relative to the surrounding brain regions of all distantly related avian vocal learners. This differential expression was specific to PV and vocal learners, as it was not found in avian vocal non-learners nor for control genes in learners and non-learners. Similar to the vocal learning birds, higher PV up-regulation was found in the brainstem tongue motor neurons used for speech production in humans relative to a non-human primate, macaques. These results suggest repeated convergent evolution of differential PV up-regulation in the brains of vocal learners separated by more than 65–300 million years from a common ancestor and that the specialized behaviors of learned song and speech may require extra calcium buffering and signaling. PMID:22238614
Neural coding of syntactic structure in learned vocalizations in the songbird.
Fujimoto, Hisataka; Hasegawa, Taku; Watanabe, Dai
2011-07-06
Although vocal signals including human languages are composed of a finite number of acoustic elements, complex and diverse vocal patterns can be created from combinations of these elements, linked together by syntactic rules. To enable such syntactic vocal behaviors, neural systems must extract the sequence patterns from auditory information and establish syntactic rules to generate motor commands for vocal organs. However, the neural basis of syntactic processing of learned vocal signals remains largely unknown. Here we report that the basal ganglia projecting premotor neurons (HVC(X) neurons) in Bengalese finches represent syntactic rules that generate variable song sequences. When vocalizing an alternative transition segment between song elements called syllables, sparse burst spikes of HVC(X) neurons code the identity of a specific syllable type or a specific transition direction among the alternative trajectories. When vocalizing a variable repetition sequence of the same syllable, HVC(X) neurons not only signal the initiation and termination of the repetition sequence but also indicate the progress and state-of-completeness of the repetition. These different types of syntactic information are frequently integrated within the activity of single HVC(X) neurons, suggesting that syntactic attributes of the individual neurons are not programmed as a basic cellular subtype in advance but acquired in the course of vocal learning and maturation. Furthermore, some auditory-vocal mirroring type HVC(X) neurons display transition selectivity in the auditory phase, much as they do in the vocal phase, suggesting that these songbirds may extract syntactic rules from auditory experience and apply them to form their own vocal behaviors.
Gender differences affecting vocal health of women in vocally demanding careers
Hunter, Eric J.; Smith, Marshall E.; Tanner, Kristine
2012-01-01
Studies suggest that occupational voice users have a greater incidence of vocal issues than the general population. Women have been found to experience vocal health problems more frequently than men, regardless of their occupation. Traditionally, it has been assumed that differences in the laryngeal system are the cause of this disproportion. Nevertheless, it is valuable to identify other potential gender distinctions which may make women more vulnerable to voice disorders. A search of the literature was conducted for gender-specific characteristics which might impact the vocal health of women. This search can be used by healthcare practitioners to help female patients avoid serious vocal health injuries, as well as to better treat women who already suffer from such vocal health issues. PMID:21722077
Evolution of Courtship Songs in Xenopus : Vocal Pattern Generation and Sound Production.
Leininger, Elizabeth C; Kelley, Darcy B
2015-01-01
The extant species of African clawed frogs (Xenopus and Silurana) provide an opportunity to link the evolution of vocal characters to changes in the responsible cellular and molecular mechanisms. In this review, we integrate several robust lines of research: evolutionary trajectories of Xenopus vocalizations, cellular and circuit-level mechanisms of vocalization in selected Xenopus model species, and Xenopus evolutionary history and speciation mechanisms. Integrating recent findings allows us to generate and test specific hypotheses about the evolution of Xenopus vocal circuits. We propose that reduced vocal sex differences in some Xenopus species result from species-specific losses of sexually differentiated neural and neuromuscular features. Modification of sex-hormone-regulated developmental mechanisms is a strong candidate mechanism for reduced vocal sex differences.
Lee, Shao-Hsuan; Hsiao, Tzu-Yu; Lee, Guo-She
2015-06-01
Sustained vocalizations of vowels [a], [i], and syllable [mə] were collected in twenty normal-hearing individuals. On vocalizations, five conditions of different audio-vocal feedback were introduced separately to the speakers including no masking, wearing supra-aural headphones only, speech-noise masking, high-pass noise masking, and broad-band-noise masking. Power spectral analysis of vocal fundamental frequency (F0) was used to evaluate the modulations of F0 and linear-predictive-coding was used to acquire first two formants. The results showed that while the formant frequencies were not significantly shifted, low-frequency modulations (<3 Hz) of F0 significantly increased with reduced audio-vocal feedback across speech sounds and were significantly correlated with auditory awareness of speakers' own voices. For sustained speech production, the motor speech controls on F0 may depend on a feedback mechanism while articulation should rely more on a feedforward mechanism. Power spectral analysis of F0 might be applied to evaluate audio-vocal control for various hearing and neurological disorders in the future. Copyright © 2015 Elsevier B.V. All rights reserved.
Chimpanzees modify recruitment screams as a function of audience composition
Slocombe, Katie E.; Zuberbühler, Klaus
2007-01-01
Wild chimpanzees produce acoustically distinct scream vocalizations depending on their social role during agonistic interactions with other group members. Here, we show that victims during such agonistic interactions alter the acoustic structure of their screams depending on the severity of aggression experienced, providing nearby listeners with important cues about the nature of the attack. However, we also found that victims of severe attacks produced screams that significantly exaggerated the true level of aggression experienced, but they did so only if there was at least one listener in the audience who matched or surpassed the aggressor in rank. Our results are consistent with the more general hypothesis that chimpanzees possess sophisticated understanding of third-party relationships, so-called triadic awareness, and that this knowledge influences their vocal production. PMID:17942683
Yu, Chengzhu; Hansen, John H L
2017-03-01
Human physiology has evolved to accommodate environmental conditions, including temperature, pressure, and air chemistry unique to Earth. However, the environment in space varies significantly compared to that on Earth and, therefore, variability is expected in astronauts' speech production mechanism. In this study, the variations of astronaut voice characteristics during the NASA Apollo 11 mission are analyzed. Specifically, acoustical features such as fundamental frequency and phoneme formant structure that are closely related to the speech production system are studied. For a further understanding of astronauts' vocal tract spectrum variation in space, a maximum likelihood frequency warping based analysis is proposed to detect the vocal tract spectrum displacement during space conditions. The results from fundamental frequency, formant structure, as well as vocal spectrum displacement indicate that astronauts change their speech production mechanism when in space. Moreover, the experimental results for astronaut voice identification tasks indicate that current speaker recognition solutions are highly vulnerable to astronaut voice production variations in space conditions. Future recommendations from this study suggest that successful applications of speaker recognition during extended space missions require robust speaker modeling techniques that could effectively adapt to voice production variation caused by diverse space conditions.
Garland, Ellen C; Rendell, Luke; Lilley, Matthew S; Poole, M Michael; Allen, Jenny; Noad, Michael J
2017-07-01
Identifying and quantifying variation in vocalizations is fundamental to advancing our understanding of processes such as speciation, sexual selection, and cultural evolution. The song of the humpback whale (Megaptera novaeangliae) presents an extreme example of complexity and cultural evolution. It is a long, hierarchically structured vocal display that undergoes constant evolutionary change. Obtaining robust metrics to quantify song variation at multiple scales (from a sound through to population variation across the seascape) is a substantial challenge. Here, the authors present a method to quantify song similarity at multiple levels within the hierarchy. To incorporate the complexity of these multiple levels, the calculation of similarity is weighted by measurements of sound units (lower levels within the display) to bridge the gap in information between upper and lower levels. Results demonstrate that the inclusion of weighting provides a more realistic and robust representation of song similarity at multiple levels within the display. This method permits robust quantification of cultural patterns and processes that will also contribute to the conservation management of endangered humpback whale populations, and is applicable to any hierarchically structured signal sequence.
Halim, Zahid; Abbas, Ghulam
2015-01-01
Sign language provides hearing and speech impaired individuals with an interface to communicate with other members of the society. Unfortunately, sign language is not understood by most of the common people. For this, a gadget based on image processing and pattern recognition can provide with a vital aid for detecting and translating sign language into a vocal language. This work presents a system for detecting and understanding the sign language gestures by a custom built software tool and later translating the gesture into a vocal language. For the purpose of recognizing a particular gesture, the system employs a Dynamic Time Warping (DTW) algorithm and an off-the-shelf software tool is employed for vocal language generation. Microsoft(®) Kinect is the primary tool used to capture video stream of a user. The proposed method is capable of successfully detecting gestures stored in the dictionary with an accuracy of 91%. The proposed system has the ability to define and add custom made gestures. Based on an experiment in which 10 individuals with impairments used the system to communicate with 5 people with no disability, 87% agreed that the system was useful.
Speech and swallowing disorders in Parkinson disease.
Sapir, Shimon; Ramig, Lorraine; Fox, Cynthia
2008-06-01
To review recent research and clinical studies pertaining to the nature, diagnosis, and treatment of speech and swallowing disorders in Parkinson disease. Although some studies indicate improvement in voice and speech with dopamine therapy and deep brain stimulation of the subthalamic nucleus, others show minimal or adverse effects. Repetitive transcranial magnetic stimulation of the mouth motor cortex and injection of collagen in the vocal folds have preliminary data supporting improvement in phonation in people with Parkinson disease. Treatments focusing on vocal loudness, specifically LSVT LOUD (Lee Silverman Voice Treatment), have been effective for the treatment of speech disorders in Parkinson disease. Changes in brain activity due to LSVT LOUD provide preliminary evidence for neural plasticity. Computer-based technology makes the Lee Silverman Voice Treatment available to a large number of users. A rat model for studying neuropharmacologic effects on vocalization in Parkinson disease has been developed. New diagnostic methods of speech and swallowing are also available as the result of recent studies. Speech rehabilitation with the LSVT LOUD is highly efficacious and scientifically tested. There is a need for more studies to improve understanding, diagnosis, prevention, and treatment of speech and swallowing disorders in Parkinson disease.
Understanding speaker attitudes from prosody by adults with Parkinson's disease.
Monetta, Laura; Cheang, Henry S; Pell, Marc D
2008-09-01
The ability to interpret vocal (prosodic) cues during social interactions can be disrupted by Parkinson's disease, with notable effects on how emotions are understood from speech. This study investigated whether PD patients who have emotional prosody deficits exhibit further difficulties decoding the attitude of a speaker from prosody. Vocally inflected but semantically nonsensical 'pseudo-utterances' were presented to listener groups with and without PD in two separate rating tasks. Task I required participants to rate how confident a speaker sounded from their voice and Task 2 required listeners to rate how polite the speaker sounded for a comparable set of pseudo-utterances. The results showed that PD patients were significantly less able than HC participants to use prosodic cues to differentiate intended levels of speaker confidence in speech, although the patients could accurately detect the politelimpolite attitude of the speaker from prosody in most cases. Our data suggest that many PD patients fail to use vocal cues to effectively infer a speaker's emotions as well as certain attitudes in speech such as confidence, consistent with the idea that the basal ganglia play a role in the meaningful processing of prosodic sequences in spoken language (Pell & Leonard, 2003).
Precursors of Dancing and Singing to Music in Three- to Four-Months-Old Infants
Fujii, Shinya; Watanabe, Hama; Oohashi, Hiroki; Hirashima, Masaya; Nozaki, Daichi; Taga, Gentaro
2014-01-01
Dancing and singing to music involve auditory-motor coordination and have been essential to our human culture since ancient times. Although scholars have been trying to understand the evolutionary and developmental origin of music, early human developmental manifestations of auditory-motor interactions in music have not been fully investigated. Here we report limb movements and vocalizations in three- to four-months-old infants while they listened to music and were in silence. In the group analysis, we found no significant increase in the amount of movement or in the relative power spectrum density around the musical tempo in the music condition compared to the silent condition. Intriguingly, however, there were two infants who demonstrated striking increases in the rhythmic movements via kicking or arm-waving around the musical tempo during listening to music. Monte-Carlo statistics with phase-randomized surrogate data revealed that the limb movements of these individuals were significantly synchronized to the musical beat. Moreover, we found a clear increase in the formant variability of vocalizations in the group during music perception. These results suggest that infants at this age are already primed with their bodies to interact with music via limb movements and vocalizations. PMID:24837135
Precursors of dancing and singing to music in three- to four-months-old infants.
Fujii, Shinya; Watanabe, Hama; Oohashi, Hiroki; Hirashima, Masaya; Nozaki, Daichi; Taga, Gentaro
2014-01-01
Dancing and singing to music involve auditory-motor coordination and have been essential to our human culture since ancient times. Although scholars have been trying to understand the evolutionary and developmental origin of music, early human developmental manifestations of auditory-motor interactions in music have not been fully investigated. Here we report limb movements and vocalizations in three- to four-months-old infants while they listened to music and were in silence. In the group analysis, we found no significant increase in the amount of movement or in the relative power spectrum density around the musical tempo in the music condition compared to the silent condition. Intriguingly, however, there were two infants who demonstrated striking increases in the rhythmic movements via kicking or arm-waving around the musical tempo during listening to music. Monte-Carlo statistics with phase-randomized surrogate data revealed that the limb movements of these individuals were significantly synchronized to the musical beat. Moreover, we found a clear increase in the formant variability of vocalizations in the group during music perception. These results suggest that infants at this age are already primed with their bodies to interact with music via limb movements and vocalizations.
Perceptual elements in brain mechanisms of acoustic communication in humans and nonhuman primates.
Reser, David H; Rosa, Marcello
2014-12-01
Ackermann et al. outline a model for elaboration of subcortical motor outputs as a driving force for the development of the apparently unique behaviour of language in humans. They emphasize circuits in the striatum and midbrain, and acknowledge, but do not explore, the importance of the auditory perceptual pathway for evolution of verbal communication. We suggest that understanding the evolution of language will also require understanding of vocalization perception, especially in the auditory cortex.
An Investigation of Extinction-Induced Vocalizations
ERIC Educational Resources Information Center
Valentino, Amber L.; Shillingsburg, M. Alice; Call, Nathan A.; Burton, Britney; Bowen, Crystal N.
2011-01-01
Children with autism have significant communication delays. Although some children develop vocalizations through shaping and differential reinforcement, others rarely exhibit vocalizations, and alternative methods are targeted in intervention. However, vocal language often remains a goal for caregivers and clinicians. Thus, strategies to increase…
Vocal fold paralysis secondary to phonotrauma.
Klein, Travis A L; Gaziano, Joy E; Ridley, Marion B
2014-01-01
A unique case of acute onset vocal fold paralysis secondary to phonotrauma is presented. The cause was forceful vocalization by a drill instructor on a firearm range. Imaging studies revealed extensive intralaryngeal and retropharyngeal hemorrhage. Laryngoscopy showed a complete left vocal fold paralysis. Relative voice rest was recommended, and the patient regained normal vocal fold mobility and function after approximately 12 weeks. Copyright © 2014 The Voice Foundation. All rights reserved.
ERIC Educational Resources Information Center
Meerschman, Iris; Van Lierde, Kristiane; Peeters, Karen; Meersman, Eline; Claeys, Sofie; D'haeseleer, Evelien
2017-01-01
Purpose: The purpose of this study was to determine the short-term effect of 2 semi-occluded vocal tract training programs, "resonant voice training using nasal consonants" versus "straw phonation," on the vocal quality of vocally healthy future occupational voice users. Method: A multigroup pretest--posttest randomized control…
Clinical practice: vocal nodules in dysphonic children.
Martins, Regina Helena Garcia; Branco, Anete; Tavares, Elaine Lara Mendes; Gramuglia, Andrea Cristina Jóia
2013-09-01
Common among children, vocal symptoms are a cause of concern for parents who seek elucidation of their diagnosis and treatment. Vocal nodules are the major cause of dysphonias in children and are related to vocal abuse. We conducted a literature review considering clinical, physiopathological, epidemiological, and histological aspects of vocal nodules, as well as diagnostic methods, highlighting the main studies addressing this issue. The controversial points of treatments were also discussed.
Mechanical properties of the vocal fold. Stress-strain studies.
Haji, T; Mori, K; Omori, K; Isshiki, N
1992-01-01
The viscoelasticity of the vocal and ventricular folds was experimentally assessed by analyzing the stress-strain relationships obtained using a newly developed measuring system. The degree of stiffness of the mid-membranous portion of the vocal fold was less than that near the anterior commissure or the vocal process. The ventricular fold was much less stiff and significantly more viscous than the vocal fold. At the membranous portion of the vocal fold, the degree of stiffness was less and that of viscosity greater at 2 mm above and below the free margin than at the free margin itself.
Fuentes-López, Eduardo; Fuente, Adrian; Contreras, Karem V
2017-12-18
The aim of this study is to determine possible associations between vocal hygiene habits and self-reported vocal symptoms in telemarketers. A cross-sectional study that included 79 operators from call centres in Chile was carried out. Their vocal hygiene habits and self-reported symptoms were investigated using a validated and reliable questionnaire created for the purposes of this study. Forty-five percent of telemarketers reported having one or more vocal symptoms. Among them, 16.46% reported that their voices tense up when talking and 10.13% needed to clear their throat to make their voices clearer. Five percent mentioned that they always talk without taking a break and 40.51% reported using their voices in noisy environments. The number of working hours per day and inadequate vocal hygiene habits were associated with the presence of self-reported symptoms. Additionally, an interaction between the use of the voice in noisy environments and not taking breaks during the day was observed. Finally, the frequency of inadequate vocal hygiene habits was associated with the number of symptoms reported. Using the voice in noisy environments and talking without taking breaks were both associated with the presence of specific vocal symptoms. This study provides some evidence about the interaction between these two inadequate vocal hygiene habits that potentiates vocal symptoms.
Julias, Margaret; Riede, Tobias; Cook, Douglas
2014-01-01
Objectives Collagen fiber content and orientation affect the viscoelastic properties of the vocal folds, determining oscillation characteristics during speech and other vocalization. The investigation and reconstruction of the collagen network in vocal folds remains a challenge, because the collagen network requires at least micron-scale resolution. In this study, we used polarized light microscopy to investigate the distribution and alignment of collagen fibers within the vocal folds. Methods Data were collected in sections of human and rhesus monkey (Macaca mulatta) vocal folds cut at 3 different angles and stained with picrosirius red. Results Statistically significant differences were found between different section angles, implying that more than one section angle is required to capture the network’s complexity. In the human vocal folds, the collagen fiber distribution continuously varied across the lamina propria (medial to lateral). Distinct differences in birefringence distribution were observed between the species. For the human vocal folds, high birefringence was observed near the thyroarytenoid muscle and near the epithelium. However, in the rhesus monkey vocal folds, high birefringence was observed near the epithelium, and lower birefringence was seen near the thyroarytenoid muscle. Conclusions The differences between the collagen networks in human and rhesus monkey vocal folds provide a morphological basis for differences in viscoelastic properties between species. PMID:23534129
Imaizumi, Mitsuyoshi; Li-Jessen, Nicole Y K; Sato, Yuka; Yang, David T; Thibeault, Susan L
2017-04-01
One prospective treatment option for vocal fold scarring is regeneration with an engineered scaffold containing induced pluripotent stem cells (iPS). In the present study, we investigated the feasibility of utilizing an injectable hyaluronic acid (HA) scaffold encapsulated with human-iPS cell (hiPS) for regeneration of vocal folds. Thirty athymic nude rats underwent unilateral vocal fold injury. Contralateral vocal folds served as uninjured controls. Hyaluronic acid hydrogel scaffold, HA hydrogel scaffold containing hiPS, and HA hydrogel scaffold containing hiPS with epidermal growth factor (EGF) were injected in both vocal folds immediately after surgery. One and 2 weeks after injection, larynges were excised for histology, immunohistochemistry, and fluorescence in situ hybridization (FISH). Presence of HA hydrogel was confirmed in vocal folds 1 and 2 weeks post injection. The FISH analysis confirmed the presence and viability of hiPS in the injected vocal folds. Histological results demonstrated that vocal folds injected with HA hydrogel scaffold containing EGF demonstrated less fibrosis than those with HA hydrogel only. Human-iPS survived in injured rat vocal folds. The HA hydrogel with hiPS and EGF ameliorated the fibrotic response. Additional work is necessary to optimize hiPS differentiation and further confirm the safety of hiPS for clinical applications.
Makiyama, Kiyoshi; Yoshihashi, Hidetaka; Mogitate, Manabu; Kida, Akinori
2005-04-01
To determine the role of the adjustment of expiratory effort in the control of vocal intensity. An intensity-loading test was performed by using the airway interruption method. Three groups of subjects were used: a control group thought to resemble normal vocal fold closure, a group of patients with Reinke's edema thought to represent increased mass at the level of the vocal folds, and a group with vocal fold paralysis that was thought to represent a group with lack of adequate vocal fold closure. In the control group, expiratory lung pressure and airway resistance slightly increased. In the patients with Reinke's edema, expiratory lung pressure, and airway resistance significantly increased. In this group, the voice intensity was controlled by laryngeal adjustment, but a greater expiratory effort was needed because of a greater increase in glottal resistance. In the patients with vocal cord paralysis, airway resistance did not increase even with a high-intensity voice. Vocal intensity was controlled by expiratory effort. If there is sufficient ability for laryngeal adjustment, vocal intensity is controlled primarily by laryngeal adjustment and by expiratory adjustment in response to increased glottal resistance. However, vocal intensity is controlled by expiratory effort when laryngeal adjustment ability is poor.
Mothers' tone of voice depends on the nature of infants' transgressions.
Dahl, Audun; Sherlock, Briana R; Campos, Joseph J; Theunissen, Frédéric E
2014-08-01
Emotional vocal signals are important ways of communicating norms to young infants. The second year is a period of increase in various forms of child transgressions, but also a period when infants have limited linguistic abilities. Two studies investigated the hypothesis that mothers respond with different vocal emotional tones to 3 types of child transgressions: moral (harming others), prudential (harming oneself), and pragmatic (creating inconvenience, e.g., by spilling) transgressions. We used a combination of naturalistic observation (Study 1) and experimental manipulation (Study 2) to record, code, and analyze maternal vocal responses to child transgressions. Both studies showed that mothers were more likely to use intense, angry vocalizations in response to moral transgressions, fearful vocalizations in response to prudential transgressions, comforting vocalizations in response to pragmatic and prudential transgressions, and (in Study 2) playful vocalizations in response to pragmatic transgressions. Study 1 showed that this differential use of vocal tone is used systematically in everyday life. Study 2 allowed us to standardize the context of the maternal intervention and perform additional acoustical analyses. A combination of principal component analysis and linear discriminant analysis applied to pitch and intensity data provided quantitative measures of the differences in vocal responses. These differentiated vocal responses are likely contributors to children's acquisition of norms from early in life.
Discriminating Simulated Vocal Tremor Source Using Amplitude Modulation Spectra
Carbonell, Kathy M.; Lester, Rosemary A.; Story, Brad H.; Lotto, Andrew J.
2014-01-01
Objectives/Hypothesis Sources of vocal tremor are difficult to categorize perceptually and acoustically. This paper describes a preliminary attempt to discriminate vocal tremor sources through the use of spectral measures of the amplitude envelope. The hypothesis is that different vocal tremor sources are associated with distinct patterns of acoustic amplitude modulations. Study Design Statistical categorization methods (discriminant function analysis) were used to discriminate signals from simulated vocal tremor with different sources using only acoustic measures derived from the amplitude envelopes. Methods Simulations of vocal tremor were created by modulating parameters of a vocal fold model corresponding to oscillations of respiratory driving pressure (respiratory tremor), degree of vocal fold adduction (adductory tremor) and fundamental frequency of vocal fold vibration (F0 tremor). The acoustic measures were based on spectral analyses of the amplitude envelope computed across the entire signal and within select frequency bands. Results The signals could be categorized (with accuracy well above chance) in terms of the simulated tremor source using only measures of the amplitude envelope spectrum even when multiple sources of tremor were included. Conclusions These results supply initial support for an amplitude-envelope based approach to identify the source of vocal tremor and provide further evidence for the rich information about talker characteristics present in the temporal structure of the amplitude envelope. PMID:25532813
Perception of affective and linguistic prosody: an ALE meta-analysis of neuroimaging studies
Brown, Steven
2014-01-01
Prosody refers to the melodic and rhythmic aspects of speech. Two forms of prosody are typically distinguished: ‘affective prosody’ refers to the expression of emotion in speech, whereas ‘linguistic prosody’ relates to the intonation of sentences, including the specification of focus within sentences and stress within polysyllabic words. While these two processes are united by their use of vocal pitch modulation, they are functionally distinct. In order to examine the localization and lateralization of speech prosody in the brain, we performed two voxel-based meta-analyses of neuroimaging studies of the perception of affective and linguistic prosody. There was substantial sharing of brain activations between analyses, particularly in right-hemisphere auditory areas. However, a major point of divergence was observed in the inferior frontal gyrus: affective prosody was more likely to activate Brodmann area 47, while linguistic prosody was more likely to activate the ventral part of area 44. PMID:23934416
Gillis, Julia M.; Laman, Timothy G.
2017-01-01
The distinctive and divergent courtship phenotypes of the birds-of-paradise make them an important group for gaining insights into the evolution of sexually selected phenotypic evolution. The genus Astrapia includes five long-tailed species that inhabit New Guinea’s montane forests. The visual and acoustic components of courtship among Astrapia species are very poorly known. In this study, we use audiovisual data from a natural history collection of animal behavior to fill gaps in knowledge about the visual and acoustic components of Astrapia courtship. We report seven distinct male behaviors and two female specific behaviors along with distinct vocalizations and wing-produced sonations for all five species. These results provide the most complete assessment of courtship in the genus Astrapia to date and provide a valuable baseline for future research, including comparative and evolutionary studies among these and other bird-of-paradise species. PMID:29134145
Harbor Seal (Phoca vitulina) Reproductive Advertisement Behavior and the Effects of Vessel Noise
NASA Astrophysics Data System (ADS)
Matthews, Leanna P.
Harbor seals (Phoca vitulina) are a widely distributed pinniped species that mate underwater. Similar to other aquatically mating pinnipeds, male harbor seals produce vocalizations during the breeding season that function in male-male interactions and possibly as an attractant for females. I investigated multiple aspects of these reproductive advertisement displays in a population of harbor seals in Glacier Bay National Park and Preserve, Alaska. First, I looked at vocal production as a function of environmental variables, including season, daylight, and tidal state. Vocalizations were highly seasonal and detection of these vocalizations peaked in June and July, which correspond with the estimated time of breeding. Vocalizations also varied with light, with the lowest probability of detection during the day and the highest probability of detection at night. The high probability of detection corresponded to when females are known to forage. These results are similar to the vocal behavior of previously studied populations. However, unlike previously studied populations, the detection of harbor seal breeding vocalizations did not vary with tidal state. This is likely due to the location of the hydrophone, as it was not near the haul out and depth was therefore not significantly influenced by changes in tidal height. I also investigated the source levels and call parameters of vocalizations, as well as call rate and territoriality. The average source level of harbor seal breeding vocalizations was 144 dB re 1 ?Pa at 1 m and measurements ranged from 129 to 149 dB re 1 ?Pa. Analysis of call parameters indicated that vocalizations of harbor seals in Glacier Bay were similar in duration to other populations, but were much lower in frequency. During the breeding season, there were two discrete calling areas that likely represent two individual males; the average call rate in these display areas was approximately 1 call per minute. The harbor seal breeding season also overlaps with peak tourism in Glacier Bay, and the majority of tourists visit the park on a motorized vessel. Because of this overlap, I investigated the impacts of vessel noise on the vocal behavior of individual males. In the presence of vessel noise, male harbor seals increase the amplitude of their vocalizations, decrease the duration, and increase the minimum frequency. These vocal shifts are similar to studies of noise impacts on other species across taxa, but it is unknown how this could impact the reproductive success of male harbor seals. Finally, I looked at the role of female preference for male vocalizations. Using playbacks of male vocalizations to captive female harbor seals, I found that females have a higher response to vocalizations that correspond to dominant males. Females were less responsive to subordinate male vocalizations, which had a shorter duration and a higher frequency. Given that male harbor seals decrease the duration and increase the frequency of vocalizations in the presence of noise, it is possible that these vocalizations become less attractive in noise.
Guo, Zhiqiang; Wu, Xiuqin; Li, Weifeng; Jones, Jeffery A; Yan, Nan; Sheft, Stanley; Liu, Peng; Liu, Hanjun
2017-10-25
Although working memory (WM) is considered as an emergent property of the speech perception and production systems, the role of WM in sensorimotor integration during speech processing is largely unknown. We conducted two event-related potential experiments with female and male young adults to investigate the contribution of WM to the neurobehavioural processing of altered auditory feedback during vocal production. A delayed match-to-sample task that required participants to indicate whether the pitch feedback perturbations they heard during vocalizations in test and sample sequences matched, elicited significantly larger vocal compensations, larger N1 responses in the left middle and superior temporal gyrus, and smaller P2 responses in the left middle and superior temporal gyrus, inferior parietal lobule, somatosensory cortex, right inferior frontal gyrus, and insula compared with a control task that did not require memory retention of the sequence of pitch perturbations. On the other hand, participants who underwent extensive auditory WM training produced suppressed vocal compensations that were correlated with improved auditory WM capacity, and enhanced P2 responses in the left middle frontal gyrus, inferior parietal lobule, right inferior frontal gyrus, and insula that were predicted by pretraining auditory WM capacity. These findings indicate that WM can enhance the perception of voice auditory feedback errors while inhibiting compensatory vocal behavior to prevent voice control from being excessively influenced by auditory feedback. This study provides the first evidence that auditory-motor integration for voice control can be modulated by top-down influences arising from WM, rather than modulated exclusively by bottom-up and automatic processes. SIGNIFICANCE STATEMENT One outstanding question that remains unsolved in speech motor control is how the mismatch between predicted and actual voice auditory feedback is detected and corrected. The present study provides two lines of converging evidence, for the first time, that working memory cannot only enhance the perception of vocal feedback errors but also exert inhibitory control over vocal motor behavior. These findings represent a major advance in our understanding of the top-down modulatory mechanisms that support the detection and correction of prediction-feedback mismatches during sensorimotor control of speech production driven by working memory. Rather than being an exclusively bottom-up and automatic process, auditory-motor integration for voice control can be modulated by top-down influences arising from working memory. Copyright © 2017 the authors 0270-6474/17/3710324-11$15.00/0.
The Good, the Bad, and the Ugly: Agonistic Behaviour in Juvenile Crocodilians
Brien, Matthew L.; Lang, Jeffrey W.; Webb, Grahame J.; Stevenson, Colin; Christian, Keith A.
2013-01-01
We examined agonistic behaviour in seven species of hatchling and juvenile crocodilians held in small groups (N = 4) under similar laboratory conditions. Agonistic interactions occurred in all seven species, typically involved two individuals, were short in duration (5–15 seconds), and occurred between 1600–2200 h in open water. The nature and extent of agonistic interactions, the behaviours displayed, and the level of conspecific tolerance varied among species. Discrete postures, non-contact and contact movements are described. Three of these were species-specific: push downs by C. johnstoni; inflated tail sweeping by C. novaeguineae; and, side head striking combined with tail wagging by C. porosus. The two long-snouted species (C. johnstoni and G. gangeticus) avoided contact involving the head and often raised the head up out of the way during agonistic interactions. Several behaviours not associated with aggression are also described, including snout rubbing, raising the head up high while at rest, and the use of vocalizations. The two most aggressive species (C. porosus, C. novaeguineae) appeared to form dominance hierarchies, whereas the less aggressive species did not. Interspecific differences in agonistic behaviour may reflect evolutionary divergence associated with morphology, ecology, general life history and responses to interspecific conflict in areas where multiple species have co-existed. Understanding species-specific traits in agonistic behaviour and social tolerance has implications for the controlled raising of different species of hatchlings for conservation, management or production purposes. PMID:24349018
The Teacher's Voice: Vocal Training in Teacher Education
ERIC Educational Resources Information Center
Bele, Irene Velsvik
2008-01-01
The voice is a basic tool in human communication and an important factor in a positive self-understanding and identity, both for the teacher's sense of profession and for the pupils' ability to express themselves orally; two perspectives of great importance in the Norwegian National Curriculum. Voice disorders are common among teachers world-wide…
Collagen Content Limits Optical Coherence Tomography Image Depth in Porcine Vocal Fold Tissue.
Garcia, Jordan A; Benboujja, Fouzi; Beaudette, Kathy; Rogers, Derek; Maurer, Rie; Boudoux, Caroline; Hartnick, Christopher J
2016-11-01
Vocal fold scarring, a condition defined by increased collagen content, is challenging to treat without a method of noninvasively assessing vocal fold structure in vivo. The goal of this study was to observe the effects of vocal fold collagen content on optical coherence tomography imaging to develop a quantifiable marker of disease. Excised specimen study. Massachusetts Eye and Ear Infirmary. Porcine vocal folds were injected with collagenase to remove collagen from the lamina propria. Optical coherence tomography imaging was performed preinjection and at 0, 45, 90, and 180 minutes postinjection. Mean pixel intensity (or image brightness) was extracted from images of collagenase- and control-treated hemilarynges. Texture analysis of the lamina propria at each injection site was performed to extract image contrast. Two-factor repeated measure analysis of variance and t tests were used to determine statistical significance. Picrosirius red staining was performed to confirm collagenase activity. Mean pixel intensity was higher at injection sites of collagenase-treated vocal folds than control vocal folds (P < .0001). Fold change in image contrast was significantly increased in collagenase-treated vocal folds than control vocal folds (P = .002). Picrosirius red staining in control specimens revealed collagen fibrils most prominent in the subepithelium and above the thyroarytenoid muscle. Specimens treated with collagenase exhibited a loss of these structures. Collagen removal from vocal fold tissue increases image brightness of underlying structures. This inverse relationship may be useful in treating vocal fold scarring in patients. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.
Vocal Hyperfunction in Parents of Children With Attention Deficit Hyperactivity Disorder.
Teresa, Garcia-Real; Díaz-Román, Tomás M
2016-05-01
The objective of this study was to evaluate the presence of habits and symptoms of vocal hyperfunction in the parents of children with attention deficit hyperactivity disorder (ADHD). Parents of 24 children with ADHD and 30 children of a control group completed a specific questionnaire to detect the hyperfunctional use of the voice (excessive talking, excessive loudness, talking too fast, and shouting), hoarseness, vocal fatigue, mental and physical fatigue, and the degree of parental concern for the vocal health of their child. Parents of children with ADHD spoke more often, faster, and stronger than the parents of the control group; in addition, they also used a louder volume than they usually used when they spoke to their children. The parents manifested more vocal, mental, and physical fatigue than the parents of the control group. There was a significant correlation between the "concern" for the vocal health of their children with respect to vocal symptoms of the children, the habits of vocal hyperfunctioning, and the symptoms suffered by the parents. These results suggest that the parents of children with ADHD change their vocal attitude when communicating with their children. Most likely, the increased concern of parents with ADHD children and their respective level of stress lead to hyperfunctional vocal usage. This subsequently leads to symptoms of vocal, physical, and mental fatigue at the end of the day. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Is laughter a better vocal change detector than a growl?
Pinheiro, Ana P; Barros, Carla; Vasconcelos, Margarida; Obermeier, Christian; Kotz, Sonja A
2017-07-01
The capacity to predict what should happen next and to minimize any discrepancy between an expected and an actual sensory input (prediction error) is a central aspect of perception. Particularly in vocal communication, the effective prediction of an auditory input that informs the listener about the emotionality of a speaker is critical. What is currently unknown is how the perceived valence of an emotional vocalization affects the capacity to predict and detect a change in the auditory input. This question was probed in a combined event-related potential (ERP) and time-frequency analysis approach. Specifically, we examined the brain response to standards (Repetition Positivity) and to deviants (Mismatch Negativity - MMN), as well as the anticipatory response to the vocal sounds (pre-stimulus beta oscillatory power). Short neutral, happy (laughter), and angry (growls) vocalizations were presented both as standard and deviant stimuli in a passive oddball listening task while participants watched a silent movie and were instructed to ignore the vocalizations. MMN amplitude was increased for happy compared to neutral and angry vocalizations. The Repetition Positivity was enhanced for happy standard vocalizations. Induced pre-stimulus upper beta power was increased for happy vocalizations, and predicted the modulation of the standard Repetition Positivity. These findings indicate enhanced sensory prediction for positive vocalizations such as laughter. Together, the results suggest that positive vocalizations are more effective predictors in social communication than angry and neutral ones, possibly due to their high social significance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Lin, Ya; Yamashita, Masaru; Zhang, Jingxian; Ling, Changying; Welham, Nathan V
2009-10-01
Disruption of the vocal fold extracellular matrix (ECM) can induce a profound and refractory dysphonia. Pulsed dye laser (PDL) irradiation has shown early promise as a treatment modality for disordered ECM in patients with chronic vocal fold scar; however, there are limited data addressing the mechanism by which this laser energy might induce cellular and extracellular changes in vocal fold tissues. In this study, we examined the inflammatory and ECM modulating effects of PDL irradiation on normal vocal fold tissues and cultured vocal fold fibroblasts (VFFs). We evaluated the effects of 585 nm PDL irradiation on inflammatory cytokine and collagen/collagenase gene transcription in normal rat vocal folds in vivo (3-168 hours following delivery of approximately 39.46 J/cm(2) fluence) and VFFs in vitro (3-72 hours following delivery of 4.82 or 9.64 J/cm(2) fluence). We also examined morphological vocal fold tissue changes 3 hours, 1 week, and 1 month post-irradiation. PDL irradiation altered inflammatory cytokine and procollagen/collagenase expression at the transcript level, both in vitro and in vivo. Additionally, PDL irradiation induced an inflammatory repair process in vivo that was completed by 1 month with preservation of normal tissue morphology. PDL irradiation can modulate ECM turnover in phenotypically normal vocal folds. Additional work is required to determine if these findings extend to disordered ECM, such as is seen in vocal fold scar. Lasers Surg. Med. 41:585-594, 2009. (c) 2009 Wiley-Liss, Inc.
Nocturnal "humming" vocalizations: adding a piece to the puzzle of giraffe vocal communication.
Baotic, Anton; Sicks, Florian; Stoeger, Angela S
2015-09-09
Recent research reveals that giraffes (Giraffa camelopardalis sp.) exhibit a socially structured, fission-fusion system. In other species possessing this kind of society, information exchange is important and vocal communication is usually well developed. But is this true for giraffes? Giraffes are known to produce sounds, but there is no evidence that they use vocalizations for communication. Reports on giraffe vocalizations are mainly anecdotal and the missing acoustic descriptions make it difficult to establish a call nomenclature. Despite inconclusive evidence to date, it is widely assumed that giraffes produce infrasonic vocalizations similar to elephants. In order to initiate a more detailed investigation of the vocal communication in giraffes, we collected data of captive individuals during day and night. We particularly focussed on detecting tonal, infrasonic or sustained vocalizations. We collected over 947 h of audio material in three European zoos and quantified the spectral and temporal components of acoustic signals to obtain an accurate set of acoustic parameters. Besides the known burst, snorts and grunts, we detected harmonic, sustained and frequency-modulated "humming" vocalizations during night recordings. None of the recorded vocalizations were within the infrasonic range. These results show that giraffes do produce vocalizations, which, based on their acoustic structure, might have the potential to function as communicative signals to convey information about the physical and motivational attributes of the caller. The data further reveal that the assumption of infrasonic communication in giraffes needs to be considered with caution and requires further investigations in future studies.
Vocal activity of lesser galagos (Galago spp.) at zoos.
Schneiderová, Irena; Zouhar, Jan; Štefanská, Lucie; Bolfíková, Barbora Černá; Lhota, Stanislav; Brandl, Pavel
2016-01-01
Almost nothing is known about the natural vocal behavior of lesser galagos living in zoos. This is perhaps because they are usually kept in nocturnal exhibits separated from the visitors by a transparent and acoustically insulating glass barrier. The aim of the present study was therefore to fill this gap in knowledge of the vocal behavior of lesser galagos from zoos. This knowledge might be beneficial because the vocalizations of these small primates can be used for species determination. We performed a 10-day-long acoustic monitoring of vocal activity in each of seven various groups of Galago senegalensis and G. moholi living at four zoos. We quantitatively evaluated the occurrence of four loud vocalization types present in both species, including the most species-specific advertisement call. We found that qualitative as well as quantitative differences exist in the vocal behavior of the studied groups. We confirmed that the observed vocalization types can be collected from lesser galagos living at zoos, and the success can be increased by selecting larger and more diverse groups. We found two distinct patterns of diel vocal activity in the most vocally active groups. G. senegalensis groups were most vocally active at the beginning and at the end of their activity period, whereas one G. moholi group showed an opposite pattern. The latter is surprising, as it is generally accepted that lesser galagos emit advertisement calls especially at dawn and dusk, i.e., at the beginning and at the end of their diel activity. © 2016 Wiley Periodicals, Inc.
[Causes of vocal cord dyscinesia and its original factors after endotracheal intubation].
Sun, Anke; Zhang, Tiezheng; Liu, Wenyuan; Tang, Weiwei; Guo, Xiaohong
2012-03-01
To research the causes of postintubation vocal cord dyskinesia and its contributing factors. The causes of vocal cord dyskinesia were confirmed by laryngoscope, three-dimensional spiral CT, stroboscope, and the analysis of therapy. The factors relevant to the causes of vocal cord dyskinesia were analysed based on the following elements: (1) the anatomic or pathological condition of patients or the technical skills of anesthetists. (2) emaciated or obese body and neck. (3) the age of patients. (4) the duration of endotracheal tube retention. (5) the types of operations. (6) anesthesia procedure. Among 135 patients, 128 cases (94.81%) manifested arytenoid dislocation, 7 cases (5.19%) vocal cord paralysis. The study showed that the vocal cord dyskinesia associated with anatomic or pathological condition of patients and technical skills of anesthetists (with intubation difficulty) accounted for 76.30%. The patients with relative emaciated body or neck accounted for 90.62% in cases without intubation difficulty. Age had no significant analytical relationship with vocal cord dyskinesia. Prolonged intubation (endotracheal tube retention over 12 hours) was accounted for only 17.64%. The incidence of vocal cord dyskinesia was nearly 0.5% in patients underwent cardio-thoracic surgery, accounting for 59.26% of all the patients. There are two major causes of vocal cord dyskinesia: arytenoid dislocation and vocal cord paralysis, and the rate of vocal cord dyskinesia could be reduced by the improvement of technical skill of anesthetists and/or sufficient attention to the intubation condition of patients.
Experimental analysis of the characteristics of artificial vocal folds.
Misun, Vojtech; Svancara, Pavel; Vasek, Martin
2011-05-01
Specialized literature presents a number of models describing the function of the vocal folds. In most of those models, an emphasis is placed on the air flowing through the glottis and, further, on the effect of the parameters of the air alone (its mass, speed, and so forth). The article focuses on the constructional definition of artificial vocal folds and their experimental analysis. The analysis is conducted for voiced source voice phonation and for the changing mean value of the subglottal pressure. The article further deals with the analysis of the pressure of the airflow through the vocal folds, which is cut (separated) into individual pulses by the vibrating vocal folds. The analysis results show that air pulse characteristics are relevant to voice generation, as they are produced by the flowing air and vibrating vocal folds. A number of artificial vocal folds have been constructed to date, and the aforementioned view of their phonation is confirmed by their analysis. The experiments have confirmed that man is able to consciously affect only two parameters of the source voice, that is, its fundamental frequency and voice intensity. The main forces acting on the vocal folds during phonation are as follows: subglottal air pressure and elastic and inertia forces of the vocal folds' structure. The correctness of the function of the artificial vocal folds is documented by the experimental verification of the spectra of several types of artificial vocal folds. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Lewandowski, Brian; Vyssotski, Alexei; Hahnloser, Richard H R; Schmidt, Marc
2013-06-01
Communication between auditory and vocal motor nuclei is essential for vocal learning. In songbirds, the nucleus interfacialis of the nidopallium (NIf) is part of a sensorimotor loop, along with auditory nucleus avalanche (Av) and song system nucleus HVC, that links the auditory and song systems. Most of the auditory information comes through this sensorimotor loop, with the projection from NIf to HVC representing the largest single source of auditory information to the song system. In addition to providing the majority of HVC's auditory input, NIf is also the primary driver of spontaneous activity and premotor-like bursting during sleep in HVC. Like HVC and RA, two nuclei critical for song learning and production, NIf exhibits behavioral-state dependent auditory responses and strong motor bursts that precede song output. NIf also exhibits extended periods of fast gamma oscillations following vocal production. Based on the converging evidence from studies of physiology and functional connectivity it would be reasonable to expect NIf to play an important role in the learning, maintenance, and production of song. Surprisingly, however, lesions of NIf in adult zebra finches have no effect on song production or maintenance. Only the plastic song produced by juvenile zebra finches during the sensorimotor phase of song learning is affected by NIf lesions. In this review, we carefully examine what is known about NIf at the anatomical, physiological, and behavioral levels. We reexamine conclusions drawn from previous studies in the light of our current understanding of the song system, and establish what can be said with certainty about NIf's involvement in song learning, maintenance, and production. Finally, we review recent theories of song learning integrating possible roles for NIf within these frameworks and suggest possible parallels between NIf and sensorimotor areas that form part of the neural circuitry for speech processing in humans. Copyright © 2013 Elsevier Ltd. All rights reserved.
Lewandowski, Brian; Vyssotski, Alexei; Hahnloser, Richard H.R.; Schmidt, Marc
2015-01-01
Communication between auditory and vocal motor nuclei is essential for vocal learning. In songbirds, the nucleus interfacialis of the nidopallium (NIf) is part of a sensorimotor loop, along with auditory nucleus avalanche (Av) and song system nucleus HVC, that links the auditory and song systems. Most of the auditory information comes through this sensorimotor loop, with the projection from NIf to HVC representing the largest single source of auditory information to the song system. In addition to providing the majority of HVC’s auditory input, NIf is also the primary driver of spontaneous activity and premotor-like bursting during sleep in HVC. Like HVC and RA, two nuclei critical for song learning and production, NIf exhibits behavioral-state dependent auditory responses and strong motor bursts that precede song output. NIf also exhibits extended periods of fast gamma oscillations following vocal production. Based on the converging evidence from studies of physiology and functional connectivity it would be reasonable to expect NIf to play an important role in the learning, maintenance, and production of song. Surprisingly, however, lesions of NIf in adult zebra finches have no effect on song production or maintenance. Only the plastic song produced by juvenile zebra finches during the sensorimotor phase of song learning is affected by NIf lesions. In this review, we carefully examine what is known about NIf at the anatomical, physiological, and behavioral levels. We reexamine conclusions drawn from previous studies in the light of our current understanding of the song system, and establish what can be said with certainty about NIf’s involvement in song learning, maintenance, and production. Finally, we review recent theories of song learning integrating possible roles for NIf within these frameworks and suggest possible parallels between NIf and sensorimotor areas that form part of the neural circuitry for speech processing in humans. PMID:23603062
Determining the Effect of Natural Selection on Linked Neutral Divergence across Species
Phung, Tanya N.; Lohmueller, Kirk E.
2016-01-01
A major goal in evolutionary biology is to understand how natural selection has shaped patterns of genetic variation across genomes. Studies in a variety of species have shown that neutral genetic diversity (intra-species differences) has been reduced at sites linked to those under direct selection. However, the effect of linked selection on neutral sequence divergence (inter-species differences) remains ambiguous. While empirical studies have reported correlations between divergence and recombination, which is interpreted as evidence for natural selection reducing linked neutral divergence, theory argues otherwise, especially for species that have diverged long ago. Here we address these outstanding issues by examining whether natural selection can affect divergence between both closely and distantly related species. We show that neutral divergence between closely related species (e.g. human-primate) is negatively correlated with functional content and positively correlated with human recombination rate. We also find that neutral divergence between distantly related species (e.g. human-rodent) is negatively correlated with functional content and positively correlated with estimates of background selection from primates. These patterns persist after accounting for the confounding factors of hypermutable CpG sites, GC content, and biased gene conversion. Coalescent models indicate that even when the contribution of ancestral polymorphism to divergence is small, background selection in the ancestral population can still explain a large proportion of the variance in divergence across the genome, generating the observed correlations. Our findings reveal that, contrary to previous intuition, natural selection can indirectly affect linked neutral divergence between both closely and distantly related species. Though we cannot formally exclude the possibility that the direct effects of purifying selection drive some of these patterns, such a scenario would be possible only if more of the genome is under purifying selection than currently believed. Our work has implications for understanding the evolution of genomes and interpreting patterns of genetic variation. PMID:27508305
Determining the Effect of Natural Selection on Linked Neutral Divergence across Species.
Phung, Tanya N; Huber, Christian D; Lohmueller, Kirk E
2016-08-01
A major goal in evolutionary biology is to understand how natural selection has shaped patterns of genetic variation across genomes. Studies in a variety of species have shown that neutral genetic diversity (intra-species differences) has been reduced at sites linked to those under direct selection. However, the effect of linked selection on neutral sequence divergence (inter-species differences) remains ambiguous. While empirical studies have reported correlations between divergence and recombination, which is interpreted as evidence for natural selection reducing linked neutral divergence, theory argues otherwise, especially for species that have diverged long ago. Here we address these outstanding issues by examining whether natural selection can affect divergence between both closely and distantly related species. We show that neutral divergence between closely related species (e.g. human-primate) is negatively correlated with functional content and positively correlated with human recombination rate. We also find that neutral divergence between distantly related species (e.g. human-rodent) is negatively correlated with functional content and positively correlated with estimates of background selection from primates. These patterns persist after accounting for the confounding factors of hypermutable CpG sites, GC content, and biased gene conversion. Coalescent models indicate that even when the contribution of ancestral polymorphism to divergence is small, background selection in the ancestral population can still explain a large proportion of the variance in divergence across the genome, generating the observed correlations. Our findings reveal that, contrary to previous intuition, natural selection can indirectly affect linked neutral divergence between both closely and distantly related species. Though we cannot formally exclude the possibility that the direct effects of purifying selection drive some of these patterns, such a scenario would be possible only if more of the genome is under purifying selection than currently believed. Our work has implications for understanding the evolution of genomes and interpreting patterns of genetic variation.
Computational model for vocal tract dynamics in a suboscine bird.
Assaneo, M F; Trevisan, M A
2010-09-01
In a recent work, active use of the vocal tract has been reported for singing oscines. The reconfiguration of the vocal tract during song serves to match its resonances to the syringeal fundamental frequency, demonstrating a precise coordination of the two main pieces of the avian vocal system for songbirds characterized by tonal songs. In this work we investigated the Great Kiskadee (Pitangus sulfuratus), a suboscine bird whose calls display a rich harmonic content. Using a recently developed mathematical model for the syrinx and a mobile vocal tract, we set up a computational model that provides a plausible reconstruction of the vocal tract movement using a few spectral features taken from the utterances. Moreover, synthetic calls were generated using the articulated vocal tract that accounts for all the acoustical features observed experimentally.
Further evaluation of response interruption and redirection as treatment for stereotypy.
Ahrens, Erin N; Lerman, Dorothea C; Kodak, Tiffany; Worsdell, April S; Keegan, Courtney
2011-01-01
The effects of 2 forms of response interruption and redirection (RIRD)-motor RIRD and vocal RIRD-were examined with 4 boys with autism to evaluate further the effects of this intervention and its potential underlying mechanisms. In Experiment 1, the effects of motor RIRD and vocal RIRD on vocal stereotypy and appropriate vocalizations were compared for 2 participants. In Experiment 2, the effects of both RIRD procedures on both vocal and motor stereotypy and appropriate vocalizations were compared with 2 additional participants. Results suggested that RIRD was effective regardless of the procedural variation or topography of stereotypy and that vocal RIRD functioned as a punisher. This mechanism was further explored with 1 participant by manipulating the schedule of RIRD in Experiment 3. Results were consistent with the punishment interpretation.
FURTHER EVALUATION OF RESPONSE INTERRUPTION AND REDIRECTION AS TREATMENT FOR STEREOTYPY
Ahrens, Erin N; Lerman, Dorothea C; Kodak, Tiffany; Worsdell, April S; Keegan, Courtney
2011-01-01
The effects of 2 forms of response interruption and redirection (RIRD)—motor RIRD and vocal RIRD—were examined with 4 boys with autism to evaluate further the effects of this intervention and its potential underlying mechanisms. In Experiment 1, the effects of motor RIRD and vocal RIRD on vocal stereotypy and appropriate vocalizations were compared for 2 participants. In Experiment 2, the effects of both RIRD procedures on both vocal and motor stereotypy and appropriate vocalizations were compared with 2 additional participants. Results suggested that RIRD was effective regardless of the procedural variation or topography of stereotypy and that vocal RIRD functioned as a punisher. This mechanism was further explored with 1 participant by manipulating the schedule of RIRD in Experiment 3. Results were consistent with the punishment interpretation. PMID:21541130
Vocal cord paralysis after aortic arch surgery: predictors and clinical outcome.
Ohta, Noriyuki; Kuratani, Toru; Hagihira, Satoshi; Kazumi, Ken-Ichiro; Kaneko, Mitsunori; Mori, Takahiko
2006-04-01
This study is retrospective cohort study of data on vocal cord paralysis after aortic arch surgery collected during 14 years at a general hospital. We investigated factors in the development of vocal cord paralysis after aortic arch surgery and the effect of vocal cord paralysis on clinical course and outcome. We reviewed data for 182 patients who underwent aortic arch surgery for aortic arch aneurysm and aortic dissection between 1989 and 2003, of whom 58 patients had proximal aortic repair, 62 had distal arch repair, and 62 had total arch repair. We assessed factors associated with the development of vocal cord paralysis and examined in detail the clinical outcome of patients with vocal cord paralysis. Postoperative vocal cord paralysis occurred in 40 patients. Multiple logistic regression analysis revealed the following risk factors with odds ratios (OR) for vocal cord paralysis: extension of procedures into distal arch (OR, 17.0), chronic dilatation of the aorta at the left subclavian artery (OR, 9.14), and total arch repair (OR, 4.24). Adoption of open-style stent-grafts reduced the incidence of vocal cord paralysis (OR, 0.031). The postoperative occurrence of vocal cord paralysis itself emerges as an independent predictor of pulmonary complications (OR, 4.12) and leads to a longer duration of hospital stay. The risk of vocal cord paralysis after aortic arch surgery depends on surgical factors, such as aneurysmal involvement of the distal arch, or the application of newer, less invasive surgical procedures. Vocal cord paralysis after aortic arch surgery itself, under aggressive postoperative respiratory management, did not increase aspiration pneumonia but was associated with postoperative complications leading to higher hospital mortality and prolonged hospitalization.
Automated Assessment of Child Vocalization Development Using LENA
ERIC Educational Resources Information Center
Richards, Jeffrey A.; Xu, Dongxin; Gilkerson, Jill; Yapanel, Umit; Gray, Sharmistha; Paul, Terrance
2017-01-01
Purpose: To produce a novel, efficient measure of children's expressive vocal development on the basis of automatic vocalization assessment (AVA), child vocalizations were automatically identified and extracted from audio recordings using Language Environment Analysis (LENA) System technology. Method: Assessment was based on full-day audio…
Vocal education for the professional voice user and singer.
Murry, T; Rosen, C A
2000-10-01
Providing education on voice-related anatomy, physiology, and vocal hygiene information is the responsibility of every voice care professional. This article discusses the importance of a vocal education program for singers and professional voice users. An outline of a vocal education lecture is provided.
Contextual influences on children's use of vocal affect cues during referential interpretation.
Berman, Jared M J; Graham, Susan A; Chambers, Craig G
2013-01-01
In three experiments, we investigated 5-year-olds' sensitivity to speaker vocal affect during referential interpretation in cases where the indeterminacy is or is not resolved by speech information. In Experiment 1, analyses of eye gaze patterns and pointing behaviours indicated that 5-year-olds used vocal affect cues at the point where an ambiguous description was encountered. In Experiments 2 and 3, we used unambiguous situations to investigate how the referential context influences the ability to use affect cues earlier in the utterance. Here, we found a differential use of speaker vocal affect whereby 5-year-olds' referential hypotheses were influenced by negative vocal affect cues in advance of the noun, but not by positive affect cues. Together, our findings reveal how 5-year-olds use a speaker's vocal affect to identify potential referents in different contextual situations and also suggest that children may be more attuned to negative vocal affect than positive vocal affect, particularly early in an utterance.
Automated Assessment of Child Vocalization Development Using LENA.
Richards, Jeffrey A; Xu, Dongxin; Gilkerson, Jill; Yapanel, Umit; Gray, Sharmistha; Paul, Terrance
2017-07-12
To produce a novel, efficient measure of children's expressive vocal development on the basis of automatic vocalization assessment (AVA), child vocalizations were automatically identified and extracted from audio recordings using Language Environment Analysis (LENA) System technology. Assessment was based on full-day audio recordings collected in a child's unrestricted, natural language environment. AVA estimates were derived using automatic speech recognition modeling techniques to categorize and quantify the sounds in child vocalizations (e.g., protophones and phonemes). These were expressed as phone and biphone frequencies, reduced to principal components, and inputted to age-based multiple linear regression models to predict independently collected criterion-expressive language scores. From these models, we generated vocal development AVA estimates as age-standardized scores and development age estimates. AVA estimates demonstrated strong statistical reliability and validity when compared with standard criterion expressive language assessments. Automated analysis of child vocalizations extracted from full-day recordings in natural settings offers a novel and efficient means to assess children's expressive vocal development. More research remains to identify specific mechanisms of operation.
Zhang, Zhaoyan
2015-01-01
Maintaining a small glottal opening across a large range of voice conditions is critical to normal voice production. This study investigated the effectiveness of vocal fold approximation and stiffening in regulating glottal opening and airflow during phonation, using a three-dimensional numerical model of phonation. The results showed that with increasing subglottal pressure the vocal folds were gradually pushed open, leading to increased mean glottal opening and flow rate. A small glottal opening and a mean glottal flow rate typical of human phonation can be maintained against increasing subglottal pressure by proportionally increasing the degree of vocal fold approximation for low to medium subglottal pressures and vocal fold stiffening at high subglottal pressures. Although sound intensity was primarily determined by the subglottal pressure, the results suggest that, to maintain small glottal opening as the sound intensity increases, one has to simultaneously tighten vocal fold approximation and/or stiffen the vocal folds, resulting in increased glottal resistance, vocal efficiency, and fundamental frequency. PMID:25698022
Contingent imitation increases verbal interaction in children with autism spectrum disorders.
Ishizuka, Yuka; Yamamoto, Jun-Ichi
2016-11-01
Several studies have suggested that contingent adult imitation increase nonverbal communication, such as attention and proximity to adults, in children with autism spectrum disorders. However, few studies have shown the effect of contingent imitation on verbal communication. This study examined whether children with autism were able to promote verbal interaction such as vocal imitation, vocalization, and vocal turn-taking via contingent imitation. We used an alternating treatment design composed of the conditions of contingent imitation and control for six children with autism (aged 33-63 months). For contingent imitation condition, adults imitated children's vocalization immediately. For control condition, adults did not imitate but gave a vocal response immediately. Results showed that in contingent imitation condition, all children increased the number of vocal imitations and vocal turn-takings compared with control condition. The number of vocalizations increased in both condition for all children. Overall, it is suggested that all children promote verbal interaction via contingent imitation. © The Author(s) 2016.
Influence of the ventricular folds on a voice source with specified vocal fold motion1
McGowan, Richard S.; Howe, Michael S.
2010-01-01
The unsteady drag on the vocal folds is the major source of sound during voiced speech. The drag force is caused by vortex shedding from the vocal folds. The influence of the ventricular folds (i.e., the “false” vocal folds that protrude into the vocal tract a short distance downstream of the glottis) on the drag and the voice source are examined in this paper by means of a theoretical model involving vortex sheets in a two-dimensional geometry. The effect of the ventricular folds on the output acoustic pressure is found to be small when the movement of the vocal folds is prescribed. It is argued that the effect remains small when fluid-structure interactions account for vocal fold movement. These conclusions can be justified mathematically when the characteristic time scale for change in the velocity of the glottal jet is large compared to the time it takes for a vortex disturbance to be convected through the vocal fold and ventricular fold region. PMID:20329852
Pasch, Bret; Abbasi, Mustafa Z; Wilson, Macey; Zhao, Daniel; Searle, Jeremy B; Webster, Michael S; Rice, Aaron N
2016-04-01
Nutritional stress can have lasting impacts on the development of traits involved in vocal production. Cross-fostering experiments are often used to examine the propensity for vocal learning in a variety of taxa, but few studies assess the influence of malnourishment that can occur as a byproduct of this technique. In this study, we reciprocally cross-fostered sister taxa of voluble grasshopper mice (genus Onychomys) to explore their propensity for vocal learning. Vocalizations of Onychomys leucogaster did not differ between control and cross-fostered animals, but cross-fostered Onychomys arenicola produced vocalizations that were higher in frequency in a direction away from tutors. These same animals exhibited a transient reduction in body mass early in development, indicative of malnutrition. Our findings simultaneously refute vocal learning and support the developmental stress hypothesis to highlight the importance of early ontogeny on the production of vocalizations later in life. Copyright © 2016 Elsevier Inc. All rights reserved.
Further evaluation of methods to identify matched stimulation.
Rapp, John T
2007-01-01
The effects of preferred stimulation on the vocal stereotypy of 2 individuals were evaluated in two experiments. The results of Experiment 1 showed that (a) the vocal stereotypy of both participants persisted in the absence of social consequences, (b) 1 participant manipulated toys that did and did not produce auditory stimulation, but only sound-producing toys decreased his vocal stereotypy, and (c) only noncontingent music decreased vocal stereotypy for the other participant, but sterotypy paradoxically increased when toys were presented with music. Using a three-component multiple schedule, the results of Experiment 2 showed that the vocal stereotypy of both participants remained below preintervention levels following the removal of auditory stimulation and that 1 participant's vocal stereotypy increased following the removal of contingent reprimands. These patterns suggest that auditory stimulation functioned as an abolishing operation for vocal stereotypy and reprimands functioned as an establishing operation for vocal stereotypy. Together, the two experiments provide a method for identifying alternative stimulation that may substitute for automatically reinforced behavior.
Shear properties of vocal fold mucosal tissues and their effect on vocal fold oscillation
NASA Astrophysics Data System (ADS)
Chan, Roger Wai Kai
Viscoelastic shear properties of vocal fold mucosal tissues and phonosurgical biomaterials were measured with a parallel-plate rotational rheometer. Elastic, viscous and damping properties were quantified as a function of frequency (0.01 Hz to 15 Hz) for human vocal fold mucosal tissues (N = 15), implantable biomaterials commonly used in the treatment of vocal fold paralysis (Teflon, gelatin, and collagen) (the non-mucosal group), and biomaterials currently or potentially useful in the treatment of vocal fold mucosal defects (adipose tissue or fat, hyaluronic acid, and fibronectin) (the mucosal group). It was found that intersubject differences as large as an order of magnitude were often observed for the shear properties of vocal fold mucosal tissues, part of which may be age- and gender-related. Shear properties of the non-mucosal group biomaterials were often much higher than those of the mucosal group biomaterials, which were relatively close to the shear properties of mucosal tissues. Viscoelastic and rheological modeling showed that shear properties of human vocal fold mucosa may be described by a quasi-linear viscoelastic theory and a statistical network theory, based upon which extrapolations to audio frequencies were possible. A theory of small-amplitude vocal fold oscillation was revisited to describe the effects of tissue shear properties on vocal fold oscillation and phonation threshold pressure, a measure of the 'ease' of phonation and an objective indication of vocal function. It was found that phonation threshold pressure is directly related to the viscous shear modulus or the 'effective damping modulus', a concept proposed to quantify the effective amount of damping in vocal fold oscillation. The mucosal group biomaterials were incorporated into the artificial vocal fold mucosa of a physical model in order to empirically assess their effects on phonation threshold pressure. Results showed that higher threshold pressures were consistently observed for higher concentrations of hyaluronic acid and for hyaluronic acid mixed with fibronectin, in correlation with their differences in viscous shear modulus and effective damping modulus. Implications for phonosurgery were discussed in terms of the choice of optimal biomaterials for the surgical management of vocal fold mucosal defects and lamina propria deficiencies.
Differential short-term memorisation for vocal and instrumental rhythms.
Klyn, Niall A M; Will, Udo; Cheong, Yong-Jeon; Allen, Erin T
2016-07-01
This study explores differential processing of vocal and instrumental rhythms in short-term memory with three decision (same/different judgments) and one reproduction experiment. In the first experiment, memory performance declined for delayed versus immediate recall, with accuracy for the two rhythms being affected differently: Musicians performed better than non-musicians on clapstick but not on vocal rhythms, and musicians were better on vocal rhythms in the same than in the different condition. Results for the second experiment showed that concurrent sub-vocal articulation and finger-tapping differentially affected the two rhythms and same/different decisions, but produced no evidence for articulatory loop involvement in delayed decision tasks. In a third experiment, which tested rhythm reproduction, concurrent sub-vocal articulation decreased memory performance, with a stronger deleterious effect on the reproduction of vocal than of clapstick rhythms. This suggests that the articulatory loop may only be involved in delayed reproduction not in decision tasks. The fourth experiment tested whether differences between filled and empty rhythms (continuous vs. discontinuous sounds) can explain the different memorisation of vocal and clapstick rhythms. Though significant differences were found for empty and filled instrumental rhythms, the differences between vocal and clapstick can only be explained by considering additional voice specific features.
Effects of speech style, room acoustics, and vocal fatigue on vocal effort
Bottalico, Pasquale; Graetzer, Simone; Hunter, Eric J.
2016-01-01
Vocal effort is a physiological measure that accounts for changes in voice production as vocal loading increases. It has been quantified in terms of sound pressure level (SPL). This study investigates how vocal effort is affected by speaking style, room acoustics, and short-term vocal fatigue. Twenty subjects were recorded while reading a text at normal and loud volumes in anechoic, semi-reverberant, and reverberant rooms in the presence of classroom babble noise. The acoustics in each environment were modified by creating a strong first reflection in the talker position. After each task, the subjects answered questions addressing their perception of the vocal effort, comfort, control, and clarity of their own voice. Variation in SPL for each subject was measured per task. It was found that SPL and self-reported effort increased in the loud style and decreased when the reflective panels were present and when reverberation time increased. Self-reported comfort and control decreased in the loud style, while self-reported clarity increased when panels were present. The lowest magnitude of vocal fatigue was experienced in the semi-reverberant room. The results indicate that early reflections may be used to reduce vocal effort without modifying reverberation time. PMID:27250179
The vocal load of Reform Jewish cantors in the USA.
Hapner, Edie; Gilman, Marina
2012-03-01
Jewish cantors comprise a subset of vocal professionals that is not well understood by vocal health professionals. This study aimed to document the vocal demands, vocal training, reported incidence of voice problems, and treatment-seeking behavior of Reform Jewish cantors. The study used a prospective observational design to anonymously query Reform Jewish cantors using a 35-item multiple-choice survey distributed online. Demographic information, medical history, vocal music training, cantorial duties, history of voice problems, and treatment-seeking behavior were addressed. Results indicated that many of the commonly associated risk factors for developing voice disorders were present in this population, including high vocal demands, reduced vocal downtime, allergies, and acid reflux. Greater than 65% of the respondents reported having had a voice problem that interfered with their ability to perform their duties at some time during their careers. Reform Jewish cantors are a population of occupational voice users who may be currently unidentified and underserved by vocal health professionals. The results of the survey suggest that Reform Jewish cantors are occupational voice users and are at high risk for developing voice disorders. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Data analysis of response interruption and redirection as a treatment for vocal stereotypy.
Wunderlich, Kara L; Vollmer, Timothy R
2015-12-01
Vocal stereotypy, or repetitive, noncontextual vocalizations, is a problematic form of behavior exhibited by many individuals with autism spectrum disorder (ASD). Recent research has evaluated the efficacy of response interruption and redirection (RIRD) in the reduction of vocal stereotypy. Research has indicated that RIRD often results in reductions in the level of vocal stereotypy; however, many previous studies have only presented data on vocal stereotypy that occurred outside RIRD implementation. The current study replicated the procedures of previous studies that have evaluated the efficacy of RIRD and compared 2 data-presentation methods: inclusion of only data collected outside RIRD implementation and inclusion of all vocal stereotypy data from the entirety of each session. Subjects were 7 children who had been diagnosed with ASD. Results indicated that RIRD appeared to be effective when we evaluated the level of vocal stereotypy outside RIRD implementation, but either no reductions or more modest reductions in the level of vocal stereotypy during the entirety of sessions were obtained for all subjects. Results suggest that data-analysis methods used in previous research may overestimate the efficacy of RIRD. © Society for the Experimental Analysis of Behavior.
Characterizing the graded structure of false killer whale (Pseudorca crassidens) vocalizations.
Murray, S O; Mercado, E; Roitblat, H L
1998-09-01
The vocalizations from two, captive false killer whales (Pseudorca crassidens) were analyzed. The structure of the vocalizations was best modeled as lying along a continuum with trains of discrete, exponentially damped sinusoidal pulses at one end and continuous sinusoidal signals at the other end. Pulse trains were graded as a function of the interval between pulses where the minimum interval between pulses could be zero milliseconds. The transition from a pulse train with no inter-pulse interval to a whistle could be modeled by gradations in the degree of damping. There were many examples of vocalizations that were gradually modulated from pulse trains to whistles. There were also vocalizations that showed rapid shifts in signal type--for example, switching immediately from a whistle to a pulse train. These data have implications when considering both the possible function(s) of the vocalizations and the potential sound production mechanism(s). A short-time duty cycle measure was developed to characterize the graded structure of the vocalizations. A random sample of 500 vocalizations was characterized by combining the duty cycle measure with peak frequency measurements. The analysis method proved to be an effective metric for describing the graded structure of false killer whale vocalizations.
Vocal Fold Epithelial Barrier in Health and Injury A Research Review
Levendoski, Elizabeth Erickson; Leydon, Ciara; Thibeault, Susan L.
2015-01-01
Purpose Vocal fold epithelium is composed of layers of individual epithelial cells joined by junctional complexes constituting a unique interface with the external environment. This barrier provides structural stability to the vocal folds and protects underlying connective tissue from injury while being nearly continuously exposed to potentially hazardous insults including environmental or systemic-based irritants such as pollutants and reflux, surgical procedures, and vibratory trauma. Small disruptions in the epithelial barrier may have a large impact on susceptibility to injury and overall vocal health. The purpose of this article is to provide a broad-based review of our current knowledge of the vocal fold epithelial barrier. Methods A comprehensive review of the literature was conducted. Details of the structure of the vocal fold epithelial barrier are presented and evaluated in the context of function in injury and pathology. The importance of the epithelial-associated vocal fold mucus barrier is also introduced. Results/Conclusions Information presented in this review is valuable for clinicians and researchers as it highlights the importance of this understudied portion of the vocal folds to overall vocal health and disease. Prevention and treatment of injury to the epithelial barrier is a significant area awaiting further investigation. PMID:24686981
Low-frequency vocalizations in the Florida manatee (Trichechus manatus latirostris)
NASA Astrophysics Data System (ADS)
Frisch, Katherine; Frisch, Stefan
2003-10-01
Vocalizations produced by Florida manatees (Trichechus manatus latirostris) have been characterized as being of relatively high frequency, with fundamental tones ranging from 2500-5000 Hz. These sounds have been variously described as squeaks, squeals, and chirps. Vocalizations below 500 Hz have not been previously reported. Two captive-born Florida manatees were recorded at Mote Marine Laboratory in Sarasota, Florida. The analysis of these vocalizations provides evidence of a new category of low-frequency sounds produced by manatees. These sounds are often heard in conjunction with higher-frequency vocalizations. The low-frequency vocalizations are relatively brief and of low amplitude. These vocalizations are perceived as a series of impulses rather than a low-frequency periodic tone. Knowledge of these low-frequency vocalizations could be useful to those developing future management strategies. Interest has recently increased in the development of acoustic detection and deterrence devices to reduce the number of manatee watercraft interactions. The design of appropriate devices must take into account the apparent ability of manatees to perceive and produce sounds of both high and low frequency. It is also important to consider the possibility that acoustic deterrence devices may disrupt the potentially communicative frequencies of manatee vocalizations.
Determination of West Indian manatee vocalization levels and rate
NASA Astrophysics Data System (ADS)
Phillips, Richard; Niezrecki, Christopher; Beusse, Diedrich
2004-05-01
The West Indian manatee (Trichechus manatus latirostris) has become endangered partly because of a growing number of collisions with boats. A system to warn boaters of the presence of manatees, based upon the vocalizations of manatees, could potentially reduce these boat collisions. The feasibility of this warning system would depend mainly upon two factors: the rate at which manatees vocalize and the distance in which the manatees can be detected. The research presented in this paper verifies that the average vocalization rate of the West Indian manatee is approximately one to two times per 5-min period. Several different manatee vocalization recordings were broadcast to the manatees and their response was observed. It was found that during the broadcast periods, the vocalization rates for the manatees increased substantially when compared with the average vocalization rates during nonbroadcast periods. An array of four hydrophones was used while recording the manatees. This allowed for position estimation techniques to be used to determine the location of the vocalizing manatee. Knowing the position of the manatee, the source level was determined and it was found that the mean source level of the manatee vocalizations is approximately 112 dB (re:1 Pa) @ 1 m.
Determination of West Indian manatee vocalization levels and rate
NASA Astrophysics Data System (ADS)
Phillips, Richard; Niezrecki, Christopher; Beusse, Diedrich O.
2004-01-01
The West Indian manatee (Trichechus manatus latirostris) has become endangered partly because of a growing number of collisions with boats. A system to warn boaters of the presence of manatees, based upon the vocalizations of manatees, could potentially reduce these boat collisions. The feasibility of this warning system would depend mainly upon two factors: the rate at which manatees vocalize and the distance in which the manatees can be detected. The research presented in this paper verifies that the average vocalization rate of the West Indian manatee is approximately one to two times per 5-min period. Several different manatee vocalization recordings were broadcast to the manatees and their response was observed. It was found that during the broadcast periods, the vocalization rates for the manatees increased substantially when compared with the average vocalization rates during nonbroadcast periods. An array of four hydrophones was used while recording the manatees. This allowed for position estimation techniques to be used to determine the location of the vocalizing manatee. Knowing the position of the manatee, the source level was determined and it was found that the mean source level of the manatee vocalizations is approximately 112 dB (re 1 μPa) @ 1 m.
Schneider, Berit; Zumtobel, Michaela; Prettenhofer, Walter; Aichstill, Birgitta; Jocher, Werner
2010-03-01
Only limited data on normal vocal constitution and vocal capabilities in school-aged children are available. To take better care of children's voices, it might be helpful to know voice ranges and limits of not only vocally trained but also vocally untrained children. Goal of this study was the evaluation of singing voice capabilities of vocally healthy children with different social and vocal/musical backgrounds using voice range profile measurements (VRP). VRP percentiles that reflect constitutional aspects were suggested. In this cross-sectional study, 186 children (aged between seven and 10 years), attending five schools, were included. VRP measurements were performed under field conditions. Interviews and questionnaires regarding vocal strain and vocal training were applied; the answers were used for classification of singing activity and vocal training (KLASAK). All children reached a mean singing voice range of at least two octaves. By using the answers of interviews and questionnaires, the children could be classified according to vocal strain and vocal training. The groups showed no significant differences regarding VRP measurements. In the following step, percentiles were calculated. Twenty-five percent of all children (P25) reached a minimum voice range of almost two octaves, namely, 22 semitones (ST) from 220 to 784 Hz with soft and loud singing. Half of the children (P50) had a voice range of 24 ST (2 octaves), while soft singing and a larger voice range of 26 ST while loud singing. The measurements of third quartile (P75) revealed that 25% of children have even a larger voice range than 29 dB (from 196 Hz/g to 1047 Hz/c3) and can sing at most frequencies louder than 90 dB. P90 demonstrated that 10% of the children can sing even lower or higher than the frequency range between 196 Hz/g and 1319 Hz/e3 analyzed. The voice range seems not to be constrained by social but by voice/musical background: children of vocally/musically encouraged schools had wider voice ranges. This underlines the necessity of regular singing lessons already in primary schools. The percentile VRP introduced might help to evaluate the vocal constitution and vocal capabilities of a child. Copyright (c) 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Orestes, Michael I.; Neubauer, Juergen; Sofer, Elazar; Salinas, Jonathon; Chhetri, Dinesh K.
2015-01-01
Objectives/Hypothesis Medialization thyroplasty (MT) is commonly used to treat glottic insufficiency. In this study, we investigated the phonatory effects of MT implant medialization depth and medial surface shape. Methods Recurrent laryngeal nerve (RLN) and vagal paralysis were simulated in an in vivo canine. A type 1 MT was performed using a silicone elastomer implant with variable medialization depths and medial surface shapes: rectangular, V-shaped, divergent, and convergent. The effects on phonation onset flow/pressure relationships and acoustics were measured. Results Increasing depth of medialization led to improvements in fundamental frequency (F0) range and normalization of the slope of pressure/flow relationship toward baseline activation conditions. The effects of implant medial shape also depended on depth of medialization. Outcome measures were similar among the implants at smaller medialization depths. With large medialization depths and vagal paralysis conditions, the divergent implant maintained pressure/flow relationship closer to baseline. The vagal paralysis conditions also demonstrated decreased fundamental frequency range and worse flow/pressure relationship compared to RLN paralysis. Conclusions The depth and medial shape of a medialization laryngoplasty (ML) implant significantly affect both the F0 range and aerodynamic power required for phonation. These effects become more notable with increasing depth of medialization. The study also illustrates that ML is less effective in vagal paralysis compared to RLN paralysis. PMID:25046146
Morphological adaptation influences the evolution of a mating signal.
Ballentine, Barbara
2006-09-01
Theory predicts that forces of natural selection can reduce the intensity of sexually selected traits. In this study, I investigate how morphological adaptation to feeding ecology influences a mating signal. In birds, changes in feeding ecology can cause rapid divergence in bill morphology. Because bills are also important for song production, feeding ecology may influence song divergence. During song, birds can rapidly change vocal tract resonance using bill movement, yet are constrained in rate and magnitude of bill movements resulting in a trade-off between trill rate and frequency bandwidth. Male swamp sparrows vary in their ability to produce rapid, broad-band trills and females prefer more physically demanding songs. Populations of swamp sparrows adapted to the feeding ecology of tidal marshes have larger bills than inland populations. Larger bills should increase the constraints of producing rapid, broad-band trills allowing for a test of how changes in feeding ecology affect a feature of song used in mate choice. I found significant differences in acoustic features of song consistent with the hypothesis that coastal males are less able to meet the physical demands of song production because of the constraints of having larger bills. As possible compensation for decreases in song performance, coastal populations exhibit an increase in song complexity. These changes support the current model of how motor constraints influence song production and suggest a mechanism by which feeding ecology can influence signal evolution.
Human-like brain hemispheric dominance in birdsong learning.
Moorman, Sanne; Gobes, Sharon M H; Kuijpers, Maaike; Kerkhofs, Amber; Zandbergen, Matthijs A; Bolhuis, Johan J
2012-07-31
Unlike nonhuman primates, songbirds learn to vocalize very much like human infants acquire spoken language. In humans, Broca's area in the frontal lobe and Wernicke's area in the temporal lobe are crucially involved in speech production and perception, respectively. Songbirds have analogous brain regions that show a similar neural dissociation between vocal production and auditory perception and memory. In both humans and songbirds, there is evidence for lateralization of neural responsiveness in these brain regions. Human infants already show left-sided dominance in their brain activation when exposed to speech. Moreover, a memory-specific left-sided dominance in Wernicke's area for speech perception has been demonstrated in 2.5-mo-old babies. It is possible that auditory-vocal learning is associated with hemispheric dominance and that this association arose in songbirds and humans through convergent evolution. Therefore, we investigated whether there is similar song memory-related lateralization in the songbird brain. We exposed male zebra finches to tutor or unfamiliar song. We found left-sided dominance of neuronal activation in a Broca-like brain region (HVC, a letter-based name) of juvenile and adult zebra finch males, independent of the song stimulus presented. In addition, juvenile males showed left-sided dominance for tutor song but not for unfamiliar song in a Wernicke-like brain region (the caudomedial nidopallium). Thus, left-sided dominance in the caudomedial nidopallium was specific for the song-learning phase and was memory-related. These findings demonstrate a remarkable neural parallel between birdsong and human spoken language, and they have important consequences for our understanding of the evolution of auditory-vocal learning and its neural mechanisms.
Development of echolocation calls and neural selectivity for echolocation calls in the pallid bat.
Razak, Khaleel A; Fuzessery, Zoltan M
2015-10-01
Studies of birdsongs and neural selectivity for songs have provided important insights into principles of concurrent behavioral and auditory system development. Relatively little is known about mammalian auditory system development in terms of vocalizations or other behaviorally relevant sounds. This review suggests echolocating bats are suitable mammalian model systems to understand development of auditory behaviors. The simplicity of echolocation calls with known behavioral relevance and strong neural selectivity provides a platform to address how natural experience shapes cortical receptive field (RF) mechanisms. We summarize recent studies in the pallid bat that followed development of echolocation calls and cortical processing of such calls. We also discuss similar studies in the mustached bat for comparison. These studies suggest: (1) there are different developmental sensitive periods for different acoustic features of the same vocalization. The underlying basis is the capacity for some components of the RF to be modified independent of others. Some RF computations and maps involved in call processing are present even before the cochlea is mature and well before use of echolocation in flight. Others develop over a much longer time course. (2) Normal experience is required not just for refinement, but also for maintenance, of response properties that develop in an experience independent manner. (3) Experience utilizes millisecond range changes in timing of inhibitory and excitatory RF components as substrates to shape vocalization selectivity. We suggest that bat species and call diversity provide a unique opportunity to address developmental constraints in the evolution of neural mechanisms of vocalization processing. © 2014 Wiley Periodicals, Inc.
Development of echolocation calls and neural selectivity for echolocation calls in the pallid bat
Razak, Khaleel A.; Fuzessery, Zoltan M.
2014-01-01
Studies of birdsongs and neural selectivity for songs have provided important insights into principles of concurrent behavioral and auditory system development. Relatively little is known about mammalian auditory system development in terms of vocalizations, or other behaviorally relevant sounds. This review suggests echolocating bats are suitable mammalian model systems to understand development of auditory behaviors. The simplicity of echolocation calls with known behavioral relevance and strong neural selectivity provides a platform to address how natural experience shapes cortical receptive field (RF) mechanisms. We summarize recent studies in the pallid bat that followed development of echolocation calls and cortical processing of such calls. We also discuss similar studies in the mustached bat for comparison. These studies suggest: (1) there are different developmental sensitive periods for different acoustic features of the same vocalization. The underlying basis is the capacity for some components of the RF to be modified independent of others. Some RF computations and maps involved in call processing are present even before the cochlea is mature and well before use of echolocation in flight. Others develop over a much longer time course. (2) Normal experience is required not just for refinement, but also for maintenance, of response properties that develop in an experience independent manner. (3) Experience utilizes millisecond range changes in timing of inhibitory and excitatory RF components as substrates to shape vocalization selectivity. We suggest that bat species and call diversity provide a unique opportunity to address developmental constraints in the evolution of neural mechanisms of vocalization processing. PMID:25142131
Masapollo, Matthew; Polka, Linda; Ménard, Lucie
2016-03-01
To learn to produce speech, infants must effectively monitor and assess their own speech output. Yet very little is known about how infants perceive speech produced by an infant, which has higher voice pitch and formant frequencies compared to adult or child speech. Here, we tested whether pre-babbling infants (at 4-6 months) prefer listening to vowel sounds with infant vocal properties over vowel sounds with adult vocal properties. A listening preference favoring infant vowels may derive from their higher voice pitch, which has been shown to attract infant attention in infant-directed speech (IDS). In addition, infants' nascent articulatory abilities may induce a bias favoring infant speech given that 4- to 6-month-olds are beginning to produce vowel sounds. We created infant and adult /i/ ('ee') vowels using a production-based synthesizer that simulates the act of speaking in talkers at different ages and then tested infants across four experiments using a sequential preferential listening task. The findings provide the first evidence that infants preferentially attend to vowel sounds with infant voice pitch and/or formants over vowel sounds with no infant-like vocal properties, supporting the view that infants' production abilities influence how they process infant speech. The findings with respect to voice pitch also reveal parallels between IDS and infant speech, raising new questions about the role of this speech register in infant development. Research exploring the underpinnings and impact of this perceptual bias can expand our understanding of infant language development. © 2015 John Wiley & Sons Ltd.
Computational Models of Laryngeal Aerodynamics: Potentials and Numerical Costs.
Sadeghi, Hossein; Kniesburges, Stefan; Kaltenbacher, Manfred; Schützenberger, Anne; Döllinger, Michael
2018-02-07
Human phonation is based on the interaction between tracheal airflow and laryngeal dynamics. This fluid-structure interaction is based on the energy exchange between airflow and vocal folds. Major challenges in analyzing the phonatory process in-vivo are the small dimensions and the poor accessibility of the region of interest. For improved analysis of the phonatory process, numerical simulations of the airflow and the vocal fold dynamics have been suggested. Even though most of the models reproduced the phonatory process fairly well, development of comprehensive larynx models is still a subject of research. In the context of clinical application, physiological accuracy and computational model efficiency are of great interest. In this study, a simple numerical larynx model is introduced that incorporates the laryngeal fluid flow. It is based on a synthetic experimental model with silicone vocal folds. The degree of realism was successively increased in separate computational models and each model was simulated for 10 oscillation cycles. Results show that relevant features of the laryngeal flow field, such as glottal jet deflection, develop even when applying rather simple static models with oscillating flow rates. Including further phonatory components such as vocal fold motion, mucosal wave propagation, and ventricular folds, the simulations show phonatory key features like intraglottal flow separation and increased flow rate in presence of ventricular folds. The simulation time on 100 CPU cores ranged between 25 and 290 hours, currently restricting clinical application of these models. Nevertheless, results show high potential of numerical simulations for better understanding of phonatory process. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Wireless multi-channel single unit recording in freely moving and vocalizing primates
Roy, Sabyasachi; Wang, Xiaoqin
2011-01-01
The ability to record well-isolated action potentials from individual neurons in naturally behaving animals is crucial for understanding neural mechanisms underlying natural behaviors. Traditional neurophysiology techniques, however, require the animal to be restrained which often restricts natural behavior. An example is the common marmoset (Callithrix jacchus), a highly vocal New World primate species, used in our laboratory to study the neural correlates of vocal production and sensory feedback. When restrained by traditional neurophysiological techniques marmoset vocal behavior is severely inhibited. Tethered recording systems, while proven effective in rodents pose limitations in arboreal animals such as the marmoset that typically roam in a three-dimensional environment. To overcome these obstacles, we have developed a wireless neural recording technique that is capable of collecting single-unit data from chronically implanted multi-electrodes in freely moving marmosets. A lightweight, low power and low noise wireless transmitter (headstage) is attached to a multi-electrode array placed in the premotor cortex of the marmoset. The wireless headstage is capable of transmitting 15 channels of neural data with signal-to-noise ratio (SNR) comparable to a tethered system. To minimize radio-frequency (RF) and electro-magnetic interference (EMI), the experiments were conducted within a custom designed RF/EMI and acoustically shielded chamber. The individual electrodes of the multi-electrode array were periodically advanced to densely sample the cortical layers. We recorded single-unit data over a period of several months from the frontal cortex of two marmosets. These recordings demonstrate the feasibility of using our wireless recording method to study single neuron activity in freely roaming primates. PMID:21933683
Forlano, Paul M; Sisneros, Joseph A
2016-01-01
The plainfin midshipman fish (Porichthys notatus) is a well-studied model to understand the neural and endocrine mechanisms underlying vocal-acoustic communication across vertebrates. It is well established that steroid hormones such as estrogen drive seasonal peripheral auditory plasticity in female Porichthys in order to better encode the male's advertisement call. However, little is known of the neural substrates that underlie the motivation and coordinated behavioral response to auditory social signals. Catecholamines, which include dopamine and noradrenaline, are good candidates for this function, as they are thought to modulate the salience of and reinforce appropriate behavior to socially relevant stimuli. This chapter summarizes our recent studies which aimed to characterize catecholamine innervation in the central and peripheral auditory system of Porichthys as well as test the hypotheses that innervation of the auditory system is seasonally plastic and catecholaminergic neurons are activated in response to conspecific vocalizations. Of particular significance is the discovery of direct dopaminergic innervation of the saccule, the main hearing end organ, by neurons in the diencephalon, which also robustly innervate the cholinergic auditory efferent nucleus in the hindbrain. Seasonal changes in dopamine innervation in both these areas appear dependent on reproductive state in females and may ultimately function to modulate the sensitivity of the peripheral auditory system as an adaptation to the seasonally changing soundscape. Diencephalic dopaminergic neurons are indeed active in response to exposure to midshipman vocalizations and are in a perfect position to integrate the detection and appropriate motor response to conspecific acoustic signals for successful reproduction.
Vocal fold motion outcome based on excellent prognosis with laryngeal electromyography.
Smith, Libby J; Rosen, Clark A; Munin, Michael C
2016-10-01
As laryngeal electromyography (LEMG) becomes more refined, accurate predictions of vocal fold motion recovery are possible. Focus has been on outcomes for patients with poor prognosis for vocal fold motion recovery. Limited information is available regarding the expected rate of purposeful vocal fold motion recovery when there is good to normal motor recruitment, no signs of denervation, and no signs of synkinetic activity with LEMG, termed excellent prognosis. The objective of this study is to determine the rate of vocal fold motion recovery with excellent prognosis findings on LEMG after acute recurrent laryngeal nerve injury. Retrospective review. Patients undergoing a standardized LEMG protocol, consisting of qualitative (evaluation of motor recruitment, motor unit configuration, detection of fibrillations, presence of synkinesis) and quantitative (turns analysis) measurements were evaluated for purposeful vocal-fold motion recovery, calculated after at least 6 months since onset of injury. Twenty-three patients who underwent LEMG for acute vocal fold paralysis met the inclusion criteria of excellent prognosis. Eighteen patients (78.3%) recovered vocal fold motion, as determined by flexible laryngoscopy. Nearly 80% of patients determined to have excellent prognosis for vocal fold motion recovery experienced return of vocal fold motion. This information will help clinicians not only counsel their patients on expectations but will also help guide treatment. 4. Laryngoscope, 126:2310-2314, 2016. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
Vocal Learning via Social Reinforcement by Infant Marmoset Monkeys.
Takahashi, Daniel Y; Liao, Diana A; Ghazanfar, Asif A
2017-06-19
For over half a century now, primate vocalizations have been thought to undergo little or no experience-dependent acoustic changes during development [1]. If any changes are apparent, then they are routinely (and quite reasonably) attributed to the passive consequences of growth. Indeed, previous experiments on squirrel monkeys and macaque monkeys showed that social isolation [2, 3], deafness [2], cross-fostering [4] and parental absence [5] have little or no effect on vocal development. Here, we explicitly test in marmoset monkeys-a very vocal and cooperatively breeding species [6]-whether the transformation of immature into mature contact calls by infants is influenced by contingent parental vocal feedback. Using a closed-loop design, we experimentally provided more versus less contingent vocal feedback to twin infant marmoset monkeys over their first 2 months of life, the interval during which their contact calls transform from noisy, immature calls to tonal adult-like "phee" calls [7, 8]. Infants who received more contingent feedback had a faster rate of vocal development, producing mature-sounding contact calls earlier than the other twin. The differential rate of vocal development was not linked to genetics, perinatal experience, or body growth; nor did the amount of contingency influence the overall rate of spontaneous vocal production. Thus, we provide the first experimental evidence for production-related vocal learning during the development of a nonhuman primate. Copyright © 2017 Elsevier Ltd. All rights reserved.
The importance of hyaluronic acid in vocal fold biomechanics.
Chan, R W; Gray, S D; Titze, I R
2001-06-01
This study examined the influence of hyaluronic acid (HA) on the biomechanical properties of the human vocal fold cover (the superficial layer of the lamina propria). Vocal fold tissues were freshly excised from 5 adult male cadavers and were treated with bovine testicular hyaluronidase to selectively remove HA from the lamina propria extracellular matrix (ECM). Linear viscoelastic shear properties (elastic shear modulus and dynamic viscosity) of the tissue samples before and after enzymatic treatment were quantified as a function of frequency (0.01 to 15 Hz) by a parallel-plate rotational rheometer at 37 degrees C. On removing HA from the vocal fold ECM, the elastic shear modulus (G' ) or stiffness of the vocal fold cover decreased by an average of around 35%, while the dynamic viscosity (eta') increased by 70% at higher frequencies (>1 Hz). The results suggested that HA plays an important role in determining the biomechanical properties of the vocal fold cover. As a highly hydrated glycosaminoglycan in the vocal fold ECM, it likely contributes to the maintenance of an optimal tissue viscosity that may facilitate phonation, and an optimal tissue stiffness that may be important for vocal fundamental frequency control. HA has been proposed as a potential bioimplant for the surgical repair of vocal fold ECM defects (eg, vocal fold scarring and sulcus vocalis). Our results suggested that such clinical use may be potentially optimal for voice production from a biomechanical perspective.
A robotic voice simulator and the interactive training for hearing-impaired people.
Sawada, Hideyuki; Kitani, Mitsuki; Hayashi, Yasumori
2008-01-01
A talking and singing robot which adaptively learns the vocalization skill by means of an auditory feedback learning algorithm is being developed. The robot consists of motor-controlled vocal organs such as vocal cords, a vocal tract and a nasal cavity to generate a natural voice imitating a human vocalization. In this study, the robot is applied to the training system of speech articulation for the hearing-impaired, because the robot is able to reproduce their vocalization and to teach them how it is to be improved to generate clear speech. The paper briefly introduces the mechanical construction of the robot and how it autonomously acquires the vocalization skill in the auditory feedback learning by listening to human speech. Then the training system is described, together with the evaluation of the speech training by auditory impaired people.
Ultrasonic Vocalizations Emitted by Flying Squirrels
Murrant, Meghan N.; Bowman, Jeff; Garroway, Colin J.; Prinzen, Brian; Mayberry, Heather; Faure, Paul A.
2013-01-01
Anecdotal reports of ultrasound use by flying squirrels have existed for decades, yet there has been little detailed analysis of their vocalizations. Here we demonstrate that two species of flying squirrel emit ultrasonic vocalizations. We recorded vocalizations from northern (Glaucomys sabrinus) and southern (G. volans) flying squirrels calling in both the laboratory and at a field site in central Ontario, Canada. We demonstrate that flying squirrels produce ultrasonic emissions through recorded bursts of broadband noise and time-frequency structured frequency modulated (FM) vocalizations, some of which were purely ultrasonic. Squirrels emitted three types of ultrasonic calls in laboratory recordings and one type in the field. The variety of signals that were recorded suggest that flying squirrels may use ultrasonic vocalizations to transfer information. Thus, vocalizations may be an important, although still poorly understood, aspect of flying squirrel social biology. PMID:24009728
A Computational Study of Vocal Fold Dehydration During Phonation.
Wu, Liang; Zhang, Zhaoyan
2017-12-01
While vocal fold dehydration is often considered an important factor contributing to vocal fatigue, it still remains unclear whether vocal fold vibration alone is able to induce severe dehydration that has a noticeable effect on phonation and perceived vocal effort. A three-dimensional model was developed to investigate vocal fold systemic dehydration and surface dehydration during phonation. Based on the linear poroelastic theory, the model considered water resupply from blood vessels through the lateral boundary, water movement within the vocal folds, water exchange between the vocal folds and the surface liquid layer through the epithelium, and surface fluid accumulation and discharge to the glottal airway. Parametric studies were conducted to investigate water loss within the vocal folds and from the surface after a 5-min sustained phonation under different permeability and vibration conditions. The results showed that the dehydration generally increased with increasing vibration amplitude, increasing epithelial permeability, and reduced water resupply. With adequate water resupply, a large-amplitude vibration can induce an overall systemic dehydration as high as 3%. The distribution of water loss within the vocal folds was non-uniform, and a local dehydration higher than 5% was observed even under conditions of a low overall systemic dehydration (<1%). Such high level of water loss may severely affect tissue properties, muscular functions, and phonations characteristics. In contrast, water loss of the surface liquid layer was generally an order of magnitude higher than water loss inside the vocal folds, indicating that the surface dehydration level is likely not a good indicator of the systemic dehydration.
Hertegård, S; Cedervall, J; Svensson, B; Forsberg, K; Maurer, F H J; Vidovska, D; Olivius, P; Ahrlund-Richter, L; Le Blanc, K
2006-07-01
The aim of this study was to analyze the short-term viscoelastic and histologic properties of scarred rabbit vocal folds after injection of human mesenchymal stem cells (MSC) as well as the degree of MSC survival. Because MSCs are antiinflammatory and regenerate mesenchymal tissues, can MSC injection reduce vocal fold scarring after injury? Twelve vocal folds from 10 New Zealand rabbits were scarred by a localized resection and injected with human MSC or saline. Eight vocal folds were left as controls. After 4 weeks, 10 larynges were stained for histology and evaluation of the lamina propria thickness. Collagen type I content was analyzed from six rabbits. MSC survival was analyzed by fluorescent in situ hybridization staining from three rabbits. Viscoelasticity for 10 vocal folds was analyzed in a parallel-plate rheometer. The rheometry on fresh-frozen samples showed decreased dynamic viscosity and lower elastic modulus (P<.01) in the scarred samples injected with MSC as compared with the untreated scarred group. Normal controls had lower dynamic viscosity and elastic modulus as compared with the scarred untreated and treated vocal folds (P<.01). Histologic analysis showed a higher content of collagen type 1 in the scarred samples as compared with the normal vocal folds and with the scarred folds treated with MSC. MSCs remained in all samples analyzed. The treated scarred vocal folds showed persistent MSC. Injection of scarred rabbit vocal folds with MSC rendered improved viscoelastic parameters and less signs of scarring expressed as collagen content in comparison to the untreated scarred vocal folds.
Cooperative vocal control in marmoset monkeys via vocal feedback
Choi, Jung Yoon; Takahashi, Daniel Y.
2015-01-01
Humans adjust speech amplitude as a function of distance from a listener; we do so in a manner that would compensate for such distance. This ability is presumed to be the product of high-level sociocognitive skills. Nonhuman primates are thought to lack such socially related flexibility in vocal production. Using predictions from a simple arousal-based model whereby vocal feedback from a conspecific modulates the drive to produce a vocalization, we tested whether another primate exhibits this type of cooperative vocal control. We conducted a playback experiment with marmoset monkeys and simulated “far-away” and “nearby” conspecifics using contact calls that differed in sound intensity. We found that marmoset monkeys increased the amplitude of their contact calls and produced such calls with shorter response latencies toward more distant conspecifics. The same was not true in response to changing levels of background noise. To account for how simulated conspecific distance can change both the amplitude and timing of vocal responses, we developed a model that incorporates dynamic interactions between the auditory system and limbic “drive” systems. Overall, our data show that, like humans, marmoset monkeys cooperatively control the acoustics of their vocalizations according to changes in listener distance, increasing the likelihood that a conspecific will hear their call. However, we propose that such cooperative vocal control is a system property that does not necessitate any particularly advanced sociocognitive skill. At least in marmosets, this vocal control can be parsimoniously explained by the regulation of arousal states across two interacting individuals via vocal feedback. PMID:25925323
The Effectiveness of Low-Level Light Therapy in Attenuating Vocal Fatigue.
Kagan, Loraine Sydney; Heaton, James T
2017-05-01
Low-level light therapy (LLLT) is effective in reducing inflammation, promoting wound healing, and preventing tissue damage, but has not yet been studied in the treatment of voice disorders. The objective of this study was to investigate the possible effectiveness of LLLT in attenuating symptoms of vocal fatigue created by a vocal loading task as measured by acoustic, aerodynamic, and self-reported vocal effort. In a randomized, prospective study, 16 vocally healthy adults divided into four groups underwent a 1-hour vocal loading procedure, followed by infrared wavelength LLLT (828 nm), red wavelength LLLT (628 nm), heat, or no heat-light (control) treatment targeting the laryngeal region of the ventral neck surface. Phonation threshold pressure (PTP), relative fundamental frequency (RFF), and the inability to produce soft voice (IPSV) self-perceptual rating scale were recorded (1) at baseline, (2) immediately after vocal loading, (3) after treatment, and (4) 1 hour after treatment. Vocal loading significantly increased PTP and IPSV and decreased onset and offset RFFs, consistent with a shift toward vocal dysfunction. Red light significantly normalized the combination of PTP, IPSV, and RFF measures compared to other conditions. RFF is sensitive to a vocal loading task in conjunction with PTP and IPSV, and red LLLT may have a normalizing effect on objective and subjective measures of vocal fatigue. The results of this study lay the groundwork and rationale for future research to optimize LLLT wavelength combinations and overall dose. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Conserved mechanisms of vocalization coding in mammalian and songbird auditory midbrain.
Woolley, Sarah M N; Portfors, Christine V
2013-11-01
The ubiquity of social vocalizations among animals provides the opportunity to identify conserved mechanisms of auditory processing that subserve communication. Identifying auditory coding properties that are shared across vocal communicators will provide insight into how human auditory processing leads to speech perception. Here, we compare auditory response properties and neural coding of social vocalizations in auditory midbrain neurons of mammalian and avian vocal communicators. The auditory midbrain is a nexus of auditory processing because it receives and integrates information from multiple parallel pathways and provides the ascending auditory input to the thalamus. The auditory midbrain is also the first region in the ascending auditory system where neurons show complex tuning properties that are correlated with the acoustics of social vocalizations. Single unit studies in mice, bats and zebra finches reveal shared principles of auditory coding including tonotopy, excitatory and inhibitory interactions that shape responses to vocal signals, nonlinear response properties that are important for auditory coding of social vocalizations and modulation tuning. Additionally, single neuron responses in the mouse and songbird midbrain are reliable, selective for specific syllables, and rely on spike timing for neural discrimination of distinct vocalizations. We propose that future research on auditory coding of vocalizations in mouse and songbird midbrain neurons adopt similar experimental and analytical approaches so that conserved principles of vocalization coding may be distinguished from those that are specialized for each species. This article is part of a Special Issue entitled "Communication Sounds and the Brain: New Directions and Perspectives". Copyright © 2013 Elsevier B.V. All rights reserved.
Determining the etiology of mild vocal fold hypomobility.
Heman-Ackah, Yolanda D; Batory, Mark
2003-12-01
The prevalence of mild vocal fold hypomobility is unknown. In a study by Heman-Ackah et al, vocal fold hypomobility in a population of singing teachers was found to be associated more frequently with vocal complaints than was the presence of vocal fold masses. The etiology of mild vocal fold hypomobility has not been previously explored. In the present study, a retrospective chart review was performed of 134 patients who presented to a tertiary laryngology referral center over a 6-month period for evaluation of vocal complaints. Of the 134 patients, 61 (46%) were found to have mild vocal referring otolaryngologist. Imaging studies and laboratory tests to evaluate for structural, metabolic, and infectious causes of the decreased mobility had been ordered. Forty-nine patients completed the work-up. Of these, 41 out of 49 (84%) were found to have imaging or laboratory findings that could explain the hypomobility. Thyroid abnormalities were found to be associated with vocal fold hypomobility in 21 out of 49 (43%) of those with a complete evaluation. Other causes of vocal fold hypomobility included idiopathic (8 of 49, 16%), viral neuritis (5 of 49, 10%), central nervous system abnormality (4 of 49, 8%), neural tumor (3 of 49, 6%), joint dysfunction (3 of 49, 6%), iatrogenic nerve injury (2 of 49, 4%), myopathy (2 of 49, 4%), and noniatrogenic traumatic nerve injury (1 of 49, 2%), This study shows that unilateral vocal fold hypomobility often is associated with a physiologic process, and a complete investigation to determine the etiology is warranted in all cases.
Young, Katherine S; Parsons, Christine E; LeBeau, Richard T; Tabak, Benjamin A; Sewart, Amy R; Stein, Alan; Kringelbach, Morten L; Craske, Michelle G
2017-08-01
Emotional expressions are an essential element of human interactions. Recent work has increasingly recognized that emotional vocalizations can color and shape interactions between individuals. Here we present data on the psychometric properties of a recently developed database of authentic nonlinguistic emotional vocalizations from human adults and infants (the Oxford Vocal 'OxVoc' Sounds Database; Parsons, Young, Craske, Stein, & Kringelbach, 2014). In a large sample (n = 562), we demonstrate that adults can reliably categorize these sounds (as 'positive,' 'negative,' or 'sounds with no emotion'), and rate valence in these sounds consistently over time. In an extended sample (n = 945, including the initial n = 562), we also investigated a number of individual difference factors in relation to valence ratings of these vocalizations. Results demonstrated small but significant effects of (a) symptoms of depression and anxiety with more negative ratings of adult neutral vocalizations (R2 = .011 and R2 = .008, respectively) and (b) gender differences in perceived valence such that female listeners rated adult neutral vocalizations more positively and infant cry vocalizations more negatively than male listeners (R2 = .021, R2 = .010, respectively). Of note, we did not find evidence of negativity bias among other affective vocalizations or gender differences in perceived valence of adult laughter, adult cries, infant laughter, or infant neutral vocalizations. Together, these findings largely converge with factors previously shown to impact processing of emotional facial expressions, suggesting a modality-independent impact of depression, anxiety, and listener gender, particularly among vocalizations with more ambiguous valence. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Vocal Function in Introverts and Extraverts during a Psychological Stress Reactivity Protocol
ERIC Educational Resources Information Center
Dietrich, Maria; Verdolini Abbott, Katherine
2012-01-01
Purpose: To examine the proposal that introversion predictably influences extralaryngeal and vocal behavior in vocally healthy individuals compared with individuals with extraversion and whether differences are of a nature that may support a risk hypothesis for primary muscle tension dysphonia. Method: Fifty-four vocally healthy female adults…
The Loud, Clear, and Transporting Voice of Oral Interpretation.
ERIC Educational Resources Information Center
Vartabedian, Robert A.
This essay examines the art of oral interpretation from a "vocal" perspective--that is, it focuses on the crucial nature of vocal dimensions in oral interpretation. Moreover, the essay argues for an interpreter's hierarchy of vocal needs (modeled after Abraham Maslow's 1970 theory). The interpreter's hierarchy of vocal needs involves…
Oral Breathing Challenge in Participants with Vocal Attrition
ERIC Educational Resources Information Center
Sivasankar, Mahalakshmi; Fisher, Kimberly V.
2003-01-01
Vocal folds undergo osmotic challenge by mouth breathing during singing, exercising, and loud speaking. Just 15 min of obligatory oral breathing, to dry the vocal folds, increases phonation threshold pressure (P[subscript th]) and expiratory vocal effort in healthy speakers (M. Sivasankar & K. Fisher, 2002). We questioned whether oral breathing is…
Diagnostic and therapeutic pitfalls in benign vocal fold diseases
Bohlender, Jörg
2013-01-01
More than half of patients presenting with hoarseness show benign vocal fold changes. The clinician should be familiar with the anatomy, physiology and functional aspects of voice disorders and also the modern diagnostic and therapeutic possibilities in order to ensure an optimal and patient specific management. This review article focuses on the diagnostic and therapeutic limitations and difficulties of treatment of benign vocal fold tumors, the management and prevention of scarred vocal folds and the issue of unilateral vocal fold paresis. PMID:24403969
Sarcoidosis Presenting as Bilateral Vocal Fold Immobility.
Hintze, Justin M; Gnagi, Sharon H; Lott, David G
2018-05-01
Bilateral true vocal fold paralysis is rarely attributable to inflammatory diseases. Sarcoidosis is a rare but important etiology of bilateral true vocal fold paralysis by compressive lymphadenopathy, granulomatous infiltration, and neural involvement. We describe the first reported case of sarcoidosis presenting as bilateral vocal fold immobility caused by direct fixation by granulomatous infiltration severe enough to necessitate tracheostomy insertion. In addition, we discuss the presentation, the pathophysiology, and the treatment of this disease with a review of the literature of previously reported cases of sarcoidosis-related vocal fold immobility. Sarcoidosis should therefore be an important consideration for the otolaryngologist's differential diagnosis of true vocal fold immobility. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
What happens during vocal warm-up?
Elliot, N; Sundberg, J; Gramming, P
1995-03-01
Most singers prefer to warm up their voices before performing. Although the subjective effect is often considerable, the underlying physiological effects are largely unknown. Because warm-up tends to increase blood flow in muscles, it seems likely that vocal warm-up might induce decreased viscosity in the vocal folds. According to the theory of vocal-fold vibration, such a decrease should lead to a lower phonation threshold pressure. In this investigation the effect of vocal warm-up on the phonation threshold pressure was examined in a group of male and female singers. The effect varied considerably between subjects, presumably because the vocal-fold viscosity was not a dominating factor for the phonation-threshold pressure.
Vocal cord dysfunction in children.
Noyes, Blakeslee E; Kemp, James S
2007-06-01
Vocal cord dysfunction is characterised by paradoxical vocal cord adduction that occurs during inspiration, resulting in symptoms of dyspnoea, wheeze, chest or throat tightness and cough. Although the condition is well described in children and adults, confusion with asthma often triggers the use of an aggressive treatment regimen directed against asthma. The laryngoscopic demonstration of vocal cord adduction during inspiration has been considered the gold standard for the diagnosis of vocal cord dysfunction, but historical factors and pulmonary function findings may provide adequate clues to the correct diagnosis. Speech therapy, and in some cases psychological counselling, is often beneficial in this disorder. The natural course and prognosis of vocal cord dysfunction are still not well described in adults or children.
Everyday bat vocalizations contain information about emitter, addressee, context, and behavior
Prat, Yosef; Taub, Mor; Yovel, Yossi
2016-01-01
Animal vocal communication is often diverse and structured. Yet, the information concealed in animal vocalizations remains elusive. Several studies have shown that animal calls convey information about their emitter and the context. Often, these studies focus on specific types of calls, as it is rarely possible to probe an entire vocal repertoire at once. In this study, we continuously monitored Egyptian fruit bats for months, recording audio and video around-the-clock. We analyzed almost 15,000 vocalizations, which accompanied the everyday interactions of the bats, and were all directed toward specific individuals, rather than broadcast. We found that bat vocalizations carry ample information about the identity of the emitter, the context of the call, the behavioral response to the call, and even the call’s addressee. Our results underline the importance of studying the mundane, pairwise, directed, vocal interactions of animals. PMID:28005079