Behavior-Linked FoxP2 Regulation Enables Zebra Finch Vocal Learning
Heston, Jonathan B.
2015-01-01
Mutations in the FOXP2 transcription factor cause an inherited speech and language disorder, but how FoxP2 contributes to learning of these vocal communication signals remains unclear. FoxP2 is enriched in corticostriatal circuits of both human and songbird brains. Experimental knockdown of this enrichment in song control neurons of the zebra finch basal ganglia impairs tutor song imitation, indicating that adequate FoxP2 levels are necessary for normal vocal learning. In unmanipulated birds, vocal practice acutely downregulates FoxP2, leading to increased vocal variability and dynamic regulation of FoxP2 target genes. To determine whether this behavioral regulation is important for song learning, here, we used viral-driven overexpression of FoxP2 to counteract its downregulation. This manipulation disrupted the acute effects of song practice on vocal variability and caused inaccurate song imitation. Together, these findings indicate that dynamic behavior-linked regulation of FoxP2, rather than absolute levels, is critical for vocal learning. PMID:25698728
Behavior-linked FoxP2 regulation enables zebra finch vocal learning.
Heston, Jonathan B; White, Stephanie A
2015-02-18
Mutations in the FOXP2 transcription factor cause an inherited speech and language disorder, but how FoxP2 contributes to learning of these vocal communication signals remains unclear. FoxP2 is enriched in corticostriatal circuits of both human and songbird brains. Experimental knockdown of this enrichment in song control neurons of the zebra finch basal ganglia impairs tutor song imitation, indicating that adequate FoxP2 levels are necessary for normal vocal learning. In unmanipulated birds, vocal practice acutely downregulates FoxP2, leading to increased vocal variability and dynamic regulation of FoxP2 target genes. To determine whether this behavioral regulation is important for song learning, here, we used viral-driven overexpression of FoxP2 to counteract its downregulation. This manipulation disrupted the acute effects of song practice on vocal variability and caused inaccurate song imitation. Together, these findings indicate that dynamic behavior-linked regulation of FoxP2, rather than absolute levels, is critical for vocal learning. Copyright © 2015 the authors 0270-6474/15/352885-10$15.00/0.
Hunter, Eric J.; Titze, Ingo R.
2012-01-01
Objectives To quantify the recovery of voice following a 2-hour vocal loading exercise (oral reading). Methods 86 adult participants tracked their voice recovery using short vocal tasks and perceptual ratings after an initial vocal loading exercise and for the following two days. Results Short-term recovery was apparent with 90% recovery within 4-6 hours and full recovery at 12-18 hours. Recovery was shown to be similar to a dermal wound healing trajectory. Conclusions The new recovery trajectory highlighted by the vocal loading exercise in the current study is called a vocal recovery trajectory. By comparing vocal fatigue to dermal wound healing, this trajectory is parallel to a chronic wound healing trajectory (as opposed to an acute wound healing trajectory). This parallel suggests that vocal fatigue from the daily use of the voice could be treated as a chronic wound, with the healing and repair mechanisms in a state of constant repair. In addition, there is likely a vocal fatigue threshold at which point the level of tissue damage would shift the chronic healing trajectory to an acute healing trajectory. PMID:19663377
Vocal fold paralysis secondary to phonotrauma.
Klein, Travis A L; Gaziano, Joy E; Ridley, Marion B
2014-01-01
A unique case of acute onset vocal fold paralysis secondary to phonotrauma is presented. The cause was forceful vocalization by a drill instructor on a firearm range. Imaging studies revealed extensive intralaryngeal and retropharyngeal hemorrhage. Laryngoscopy showed a complete left vocal fold paralysis. Relative voice rest was recommended, and the patient regained normal vocal fold mobility and function after approximately 12 weeks. Copyright © 2014 The Voice Foundation. All rights reserved.
Kishimoto, Ayami Ohno; Kishimoto, Yo; Young, David L; Zhang, Jinjin; Rowland, Ian J; Welham, Nathan V
2016-11-01
Subepithelial changes to the vocal fold mucosa, such as fibrosis, are difficult to identify using visual assessment of the tissue surface. Moreover, without suspicion of neoplasm, mucosal biopsy is not a viable clinical option, as it carries its own risk of iatrogenic injury and scar formation. Given these challenges, we assessed the ability of high- (4.7 T) and ultrahigh-field (9.4 T) magnetic resonance imaging to resolve key vocal fold subepithelial tissue structures in the rat, an important and widely used preclinical model in vocal fold biology. We conducted serial in vivo and ex vivo imaging, evaluated an array of acquisition sequences and contrast agents, and successfully resolved key anatomic features of naïve, acutely injured, and chronically scarred vocal fold mucosae on the ex vivo scans. Naïve lamina propria was hyperintense on T1-weighted imaging with gadobenate dimeglumine contrast enhancement, whereas chronic scar was characterized by reduced lamina propria T1 signal intensity and mucosal volume. Acutely injured mucosa was hypointense on T2-weighted imaging; lesion volume steadily increased, peaked at 5 days post-injury, and then decreased - consistent with the physiology of acute, followed by subacute, hemorrhage and associated changes in the magnetic state of hemoglobin and its degradation products. Intravenous administration of superparamagnetic iron oxide conferred no T2 contrast enhancement during the acute injury period. These findings confirm that magnetic resonance imaging can resolve anatomic substructures within naïve vocal fold mucosa, qualitative and quantitative features of acute injury, and the presence of chronic scar. © 2016. Published by The Company of Biologists Ltd.
Kishimoto, Ayami Ohno; Kishimoto, Yo; Young, David L.; Zhang, Jinjin
2016-01-01
ABSTRACT Subepithelial changes to the vocal fold mucosa, such as fibrosis, are difficult to identify using visual assessment of the tissue surface. Moreover, without suspicion of neoplasm, mucosal biopsy is not a viable clinical option, as it carries its own risk of iatrogenic injury and scar formation. Given these challenges, we assessed the ability of high- (4.7 T) and ultrahigh-field (9.4 T) magnetic resonance imaging to resolve key vocal fold subepithelial tissue structures in the rat, an important and widely used preclinical model in vocal fold biology. We conducted serial in vivo and ex vivo imaging, evaluated an array of acquisition sequences and contrast agents, and successfully resolved key anatomic features of naïve, acutely injured, and chronically scarred vocal fold mucosae on the ex vivo scans. Naïve lamina propria was hyperintense on T1-weighted imaging with gadobenate dimeglumine contrast enhancement, whereas chronic scar was characterized by reduced lamina propria T1 signal intensity and mucosal volume. Acutely injured mucosa was hypointense on T2-weighted imaging; lesion volume steadily increased, peaked at 5 days post-injury, and then decreased – consistent with the physiology of acute, followed by subacute, hemorrhage and associated changes in the magnetic state of hemoglobin and its degradation products. Intravenous administration of superparamagnetic iron oxide conferred no T2 contrast enhancement during the acute injury period. These findings confirm that magnetic resonance imaging can resolve anatomic substructures within naïve vocal fold mucosa, qualitative and quantitative features of acute injury, and the presence of chronic scar. PMID:27638667
Adipose stromal cells improve healing of vocal fold scar: Morphological and functional evidences.
de Bonnecaze, Guillaume; Chaput, Benoit; Woisard, Virginie; Uro-Coste, Emmanuelle; Swider, Pascal; Vergez, Sebastien; Serrano, Elie; Casteilla, Louis; Planat-Benard, Valerie
2016-08-01
Adipose derived stromal cells (ASCs) are abundant and easy to prepare. Such cells may be useful for treating severe vocal disturbance caused by acute vocal fold scars. Prospective animal experiments with controls. Twenty New-Zealand white rabbits were used in the present study. We evaluated vocal fold healing, with or without injection of autologous ASCs, after acute scarring. A defined lesion was created and the ASCs were immediately injected. Vocal fold regeneration was evaluated histomorphometrically and via viscoelastic analysis using an electrodynamic shaker. Six weeks after ASC injection, vocal folds exhibited significantly less inflammation than control folds (P < 0.005). In addition, hypertrophy of the lamina propria and fibrosis were significantly reduced upon ASC injection (P < 0.02). The decrease in viscoelastic parameters was less important in the ASC injected group compared to the noninjected group (P = 0.08). Injection of autologous ASCs improved vocal fold healing in our preclinical model. Further studies are needed, but this method may be useful in humans. NA. Laryngoscope, 126:E278-E285, 2016. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
Vocal exercise may attenuate acute vocal fold inflammation
Abbott, Katherine Verdolini; Li, Nicole Y.K.; Branski, Ryan C.; Rosen, Clark A.; Grillo, Elizabeth; Steinhauer, Kimberly; Hebda, Patricia A.
2012-01-01
Objectives/Hypotheses The objective was to assess the utility of selected “resonant voice” exercises for the reduction of acute vocal fold inflammation. The hypothesis was that relatively large-amplitude, low-impact exercises associated with resonant voice would reduce inflammation more than spontaneous speech and possibly more than voice rest. Study Design The study design was prospective, randomized, double-blind. Methods Nine vocally healthy adults underwent a 1-hr vocal loading procedure, followed by randomization to (a) a spontaneous speech condition, (b) a vocal rest condition, or (c) a resonant voice exercise condition. Treatments were monitored in clinic for 4 hr, and continued extra-clinically until the next morning. At baseline, immediately following loading, after the 4-hr in-clinic treatment, and 24 hr post baseline, secretions were suctioned from the vocal folds bilaterally and submitted to enzyme-linked immunosorbent assay (ELISA) to estimate concentrations of key markers of tissue injury and inflammation: IL-1β, IL-6, IL-8, TNF-α, MMP-8, and IL-10. Results Complete data sets were obtained for 3 markers -- IL-1β, IL-6, and MMP-8 -- for one subject in each treatment condition. For those markers, results were poorest at 24-hr follow-up in the spontaneous speech condition, sharply improved in the voice rest condition, and best in the resonant voice condition. Average results for all markers, for all responsive subjects with normal baseline mediator concentrations, revealed an almost identical pattern. Conclusions Some forms of tissue mobilization may be useful to attenuate acute vocal fold inflammation. PMID:23177745
Acute vocal fold hemorrhage caught on video during office exam.
Carroll, Thomas L; Smith, Libby J
2009-03-01
This article presents a unique video of a laryngeal exam during which a vocal fold hemorrhage occurs. This patient had likely been suffering from intermittent vocal fold hemorrhages for the last decade due to a persistent vascular lesion and an underlying chronic cough.
Vocal fold motion outcome based on excellent prognosis with laryngeal electromyography.
Smith, Libby J; Rosen, Clark A; Munin, Michael C
2016-10-01
As laryngeal electromyography (LEMG) becomes more refined, accurate predictions of vocal fold motion recovery are possible. Focus has been on outcomes for patients with poor prognosis for vocal fold motion recovery. Limited information is available regarding the expected rate of purposeful vocal fold motion recovery when there is good to normal motor recruitment, no signs of denervation, and no signs of synkinetic activity with LEMG, termed excellent prognosis. The objective of this study is to determine the rate of vocal fold motion recovery with excellent prognosis findings on LEMG after acute recurrent laryngeal nerve injury. Retrospective review. Patients undergoing a standardized LEMG protocol, consisting of qualitative (evaluation of motor recruitment, motor unit configuration, detection of fibrillations, presence of synkinesis) and quantitative (turns analysis) measurements were evaluated for purposeful vocal-fold motion recovery, calculated after at least 6 months since onset of injury. Twenty-three patients who underwent LEMG for acute vocal fold paralysis met the inclusion criteria of excellent prognosis. Eighteen patients (78.3%) recovered vocal fold motion, as determined by flexible laryngoscopy. Nearly 80% of patients determined to have excellent prognosis for vocal fold motion recovery experienced return of vocal fold motion. This information will help clinicians not only counsel their patients on expectations but will also help guide treatment. 4. Laryngoscope, 126:2310-2314, 2016. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Matthews, Douglas B; Chesler, Elissa J; Cook, Melloni N.
2008-01-01
Footshock response is used to study biological functions in mammals. However, the genetics underlying variability in footshock sensitivity are not well understood. In the current studies, a panel of B6.A consomic mouse strains, two B6.D2 congenic mouse strains and the progenitor strains were screened for footshock sensitivity as measured by audible vocalization. It was found that A/J (A) mice and C57BL/6J (B6) mice with an A Chromosome 1 (Chr 1) were less sensitive to footshock compared to B6 animals. Furthermore, the offspring of Chr 1 consomic mice crossed with B6 mice had vocalization levels that were intermediate to A/J andmore » B6 animals. A F2 mapping panel revealed two significant QTLs for footshock vocalization centered around D1Mit490 and D1Mit206 on Chr 1. The role of these Chr 1 loci in footshock sensitivity was confirmed in B6.D2 congenic mice. These data identify genetic regions involved in footshock sensitivity and establish additional mouse resources for use in investigating complex behaviors.« less
Piristine, Hande C; Choetso, Tenzin; Gobes, Sharon M H
2016-11-01
Sensory feedback is essential for acquiring and maintaining complex motor behaviors, including birdsong. In zebra finches, auditory feedback reaches the song control circuits primarily through the nucleus interfacialis nidopalii (Nif), which provides excitatory input to HVC (proper name)-a premotor region essential for the production of learned vocalizations. Despite being one of the major inputs to the song control pathway, the role of Nif in generating vocalizations is not well understood. To address this, we transiently inactivated Nif in late juvenile zebra finches. Upon Nif inactivation (in both hemispheres or on one side only), birds went from singing stereotyped zebra finch song to uttering highly variable and unstructured vocalizations resembling sub-song, an early juvenile song form driven by a basal ganglia circuit. Simultaneously inactivating Nif and LMAN (lateral magnocellular nucleus of the anterior nidopallium), the output nucleus of a basal ganglia circuit, inhibited song production altogether. These results suggest that Nif is required for generating the premotor drive for song. Permanent Nif lesions, in contrast, have only transient effects on vocal production, with song recovering within a day. The sensorimotor nucleus Nif thus produces a premotor drive to the motor pathway that is acutely required for generating learned vocalizations, but once permanently removed, the song system can compensate for its absence. © 2016 Wiley Periodicals, Inc. Develop Neurobiol 76: 1213-1225, 2016. © 2016 Wiley Periodicals, Inc.
Graboyes, Evan M; Bradley, Joseph P; Meyers, Bryan F; Nussenbaum, Brian
2011-11-01
The primary objective of this study was to evaluate the effectiveness and safety of injection laryngoplasty using a temporary injectable agent in the acute setting for patients with unilateral vocal cord paralysis following thoracic surgical procedures. Retrospective consecutive case series in an academic institution. Inclusion criteria included patients acutely treated with injection laryngoplasty from January 1, 2006, to March 31, 2010, for a unilateral vocal cord paralysis that occurred after a thoracic surgical procedure (N = 20). All patients were injected with Radiesse Voice Gel using microlaryngoscopy technique. The mean time to vocal cord injection from the time of thoracic surgery was 4.5 days. There was one operative-related complication of intraoperative bile reflux that caused a pneumonitis. Ninety percent of patients were recommended for strict nothing by mouth prior to injection. Of these, 94% were allowed an oral diet following injection, and 67% tolerated a regular diet. None of the patients required subsequent procedures for aspiration or dysphagia, and 25% required further intervention after discharge for persistent dysphonia. Patients with a known nerve transection had a higher rate of dysphonia requiring further surgical procedures than those who did not have a known nerve transection. Acute treatment of thoracic surgery-related unilateral vocal cord paralysis with injection laryngoplasty appears safe and effective at preventing postoperative aspiration pneumonia and improves swallowing function to allow resumption of an oral diet. A single injection is often the only required treatment. Copyright © 2011 The American Laryngological, Rhinological, and Otological Society, Inc.
Homeostasis of Hyaluronic Acid in Normal and Scarred Vocal Folds
Tateya, Ichiro; Tateya, Tomoko; Watanuki, Makoto; Bless, Diane M.
2015-01-01
Summary Objectives/Hypothesis Vocal fold scarring is one of the most challenging laryngeal disorders to treat. Hyaluronic acid (HA) is the main component of lamina propria, and it plays an important role in proper vocal fold vibration and is also thought to be important in fetal wound healing without scarring. Although several animal models of vocal fold scarring have been reported, little is known about the way in which HA is maintained in vocal folds. The purpose of this study was to clarify the homeostasis of HA by examining the expression of hyaluronan synthase (Has) and hyaluronidase (Hyal), which produce and digest HA, respectively. Study Design Experimental prospective animal study. Methods Vocal fold stripping was performed on 38 Sprague-Dawley rats. Vocal fold tissue was collected at five time points (3 days–2 months). Expression of HA was examined by immunohistochemistry, and messenger RNA (mRNA) expression of Has and Hyal was examined by real-time polymerase chain reaction and in-situ hybridization. Results In scarred vocal folds, expression of Has1 and Has2 increased at day 3 together with expression of HA and returned to normal at 2 weeks. At 2 months, Has3 and Hyal3 mRNA showed higher expressions than normal. Conclusions Expression patterns of Has and Hyal genes differed between normal, acute-scarred, and chronic-scarred vocal folds, indicating the distinct roles of each enzyme in maintaining HA. Continuous upregulation of Has genes in the acute phase may be necessary to achieve scarless healing of vocal folds. PMID:25499520
Irritant vocal cord dysfunction at first misdiagnosed as reactive airway dysfunction syndrome.
Galdi, Eugenia; Perfetti, Luca; Pagella, Fabio; Bertino, Giulia; Ferrari, Massimo; Moscato, Gianna
2005-06-01
This report describes a case of vocal cord dysfunction at first misdiagnosed as reactive airway dysfunction syndrome (RADS). A woman developed recurrent episodes of cough, dyspnea, and wheezing unresponsive to asthma therapy after irritant exposure to glutaraldehyde. Direct laryngoscopy was performed immediately after the induction of symptoms. Laryngoscopy showed a paradoxical adduction of the vocal cord on inspiration. Vocal cord dysfunction was diagnosed. A case of vocal cord dysfunction occurred after exposure to glutaraldhyde in a person with a history highly suggestive of RADS. Vocal cord dysfunction should always be considered in the differential diagnosis of patients with acute respiratory symptoms after exposure to irritants and with asthma-like symptoms that fail to respond to conventional asthma therapy.
ERIC Educational Resources Information Center
Coppoolse, Jiska M. S.; Van Kooten, T. G.; Heris, Hossein K.; Mongeau, Luc; Li, Nicole Y. K.; Thibeault, Susan L.; Pitaro, Jacob; Akinpelu, Olubunm; Daniel, Sam J.
2014-01-01
Purpose: The objective of this study was to investigate local injection with a hierarchically microstructured hyaluronic acid-gelatin (HA-Ge) hydrogel for the treatment of acute vocal fold injury using a rat model. Method: Vocal fold stripping was performed unilaterally in 108 Sprague-Dawley rats. A volume of 25 µl saline (placebo controls),…
Predicting Achievable Fundamental Frequency Ranges in Vocalization Across Species
Titze, Ingo; Riede, Tobias; Mau, Ted
2016-01-01
Vocal folds are used as sound sources in various species, but it is unknown how vocal fold morphologies are optimized for different acoustic objectives. Here we identify two main variables affecting range of vocal fold vibration frequency, namely vocal fold elongation and tissue fiber stress. A simple vibrating string model is used to predict fundamental frequency ranges across species of different vocal fold sizes. While average fundamental frequency is predominantly determined by vocal fold length (larynx size), range of fundamental frequency is facilitated by (1) laryngeal muscles that control elongation and by (2) nonlinearity in tissue fiber tension. One adaptation that would increase fundamental frequency range is greater freedom in joint rotation or gliding of two cartilages (thyroid and cricoid), so that vocal fold length change is maximized. Alternatively, tissue layers can develop to bear a disproportionate fiber tension (i.e., a ligament with high density collagen fibers), increasing the fundamental frequency range and thereby vocal versatility. The range of fundamental frequency across species is thus not simply one-dimensional, but can be conceptualized as the dependent variable in a multi-dimensional morphospace. In humans, this could allow for variations that could be clinically important for voice therapy and vocal fold repair. Alternative solutions could also have importance in vocal training for singing and other highly-skilled vocalizations. PMID:27309543
Vocal exploration is locally regulated during song learning
Ravbar, Primoz; Parra, Lucas C.; Lipkind, Dina; Tchernichovski, Ofer
2012-01-01
Exploratory variability is essential for sensory-motor learning, but it is not known how and at what time scales it is regulated. We manipulated song learning in zebra finches to experimentally control the requirements for vocal exploration in different parts of their song. We first trained birds to perform a one-syllable song, and once they mastered it we added a new syllable to the song model. Remarkably, when practicing the modified song, birds rapidly alternated between high and low acoustic variability to confine vocal exploration to the newly added syllable. Further, even within syllables, acoustic variability changed independently across song elements that were only milliseconds apart. Analysis of the entire vocal output during learning revealed that the variability of each song element decreased as it approached the target, correlating with momentary local distance from the target and less so with the overall distance. We conclude that vocal error is computed locally in sub-syllabic time scales and that song elements can be learned and crystalized independently. Songbirds have dedicated brain circuitry for vocal babbling in the anterior forebrain pathway (AFP), which generates exploratory song patterns that drive premotor neurons at the song nucleus RA (robust nucleus of the arcopallium). We hypothesize that either AFP adjusts the gain of vocal exploration in fine time scales, or that the sensitivity of RA premotor neurons to AFP/HVC inputs varies across song elements. PMID:22399765
A Bayesian Account of Vocal Adaptation to Pitch-Shifted Auditory Feedback
Hahnloser, Richard H. R.
2017-01-01
Motor systems are highly adaptive. Both birds and humans compensate for synthetically induced shifts in the pitch (fundamental frequency) of auditory feedback stemming from their vocalizations. Pitch-shift compensation is partial in the sense that large shifts lead to smaller relative compensatory adjustments of vocal pitch than small shifts. Also, compensation is larger in subjects with high motor variability. To formulate a mechanistic description of these findings, we adapt a Bayesian model of error relevance. We assume that vocal-auditory feedback loops in the brain cope optimally with known sensory and motor variability. Based on measurements of motor variability, optimal compensatory responses in our model provide accurate fits to published experimental data. Optimal compensation correctly predicts sensory acuity, which has been estimated in psychophysical experiments as just-noticeable pitch differences. Our model extends the utility of Bayesian approaches to adaptive vocal behaviors. PMID:28135267
Goldberg, Jesse H.
2011-01-01
Young songbirds produce vocal “babbling,” and the variability of their songs is thought to underlie a process of trial-and-error vocal learning. It is known that this exploratory variability requires the “cortical” component of a basal ganglia (BG) thalamocortical loop, but less understood is the role of the BG and thalamic components in this behavior. We found that large bilateral lesions to the songbird BG homolog Area X had little or no effect on song variability during vocal babbling. In contrast, lesions to the BG-recipient thalamic nucleus DLM (medial portion of the dorsolateral thalamus) largely abolished normal vocal babbling in young birds and caused a dramatic increase in song stereotypy. These findings support the idea that the motor thalamus plays a key role in the expression of exploratory juvenile behaviors during learning. PMID:21430276
Effects of adventitious acute vocal trauma: Relative fundamental frequency and listener perception
Murray, Elizabeth Heller; Hands, Gabrielle L.; Calabrese, Carolyn R.; Stepp, Cara E.
2015-01-01
Objective High voice users (individuals who demonstrate excessive or loud vocal use) are at risk for developing voice disorders. The objective of this study was to examine, both acoustically and perceptually, vocal changes in healthy speakers following an acute period of high voice use. Methods Members of a university women’s volleyball team (N=12) were recorded a week prior (Pre) and week following (Post) the 10-week spring season; N=6 control speakers were recorded over the same time period for comparison. Speakers read four sentences, which were analyzed for relative fundamental frequency (RFF). Eight naïve listeners participated in an auditory-perceptual visual sort and rate (VSR) task, in which they rated each voice sample’s overall severity and strain. Results No significant differences were found as a function of time point in the VSR ratings for the volleyball group. Onset cycle 1 RFF values were significantly lower (p = 0.04) in the Post recordings of the volleyball participants compared to Pre recordings, but there was no significant difference (p = 0.20) in offset cycle 10 RFF values. Receiver operating characteristic analyses indicated moderate sensitivity and specificity of onset cycle 1 RFF for discrimination between the volleyball and control participants. Changes were not apparent in the control group as a function of time for either, onset cycle 1 RFF, offset cycle 10 FF, or either vocal attribute. Conclusion Onset cycle 1 RFF may be an effective marker for detecting vocal changes over an acute high voice use period of time before perceptual changes are noted. PMID:26028369
Schneiderová, Irena; Zouhar, Jan
2014-01-01
Shrews have rich vocal repertoires that include vocalizations within the human audible frequency range and ultrasonic vocalizations. Here, we recorded and analyzed in detail the acoustic structure of a vocalization with unclear functional significance that was spontaneously produced by 15 adult, captive Asian house shrews (Suncus murinus) while they were lying motionless and resting in their nests. This vocalization was usually emitted repeatedly in a long series with regular intervals. It showed some structural variability; however, the shrews most frequently emitted a tonal, low-frequency vocalization with minimal frequency modulation and a low, non-vocal click that was clearly noticeable at its beginning. There was no effect of sex, but the acoustic structure of the analyzed vocalizations differed significantly between individual shrews. The encoded individuality was low, but it cannot be excluded that this individuality would allow discrimination of family members, i.e., a male and female with their young, collectively resting in a common nest. The question remains whether the Asian house shrews indeed perceive the presence of their mates, parents or young resting in a common nest via the resting-associated vocalization and whether they use it to discriminate among their family members. Additional studies are needed to explain the possible functional significance of resting-associated vocalizations emitted by captive Asian house shrews. Our study highlights that the acoustic communication of shrews is a relatively understudied topic, particularly considering that they are highly vocal mammals. PMID:25390304
Scheerer, N E; Jacobson, D S; Jones, J A
2016-02-09
Auditory feedback plays an important role in the acquisition of fluent speech; however, this role may change once speech is acquired and individuals no longer experience persistent developmental changes to the brain and vocal tract. For this reason, we investigated whether the role of auditory feedback in sensorimotor learning differs across children and adult speakers. Participants produced vocalizations while they heard their vocal pitch predictably or unpredictably shifted downward one semitone. The participants' vocal pitches were measured at the beginning of each vocalization, before auditory feedback was available, to assess the extent to which the deviant auditory feedback modified subsequent speech motor commands. Sensorimotor learning was observed in both children and adults, with participants' initial vocal pitch increasing following trials where they were exposed to predictable, but not unpredictable, frequency-altered feedback. Participants' vocal pitch was also measured across each vocalization, to index the extent to which the deviant auditory feedback was used to modify ongoing vocalizations. While both children and adults were found to increase their vocal pitch following predictable and unpredictable changes to their auditory feedback, adults produced larger compensatory responses. The results of the current study demonstrate that both children and adults rapidly integrate information derived from their auditory feedback to modify subsequent speech motor commands. However, these results also demonstrate that children and adults differ in their ability to use auditory feedback to generate compensatory vocal responses during ongoing vocalization. Since vocal variability also differed across the children and adult groups, these results also suggest that compensatory vocal responses to frequency-altered feedback manipulations initiated at vocalization onset may be modulated by vocal variability. Copyright © 2015 IBRO. Published by Elsevier Ltd. All rights reserved.
Perceptions of Voice Teachers Regarding Students' Vocal Behaviors During Singing and Speaking.
Beeman, Shellie A
2017-01-01
This study examined voice teachers' perceptions of their instruction of healthy singing and speaking voice techniques. An online, researcher-generated questionnaire based on the McClosky technique was administered to college/university voice teachers listed as members in the 2012-2013 College Music Society directory. A majority of participants believed there to be a relationship between the health of the singing voice and the health of the speaking voice. Participants' perception scores were the most positive for variable MBSi, the monitoring of students' vocal behaviors during singing. Perception scores for variable TVB, the teaching of healthy vocal behaviors, and variable MBSp, the monitoring of students' vocal behaviors while speaking, ranked second and third, respectively. Perception scores for variable TVB were primarily associated with participants' familiarity with voice rehabilitation techniques, gender, and familiarity with the McClosky technique. Perception scores for variable MBSi were primarily associated with participants' familiarity with voice rehabilitation techniques, gender, type of student taught, and instruction of a student with a voice disorder. Perception scores for variable MBSp were correlated with the greatest number of characteristics, including participants' familiarity with voice rehabilitation techniques, familiarity with the McClosky technique, type of student taught, years of teaching experience, and instruction of a student with a voice disorder. Voice teachers are purportedly working with injured voices and attempting to include vocal health in their instruction. Although a voice teacher is not obligated to pursue further rehabilitative training, the current study revealed a positive relationship between familiarity with specific rehabilitation techniques and vocal health. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Schwalm, Afton Leigh
California sea lions (Zalophus californianus) are a highly popular and easily recognized marine mammal in zoos, aquariums, circuses, and often seen by ocean visitors. They are highly vocal and gregarious on land. Surprisingly, little research has been performed on the vocalization types, source levels, acoustic properties, and functions of airborne sounds used by California sea lions. This research on airborne vocalizations of California sea lions will advance the understanding of this aspect of California sea lions communication, as well as examine the relationship between health condition and acoustic behavior. Using a PhillipsRTM digital recorder with attached microphone and a calibrated RadioShackRTM sound pressure level meter, acoustical data were recorded opportunistically on California sea lions during rehabilitation at The Marine Mammal Center in Sausalito, CA. Vocalizations were analyzed using frequency, time, and amplitude variables with Raven Pro: Interactive Sound Analysis Software Version 1.4 (The Cornell Lab of Ornithology, Ithaca, NY). Five frequency, three time, and four amplitude variables were analyzed for each vocalization. Differences in frequency, time, and amplitude variables were not significant by sex. The older California sea lion group produced vocalizations that were significantly lower in four frequency variables, significantly longer in two time variables, significantly higher in calibrated maximum and minimum amplitude variables, and significantly lower in frequency at maximum and minimum amplitude compared with pups. Six call types were identified: bark, goat, growl/grumble, bark/grumble, bark/growl, and grumble/moan. The growl/grumble call was higher in dominant beginning, ending, and minimum frequency, as well as in the frequency at maximum amplitude compared with the bark, goat, bark/grumble calls in the first versus last vocalization sample. The goat call was significantly higher in first harmonic interval than any other call type in the all vocalizations sample. The "fate" of a sea lion was categorized as: released, placed at another facility, remained at TMMC, euthanized, or died. To determine if acoustic features could be used to assess the recovery of a pup, the acoustic features of a pup's first recorded vocalization were compared with the frequency, time, and amplitude of the last vocalization recorded (i.e., before it was released or placed at another facility). In addition, all first vocalizations were pooled and all last vocalizations were pooled for acoustic analysis, regardless of their fate. Released pups had shorter duration calls, a greater first harmonic interval, and a higher dominant maximum frequency than either pups that died or pups remaining at TMMC. Released pups had a higher frequency at maximum and minimum amplitude compared to dead and remaining pups. Pups that died had significantly lower dominant ending frequency and a lower dominant minimum frequency than released or remaining pups. These results were supported by other studies on different species of otariids, phocids, and cetaceans. The preliminary analyses presented in this thesis holds promise that with additional data acoustic features of California sea lion airborne vocalizations could indicate sex, age, and possibly health condition or the potential for release.
Factors associated with vocal fold pathologies in teachers.
Souza, Carla Lima de; Carvalho, Fernando Martins; Araújo, Tânia Maria de; Reis, Eduardo José Farias Borges Dos; Lima, Verônica Maria Cadena; Porto, Lauro Antonio
2011-10-01
To analyze factors associated with the prevalence of the medical diagnosis of vocal fold pathologies in teachers. A census-based epidemiological, cross-sectional study was conducted with 4,495 public primary and secondary school teachers in the city of Salvador, Northeastern Brazil, between March and April 2006. The dependent variable was the self-reported medical diagnosis of vocal fold pathologies and the independent variables were sociodemographic characteristics; professional activity; work organization/interpersonal relationships; physical work environment characteristics; frequency of common mental disorders, measured by the Self-Reporting Questionnaire-20 (SRQ-20 >7); and general health conditions. Descriptive statistical, bivariate and multiple logistic regression analysis techniques were used. The prevalence of self-reported medical diagnosis of vocal fold pathologies was 18.9%. In the logistic regression analysis, the variables that remained associated with this medical diagnosis were as follows: being female, having worked as a teacher for more than seven years, excessive voice use, reporting more than five unfavorable physical work environment characteristics and presence of common mental disorders. The presence of self-reported vocal fold pathologies was associated with factors that point out the need of actions that promote teachers' vocal health and changes in their work structure and organization.
On Short-Time Estimation of Vocal Tract Length from Formant Frequencies
Lammert, Adam C.; Narayanan, Shrikanth S.
2015-01-01
Vocal tract length is highly variable across speakers and determines many aspects of the acoustic speech signal, making it an essential parameter to consider for explaining behavioral variability. A method for accurate estimation of vocal tract length from formant frequencies would afford normalization of interspeaker variability and facilitate acoustic comparisons across speakers. A framework for considering estimation methods is developed from the basic principles of vocal tract acoustics, and an estimation method is proposed that follows naturally from this framework. The proposed method is evaluated using acoustic characteristics of simulated vocal tracts ranging from 14 to 19 cm in length, as well as real-time magnetic resonance imaging data with synchronous audio from five speakers whose vocal tracts range from 14.5 to 18.0 cm in length. Evaluations show improvements in accuracy over previously proposed methods, with 0.631 and 1.277 cm root mean square error on simulated and human speech data, respectively. Empirical results show that the effectiveness of the proposed method is based on emphasizing higher formant frequencies, which seem less affected by speech articulation. Theoretical predictions of formant sensitivity reinforce this empirical finding. Moreover, theoretical insights are explained regarding the reason for differences in formant sensitivity. PMID:26177102
A neural circuit mechanism for regulating vocal variability during song learning in zebra finches.
Garst-Orozco, Jonathan; Babadi, Baktash; Ölveczky, Bence P
2014-12-15
Motor skill learning is characterized by improved performance and reduced motor variability. The neural mechanisms that couple skill level and variability, however, are not known. The zebra finch, a songbird, presents a unique opportunity to address this question because production of learned song and induction of vocal variability are instantiated in distinct circuits that converge on a motor cortex analogue controlling vocal output. To probe the interplay between learning and variability, we made intracellular recordings from neurons in this area, characterizing how their inputs from the functionally distinct pathways change throughout song development. We found that inputs that drive stereotyped song-patterns are strengthened and pruned, while inputs that induce variability remain unchanged. A simple network model showed that strengthening and pruning of action-specific connections reduces the sensitivity of motor control circuits to variable input and neural 'noise'. This identifies a simple and general mechanism for learning-related regulation of motor variability.
A rare case of a sharp foreign body on the vocal cord.
Nor Hisyam, C I; Misron, K; Mohamad, I
2017-01-01
A foreign body (FB) in the upper aerodigestive tract is a common clinical problem that presents as as acute emergency. Sharp FB, such as fish bone or chicken bone, commonly lodges in the tonsil, base of tongue, vallecula or pyriform fossa. Dislodgement of a FB into the laryngopharynx is very rare and specifically onto the vocal cord is extremely uncommon. This case report illustrates a rare case of a sharp FB that was dislodged into the airway and stuck on to the right vocal cord, which was removed under local anaesthesia.
M, Irfan; Yaroko, Ali Ango; S M, Najeb; Periasamy, Centilnathan
2013-04-01
A massive goiter may constrict the trachea resulting in shortness of breath. Recurrent laryngeal nerve compression may cause vocal cord paralysis. We highlight a case of a 62- year-old female with a 30 year history of an anterior neck swelling gradually increasing in size. She presented with acute symptoms of upper airway obstruction and voice changes. Emergency thyroidectomy was performed by dividing the middle part of the gland using ultrasonic scissors. The recovery was uneventful and the patient regained normal vocal cord function post operatively.
Neural coding of syntactic structure in learned vocalizations in the songbird.
Fujimoto, Hisataka; Hasegawa, Taku; Watanabe, Dai
2011-07-06
Although vocal signals including human languages are composed of a finite number of acoustic elements, complex and diverse vocal patterns can be created from combinations of these elements, linked together by syntactic rules. To enable such syntactic vocal behaviors, neural systems must extract the sequence patterns from auditory information and establish syntactic rules to generate motor commands for vocal organs. However, the neural basis of syntactic processing of learned vocal signals remains largely unknown. Here we report that the basal ganglia projecting premotor neurons (HVC(X) neurons) in Bengalese finches represent syntactic rules that generate variable song sequences. When vocalizing an alternative transition segment between song elements called syllables, sparse burst spikes of HVC(X) neurons code the identity of a specific syllable type or a specific transition direction among the alternative trajectories. When vocalizing a variable repetition sequence of the same syllable, HVC(X) neurons not only signal the initiation and termination of the repetition sequence but also indicate the progress and state-of-completeness of the repetition. These different types of syntactic information are frequently integrated within the activity of single HVC(X) neurons, suggesting that syntactic attributes of the individual neurons are not programmed as a basic cellular subtype in advance but acquired in the course of vocal learning and maturation. Furthermore, some auditory-vocal mirroring type HVC(X) neurons display transition selectivity in the auditory phase, much as they do in the vocal phase, suggesting that these songbirds may extract syntactic rules from auditory experience and apply them to form their own vocal behaviors.
Brondani, Juliana Tabarelli; Luna, Stelio Pacca Loureiro; Padovani, Carlos Roberto
2011-02-01
To refine and test construct validity and reliability of a composite pain scale for use in assessing acute postoperative pain in cats undergoing ovariohysterectomy. 40 cats that underwent ovariohysterectomy in a previous study. In a previous randomized, double-blind, placebo-controlled study, a composite pain scale was developed to assess postoperative pain in cats that received a placebo or an analgesic (tramadol, vedaprofen, or tramadol-vedaprofen combination). In the present study, the scale was refined via item analysis (distribution frequency and occurrence), a nonparametric ANOVA, and item-to-total score correlation. Construct validity was assessed via factor analysis and known-groups discrimination, and reliability was measured by assessing internal consistency. Respiratory rate and respiratory pattern were rejected after item analysis. Factor analysis resulted in 5 dimensions (F1 [psychomotor change], posture, comfort, activity, mental status, and miscellaneous behaviors; F2 [protection of wound area], reaction to palpation of the surgical wound and palpation of the abdomen and flank; F3 [physiologic variables], systolic arterial blood pressure and appetite; F4 [vocal expression of pain], vocalization; and F5 [heart rate]). Internal consistency was excellent for the overall scale and for F1, F2, and F3; very good for F4; and unacceptable for F5. Except for heart rate, the identified factors and scale total score could be used to detect differences between the analgesic and placebo groups and differences among the analgesic treatments. Results provided initial evidence of construct validity and reliability of a multidimensional composite tool for use in assessing acute postoperative pain in cats undergoing ovariohysterectomy.
Vocal fold hemorrhage: factors predicting recurrence.
Lennon, Christen J; Murry, Thomas; Sulica, Lucian
2014-01-01
Vocal fold hemorrhage is an acute phonotraumatic injury treated with voice rest; recurrence is a generally accepted indication for surgical intervention. This study aims to identify factors predictive of recurrence based on outcomes of a large clinical series. Retrospective cohort. Retrospective review of cases of vocal fold hemorrhage presenting to a university laryngology service. Demographic information was compiled. Videostroboscopic exams were evaluated for hemorrhage extent, presence of varix, mucosal lesion, and/or vocal fold paresis. Vocal fold hemorrhage recurrence was the main outcome measure. Follow-up telephone survey was used to complement clinical data. Forty-seven instances of vocal fold hemorrhage were evaluated (25M:22F; 32 professional voice users). Twelve of the 47 (26%) patients experienced recurrence. Only the presence of varix demonstrated significant association with recurrence (P = 0.0089) on multivariate logistic regression. Vocal fold hemorrhage recurred in approximately 26% of patients. Varix was a predictor of recurrence, with 48% of those with varix experiencing recurrence. Monitoring, behavioral management and/or surgical intervention may be indicated to treat patients with such characteristics. © 2013 The American Laryngological, Rhinological and Otological Society, Inc.
Acute dysphonia secondary to vocal fold hemorrhage after vardenafil use.
Singh, Vikas; Cohen, Seth M; Rousseau, Bernard; Noordzij, J Pieter; Garrett, C Gaelyn; Ossoff, Robert H
2010-06-01
Owing to their vasodilatory effects, the phosphodiesterase-5 inhibitors have become widely used for the treatment of erectile dysfunction. Among the reported adverse events of these agents are epistaxis, variceal bleeding, intracranial hemorrhage, and hemorrhoidal bleeding. We report a case of vocal fold hemorrhage that occurred after vardenafil use in a 31-year-old man who was a professional singer.
Bicarbonate availability for vocal fold epithelial defense to acidic challenge.
Durkes, Abigail; Sivasankar, M Preeti
2014-01-01
Bicarbonate is critical for acid-base tissue homeostasis. In this study we investigated the role of bicarbonate ion transport in vocal fold epithelial defense to acid challenges. Acidic insults to the larynx are common in gastric reflux, carcinogenesis and metastasis, and acute inflammation. Ion transport was measured in viable porcine vocal fold epithelium. First, 18 vocal folds were exposed to either the carbonic anhydrase antagonist acetazolamide or to vehicle. Second, 32 vocal folds were exposed to either a control buffer or a bicarbonate-free buffer on their luminal or basolateral surface or both. Third, 32 vocal folds were challenged with acid in the presence of bicarbonate-free or control buffer. The vocal fold transepithelial resistance was greater than 300 Ω*cm(2), suggesting robust barrier integrity. Ion transport did not change after exposure to acetazolamide (p > 0.05). Exposure to bicarbonate-free buffer did not compromise vocal fold ion transport (p > 0.05). Ion transport increased after acid challenge. This increase approached statistical significance and was the greatest for the control buffer and for the bicarbonate-free buffer applied to the basolateral surface. Bicarbonate secretion may contribute to vocal fold defense against acid challenge. Our data offer a potential novel role for bicarbonate as a therapeutic agent to reduce pH abnormalities in the larynx and prevent associated pathological changes.
Carbone, Vincent J; Sweeney-Kerwin, Emily J; Attanasio, Vivian; Kasper, Tamara
2010-01-01
The purpose of this study was to determine the effect of manual sign mand training combined with prompt delay and vocal prompting on the production of vocal responses in nonvocal children with developmental disabilities. A multiple baseline design across participants verified the effectiveness of this intervention. All participants showed increases in vocal responses following the implementation of the independent variables.
Cohen, Alex S; Dinzeo, Thomas J; Donovan, Neila J; Brown, Caitlin E; Morrison, Sean C
2015-03-30
Vocal expression reflects an integral component of communication that varies considerably within individuals across contexts and is disrupted in a range of neurological and psychiatric disorders. There is reason to suspect that variability in vocal expression reflects, in part, the availability of "on-line" resources (e.g., working memory, attention). Thus, understanding vocal expression is a potentially important biometric index of information processing, not only across but within individuals over time. A first step in this line of research involves establishing a link between vocal expression and information processing systems in healthy adults. The present study employed a dual attention experimental task where participants provided natural speech while simultaneously engaged in a baseline, medium or high nonverbal processing-load task. Objective, automated, and computerized analysis was employed to measure vocal expression in 226 adults. Increased processing load resulted in longer pauses, fewer utterances, greater silence overall and less variability in frequency and intensity levels. These results provide compelling evidence of a link between information processing resources and vocal expression, and provide important information for the development of an automated, inexpensive and uninvasive biometric measure of information processing. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Comparison of English Language Rhythm and Kalhori Kurdish Language Rhythm
ERIC Educational Resources Information Center
Taghva, Nafiseh; Zadeh, Vahideh Abolhasani
2016-01-01
Interval-based method is a method of studying the rhythmic quantitative features of languages. This method use Pairwise Variability Index (PVI) to consider the variability of vocalic duration and inter-vocalic duration of sentences which leads to classification of languages rhythm into stress-timed languages and syllable-timed ones. This study…
Benninger, Michael S; Hanick, Andrea; Hicks, Douglas M
2016-01-01
Bilateral vocal fold paralysis most commonly results from iatrogenic trauma to the recurrent laryngeal nerve during surgical procedures in the anterior neck. Patients may require tracheostomy because of acute or gradual onset of dyspnea and airway compromise. The intralaryngeal injection of Botox has been considered as a possible therapy for these airway symptoms of bilateral vocal fold paralysis. Chronic unopposed activity of intact cricothyroid muscles could potentially result in gradual medialization of the vocal folds in patients with bilateral recurrent laryngeal nerve paralysis. This case series describes three patients who successfully underwent injections of botulinum toxin into the bilateral cricothyroid muscles to offer sustained relief of dyspnea resulting from bilateral vocal fold paralysis. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Bicarbonate Availability for Vocal Fold Epithelial Defense to Acidic Challenge
Durkes, Abigail; Sivasankar, M. Preeti
2014-01-01
Objectives Bicarbonate is critical for acid-base tissue homeostasis. In this study we investigated the role of bicarbonate ion transport in vocal fold epithelial defense to acid challenges. Acidic insults to the larynx are common in gastric reflux, carcinogenesis and metastasis, and acute inflammation. Methods Ion transport was measured in viable, porcine vocal fold epithelium. First, 18 vocal folds were exposed to either the carbonic anhydrase antagonist acetazolamide or to vehicle. Second, 32 vocal folds were exposed to either a control buffer or a bicarbonate-free buffer on their luminal or basolateral surface or both. Third, vocal folds were challenged with acid in the presence of bicarbonate-free or control buffer. Results The vocal fold transepithelial resistance was greater than 300 Ω*cm2, suggesting robust barrier integrity. Ion transport did not change after exposure to acetazolamide (p > 0.05). Exposure to bicarbonate-free buffer did not compromise vocal fold ion transport (p > 0.05). Ion transport increased after acid challenge. This increase approached statistical significance and was the greatest for the control buffer and for the bicarbonate-free buffer applied to the basolateral surface. Conclusions Bicarbonate secretion may contribute to vocal fold defense against acid challenge. Our data offer a potential novel role for bicarbonate as a therapeutic agent to reduce pH abnormalities in the larynx and prevent associated pathological changes. PMID:24574427
Genetic and Environmental Effects on Vocal Symptoms and Their Intercorrelations
ERIC Educational Resources Information Center
Nybacka, Ida; Simberg, Susanna; Santtila, Pekka; Sala, Eeva; Sandnabba, N. Kenneth
2012-01-01
Purpose: Recently, Simberg et al. (2009) found genetic effects on a composite variable consisting of 6 vocal symptom items measuring dysphonia. The purpose of the present study was to determine genetic and environmental effects on the individual vocal symptoms in a population-based sample of Finnish twins. Method: The sample comprised 1,728 twins…
Vocalization Rate and Consonant Production in Toddlers at High and Low Risk for Autism
ERIC Educational Resources Information Center
Chenausky, Karen; Nelson, Charles, III.; Tager-Flusberg, Helen
2017-01-01
Background: Previous work has documented lower vocalization rate and consonant acquisition delays in toddlers with autism spectrum disorder (ASD). We investigated differences in these variables at 12, 18, and 24 months in toddlers at high and low risk for ASD. Method: Vocalization rate and number of different consonants were obtained from speech…
Fournet, Michelle E; Szabo, Andy; Mellinger, David K
2015-01-01
On low-latitude breeding grounds, humpback whales produce complex and highly stereotyped songs as well as a range of non-song sounds associated with breeding behaviors. While on their Southeast Alaskan foraging grounds, humpback whales produce a range of previously unclassified non-song vocalizations. This study investigates the vocal repertoire of Southeast Alaskan humpback whales from a sample of 299 non-song vocalizations collected over a 3-month period on foraging grounds in Frederick Sound, Southeast Alaska. Three classification systems were used, including aural spectrogram analysis, statistical cluster analysis, and discriminant function analysis, to describe and classify vocalizations. A hierarchical acoustic structure was identified; vocalizations were classified into 16 individual call types nested within four vocal classes. The combined classification method shows promise for identifying variability in call stereotypy between vocal groupings and is recommended for future classification of broad vocal repertoires.
A meta-analysis of outcomes of hydration intervention on phonation threshold pressure.
Leydon, Ciara; Wroblewski, Marcin; Eichorn, Naomi; Sivasankar, Mahalakshmi
2010-11-01
Vocal fold hydration is purported to promote optimal biomechanical characteristics of vocal fold mucosa, increase efficiency of vocal fold oscillation, and enhance voice quality. The purpose of this work was to determine the magnitude and consistency of the effect of vocal fold hydration on vocal fold function across published clinical studies. We completed a comprehensive meta-analysis of the effects of superficial and systemic vocal fold hydration on phonation threshold pressure (PTP), a measure of efficiency of voice production. We identified 34 studies that examined the effects of hydration on vocal function. Of these studies, 14 examined the effects of hydration on PTP. Nine of these articles met the criteria for inclusion in this analysis. We observed an average effect size of 0.33, indicating that, overall, hydration treatment demonstrated a tendency to reduce PTP. However, this decrease in phonatory effort did not reach significance at the 95% confidence level. The effects of hydration intervention varied considerably across studies (-0.19 to 3.96). We considered that two factors, pitch level of the task and vocal health of participants, may have contributed to this variability in findings. However, our analysis found that these factors could not account for differences in effect size. To understand the variability in outcomes across studies, the role of factors that may impact the effects of hydration, such as the amount, type, and duration of intervention, must be determined. Only then can we obtain data to guide best clinical practice for protecting and rehabilitating vocal function. Copyright © 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Vocal tract length and acoustics of vocalization in the domestic dog (Canis familiaris).
Riede, T; Fitch, T
1999-10-01
The physical nature of the vocal tract results in the production of formants during vocalisation. In some animals (including humans), receivers can derive information (such as body size) about sender characteristics on the basis of formant characteristics. Domestication and selective breeding have resulted in a high variability in head size and shape in the dog (Canis familiaris), suggesting that there might be large differences in the vocal tract length, which could cause formant behaviour to affect interbreed communication. Lateral radiographs were made of dogs from several breeds ranging in size from a Yorkshire terrier (2.5 kg) to a German shepherd (50 kg) and were used to measure vocal tract length. In addition, we recorded an acoustic signal (growling) from some dogs. Significant correlations were found between vocal tract length, body mass and formant dispersion, suggesting that formant dispersion can deliver information about the body size of the vocalizer. Because of the low correlation between vocal tract length and the first formant, we predict a non-uniform vocal tract shape.
ERIC Educational Resources Information Center
Liao, Mei-Ying; Davidson, Jane W.
2016-01-01
The main purpose of the current study was to examine the effects of gesture and movement training for beginning children's choirs with regard to improving intonation. It was a between-subjects design with one independent variable Training Technique (TT). One dependent variable was measured: intonation in the singing of vocal pattern warm-up…
Effects of Systemic Hydration on Vocal Acoustics of 18- to 35-Year-Old Females
ERIC Educational Resources Information Center
Franca, Maria Claudia; Simpson, Kenneth O.
2012-01-01
The influence of body hydration and vocal acoustics was investigated in this study. Effects of two levels of hydration on objective measures of vocal acoustics were explored. In an attempt to reduce variability in the degree of systemic hydration and to induce a state of systemic dehydration, participants were instructed to refrain from ingestion…
Acoustic changes of the voice as signs of vocal fatigue in radio broadcasters: preliminary findings.
Guzmán, Marco; Malebrán, María Celina; Zavala, Paulina; Saldívar, Patricio; Muñoz, Daniel
2013-01-01
Vocal fatigue is one of the most common voice symptoms. It usually refers to the sensation of vocal tiredness after a long period of speaking or singing. The purpose of this study was to compare the acoustic characteristics of the voice before and after a long period of voice use in a group of radio broadcasters. Eight radio broadcasters with normal voices were assessed. We used cepstrum, energy ratio, noise to harmonic ratio and soft phonation index as acoustic variables to assess the possible pre-post vocal loading changes objectively. There were no statistically significant pre-post differences in any of the acoustic parameters. Although cepstrum at high pitch did not show a significant difference, it obtained the greatest difference among the acoustic variables. The acoustic measurements used in the present study might not be sensitive enough or appropriate for detecting vocal changes after a long period of voice use, whether in reading (as reported in previous research) or speaking tasks. Moreover, a longer period of vocal loading would eventually reveal more evident and consistent acoustic voice changes. Copyright © 2012 Elsevier España, S.L. All rights reserved.
Singers' interest and knowledge levels of vocal function and dysfunction: survey findings.
Braun-Janzen, Colleen; Zeine, Lina
2009-07-01
A questionnaire investigating the levels of interest in and knowledge of vocal function and dysfunction was completed by 129 singers. Those with professional singing experience indicated significantly greater interest and higher perceived knowledge levels than amateurs in areas of vocal anatomy and physiology, vocal hygiene, and functional vocal pathologies. Greater interest levels, but not higher perceived knowledge levels were reported by professional singers (PSs) in the area of the role of the speech-language pathologist (SLP) and the voice. Professionals answered significantly more knowledge-based questions correctly than amateurs in all areas except the role of the SLP and the voice. However, findings indicated wide variability in knowledge levels of both groups. Singing teachers (STs) within the group significantly outperformed the remainder of the group in areas of vocal anatomy and physiology, vocal hygiene, and functional vocal pathologies. Scores of the choir directors (CDs) within the group were not significantly superior to the remainder of the group except in the area of functional vocal pathologies. Implications for a preventative approach to vocal health are discussed.
Psychogenic dysphonia: diversity of clinical and vocal manifestations in a case series.
Martins, Regina Helena Garcia; Tavares, Elaine Lara Mendes; Ranalli, Paula Ferreira; Branco, Anete; Pessin, Adriana Bueno Benito
2014-01-01
Psychogenic dysphonia is a functional disorder with variable clinical manifestations. To assess the clinical and vocal characteristics of patients with psychogenic dysphonia in a case series. The study included 28 adult patients with psychogenic dysphonia, evaluated at a University hospital in the last ten years. Assessed variables included gender, age, occupation, vocal symptoms, vocal characteristics, and videolaryngostroboscopic findings. 28 patients (26 women and 2 men) were assessed. Their occupations included: housekeeper (n=17), teacher (n=4), salesclerk (n=4), nurse (n=1), retired (n=1), and psychologist (n=1). Sudden symptom onset was reported by 16 patients and progressive symptom onset was reported by 12; intermittent evolution was reported by 15; symptom duration longer than three months was reported by 21 patients. Videolaryngostroboscopy showed only functional disorders; no patient had structural lesions or changes in vocal fold mobility. Conversion aphonia, skeletal muscle tension, and intermittent voicing were the most frequent vocal emission manifestation forms. In this case series of patients with psychogenic dysphonia, the most frequent form of clinical presentation was conversion aphonia, followed by musculoskeletal tension and intermittent voicing. The clinical and vocal aspects of 28 patients with psychogenic dysphonia, as well as the particularities of each case, are discussed. Copyright © 2014 Associação Brasileira de Otorrinolaringologia e Cirurgia Cérvico-Facial. Published by Elsevier Editora Ltda. All rights reserved.
[Exudative laryngeal diseases of Reinke's space].
Remacle, M; Lagneau, G; Marbaix, E; Doyen, A; Van den Eeckhaut, J
1992-01-01
Exudative laryngopathies are represented by the nodule, the polyp and the Reinke's oedema. Their origin is found in exudative processes located in the Reinke's space. Evolution towards one or another lesion depends on the etiological factor (vocal abuse, tobacco, alcohol, acute vocal fold trauma) and can happen according three modalities: oedema, fibrosis and angiectasies development. Apparition of fibrosis inside the lesion or too important haemorrhage require microsurgical treatment in addition to the medical treatment and the speech therapy.
Vocal warm-up increases phonation threshold pressure in soprano singers at high pitch.
Motel, Tamara; Fisher, Kimberly V; Leydon, Ciara
2003-06-01
Vocal warm-up is thought to optimize singing performance. We compared effects of short-term, submaximal, vocal warm-up exercise with those of vocal rest on the soprano voice (n = 10, ages 19-21 years). Dependent variables were the minimum subglottic air pressure required for vocal fold oscillation to occur (phonation threshold pressure, Pth), and the maximum and minimum phonation fundamental frequency. Warm-up increased Pth for high pitch phonation (p = 0.033), but not for comfortable (p = 0.297) or low (p = 0.087) pitch phonation. No significant difference in the maximum phonation frequency (p = 0.193) or minimum frequency (p = 0.222) was observed. An elevated Pth at controlled high pitch, but an unchanging maximum and minimum frequency production suggests that short-term vocal exercise may increase the viscosity of the vocal fold and thus serve to stabilize the high voice.
The Impact of Vocal Cool-down Exercises: A Subjective Study of Singers' and Listeners' Perceptions.
Ragan, Kari
2016-11-01
Using subjective measures, this study investigated singers' and listeners' perceptions of changes in voice condition after vocal cool-down exercises. A single-subject crossover was designed to evaluate whether there were discernible differences in either singer or listener perceptions from pre (no vocal cool downs) to post (with cool downs) test. Subjective questionnaires were completed throughout the study. Twenty classically trained female singers documented self-ratings and perceptual judgments through the Evaluation of the Ability to Sing Easily survey, the Singing Voice Handicap Index, and Self-Perceptual Questionnaires after a 60-minute voice load. Recordings were made and assessed by four expert listeners. The assessed data from the Singing Voice Handicap Index, the Evaluation of the Ability to Sing Easily, and Daily Perceptual Questionnaires show 68%, 67%, and 74% of singers reported improvement, respectively. However, because of significant variability in the underlying scores, the amount of improvement was not deemed to be statistically significant. Expert listeners correctly identified the cool-down week 46% of the time. Singers strongly perceived positive impact from the cool-down exercises on both their speaking and singing voices. Even though the objective data were statistically insignificant, the singers' subjective data clearly indicates a perceived sense of vocal well-being after utilizing the vocal cool-down protocol. The variability in the daily life of a singer (eg, stress, menses, reflux, vocal load, and vocal hygiene) makes it difficult to objectively quantify the impact of vocal cool downs. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Singers' phonation threshold pressure and ratings of self-perceived effort on vocal tasks.
McHenry, Monica; Evans, Joseph; Powitzky, Eric
2013-05-01
This study was designed to determine if singers' self-ratings of vocal effort could predict phonation threshold pressure (PTP). It was hypothesized that effort ratings on the more complex task of singing "Happy Birthday" would best predict PTP. A multiple regression analysis was performed with PTP as the predicted variable and self-ratings on four phonatory tasks as the predictor variables. Participants were 48 undergraduate and graduate students majoring in vocal performance. They produced /pi/ syllable trains as softly as possible for the measurement of PTP. They then rated their self-perceived vocal effort while softly producing the following: (1) sustained "ah" (comfortable, midrange pitch); (2) "ah" glide (chest to head voice); (3) Staccato "ah" in head voice (not falsetto); and (4) Happy Birthday in head voice (not falsetto). No ratings of vocal effort predicted PTP. The lack of correlation between PTP and ratings of Happy Birthday remained when separately evaluating graduate versus undergraduate students or males versus females. Informal evaluation of repeated ratings over time suggested the potential for effective self-monitoring. Students' ratings of self-perceived vocal effort were poor predictors of PTP. This may be because of the use of "effortless" imagery during singing instruction or consistent positive feedback regarding vocal performance. It is possible that self-rating could become an effective tool to predict vocal health if task elicitation instructions were more precise, and the student and voice teacher worked collaboratively to improve self-evaluation. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Analysis of 3-D Propagation Effects Due to Environmental Variability
2014-09-30
two directions over a region of the Monterey Bay Canyon. The directions coincided with data collected of humpback whale vocalizations that...FY13, various whale vocalizations were recorded on a directional sensor in Monterey Bay that displayed significant bearing ambiguity. The bathymetry...recorder at the time of the whale vocalizations being examined. 4 Figure 2: 3-D MMPE test case results for a constant slope surface over a
Vocal fold self-disruption after phonotrauma on a lead actor: a case presentation.
Behlau, Mara; Oliveira, Gisele; Pontes, Paulo
2009-11-01
It is well known that phonotraumatic events may produce laryngeal inflammation, vocal fold hemorrhage and different types of mass lesions. This study describes a vocal fold self-disruption that occurred on stage to a lead actor in the role of Richard III. The study design is as case presentation. A 43-year-old actor presented with a sudden voice loss that first occurred on stage after a series of presentations. He also had a cold-like condition that had not been treated. His past medical history included an average of ten cigarettes per day for ten years and a 10-year history of gastritis and stomach ulcer. Perceptual, acoustic, and laryngeal analyses were performed following pharmacological and voice therapy. Perceptual and acoustic analyses showed mild deviations whereas laryngeal visual examination revealed a complete right vocal fold detachment from the anterior commissure to the vocal process, with generalized hyperemia. A mild diffuse Reinke's edema was observed on the left vocal fold. Mild discomfort was present only during the first day of the acute period. Modified vocal rest was recommended and a series of vocal exercises were administered. The patient performed again 4 days later, after following a series of behavioral modification techniques that included casting guidelines during the subsequent 15 days. Healing was exceptional and his voice returned to normal. This unique case with an exceptional recovery emphasizes the etiological aspects of scar formation after phonotrauma. Positive contributing factors may include a good vocal technique and adequate training as well as the protective upregulated genes present in Reinke's edema.
Smirnova, Darya S.; Demina, Tatyana S.; Volodina, Elena V.
2016-01-01
The vocal repertoire of captive cheetahs (Acinonyx jubatus) and the specific role of meow vocalizations in communication of this species attract research interest about two dozen years. Here, we expand this research focus for the contextual use of call types, sex differences and individual differences at short and long terms. During 457 trials of acoustic recordings, we collected calls (n = 8120) and data on their contextual use for 13 adult cheetahs (6 males and 7 females) in four Russian zoos. The cheetah vocal repertoire comprised 7 call types produced in 8 behavioural contexts. Context-specific call types (chirr, growl, howl and hiss) were related to courting behaviour (chirr) or to aggressive behaviour (growl, howl and hiss). Other call types (chirp, purr and meow) were not context-specific. The values of acoustic variables differed between call types. The meow was the most often call type. Discriminant function analysis revealed a high potential of meows to encode individual identity and sex at short terms, however, the vocal individuality was unstable over years. We discuss the contextual use and acoustic variables of call types, the ratios of individual and sex differences in calls and the pathways of vocal ontogeny in the cheetah with relevant data on vocalization of other animals. PMID:27362643
Smirnova, Darya S; Volodin, Ilya A; Demina, Tatyana S; Volodina, Elena V
2016-01-01
The vocal repertoire of captive cheetahs (Acinonyx jubatus) and the specific role of meow vocalizations in communication of this species attract research interest about two dozen years. Here, we expand this research focus for the contextual use of call types, sex differences and individual differences at short and long terms. During 457 trials of acoustic recordings, we collected calls (n = 8120) and data on their contextual use for 13 adult cheetahs (6 males and 7 females) in four Russian zoos. The cheetah vocal repertoire comprised 7 call types produced in 8 behavioural contexts. Context-specific call types (chirr, growl, howl and hiss) were related to courting behaviour (chirr) or to aggressive behaviour (growl, howl and hiss). Other call types (chirp, purr and meow) were not context-specific. The values of acoustic variables differed between call types. The meow was the most often call type. Discriminant function analysis revealed a high potential of meows to encode individual identity and sex at short terms, however, the vocal individuality was unstable over years. We discuss the contextual use and acoustic variables of call types, the ratios of individual and sex differences in calls and the pathways of vocal ontogeny in the cheetah with relevant data on vocalization of other animals.
Dissociable Effects on Birdsong of Androgen Signaling in Cortex-Like Brain Regions of Canaries
2017-01-01
The neural basis of how learned vocalizations change during development and in adulthood represents a major challenge facing cognitive neuroscience. This plasticity in the degree to which learned vocalizations can change in both humans and songbirds is linked to the actions of sex steroid hormones during ontogeny but also in adulthood in the context of seasonal changes in birdsong. We investigated the role of steroid hormone signaling in the brain on distinct features of birdsong using adult male canaries (Serinus canaria), which show extensive seasonal vocal plasticity as adults. Specifically, we bilaterally implanted the potent androgen receptor antagonist flutamide in two key brain regions that control birdsong. We show that androgen signaling in the motor cortical-like brain region, the robust nucleus of the arcopallium (RA), controls syllable and trill bandwidth stereotypy, while not significantly affecting higher order features of song such syllable-type usage (i.e., how many times each syllable type is used) or syllable sequences. In contrast, androgen signaling in the premotor cortical-like brain region, HVC (proper name), controls song variability by increasing the variability of syllable-type usage and syllable sequences, while having no effect on syllable or trill bandwidth stereotypy. Other aspects of song, such as the duration of trills and the number of syllables per song, were also differentially affected by androgen signaling in HVC versus RA. These results implicate androgens in regulating distinct features of complex motor output in a precise and nonredundant manner. SIGNIFICANCE STATEMENT Vocal plasticity is linked to the actions of sex steroid hormones, but the precise mechanisms are unclear. We investigated this question in adult male canaries (Serinus canaria), which show extensive vocal plasticity throughout their life. We show that androgens in two cortex-like vocal control brain regions regulate distinct aspects of vocal plasticity. For example, in HVC (proper name), androgens regulate variability in syntax but not phonology, whereas androgens in the robust nucleus of the arcopallium (RA) regulate variability in phonology but not syntax. Temporal aspects of song were also differentially affected by androgen signaling in HVC versus RA. Thus, androgen signaling may reduce vocal plasticity by acting in a nonredundant and precise manner in the brain. PMID:28821656
McCathren, R B; Yoder, P J; Warren, S F
1999-08-01
This study tested the relationship between prelinguistic vocalization and expressive vocabulary 1 year later in young children with mild to moderate developmental delays. Three vocalization variables were tested: rate of all vocalization, rate of vocalizations with consonants, and rate of vocalizations used interactively. The 58 toddlers in the study were 17-34 months old, not sensory impaired, and had Bayley Mental Development Indices (Bayley, 1969; Bayley, 1993) from 35-85. In addition, the children had fewer than 3 words in their expressive vocabularies and during classroom observation each showed at least one instance of intentional prelinguistic communication before testing. Selected sections of the Communication and Symbolic Behavior Scales procedures (CSBS; Wetherby & Prizant, 1993) were administered at the beginning and at the end of the study. The vocal measures were obtained in the initial CSBS session. One measure of expressive vocabulary was obtained in the CSBS session at the end of the study. In addition, expressive vocabulary was measured in a nonstructured play session at the end of the study. We predicted that rate of vocalization, rate of vocalizations with consonants, and rate of vocalizations used interactively would all be positively related to later expressive vocabulary. The results confirmed the predictions.
Effects of long-acting beta adrenergic agonists on vocal fold ion transport.
Sivasankar, Mahalakshmi; Blazer-Yost, Bonnie
2009-03-01
Inhaled medications prescribed for the hypersensitive airway typically combine corticosteroids and long-acting beta2 adrenergic agonists (LABAs). The phonatory side effects of these combination treatments are widely recognized. However, there is limited understanding of the physiological changes induced by these medications that underlie the phonatory side effects. The objective of this study was to investigate the distinct effects of corticosteroids and LABAs on vocal fold mucosal physiology. Understanding the physiological changes to the vocal folds after corticosteroid and LABA treatments is necessary to prevent the prevalent vocal decrement associated with these medications. Experimental in vitro design with treatment and control groups. Native porcine vocal fold mucosae (N = 38) were exposed to corticosteroid or LABA treatments. Ion transport was measured continuously at baseline and after treatment. To quantify the nature of ion transport, vocal folds were also treated with chloride and sodium channel inhibitors. Corticosteroid treatment did not alter ion transport. Conversely, exposure to LABAs significantly increased ion transport. This increase in ion transport was transient, observed immediately after treatment in all tissue and associated with increased chloride secretion. The distinct effects of corticosteroids and LABAs on vocal fold physiology have not been examined to date. This study demonstrates that short-term treatment with LABAs, but not corticosteroids, significantly increases ion transport. These findings suggest that one underlying physiological mechanism for phonatory changes associated with inhaled treatments may be related to acute alterations in vocal fold ion transport and surface hydration.
Doellinger, Michael; Lohscheller, Joerg; McWhorter, Andrew; Kunduk, Melda
2009-03-01
We investigate the potential of high-speed digital imaging technique (HSI) and the phonovibrogram (PVG) analysis in normal vocal fold dynamics by studying the effects of continuous voice use (vocal loading) during the workday. One healthy subject was recorded at sustained phonation 13 times within 2 consecutive days in the morning before and in the afternoon after vocal loading, respectively. Vocal fold dynamics were extracted and visualized by PVGs. The characteristic PVG patterns were extracted representing vocal fold vibration types. The parameter values were then analyzed by statistics regarding vocal load, left-right PVG asymmetries, anterior-posterior PVG asymmetries, and opening-closing differences. For the first time, the direct impact of vocal load could be determined by analyzing vocal fold dynamics. For same vocal loading conditions, equal dynamical behavior of the vocal folds were confirmed. Comparison of recordings performed in the morning with the recordings after work revealed significant changes in vibration behavior, indicating impact of occurring vocal load. Left-right asymmetries in vocal fold dynamics were found confirming earlier assumptions. Different dynamics between opening and closing procedure as well as for anterior and posterior parts were found. Constant voice usage stresses the vocal folds even in healthy subjects and can be detected by applying the PVG technique. Furthermore, left-right PVG asymmetries do occur in healthy voice to a certain extent. HSI in combination with PVG analysis seems to be a promising tool for investigation of vocal fold fatigue and pathologies resulting in small forms of dynamical changes.
Biochemical basis of vocal fold mobilization after microflap surgery in a rabbit model.
Mitchell, Joshua R; Kojima, Tsuyoshi; Wu, Hongmei; Garrett, C Gaelyn; Rousseau, Bernard
2014-02-01
To investigate phonation-related extracellular matrix (ECM) changes in the vocal fold lamina propria after microflap surgery using an in vivo rabbit phonation model. Prospective animal study. Twenty-four New Zealand White rabbits were used in this study. Quantitative polymerase chain reaction and immunohistochemistry were used to investigate alterations in vocal fold ECM proinflammatory and profibrotic gene, and protein expression from a control group of animals receiving a microflap without phonation and a separate group of animals receiving experimentally induced phonation on postmicroflap days 0, 3, and 7. IHC demonstrated the highest concentration of CD45 in vocal folds on postoperative day 0. Staining for CD45 was absent by postoperative day 7, with no differences in CD45 staining between groups. Fibronectin gene expression increased significantly on postoperative day 3 in the control and experimentally induced phonation groups, with maximal staining of fibronectin around the microflap incision on postoperative day 7. No alterations in cyclooxygenase-2, interleukin-1β, and transforming growth factor-β1 gene expression were observed between groups. Results of the present study revealed an acute inflammatory response in the vocal fold at the time of microflap (day 0) and up to 3 days post-microflap. By post-operative day 3, staining of CD45 positive cells decreased, with essentially no evidence of inflammation by post-operative day 7. With the end of the acute inflammatory response occurring around day 3, these data may provide support for mobilizing tissue after inflammation has subsided and the process of active tissue remodeling has ensued (days 3-7). N/A. © 2013 The American Laryngological, Rhinological and Otological Society, Inc.
Spuz, Catherine A; Borszcz, George S
2012-04-01
The amygdala contributes to generation of affective behaviors to threats. The prototypical threat to an individual is exposure to a noxious stimulus and the amygdaloid central nucleus (CeA) receives nociceptive input that is mediated by glutamatergic neurotransmission. The present study evaluated the contribution of glutamate receptors in CeA to generation of the affective response to acute pain in rats. Vocalizations that occur following a brief noxious tail shock (vocalization afterdischarges) are a validated rodent model of pain affect, and were preferentially suppressed by bilateral injection into CeA of the NMDA receptor antagonist D-2-amino-5-phosphonovalerate (AP5, 1 μg, 2 μg, or 4 μg) or the non-NMDA receptor antagonist 6-Cyano-7-nitroquinoxaline-2,3-dione disodium (CNQX, .25 μg, .5 μg, 1 μg, or 2 μg). Vocalizations that occur during tail shock were suppressed to a lesser degree, whereas spinal motor reflexes (tail flick and hind limb movements) were unaffected by injection of AP5 or CNQX into CeA. Unilateral administration of AP5 or CNQX into CeA of either hemisphere also selectively elevated vocalization thresholds. Bilateral administration of AP5 or CNQX produced greater increases in vocalization thresholds than the same doses of antagonists administered unilaterality into either hemisphere indicating synergistic hemispheric interactions. The amygdala contributes to production of emotional responses to environmental threats. Blocking glutamate neurotransmission within the central nucleus of the amygdala suppressed rats' emotional response to acute painful stimulation. Understanding the neurobiology underlying emotional responses to pain will provide insights into new treatments for pain and its associated affective disorders. Copyright © 2012 American Pain Society. Published by Elsevier Inc. All rights reserved.
Coppoolse, Jiska M. S.; Van Kooten, T. G.; Heris, Hossein K.; Mongeau, Luc; Li, Nicole Y. K.; Thibeault, Susan L.; Pitaro, Jacob; Akinpelu, Olubunmi; Daniel, Sam J.
2016-01-01
Purpose The objective of this study was to investigate local injection with a hierarchically microstructured hyaluronic acid–gelatin (HA-Ge) hydrogel for the treatment of acute vocal fold injury using a rat model. Method Vocal fold stripping was performed unilaterally in 108 Sprague-Dawley rats. A volume of 25 ml saline (placebo controls), HA-bulk, or HA-Ge hydrogel was injected into the lamina propria (LP) 5 days after surgery. The vocal folds were harvested at 3, 14, and 28 days after injection and analyzed using hematoxylin and eosin staining and immunohistochemistry staining for macrophages, myofibroblasts, elastin, collagen type I, and collagen type III. Results The macrophage count was statistically significantly lower in the HA-Ge group than in the saline group (p < .05) at Day 28. Results suggested that the HA-Ge injection did not induce inflammatory or rejection response. Myofibroblast counts and elastin were statistically insignificant across treatment groups at all time points. Increased elastin deposition was qualitatively observed in both HA groups from Day 3 to Day 28, and not in the saline group. Significantly more elastin was observed in the HA-bulk group than in the uninjured group at Day 28. Significantly more collagen type I was observed in the HA-bulk and HA-Ge groups than in the saline group (p < .05) at Day 28. The collagen type I concentration in the HA-Ge and saline groups was found to be comparable to that in the uninjured controls at Day 28. The concentration of collagen type III in all treatment groups was similar to that in uninjured controls at Day 28. Conclusion Local HA-Ge and HA-bulk injections for acute injured vocal folds were biocompatible and did not induce adverse response. PMID:24687141
Esch, Barbara E; Carr, James E; Michael, Jack
2005-01-01
Many children with autism do not imitate adult vocalizations, an important skill in learning to talk. Pairing adult vocalizations with preferred stimuli has been shown to increase free-operant vocalizations but effects are temporary; thus, direct reinforcement may be necessary to establish durable vocal behaviors. In Experiment 1, directly reinforced echoic responses did not increase following stimulus-stimulus pairings in three children with autism. Similarly, pairings did not increase free-operant vocalizations in Experiment 2, a replication of Miguel et al. (2002). Experiment 3 demonstrated that shaping increased vowel frequency for one participant. Results suggest that variables are yet to be delineated that influence effectiveness of a stimulus-stimulus pairing procedure on vocalization frequency and acquisition of a verbal operant following such pairings. PMID:22477313
Lopes, Leonardo Wanderley; de Oliveira Florencio, Vanessa; Silva, Priscila Oliveira Costa; da Nóbrega E Ugulino, Ana Celiane; Almeida, Anna Alice
2018-01-04
We aimed to correlate the Vocal Tract Discomfort Scale (VTDS) with the Voice Symptom Scale (VoiSS) for evaluation of patients with dysphonia. In addition, we aimed to compare vocal tract discomfort symptoms in patients with and without self-reported voice problem. This is a descriptive, cross-sectional, and retrospective study. We analyzed 143 women and 62 men with voice disorders, as confirmed by endoscopic larynx examination. All patients completed the VTDS and VoiSS at vocal evaluation. Descriptive statistics and the Spearman correlation test were applied to all variables. The degree of covariance of variables was noted. The Mann-Whitney U test was used to compare the average number of discomfort symptoms among patients with and without self-reported voice problems. A weak to moderate positive correlation was observed between the average number, frequency, and intensity of comfort symptom and the total score, physical domain score, and limitation domain score of the VoiSS. The vocal tract discomfort symptoms and the emotional domain score of the VoiSS were weakly correlated. Patients with self-reported voice problems had a higher number, frequency, and intensity of vocal tract discomfort symptoms. There is correlation between the VTDS and VoiSS scales, with greater references to vocal tract discomfort symptom in patients with self-reported voice problems. Therefore, the discomfort symptoms seem to influence the perception of the impact of a voice problem. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Kim, Keun Ho; Ku, Boncho; Kang, Namsik; Kim, Young-Su; Jang, Jun-Su; Kim, Jong Yeol
2012-01-01
The voice has been used to classify the four constitution types, and to recognize a subject's health condition by extracting meaningful physical quantities, in traditional Korean medicine. In this paper, we propose a method of selecting the reliable variables from various voice features, such as frequency derivative features, frequency band ratios, and intensity, from vowels and a sentence. Further, we suggest a process to extract independent variables by eliminating explanatory variables and reducing their correlation and remove outlying data to enable reliable discriminant analysis. Moreover, the suitable division of data for analysis, according to the gender and age of subjects, is discussed. Finally, the vocal features are applied to a discriminant analysis to classify each constitution type. This method of voice classification can be widely used in the u-Healthcare system of personalized medicine and for improving diagnostic accuracy. PMID:22529874
Willey, Amanda R; Spear, Linda P
2013-04-01
Adolescents drink largely in social situations, likely in an attempt to facilitate social interactions. This study sought to examine alterations in the incentive salience of a social stimulus following repeated stress exposure and acute ethanol administration in adolescent and adult male Sprague-Dawley rats. Subjects were either exposed to 5days of restraint stress, chronic variable stress (CVS), which consisted of a different stressor every day, or non-stressed. On test day, the animals were injected with 0, 0.25, 0.5, or 0.75g/kg ethanol and placed in a social approach test in which they could see, hear, and smell a social conspecific, but could not physically interact with it. All the animals showed an interest in the social stimulus, with adolescents engaging in more social investigation than adults. Restraint stressed adults showed ethanol-induced increases in social investigation, while ethanol effects were not seen in any other group. An ethanol-associated increase in 50kHz ultrasonic vocalization (USV) production was only evident in restraint stressed adolescents following 0.75g/kg ethanol. 50kHz USVs were not correlated with time spent investigating the social stimulus in any test condition. These results show that age differences in the facilitatory effects of ethanol on incentive salience of social stimuli are moderated by stress, with the facilitation of social approach by ethanol only evident in restraint stressed adults. Copyright © 2013 Elsevier Inc. All rights reserved.
The Role of Lexical Stress on the Use of Vocal Fry in Young Adult Female Speakers.
Gibson, Todd A
2017-01-01
Vocal fry is a voice register often used by young adult women for sociolinguistic purposes. Some acoustic correlates of lexical stress, however, appear incompatible with the use of vocal fry. The objective of this study was to systematically examine the role of lexical stress in the use of vocal fry by young adult women. This is a semi-randomized controlled laboratory study. Fifty female undergraduate students were recorded repeating one-, two-, three-, and four-syllable nonwords that conformed to English phonotactics. Nonwords were presented in order from shorter to longer lengths, with stimuli randomized within syllable length. Perceptual analyses of recordings were augmented by acoustic analyses to identify each syllable in which vocal fry occurred. Eighty-six percent of participants produced at least one episode of vocal fry. Vocal fry was more likely to occur in unstressed than stressed position, and the likelihood increased as distance from the stressed syllable increased. There was considerable variability in the use of vocal fry. Frequent and infrequent users varied on the degree to which they used vocal fry in single-syllable nonwords. Vocal fry use persists among young adult women even in the absence of syntactic and pragmatic influences. Lexical stress appeared to dramatically reduce the use of vocal fry. Patterns of vocal fry use appeared to be different for frequent and infrequent users of this vocal register. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Hayase, Shin; Wada, Kazuhiro
2018-06-23
Learned vocalization, including birdsong and human speech, is acquired through self-motivated vocal practice during the sensitive period of vocal learning. The zebra finch (Taeniopygia guttata) develops a song characterized by vocal variability and crystalizes a defined song pattern as adulthood. However, it remains unknown how vocal variability is regulated with diurnal singing during the sensorimotor learning period. Here, we investigated the expression of activity-dependent neuroplasticity-related gene Arc during the early plastic song phase to examine its potential association with vocal plasticity. We first confirmed that multiple acoustic features of syllables in the plastic song were dramatically and simultaneously modulated during the first 3 hours of singing in a day and the altered features were maintained until sleep. Concurrently, Arc was intensely induced during morning singing and a subsequent attenuation during afternoon singing in the robust nucleus of the arcopallium (RA) and the interfacial nucleus of the nidopallium (NIf). The singing-driven Arc expression was not altered by circadian rhythm, but rather reduced during the day as juveniles produced more songs. Song stabilization accelerated by testosterone administration in juveniles was accompanied with attenuation of Arc induction in RA and NIf. In contrast, although early-deafened birds produced highly unstable song even at adulthood, singing-driven Arc expression was not different between intact and early-deafened adults. These results suggest a potential functional link between Arc expression in RA and NIf and vocal plasticity during the sensorimotor phase of song learning. Nonetheless, Arc expression did not reflect the quality of bird's own song or auditory feedback. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Lien, Yu-An S; Michener, Carolyn M; Eadie, Tanya L; Stepp, Cara E
2015-06-01
The acoustic measure relative fundamental frequency (RFF) was investigated as a potential objective measure to track variations in vocal effort within and across individuals. Twelve speakers with healthy voices created purposeful modulations in their vocal effort during speech tasks. RFF and an aerodynamic measure of vocal effort, the ratio of sound pressure level to subglottal pressure level, were estimated from the aerodynamic and acoustic signals. Twelve listeners also judged the speech samples for vocal effort using the visual sort and rate method. Relationships between RFF and both the aerodynamic and perceptual measures of vocal effort were weak across speakers (R2 = .06-.26). Within speakers, relationships were variable but much stronger on average (R2 = .45-.56). RFF showed stronger relationships between both the aerodynamic and perceptual measures of vocal effort when examined within individuals versus across individuals. Future work is necessary to establish these relationships in individuals with voice disorders across the therapeutic process.
Michener, Carolyn M.; Eadie, Tanya L.; Stepp, Cara E.
2015-01-01
Purpose The acoustic measure relative fundamental frequency (RFF) was investigated as a potential objective measure to track variations in vocal effort within and across individuals. Method Twelve speakers with healthy voices created purposeful modulations in their vocal effort during speech tasks. RFF and an aerodynamic measure of vocal effort, the ratio of sound pressure level to subglottal pressure level, were estimated from the aerodynamic and acoustic signals. Twelve listeners also judged the speech samples for vocal effort using the visual sort and rate method. Results Relationships between RFF and both the aerodynamic and perceptual measures of vocal effort were weak across speakers (R2 = .06–.26). Within speakers, relationships were variable but much stronger on average (R2 = .45–.56). Conclusions RFF showed stronger relationships between both the aerodynamic and perceptual measures of vocal effort when examined within individuals versus across individuals. Future work is necessary to establish these relationships in individuals with voice disorders across the therapeutic process. PMID:25675090
Working Conditions and Workplace Barriers to Vocal Health in Primary School Teachers.
Munier, Caitriona; Farrell, Rory
2016-01-01
The purpose of this study was to identify the working conditions and workplace barriers to vocal health in primary school teachers. The relationship between working conditions and voice is analyzed. This is a survey study in 42 randomized schools from a restricted geographical area. An 85-item questionnaire was administered to 550 primary school teachers in 42 schools in Dublin. It was designed to obtain information on demographics, vocal use patterns, vocal health, work organization, working conditions, and teacher's perceptions of the conditions in teaching that might cause a voice problem. The relationship between voice and overstretched work demands, and voice and class size, was examined. A chi-squared test was run to test the null hypothesis that the variables overstretched work demands and voice and class size and voice are independent. Subjects were given the opportunity to give their opinion on their working conditions and on the availability of advice and support within the workplace. A final question sought their opinion on what should be included in a voice care program. A 55% response rate was obtained (n = 304). It was found with 96.52% confidence that the variables overstretched work demands and voice are related. Likewise, it was found that the variables class size and voice are related with 99.97% confidence. There are workplace barriers to vocal health. The working conditions of primary school teachers need to be fully adapted to promote vocal health. Changes by education and health policy makers are needed to achieve this goal. There is a need for future research which focuses on the working conditions of teachers. Copyright © 2016. Published by Elsevier Inc.
Self-masking: Listening during vocalization. Normal hearing.
Borg, Erik; Bergkvist, Christina; Gustafsson, Dan
2009-06-01
What underlying mechanisms are involved in the ability to talk and listen simultaneously and what role does self-masking play under conditions of hearing impairment? The purpose of the present series of studies is to describe a technique for assessment of masked thresholds during vocalization, to describe normative data for males and females, and to focus on hearing impairment. The masking effect of vocalized [a:] on narrow-band noise pulses (250-8000 Hz) was studied using the maximum vocalization method. An amplitude-modulated series of sound pulses, which sounded like a steam engine, was masked until the criterion of halving the perceived pulse rate was reached. For masking of continuous reading, a just-follow-conversation criterion was applied. Intra-session test-retest reproducibility and inter-session variability were calculated. The results showed that female voices were more efficient in masking high frequency noise bursts than male voices and more efficient in masking both a male and a female test reading. The male had to vocalize 4 dBA louder than the female to produce the same masking effect on the test reading. It is concluded that the method is relatively simple to apply and has small intra-session and fair inter-session variability. Interesting gender differences were observed.
NASA Astrophysics Data System (ADS)
Irino, Toshio; Patterson, Roy
2005-04-01
We hear vowels produced by men, women, and children as approximately the same although there is considerable variability in glottal pulse rate and vocal tract length. At the same time, we can identify the speaker group. Recent experiments show that it is possible to identify vowels even when the glottal pulse rate and vocal tract length are condensed or expanded beyond the range of natural vocalization. This suggests that the auditory system has an automatic process to segregate information about shape and size of the vocal tract. Recently we proposed that the auditory system uses some form of Stabilized, Wavelet-Mellin Transform (SWMT) to analyze scale information in bio-acoustic sounds as a general framework for auditory processing from cochlea to cortex. This talk explains the theoretical background of the model and how the vocal information is normalized in the representation. [Work supported by GASR(B)(2) No. 15300061, JSPS.
Trygonis, Vasilis; Gerstein, Edmund; Moir, Jim; McCulloch, Stephen
2013-12-01
Passive acoustic surveys were conducted to assess the vocal behavior of North Atlantic right whales (Eubalaena glacialis) in the designated critical calving habitat along the shallow coastal waters of southeastern United States. Underwater vocalizations were recorded using autonomous buoys deployed in close proximity to surface active groups (SAGs). Nine main vocalization types were identified with manual inspection of spectrograms, and standard acoustic descriptors were extracted. Classification trees were used to examine the distinguishing characteristics of calls and quantify their variability within the SAG vocal repertoire. The results show that descriptors of frequency, bandwidth, and spectral disorder are the most important parameters for partitioning the SAG repertoire, contrary to duration-related measures. The reported source levels and vocalization statistics provide sound production data vital to inform regional passive acoustic monitoring and conservation for this endangered species.
Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques.
Fitch, W T
1997-08-01
Body weight, length, and vocal tract length were measured for 23 rhesus macaques (Macaca mulatta) of various sizes using radiographs and computer graphic techniques. linear predictive coding analysis of tape-recorded threat vocalizations were used to determine vocal tract resonance frequencies ("formants") for the same animals. A new acoustic variable is proposed, "formant dispersion," which should theoretically depend upon vocal tract length. Formant dispersion is the averaged difference between successive formant frequencies, and was found to be closely tied to both vocal tract length and body size. Despite the common claim that voice fundamental frequency (F0) provides an acoustic indication of body size, repeated investigations have failed to support such a relationship in many vertebrate species including humans. Formant dispersion, unlike voice pitch, is proposed to be a reliable predictor of body size in macaques, and probably many other species.
Analysis of human scream and its impact on text-independent speaker verification.
Hansen, John H L; Nandwana, Mahesh Kumar; Shokouhi, Navid
2017-04-01
Scream is defined as sustained, high-energy vocalizations that lack phonological structure. Lack of phonological structure is how scream is identified from other forms of loud vocalization, such as "yell." This study investigates the acoustic aspects of screams and addresses those that are known to prevent standard speaker identification systems from recognizing the identity of screaming speakers. It is well established that speaker variability due to changes in vocal effort and Lombard effect contribute to degraded performance in automatic speech systems (i.e., speech recognition, speaker identification, diarization, etc.). However, previous research in the general area of speaker variability has concentrated on human speech production, whereas less is known about non-speech vocalizations. The UT-NonSpeech corpus is developed here to investigate speaker verification from scream samples. This study considers a detailed analysis in terms of fundamental frequency, spectral peak shift, frame energy distribution, and spectral tilt. It is shown that traditional speaker recognition based on the Gaussian mixture models-universal background model framework is unreliable when evaluated with screams.
Repairing the vibratory vocal fold.
Long, Jennifer L
2018-01-01
A vibratory vocal fold replacement would introduce a new treatment paradigm for structural vocal fold diseases such as scarring and lamina propria loss. This work implants a tissue-engineered replacement for vocal fold lamina propria and epithelium in rabbits and compares histology and function to injured controls and orthotopic transplants. Hypotheses were that the cell-based implant would engraft and control the wound response, reducing fibrosis and restoring vibration. Translational research. Rabbit adipose-derived mesenchymal stem cells (ASC) were embedded within a three-dimensional fibrin gel, forming the cell-based outer vocal fold replacement (COVR). Sixteen rabbits underwent unilateral resection of vocal fold epithelium and lamina propria, as well as reconstruction with one of three treatments: fibrin glue alone with healing by secondary intention, replantation of autologous resected vocal fold cover, or COVR implantation. After 4 weeks, larynges were examined histologically and with phonation. Fifteen rabbits survived. All tissues incorporated well after implantation. After 1 month, both graft types improved histology and vibration relative to injured controls. Extracellular matrix (ECM) of the replanted mucosa was disrupted, and ECM of the COVR implants remained immature. Immune reaction was evident when male cells were implanted into female rabbits. Best histologic and short-term vibratory outcomes were achieved with COVR implants containing male cells implanted into male rabbits. Vocal fold cover replacement with a stem cell-based tissue-engineered construct is feasible and beneficial in acute rabbit implantation. Wound-modifying behavior of the COVR implant is judged to be an important factor in preventing fibrosis. NA. Laryngoscope, 128:153-159, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
Inpatient injection laryngoplasty for vocal fold immobility: When is it really necessary?
Zuniga, Steven; Ebersole, Barbara; Jamal, Nausheen
To compare pulmonary and swallow outcomes of injection laryngoplasty when performed in the acute versus subacute setting in head & neck and thoracic cancer patients presenting with new onset unilateral vocal fold immobility. Case series with chart review at an academic cancer center over a 2year period. Based on swallow evaluation, patients diagnosed with vocal fold immobility were grouped into an unsafe swallow group, injected as inpatients, and a safe swallow group, for whom injection laryngoplasty was delayed to the outpatient setting or not performed. Rates of pneumonia, diet recommendations, and swallow outcomes were compared between groups. 24 patients with new-onset vocal fold immobility were evaluated. 7 underwent injection in the inpatient setting, 12 in the outpatient setting, and 5 did not undergo injection. There was no perceived difference in speech and swallow outcomes between the inpatient and outpatient injection groups. Injection laryngoplasty shows promise as an effective intervention for reducing aspiration risk and improving diet normalcy in patients with dysphagia as a result of unilateral vocal fold immobility. In patients determined to have a safe swallow, delay of injection laryngoplasty is not detrimental to swallow outcomes. Copyright © 2017 Elsevier Inc. All rights reserved.
Park, Hee-Sun; Kim, Ha-Jung; Ro, Young-Jin; Yang, Hong-Seuk; Koh, Won-Uk
2017-01-01
Abstract Rationale: Recurrent laryngeal nerve block is an uncommon complication that can occur after an interscalene brachial plexus block (ISB), which may lead to vocal cord palsy or paresis. However, if the recurrent laryngeal nerve is blocked in patients with a preexisting contralateral vocal cord palsy following neck surgery, this may lead to devastating acute respiratory failure. Thus, ISB is contraindicated in patients with contralateral vocal cord lesion. To the best of our knowledge, there are no reports of bilateral vocal cord paresis, which occurred after a continuous ISB and endotracheal intubation in a patient with no history of vocal cord injury or surgery of the neck. Patient concerns: A 59 year old woman was planned for open acromioplasty and rotator cuff repair under general anesthesia. General anesthesia was induced following an ISB using 0.2% ropivacaine and catheter insertion for postoperative pain control. Diagnoses: While recovering in the postanesthesia care unit (PACU), however, the patient complained of a sore throat and hoarseness without respiratory insufficiency. On the morning of the first postoperative day, she still complained of mild dyspnea, dysphonia, and slight aspiration. She was subsequently diagnosed with bilateral vocal cord paresis following an endoscopic laryngoscopy examination. Interventions: The continuous ISB catheter was immediately removed and the dyspnea and hoarseness symptoms improved, although mild aspiration during drinking water was still present. Outcomes: On the 4th postoperative day, a laryngoscopy examination revealed that the right vocal cord movement had returned to normal but that the left vocal cord paresis still remained. Lessons: When ISB is planned, a detailed history-taking and examination of the airway are essential for patient safety and we recommend that any local anesthetics be carefully injected under ultrasound guidance. We also recommend the use of low concentration of local anesthetics to avoid possible paralysis of the vocal cord. PMID:28403100
Murugan, Malavika; Harward, Stephen; Scharff, Constance; Mooney, Richard
2013-12-18
Mutations of the FOXP2 gene impair speech and language development in humans and shRNA-mediated suppression of the avian ortholog FoxP2 disrupts song learning in juvenile zebra finches. How diminished FoxP2 levels affect vocal control and alter the function of neural circuits important to learned vocalizations remains unclear. Here we show that FoxP2 knockdown in the songbird striatum disrupts developmental and social modulation of song variability. Recordings in anesthetized birds show that FoxP2 knockdown interferes with D1R-dependent modulation of activity propagation in a corticostriatal pathway important to song variability, an effect that may be partly attributable to reduced D1R and DARPP-32 protein levels. Furthermore, recordings in singing birds reveal that FoxP2 knockdown prevents social modulation of singing-related activity in this pathway. These findings show that reduced FoxP2 levels interfere with the dopaminergic modulation of vocal variability, which may impede song and speech development by disrupting reinforcement learning mechanisms. Copyright © 2013 Elsevier Inc. All rights reserved.
Murugan, Malavika; Harward, Stephen; Scharff, Constance; Mooney, Richard
2013-01-01
Summary Mutations of the FOXP2 gene impair speech and language development in humans and shRNA-mediated suppression of the avian orthologue FoxP2 disrupts song learning in juvenile zebra finches. How diminished FoxP2 levels affect vocal control and alter the function of neural circuits important to learned vocalizations remains unclear. Here we show that FoxP2 knockdown in the songbird striatum disrupts developmental and social modulation of song variability. Recordings in anaesthetized birds show that FoxP2 knockdown interferes with D1R-dependent modulation of activity propagation in a corticostriatal pathway important to song variability, an effect that may be partly attributable to reduced D1R and DARPP-32 protein levels. Furthermore, recordings in singing birds reveal that FoxP2 knockdown prevents social modulation of singing-related activity in this pathway. These findings show that reduced FoxP2 levels interfere with the dopaminergic modulation of vocal variability, which may impede song and speech development by disrupting reinforcement learning mechanisms. PMID:24268418
The prevalence and factors associate with vocal nodules in general population
Won, Seong Jun; Kim, Rock Bum; Kim, Jin Pyeong; Park, Jung Je; Kwon, Min Su; Woo, Seung Hoon
2016-01-01
Abstract The purpose of this study was to analyze the prevalence of vocal nodules and to identify factors related with an increased risk for vocal nodules. This study was conducted using data from the Korean National Health and Nutrition Examination Survey 2008 to 2011. The subjects consisted of 19,636 men and women aged ≥19 years. Related factors such as age, marital status, incomes, and education level were assessed in individual interviews, and health-related behaviors including smoking, alcohol, and activity were assessed with self-administered questionnaires. Also, examination survey such as laryngoscopy examination, basic physical examination, and blood sampling was conducted. The prevalence of vocal nodules was 1.31% (n = 258). Among variable factors, age, education level, and voice disorder were related with the presence of vocal nodules (P < 0.05). Other factors including sex, alcohol, smoking, physical activities, hypertension, obesity, waist circumference and metabolic syndrome, hypercholesterolemia, serum calcium, and vitamin D did not show any meaningful relationship with the presence of vocal nodules. This result may help reduce the incidence of vocal nodules and offer proper management for patients with vocal nodules, and may also facilitate efficient allocation of public health resources. PMID:27684845
DOE Office of Scientific and Technical Information (OSTI.GOV)
Al-Mamgani, Abrahim, E-mail: a.almamgani@nki.nl; Kwa, Stefan L.S.; Tans, Lisa
2015-10-01
Purpose: To report, from a retrospective analysis of prospectively collected data, on the feasibility, outcome, toxicity, and voice-handicap index (VHI) of patients with T1a glottic cancer treated by a novel intensity modulated radiation therapy technique developed at our institution to treat only the involved vocal cord: single vocal cord irradiation (SVCI). Methods and Materials: Thirty patients with T1a glottic cancer were treated by means of SVCI. Dose prescription was set to 16 × 3.63 Gy (total dose 58.08 Gy). The clinical target volume was the entire vocal cord. Setup verification was done by means of an online correction protocol using cone beam computed tomography. Datamore » for voice quality assessment were collected prospectively at baseline, end of treatment, and 4, 6, and 12 weeks and 6, 12, and 18 months after treatment using VHI questionnaires. Results: After a median follow-up of 30 months (range, 7-50 months), the 2-year local control and overall survival rates were 100% and 90% because no single local recurrence was reported and 3 patients died because of comorbidity. All patients have completed the intended treatment schedule; no treatment interruptions and no grade 3 acute toxicity were reported. Grade 2 acute dermatitis or dysphagia was reported in only 5 patients (17%). No serious late toxicity was reported; only 1 patient developed temporary grade 2 laryngeal edema, and responded to a short-course of corticosteroid. The VHI improved significantly, from 33.5 at baseline to 9.5 and 10 at 6 weeks and 18 months, respectively (P<.001). The control group, treated to the whole larynx, had comparable local control rates (92.2% vs 100%, P=.24) but more acute toxicity (66% vs 17%, P<.0001) and higher VHI scores (23.8 and 16.7 at 6 weeks and 18 months, respectively, P<.0001). Conclusion: Single vocal cord irradiation is feasible and resulted in maximal local control rate at 2 years. The deterioration in VHI scores was slight and temporary and subsequently improved to normal levels. Long-term follow-up is needed to consolidate these promising results.« less
Leblanc, Linda A; Geiger, Kaneen B; Sautter, Rachael A; Sidener, Tina M
2007-01-01
The Natural Language Paradigm (NLP) has proven effective in increasing spontaneous verbalizations for children with autism. This study investigated the use of NLP with older adults with cognitive impairments served at a leisure-based adult day program for seniors. Three individuals with limited spontaneous use of functional language participated in a multiple baseline design across participants. Data were collected on appropriate and inappropriate vocalizations with appropriate vocalizations coded as prompted or unprompted during baseline and treatment sessions. All participants experienced increases in appropriate speech during NLP with variable response patterns. Additionally, the two participants with substantial inappropriate vocalizations showed decreases in inappropriate speech. Implications for intervention in day programs are discussed.
Guzman, Marco; Ortega, Andres; Olavarria, Christian; Muñoz, Daniel; Cortés, Pedro; Azocar, Maria Josefina; Cayuleo, David; Quintana, Felipe; Silva, Catalina
2016-11-01
The present study aimed to assess supraglottic activity in theater actors and to observe whether they present differences compared with subjects with no voice training. Acoustic and perceptual analyses were also performed. A total of 20 participants were divided into two groups: an experimental group of trained theater actors, and a comparative group of subjects with no voice training. Absence of laryngeal pathology was confirmed by rigid videostroboscopy. Flexible laryngoscopy was performed to assess supraglottic activity during speaking phonatory tasks. Voice recording was also carried out. Four blinded judges were asked to assess laryngoscopic and perceptual variables using a visual analog scale. A comparison between groups, phonatory tasks, and loudness levels was performed. Multivariate linear regression showed that trained participants had a higher degree of both laryngeal and pharyngeal activities compared with untrained participants. Moreover, phonatory tasks at high intensity showed higher activity than those at medium and low intensities for most phonatory tasks and laryngoscopic parameters. Vocally trained participants evidenced higher values for all spectral variables compared with untrained participants. Actors have a greater degree of both laryngeal and pharyngeal activities than vocally untrained subjects. Apparently, this higher activity is associated to speaking voice training and not to a hyperfunctional vocal behavior. Anterior-posterior laryngeal compression is greater than medial compression. Intensity and phonatory tasks have an effect on all laryngoscopic variables. Supraglottic activity during professional speaking voice may be not necessarily a hyperfunctional behavior, but a strategy to avoid vocal fold damage while producing the desired voice quality. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
2011-01-01
Vocal production requires complex planning and coordination of respiratory, laryngeal, and vocal tract movements, which are incompletely understood in most mammals. Rats produce a variety of whistles in the ultrasonic range that are of communicative relevance and of importance as a model system, but the sources of acoustic variability were mostly unknown. The goal was to identify sources of fundamental frequency variability. Subglottal pressure, tracheal airflow, and electromyographic (EMG) data from two intrinsic laryngeal muscles were measured during 22-kHz and 50-kHz call production in awake, spontaneously behaving adult male rats. During ultrasound vocalization, subglottal pressure ranged between 0.8 and 1.9 kPa. Pressure differences between call types were not significant. The relation between fundamental frequency and subglottal pressure within call types was inconsistent. Experimental manipulations of subglottal pressure had only small effects on fundamental frequency. Tracheal airflow patterns were also inconsistently associated with frequency. Pressure and flow seem to play a small role in regulation of fundamental frequency. Muscle activity, however, is precisely regulated and very sensitive to alterations, presumably because of effects on resonance properties in the vocal tract. EMG activity of cricothyroid and thyroarytenoid muscle was tonic in calls with slow or no fundamental frequency modulations, like 22-kHz and flat 50-kHz calls. Both muscles showed brief high-amplitude, alternating bursts at rates up to 150 Hz during production of frequency-modulated 50-kHz calls. A differentiated and fine regulation of intrinsic laryngeal muscles is critical for normal ultrasound vocalization. Many features of the laryngeal muscle activation pattern during ultrasound vocalization in rats are shared with other mammals. PMID:21832032
Voice rest after vocal fold surgery: current practice and evidence.
Coombs, A C; Carswell, A J; Tierney, P A
2013-08-01
Voice rest is commonly recommended after vocal fold surgery, but there is a lack of evidence base and no standard protocol. The aim of this study was to establish common practice regarding voice rest following vocal fold surgery. An online survey was circulated via e-mail invitation to members of the ENT UK Expert Panel between October and November 2011. The survey revealed that 86.5 per cent of respondents agreed that 'complete voice rest' means no sound production at all, but there was variability in how 'relative voice rest' was defined. There was no dominant type of voice rest routinely recommended after surgery for laryngeal papillomatosis or intermediate pathologies. There was considerable variability in the duration of voice rest recommended, with no statistically significant, most popular response (except for malignant lesions). Surgeons with less than 10 years of experience were more likely to recommend fewer days of voice rest. There is a lack of consistency in advice given to patients after vocal fold surgery, in terms of both type and length of voice rest. This may arise from an absence of robust evidence on which to base practice.
Fernández-Vargas, Marcela; Johnston, Robert E
2015-01-01
Vocal signaling is one of many behaviors that animals perform during social interactions. Vocalizations produced by both sexes before mating can communicate sex, identity and condition of the caller. Adult golden hamsters produce ultrasonic vocalizations (USV) after intersexual contact. To determine whether these vocalizations are sexually dimorphic, we analyzed the vocal repertoire for sex differences in: 1) calling rates, 2) composition (structural complexity, call types and nonlinear phenomena) and 3) acoustic structure. In addition, we examined it for individual variation in the calls. The vocal repertoire was mainly composed of 1-note simple calls and at least half of them presented some degree of deterministic chaos. The prevalence of this nonlinear phenomenon was confirmed by low values of harmonic-to-noise ratio for most calls. We found modest sexual differences between repertoires. Males were more likely than females to produce tonal and less chaotic calls, as well as call types with frequency jumps. Multivariate analysis of the acoustic features of 1-note simple calls revealed significant sex differences in the second axis represented mostly by entropy and bandwidth parameters. Male calls showed lower entropy and inter-quartile bandwidth than female calls. Because the variation of acoustic structure within individuals was higher than among individuals, USV could not be reliably assigned to the correct individual. Interestingly, however, this high variability, augmented by the prevalence of chaos and frequency jumps, could be the result of increased vocal effort. Hamsters motivated to produce high calling rates also produced longer calls of broader bandwidth. Thus, the sex differences found could be the result of different sex preferences but also of a sex difference in calling motivation or condition. We suggest that variable and complex USV may have been selected to increase responsiveness of a potential mate by communicating sexual arousal and preventing habituation to the caller.
Laryngeal Electromyography for Prognosis of Vocal Fold Paralysis.
Pardo-Maza, Adriana; García-Lopez, Isabel; Santiago-Pérez, Susana; Gavilán, Javier
2017-01-01
This study aimed to determine the value of laryngeal electromyography in the prognosis of vocal fold paralysis. This is a retrospective descriptive study. This study included 80 patients diagnosed with unilateral or bilateral vocal fold paralysis on flexible laryngoscopy between 2002 and 2014 in a tertiary medical center. Laryngeal electromyography using a standardized protocol was performed; the outcome measures were classified and analyzed into two groups according to the degree of injury. Group 1 included patients with mild to moderate injury, and group 2 included patients with severe to complete injury. Prognosis was correlated with vocal fold motion recovery status with a minimum of 6 months of follow-up since the symptoms onset using positive and negative predictive values. Sixty patients showed acute or chronic recurrent laryngeal neuropathy in laryngeal electromyography. Twelve of 41 patients included in group 1 recovered motion, and 30 of 35 patients included in group 2 did not recover, resulting in 88.2% of positive predictive value and 35.7% of negative predictive value. Our data confirm that laryngeal electromyography is a useful clinical tool in predicting poor recovery in patients with vocal fold paralysis. It allows identification of candidates for early intervention. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Exploring the anatomical encoding of voice with a mathematical model of the vocal system.
Assaneo, M Florencia; Sitt, Jacobo; Varoquaux, Gael; Sigman, Mariano; Cohen, Laurent; Trevisan, Marcos A
2016-11-01
The faculty of language depends on the interplay between the production and perception of speech sounds. A relevant open question is whether the dimensions that organize voice perception in the brain are acoustical or depend on properties of the vocal system that produced it. One of the main empirical difficulties in answering this question is to generate sounds that vary along a continuum according to the anatomical properties the vocal apparatus that produced them. Here we use a mathematical model that offers the unique possibility of synthesizing vocal sounds by controlling a small set of anatomically based parameters. In a first stage the quality of the synthetic voice was evaluated. Using specific time traces for sub-glottal pressure and tension of the vocal folds, the synthetic voices generated perceptual responses, which are indistinguishable from those of real speech. The synthesizer was then used to investigate how the auditory cortex responds to the perception of voice depending on the anatomy of the vocal apparatus. Our fMRI results show that sounds are perceived as human vocalizations when produced by a vocal system that follows a simple relationship between the size of the vocal folds and the vocal tract. We found that these anatomical parameters encode the perceptual vocal identity (male, female, child) and show that the brain areas that respond to human speech also encode vocal identity. On the basis of these results, we propose that this low-dimensional model of the vocal system is capable of generating realistic voices and represents a novel tool to explore the voice perception with a precise control of the anatomical variables that generate speech. Furthermore, the model provides an explanation of how auditory cortices encode voices in terms of the anatomical parameters of the vocal system. Copyright © 2016 Elsevier Inc. All rights reserved.
Exploring vocal recovery after cranial nerve injury in Bengalese finches.
Urbano, Catherine M; Peterson, Jennifer R; Cooper, Brenton G
2013-02-08
Songbirds and humans use auditory feedback to acquire and maintain their vocalizations. The Bengalese finch (Lonchura striata domestica) is a songbird species that rapidly modifies its vocal output to adhere to an internal song memory. In this species, the left side of the bipartite vocal organ is specialized for producing louder, higher frequencies (≥2.2kHz) and denervation of the left vocal muscles eliminates these notes. Thus, the return of higher frequency notes after cranial nerve injury can be used as a measure of vocal recovery. Either the left or right side of the syrinx was denervated by resection of the tracheosyringeal portion of the hypoglossal nerve. Histologic analyses of syringeal muscle tissue showed significant muscle atrophy in the denervated side. After left nerve resection, songs were mainly composed of lower frequency syllables, but three out of five birds recovered higher frequency syllables. Right nerve resection minimally affected phonology, but it did change song syntax; syllable sequence became abnormally stereotyped after right nerve resection. Therefore, damage to the neuromuscular control of sound production resulted in reduced motor variability, and Bengalese finches are a potential model for functional vocal recovery following cranial nerve injury. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
ERIC Educational Resources Information Center
Vorperian, Houri K.; Wang, Shubing; Schimek, E. Michael; Durtschi, Reid B.; Kent, Ray D.; Gentry, Lindell R.; Chung, Moo K.
2011-01-01
Purpose: The anatomic origin for prepubertal vowel acoustic differences between male and female subjects remains unknown. The purpose of this study is to examine developmental sex differences in vocal tract (VT) length and its oral and pharyngeal portions. Method: Nine VT variables were measured from 605 imaging studies (magnetic resonance imaging…
Utterance Duration as It Relates to Communicative Variables in Infant Vocal Development
ERIC Educational Resources Information Center
Ramsdell-Hudock, Heather L.; Stuart, Andrew; Parham, Douglas F.
2018-01-01
Purpose: We aimed to provide novel information on utterance duration as it relates to vocal type, facial affect, gaze direction, and age in the prelinguistic/early linguistic infant. Method: Infant utterances were analyzed from longitudinal recordings of 15 infants at 8, 10, 12, 14, and 16 months of age. Utterance durations were measured and coded…
Learning to breathe and sing: development of respiratory-vocal coordination in young songbirds
Veit, Lena; Aronov, Dmitriy
2011-01-01
How do animals with learned vocalizations coordinate vocal production with respiration? Songbirds such as the zebra finch learn their songs, beginning with highly variable babbling vocalizations known as subsong. After several weeks of practice, zebra finches are able to produce a precisely timed pattern of syllables and silences, precisely coordinated with expiratory and inspiratory pulses (Franz M, Goller F. J Neurobiol 51: 129–141, 2002). While respiration in adult song is well described, relatively little is known about respiratory patterns in subsong or about the processes by which respiratory and vocal patterns become coordinated. To address these questions, we recorded thoracic air sac pressure in juvenile zebra finches prior to the appearance of any consistent temporal or acoustic structure in their songs. We found that subsong contains brief inspiratory pulses (50 ms) alternating with longer pulses of sustained expiratory pressure (50–500 ms). In striking contrast to adult song, expiratory pulses often contained multiple (0–8) variably timed syllables separated by expiratory gaps and were only partially vocalized. During development, expiratory pulses became shorter and more stereotyped in duration with shorter and fewer nonvocalized parts. These developmental changes eventually resulted in the production of a single syllable per expiratory pulse and a single inspiratory pulse filling each gap, forming a coordinated sequence similar to that of adult song. To examine the role of forebrain song-control nuclei in the development of respiratory patterns, we performed pressure recordings before and after lesions of nucleus HVC (proper name) and found that this manipulation reverses the developmental trends in measures of the respiratory pattern. PMID:21697438
Learning to breathe and sing: development of respiratory-vocal coordination in young songbirds.
Veit, Lena; Aronov, Dmitriy; Fee, Michale S
2011-10-01
How do animals with learned vocalizations coordinate vocal production with respiration? Songbirds such as the zebra finch learn their songs, beginning with highly variable babbling vocalizations known as subsong. After several weeks of practice, zebra finches are able to produce a precisely timed pattern of syllables and silences, precisely coordinated with expiratory and inspiratory pulses (Franz M, Goller F. J Neurobiol 51: 129-141, 2002). While respiration in adult song is well described, relatively little is known about respiratory patterns in subsong or about the processes by which respiratory and vocal patterns become coordinated. To address these questions, we recorded thoracic air sac pressure in juvenile zebra finches prior to the appearance of any consistent temporal or acoustic structure in their songs. We found that subsong contains brief inspiratory pulses (50 ms) alternating with longer pulses of sustained expiratory pressure (50-500 ms). In striking contrast to adult song, expiratory pulses often contained multiple (0-8) variably timed syllables separated by expiratory gaps and were only partially vocalized. During development, expiratory pulses became shorter and more stereotyped in duration with shorter and fewer nonvocalized parts. These developmental changes eventually resulted in the production of a single syllable per expiratory pulse and a single inspiratory pulse filling each gap, forming a coordinated sequence similar to that of adult song. To examine the role of forebrain song-control nuclei in the development of respiratory patterns, we performed pressure recordings before and after lesions of nucleus HVC (proper name) and found that this manipulation reverses the developmental trends in measures of the respiratory pattern.
Demopoulos, Carly; Hopkins, Joyce; Kopald, Brandon E; Paulson, Kim; Doyle, Lauren; Andrews, Whitney E; Lewine, Jeffrey David
2015-11-01
The primary aim of this study was to examine whether there is an association between magnetoencephalography-based (MEG) indices of basic cortical auditory processing and vocal affect recognition (VAR) ability in individuals with autism spectrum disorder (ASD). MEG data were collected from 25 children/adolescents with ASD and 12 control participants using a paired-tone paradigm to measure quality of auditory physiology, sensory gating, and rapid auditory processing. Group differences were examined in auditory processing and vocal affect recognition ability. The relationship between differences in auditory processing and vocal affect recognition deficits was examined in the ASD group. Replicating prior studies, participants with ASD showed longer M1n latencies and impaired rapid processing compared with control participants. These variables were significantly related to VAR, with the linear combination of auditory processing variables accounting for approximately 30% of the variability after controlling for age and language skills in participants with ASD. VAR deficits in ASD are typically interpreted as part of a core, higher order dysfunction of the "social brain"; however, these results suggest they also may reflect basic deficits in auditory processing that compromise the extraction of socially relevant cues from the auditory environment. As such, they also suggest that therapeutic targeting of sensory dysfunction in ASD may have additional positive implications for other functional deficits. (c) 2015 APA, all rights reserved).
Cygan, Fanny; Oudiette, Delphine; Leclair-Visonneau, Laurène; Leu-Semenescu, Smaranda; Arnulf, Isabelle
2010-01-01
Objectives: The video-polysomnographic criteria of REM sleep behavior disorder (RBD) have not been well described. We evaluated the between-night reproducibility of phasic and tonic enhanced muscle activity during REM sleep as well as the associated behaviors and vocalizations of the patients. Methods: Fifteen patients with clinical RBD underwent two consecutive video-polysomnographies. The amount of excessive phasic and tonic chin muscle activity during REM sleep was measured in 15 patients in 3-sec mini-epochs. The time spent with motor (minor, major, complex, and scenic) or vocal (sounds, mumblings, and comprehensible speeches) events was measured in 7 patients during REM sleep. Results: There was a good between-night agreement for tonic (Spearman rho = 0.55, p = 0.03; Kendall tau = 0.48, p = 0.01) but not for phasic (rho = 0.47, p = 0.1; tau = 0.31, p = 0.1) excessive chin muscle activity. On the video and audio recordings, the minor RBD behaviors tended to occur more frequently during the second night than the first, whereas the patients spoke longer during the first than the second night. Conclusion: The excessive tonic activity during REM sleep is a reliable marker of RBD. It could represent the extent of dysfunction in the permissive atonia systems. In contrast, the more variable phasic activity and motor/vocal events could be more dependent on dream content (executive systems). Citation: Cygan F; Oudiette D; Leclair-Visonneau L; Leu-Semenescu S; Arnulf I. Night-to-night variability of muscle tone, movements, and vocalizations in patients with REM sleep behavior disorder. J Clin Sleep Med 2010;6(6):551-555. PMID:21206543
A model of acoustic interspeaker variability based on the concept of formant-cavity affiliation
NASA Astrophysics Data System (ADS)
Apostol, Lian; Perrier, Pascal; Bailly, Gérard
2004-01-01
A method is proposed to model the interspeaker variability of formant patterns for oral vowels. It is assumed that this variability originates in the differences existing among speakers in the respective lengths of their front and back vocal-tract cavities. In order to characterize, from the spectral description of the acoustic speech signal, these vocal-tract differences between speakers, each formant is interpreted, according to the concept of formant-cavity affiliation, as a resonance of a specific vocal-tract cavity. Its frequency can thus be directly related to the corresponding cavity length, and a transformation model can be proposed from a speaker A to a speaker B on the basis of the frequency ratios of the formants corresponding to the same resonances. In order to minimize the number of sounds to be recorded for each speaker in order to carry out this speaker transformation, the frequency ratios are exactly computed only for the three extreme cardinal vowels [eye, aye, you] and they are approximated for the remaining vowels through an interpolation function. The method is evaluated through its capacity to transform the (F1,F2) formant patterns of eight oral vowels pronounced by five male speakers into the (F1,F2) patterns of the corresponding vowels generated by an articulatory model of the vocal tract. The resulting formant patterns are compared to those provided by normalization techniques published in the literature. The proposed method is found to be efficient, but a number of limitations are also observed and discussed. These limitations can be associated with the formant-cavity affiliation model itself or with a possible influence of speaker-specific vocal-tract geometry in the cross-sectional direction, which the model might not have taken into account.
Clinical course of acute laryngeal trauma and associated effects on phonation.
Brosch, S; Johannsen, H S
1999-01-01
We report the clinical course of blunt laryngeal trauma in three young patients. All three patients underwent several phoniatric examinations as well as indirect microlaryngoscopy and microstroboscopy. The follow-up period ranged from three to eight months. In the first case, there was isolated haemorrhage of the left vocal fold; in the second, dislocation of the arytenoid cartilage with formation of an adhesion in the area of the anterior commissure; and, in the third, non-dislocated fracture of the thyroid cartilage with development of haematoma in the right hemilarynx and transient vocal fold paralysis. One patient required surgical treatment; however, repositioning of the arytenoid cartilage, attempted seven weeks following the injury, proved unsuccessful. In conclusion, all three patients showed significant limitation of vocal fold vibration many months after trauma which was unrelated to the extent of resulting tissue damage. In all three cases, patients developed secondary posttraumatic functional dysphonia requiring treatment.
Treatment of Acute Vocal Fold Injury With Platelet-Rich Plasma.
Cobden, Serap Bulut; Oztürk, Kayhan; Duman, Selcuk; Esen, Hasan; Aktan, Tahsin Murad; Avunduk, Mustafa Cihat; Elsurer, Cagdas
2016-11-01
Platelet-rich plasma (PRP) is a reliable and has low side-effect profile and has beneficial effects on wound healing. Its investigatory effects on wound-healing process were shown on various tissues. The aim of the present study was to evaluate effectiveness of PRP application on scar tissue of acute vocal fold injury. Twenty-four Wistar rats were used in the study. The entire layer of the lamina propria down to the thyroarytenoid muscle of 10 subjects was unilaterally injured by with a microscissor. Gelfoam-absorbed PRP was applied on the injured area for 10 minutes. Control group consisted of rats unilaterally injured using a microscissor, and gelfoam with normal saline was applied on the injured area. Following sacrifice, the larynxes were carefully dissected and removed for histopathologic examination. After excised larynx experiments, serial sections were prepared from vocal fold. Hematoxylin eosin and immunohistochemical staining were done for epithelial growth factor receptor (EGFR), fibroblast growth factor receptor (FGFR1), and vascular endothelial growth factor (VEGF) staining for histopathologic examinations. There was not a significant difference between the two groups for lymphocyte. Although collagen and VEGF were higher in the study group, there was not a significant difference between the groups (P > 0.05). There was a significant difference between control and study groups for EGFR and FGFR1(P < 0.05). PRP has beneficial effects on wound healing. PRP accelerates epithelization of injured rat vocal folds by inducing EGFR secretion. PRP is an autogenous, reliable, low side-effect profile, easily harvested material. PRP may be useful to prevent scar formation. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Vocal repertoire of the social giant otter.
Leuchtenberger, Caroline; Sousa-Lima, Renata; Duplaix, Nicole; Magnusson, William E; Mourão, Guilherme
2014-11-01
According to the "social intelligence hypothesis," species with complex social interactions have more sophisticated communication systems. Giant otters (Pteronura brasiliensis) live in groups with complex social interactions. It is likely that the vocal communication of giant otters is more sophisticated than previous studies suggest. The objectives of the current study were to describe the airborne vocal repertoire of giant otters in the Pantanal area of Brazil, to analyze call types within different behavioral contexts, and to correlate vocal complexity with level of sociability of mustelids to verify whether or not the result supports the social intelligence hypothesis. The behavior of nine giant otters groups was observed. Vocalizations recorded were acoustically and statistically analyzed to describe the species' repertoire. The repertoire was comprised by 15 sound types emitted in different behavioral contexts. The main behavioral contexts of each sound type were significantly associated with the acoustic variable ordination of different sound types. A strong correlation between vocal complexity and sociability was found for different species, suggesting that the communication systems observed in the family mustelidae support the social intelligence hypothesis.
Vocal cord paralysis after aortic surgery.
DiLisio, Ralph P; Mazzeffi, Michael A; Bodian, Carol A; Fischer, Gregory W
2013-06-01
The purpose of this study was to investigate variables associated with vocal cord paralysis during complex aortic procedures. A retrospective review. A tertiary care center. Four hundred ninety-eight patients who underwent aortic surgery between 2002 and 2007. Two groups were studied. Group A patients had procedures only involving their aortic root and/or ascending aorta. Group B patients had procedures only involving their aortic arch and/or descending aorta. The incidence of vocal cord paralysis was higher (7.26% v 0.8%) in group B patients (p < 0.0001). Increasing the duration of cardiopulmonary bypass time was associated with an increased risk of vocal cord paralysis and death in both groups A and B (p = 0.0002 and 0.002, respectively). Additionally, within group B, descending aneurysms emerged as an independent risk factor associated with vocal cord paralysis (p = 0.03). Length of stay was statistically significantly longer among group A patients who suffered vocal cord paralysis (p = 0.017) and trended toward significance in group B patients who suffered vocal cord paralysis (p = 0.059). The association between tracheostomy and vocal cord paralysis among group A patients reached statistical significance (p = 0.007) and trended toward significance in group B patients (p = 0.057). Increasing duration of cardiopulmonary bypass time was associated with a higher risk of vocal cord paralysis in patients undergoing aortic surgery. Additionally, within group B patients, descending aortic aneurysm was an independent risk factor associated with vocal cord paralysis. Most importantly, vocal cord paralysis appeared to have an association between an increased length of stay and tracheostomy among a select group of patients undergoing aortic surgery. Copyright © 2013 Elsevier Inc. All rights reserved.
Memory-dependent adjustment of vocal response latencies in a territorial songbird.
Geberzahn, Nicole; Hultsch, Henrike; Todt, Dietmar
2013-06-01
Vocal interactions in songbirds can be used as a model system to investigate the interplay of intrinsic singing programmes (e.g. influences from vocal memories) and external variables (e.g. social factors). When characterizing vocal interactions between territorial rivals two aspects are important: (1) the timing of songs in relation to the conspecific's singing and (2) the use of a song pattern that matches the rival's song. Responses in both domains can be used to address a territorial rival. This study is the first to investigate the relation of the timing of vocal responses to (1) the vocal memory of a responding subject and (2) the selection of the song pattern that the subject uses as a response. To this end, we conducted interactive playback experiments with adult nightingales (Luscinia megarhynchos) that had been hand-reared and tutored in the laboratory. We analysed the subjects' vocal response latencies towards broadcast playback stimuli that they either had in their own vocal repertoire (songs shared with playback) or that they had not heard before (unknown songs). Likewise, we compared vocal response latencies between responses that matched the stimulus song and those that did not. Our findings showed that the latency of singing in response to the playback was shorter for shared versus unknown song stimuli when subjects overlapped the playback stimuli with their own song. Moreover birds tended to overlap faster when vocally matching the stimulus song rather than when replying with a non-matching song type. We conclude that memory of song patterns influenced response latencies and discuss possible mechanisms. Copyright © 2012 Elsevier Ltd. All rights reserved.
Mets, David G; Brainard, Michael S
2018-01-01
Abstract Background Vocal learning in songbirds has emerged as a powerful model for sensorimotor learning. Neurobehavioral studies of Bengalese finch (Lonchura striata domestica) song, naturally more variable and plastic than songs of other finch species, have demonstrated the importance of behavioral variability for initial learning, maintenance, and plasticity of vocalizations. However, the molecular and genetic underpinnings of this variability and the learning it supports are poorly understood. Findings To establish a platform for the molecular analysis of behavioral variability and plasticity, we generated an initial draft assembly of the Bengalese finch genome from a single male animal to 151× coverage and an N50 of 3.0 MB. Furthermore, we developed an initial set of gene models using RNA-seq data from 8 samples that comprise liver, muscle, cerebellum, brainstem/midbrain, and forebrain tissue from juvenile and adult Bengalese finches of both sexes. Conclusions We provide a draft Bengalese finch genome and gene annotation to facilitate the study of the molecular-genetic influences on behavioral variability and the process of vocal learning. These data will directly support many avenues for the identification of genes involved in learning, including differential expression analysis, comparative genomic analysis (through comparison to existing avian genome assemblies), and derivation of genetic maps for linkage analysis. Bengalese finch gene models and sequences will be essential for subsequent manipulation (molecular or genetic) of genes and gene products, enabling novel mechanistic investigations into the role of variability in learned behavior. PMID:29618046
Colquitt, Bradley M; Mets, David G; Brainard, Michael S
2018-03-01
Vocal learning in songbirds has emerged as a powerful model for sensorimotor learning. Neurobehavioral studies of Bengalese finch (Lonchura striata domestica) song, naturally more variable and plastic than songs of other finch species, have demonstrated the importance of behavioral variability for initial learning, maintenance, and plasticity of vocalizations. However, the molecular and genetic underpinnings of this variability and the learning it supports are poorly understood. To establish a platform for the molecular analysis of behavioral variability and plasticity, we generated an initial draft assembly of the Bengalese finch genome from a single male animal to 151× coverage and an N50 of 3.0 MB. Furthermore, we developed an initial set of gene models using RNA-seq data from 8 samples that comprise liver, muscle, cerebellum, brainstem/midbrain, and forebrain tissue from juvenile and adult Bengalese finches of both sexes. We provide a draft Bengalese finch genome and gene annotation to facilitate the study of the molecular-genetic influences on behavioral variability and the process of vocal learning. These data will directly support many avenues for the identification of genes involved in learning, including differential expression analysis, comparative genomic analysis (through comparison to existing avian genome assemblies), and derivation of genetic maps for linkage analysis. Bengalese finch gene models and sequences will be essential for subsequent manipulation (molecular or genetic) of genes and gene products, enabling novel mechanistic investigations into the role of variability in learned behavior.
Individual killer whale vocal variation during intra-group behavioral dynamics
NASA Astrophysics Data System (ADS)
Grebner, Dawn M.
The scientific goal of this dissertation was to carefully study the signal structure of killer whale communications and vocal complexity and link them to behavioral circumstances. The overall objective of this research sought to provide insight into killer whale call content and usage which may be conveying information to conspecifics in order to maintain group cohesion. Data were collected in the summers of 2006 and 2007 in Johnstone Strait, British Columbia. For both individuals and small groups, vocalizations were isolated using a triangular hydrophone array and the behavioral movement patterns were captured by a theodolite and video camera positioned on a cliff overlooking the hyrophone locations. This dissertation is divided into four analysis chapters. In Chapter 3, discriminant analysis was used to validate the four N04 call subtypes which were originally parsed due to variations in slope segments. The first two functions of the discriminant analysis explained 97% of the variability. Most of the variability for the N04 call was found in the front convex and the terminal portions of the call, while very little variability was found in the center region of the call. This research revealed that individual killer whales produced multiple subtypes of the N04 call. No correlations of behaviors to acoustic parameters obtained were found. The aim of the Chapter 4 was to determine if killer whale calling behavior varied prior to and after the animals had joined. Pulsed call rates were found to be greater pre- compared to post-joining events. Two-way vocal exchanges were more common occurring 74% of the time during pre-joining events. In Chapter 5, initiated and first response to calls varied between age/sex class groups when mothers were separated from an offspring. Solo mothers and calves initiated pulsed calls more often than they responded. Most of the no vocal responses were due to mothers who were foraging. Finally, observations of the frequency split in N04 calls discussed in Chapter 6 showed that the higher frequency component (HFC) was always associated with sideband 7 (SB7) of the lower frequency component (LFC). Insight into Northern Resident killer whale intra-group vocal dynamics would aid our understanding of vocal behaviors of many other marine mammal species that rely on vocal exchanges for prey capture, group movement or survival. This is the first study to focus on killer whale vocal content and usage as it pertains to intra-group dynamics for (1) mother and offspring separations and (2) for all individuals prior to joining events, as well as (3) individual usage in a diverging pulsed call. It is also the first time the N04 call has been parsed into subtypes.
Individual behavioural predictors of amphetamine-induced emission of 50 kHz vocalization in rats.
Mulvihill, Kevin G; Brudzynski, Stefan M
2018-05-11
Measurement of ultrasonic vocalizations (USVs) produced by adult rats represents a highly useful index of emotional arousal. The associations found between 50 kHz USV production and a variety of behavioural and pharmacological protocols increasingly suggests they serve as a marker of positive motivational states. This study used a powerful within-subjects design to investigate the relationships among individual differences in approach to a sweet-food reward, predisposition to emit 50 kHz USVs spontaneously, and 50 kHz USVs emission following acute systemic administration of amphetamine. Both approach motivation and predisposition to call were found to not correlate with each other but did predict 50 kHz USV response to acute amphetamine. These two behavioural phenotypes appear to represent dissociable predictors of acute amphetamine-induced emission of 50 kHz USVs in a non-sensitization paradigm. In contrast to that, a measure of sucrose preference was not found to predict 50 kHz USV emission following amphetamine. Acute amphetamine was also found to increase average sound frequency of emitted USVs and selectively increase the proportion of Trill subtype 50 kHz USVs. Together, these data demonstrate that acute amphetamine-induced 50 kHz USVs in the adult rat represent more than just a univariate motivational state and may represent the product of dissociable subsystems of emotional behavior. Copyright © 2018. Published by Elsevier B.V.
Statistical learning in songbirds: from self-tutoring to song culture.
Fehér, Olga; Ljubičić, Iva; Suzuki, Kenta; Okanoya, Kazuo; Tchernichovski, Ofer
2017-01-05
At the onset of vocal development, both songbirds and humans produce variable vocal babbling with broadly distributed acoustic features. Over development, these vocalizations differentiate into the well-defined, categorical signals that characterize adult vocal behaviour. A broadly distributed signal is ideal for vocal exploration, that is, for matching vocal production to the statistics of the sensory input. The developmental transition to categorical signals is a gradual process during which the vocal output becomes differentiated and stable. But does it require categorical input? We trained juvenile zebra finches with playbacks of their own developing song, produced just a few moments earlier, updated continuously over development. Although the vocalizations of these self-tutored (ST) birds were initially broadly distributed, birds quickly developed categorical signals, as fast as birds that were trained with a categorical, adult song template. By contrast, siblings of those birds that received no training (isolates) developed phonological categories much more slowly and never reached the same level of category differentiation as their ST brothers. Therefore, instead of simply mirroring the statistical properties of their sensory input, songbirds actively transform it into distinct categories. We suggest that the early self-generation of phonological categories facilitates the establishment of vocal culture by making the song easier to transmit at the micro level, while promoting stability of shared vocabulary at the group level over generations.This article is part of the themed issue 'New frontiers for statistical learning in the cognitive sciences'. © 2016 The Authors.
Learned vocal and breathing behavior in an enculturated gorilla.
Perlman, Marcus; Clark, Nathaniel
2015-09-01
We describe the repertoire of learned vocal and breathing-related behaviors (VBBs) performed by the enculturated gorilla Koko. We examined a large video corpus of Koko and observed 439 VBBs spread across 161 bouts. Our analysis shows that Koko exercises voluntary control over the performance of nine distinctive VBBs, which involve variable coordination of her breathing, larynx, and supralaryngeal articulators like the tongue and lips. Each of these behaviors is performed in the context of particular manual action routines and gestures. Based on these and other findings, we suggest that vocal learning and the ability to exercise volitional control over vocalization, particularly in a multimodal context, might have figured relatively early into the evolution of language, with some rudimentary capacity in place at the time of our last common ancestor with great apes.
Quantitative Tools for Examining the Vocalizations of Juvenile Songbirds
Wellock, Cameron D.; Reeke, George N.
2012-01-01
The singing of juvenile songbirds is highly variable and not well stereotyped, a feature that makes it difficult to analyze with existing computational techniques. We present here a method suitable for analyzing such vocalizations, windowed spectral pattern recognition (WSPR). Rather than performing pairwise sample comparisons, WSPR measures the typicality of a sample against a large sample set. We also illustrate how WSPR can be used to perform a variety of tasks, such as sample classification, song ontogeny measurement, and song variability measurement. Finally, we present a novel measure, based on WSPR, for quantifying the apparent complexity of a bird's singing. PMID:22701474
Diagnosing vocal cord dysfunction in young athletes.
Rhodes, Rea Kae
2008-12-01
To provide an overview of the pathophysiology, steps in making a diagnosis, differential diagnosis, and treatment methods for vocal cord dysfunction (VCD) in young athletes. Review of published literature about VCD and exercise-induced asthma (EIA) and a case study. The clinical presentation of VCD is often confusing. A young athlete who is having difficulty "catching his breath" may have more than EIA. Young athletes who have been previously diagnosed with EIA may actually have VCD. The ability to correctly differentiate VCD from other causes of respiratory distress can lead to accurate interventions, save precious time in an acute situation, and promote long-term control of this condition.
Fontan, Lionel; Fraval, Marie; Michon, Anne; Déjean, Sébastien; Welby-Gieusse, Muriel
2017-03-01
Sports and fitness instructors (SFIs) are known for being a high-risk population for voice difficulties (VD). However, past studies have encountered various methodological difficulties in determining prevalence and risk factors for VD in SFIs, such as limited population, gender and selection biases, or poor statistical power, because VD were studied as a binary variable. The present research work addresses these issues and aims at studying the prevalence of vocal problems and risk factors in French SFIs, a population in which no such study was conducted yet. Another objective is to survey the French SFIs' habits and expectations regarding vocal prevention and care. This is a cross-sectional study. Three hundred and twenty SFIs answered a questionnaire, whether in an online (n = 267) or a paper (n = 53) version. The questionnaire consisted of 31 items addressing self-reported vocal difficulties, supposed risk factors, and personal health-care history, followed by the Voice Handicap Index assessment. Prevalence of self-reported vocal difficulties is 55%. The Voice Handicap Index is significantly associated with gender, age, and variables related to work environment (noise and music) and habits (shouting, frequency of classes), as well as with daily sleeping time. Results also indicate that a minority of the SFIs (37%) received information on vocal difficulties, whereas a majority (80%) declares being interested in participating in prevention programs. This work confirms that SFIs are a high-risk population for VD, underlines the need for specific information programs in France, and provides relevant data for driving such preventive actions. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Actor vocal training for the habilitation of speech in adolescent users of cochlear implants.
Holt, Colleen M; Dowell, Richard C
2011-01-01
This study examined changes to speech production in adolescents with hearing impairment following a period of actor vocal training. In addition to vocal parameters, the study also investigated changes to psychosocial factors such as confidence, self-esteem, and anxiety. The group were adolescent users of cochlear implants (mean age at commencement of training 15.9 years), with approximately half of the group wearing a hearing aid in the contralateral ear. The mean age of implantation of the group was 7.6 years and the participants displayed a range of speech production abilities. Evaluation of posttraining outcomes was performed via a combination of perceptual and acoustic analyses. Significant posttraining changes to vocal parameters included increased pitch range and variability and decreased speaking rate. From a psychosocial perspective, posttraining stress levels were significantly lowered. This study suggested that actor vocal training may benefit young people with hearing impairment, both in the way in which they use their voices and in the way in which they view themselves.
Quantifying the influence of flow asymmetries on glottal sound sources in speech
NASA Astrophysics Data System (ADS)
Erath, Byron; Plesniak, Michael
2008-11-01
Human speech is made possible by the air flow interaction with the vocal folds. During phonation, asymmetries in the glottal flow field may arise from flow phenomena (e.g. the Coanda effect) as well as from pathological vocal fold motion (e.g. unilateral paralysis). In this study, the effects of flow asymmetries on glottal sound sources were investigated. Dynamically-programmable 7.5 times life-size vocal fold models with 2 degrees-of-freedom (linear and rotational) were constructed to provide a first-order approximation of vocal fold motion. Important parameters (Reynolds, Strouhal, and Euler numbers) were scaled to physiological values. Normal and abnormal vocal fold motions were synthesized, and the velocity field and instantaneous transglottal pressure drop were measured. Variability in the glottal jet trajectory necessitated sorting of the data according to the resulting flow configuration. The dipole sound source is related to the transglottal pressure drop via acoustic analogies. Variations in the transglottal pressure drop (and subsequently the dipole sound source) arising from flow asymmetries are discussed.
Day, Nancy F; Kimball, Todd Haswell; Aamodt, Caitlin M; Heston, Jonathan B; Hilliard, Austin T; Xiao, Xinshu; White, Stephanie A
2018-01-01
Human speech is one of the few examples of vocal learning among mammals yet ~half of avian species exhibit this ability. Its neurogenetic basis is largely unknown beyond a shared requirement for FoxP2 in both humans and zebra finches. We manipulated FoxP2 isoforms in Area X, a song-specific region of the avian striatopallidum analogous to human anterior striatum, during a critical period for song development. We delineate, for the first time, unique contributions of each isoform to vocal learning. Weighted gene coexpression network analysis of RNA-seq data revealed gene modules correlated to singing, learning, or vocal variability. Coexpression related to singing was found in juvenile and adult Area X whereas coexpression correlated to learning was unique to juveniles. The confluence of learning and singing coexpression in juvenile Area X may underscore molecular processes that drive vocal learning in young zebra finches and, by analogy, humans. PMID:29360038
Contribution of the supraglottic larynx to the vocal product: imaging and acoustic analysis
NASA Astrophysics Data System (ADS)
Gracco, L. Carol
1996-04-01
Horizontal supraglottic laryngectomy is a surgical procedure to remove a mass lesion located in the region of the pharynx superior to the true vocal folds. In contrast to full or partial laryngectomy, patients who undergo horizontal supraglottic laryngectomy often present with little or nor involvement to the true vocal folds. This population provides an opportunity to examine the acoustic consequences of altering the pharynx while sparing the laryngeal sound source. Acoustic and magnetic resonance imaging (MRI) data were acquired in a group of four patients before and after supraglottic laryngectomy. Acoustic measures included the identification of vocal tract resonances and the fundamental frequency of the vocal fold vibration. 3D reconstruction of the pharyngeal portion of each subjects' vocal tract were made from MRIs taken during phonation and volume measures were obtained. These measures reveal a variable, but often dramatic difference in the surgically-altered area of the pharynx and changes in the formant frequencies of the vowel/i/post surgically. In some cases the presence of the tumor created a deviation from the expected formant values pre-operatively with post-operative values approaching normal. Patients who also underwent radiation treatment post surgically tended to have greater constriction in the pharyngeal area of the vocal tract.
[The vocal behavior of telemarketing operators before and after a working day].
Amorim, Geová Oliveira de; Bommarito, Silvana; Kanashiro, Célia Akemi; Chiari, Brasilia Maria
2011-01-01
To evaluate the vocal behavior of receptive telemarketing operators in pre- and post-work shift moments, and to relate the results to the variable gender. Participants were 55 telemarketing operators (11 men and 44 women) working in a receptive mode in the city of Maceió (Alagoas, Brazil). A questionnaire was applied before the work shift to initially identify the vocal complaints. After that, vocal samples were recorded, comprising sustained emissions and connected speech produced 10 minutes before and 10 minutes after the workday to be later evaluated. Auditory-perceptual and acoustic analyses of voice were conducted. Vocal complaints and symptoms reported by the operators after the work shift were: dry throat (64%); neck and cervix pain (33%); hoarseness (31%); voice failure (26%); and vocal fatigue (22%).Telemarketing operators presented reduced maximum phonation time before and after the day of work (p=0.645). Data from the auditory-perceptual assessment of voice were similar in pre- and post-shift moments (p=0.645). No difference was found between moments also on acoustic analysis data (p=0.738). Telemarketing operators have high indexes of vocal symptoms after the work shift, and there are no differences between pre- and post-work shift in auditory-perceptual and acoustic assessments of voice.
Singing ability is rooted in vocal-motor control of pitch.
Hutchins, Sean; Larrouy-Maestri, Pauline; Peretz, Isabelle
2014-11-01
The inability to vocally match a pitch can be caused by poor pitch perception or by poor vocal-motor control. Although previous studies have tried to examine the relationship between pitch perception and vocal production, they have failed to control for the timbre of the target to be matched. In the present study, we compare pitch-matching accuracy with an unfamiliar instrument (the slider) and with the voice, designed such that the slider plays back recordings of the participant's own voice. We also measured pitch accuracy in singing a familiar melody ("Happy Birthday") to assess the relationship between single-pitch-matching tasks and melodic singing. Our results showed that participants (all nonmusicians) were significantly better at matching recordings of their own voices with the slider than with their voice, indicating that vocal-motor control is an important limiting factor on singing ability. We also found significant correlations between the ability to sing a melody in tune and vocal pitch matching, but not pitch matching on the slider. Better melodic singers also tended to have higher quality voices (as measured by acoustic variables). These results provide important evidence about the role of vocal-motor control in poor singing ability and demonstrate that single-pitch-matching tasks can be useful in measuring general singing abilities.
Munin, Michael C; Heman-Ackah, Yolanda D; Rosen, Clark A; Sulica, Lucian; Maronian, Nicole; Mandel, Steven; Carey, Bridget T; Craig, Earl; Gronseth, Gary
2016-06-01
The purpose of this study was to develop an evidence-based consensus statement regarding use of laryngeal electromyography (LEMG) for diagnosis and treatment of vocal fold paralysis after recurrent laryngeal neuropathy (RLN). Two questions regarding LEMG were analyzed: (1) Does LEMG predict recovery in patients with acute unilateral or bilateral vocal fold paralysis? (2) Do LEMG findings change clinical management in these individuals? A systematic review was performed using American Academy of Neurology criteria for rating of diagnostic accuracy. Active voluntary motor unit potential recruitment and presence of polyphasic motor unit potentials within the first 6 months after lesion onset predicted recovery. Positive sharp waves and/or fibrillation potentials did not predict outcome. The presence of electrical synkinesis may decrease the likelihood of recovery, based on 1 published study. LEMG altered clinical management by changing the initial diagnosis from RLN in 48% of cases. Cricoarytenoid fixation and superior laryngeal neuropathy were the most common other diagnoses observed. If prognostic information is required in a patient with vocal fold paralysis that is more than 4 weeks and less than 6 months in duration, then LEMG should be performed. LEMG may be performed to clarify treatment decisions for vocal fold immobility that is presumed to be caused by RLN. Muscle Nerve 53: 850-855, 2016. © 2016 Wiley Periodicals, Inc.
Swalve, Natashia; Mulholland, Michele M.; Schulz, Tiffany D.; Li, Ming
2015-01-01
Patients with schizophrenia smoke cigarettes at a higher rate than the general population. We hypothesized that a factor in this comorbidity is sensitivity to the reinforcing and reinforcement-enhancement effects of nicotine. Phencyclidine (PCP) was used to model behavioral changes resembling negative symptoms of schizophrenia in rats. USVs in rats have been used to measure emotional states, with 50 kHz USVs indicating positive states and 22 kHz indicating negative. Total and categorized numbers of 22 and 50 kHz ultrasonic vocalizations (USVs) and USVs during a visual stimulus (e.g. a potential measure of reinforcement-enhancement) were examined in rats following .injection ofh PCP (2.0 mg/kg), and/or nicotine (0.2 or 0.4 mg/kg) daily for 7 days. PCP was then discontinued and all rats received nicotine (0.2 mg/kg and 0.4 mg/kg) and PCP (2.0 mg/kg) on 3 challenge days. PCP acutely decreased 50 kHz vocalizations while repeated nicotine potentiated rates of vocalizations, with similar patterns during light presentations. Rats in the PCP and nicotine combination groups made more 50 kHz vocalizations compared to control groups on challenge days. We conclude that PCP may produce a reward deficit that is shown by decreased 50 kHz USVs, and behaviors post-PCP exposure may best model the comorbidity between schizophrenia and nicotine. PMID:26479849
Master, Suely; Guzman, Marco; Azócar, Maria Josefina; Muñoz, Daniel; Bortnem, Cori
2015-05-01
The present study aimed to compare actors/actresses's voices and vocally trained subjects through aerodynamic and electroglottographic (EGG) analyses. We hypothesized that glottal and breathing functions would reflect technical and physiological differences between vocally trained and untrained subjects. Forty participants with normal voices participated in this study (20 professional theater actors and 20 untrained participants). In each group, 10 male and 10 female subjects were assessed. All participants underwent aerodynamic and EGG assessment of voice. From the Phonatory Aerodynamic System, three protocols were used: comfortable sustained phonation with EGG, voice efficiency with EGG, and running speech. Contact quotient was calculated from EGG. All phonatory tasks were produced at three different loudness levels. Mean sound pressure level and fundamental frequency were also assessed. Univariate, multivariate, and correlation statistical analyses were performed. Main differences between vocally trained and untrained participants were found in the following variables: mean sound pressure level, phonatory airflow, subglottic pressure, inspiratory airflow duration, inspiratory airflow, and inspiratory volume. These variables were greater for trained participants. Mean pitch was found to be lower for trained voices. The glottal source seemed to have a weak contribution when differentiating the training status in speaking voice. More prominent changes between vocally trained and untrained participants are demonstrated in respiratory-related variables. These findings may be related to better management of breathing function (better breath support). Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Catecholaminergic contributions to vocal communication signals.
Matheson, Laura E; Sakata, Jon T
2015-05-01
Social context affects behavioral displays across a variety of species. For example, social context acutely influences the acoustic and temporal structure of vocal communication signals such as speech and birdsong. Despite the prevalence and importance of such social influences, little is known about the neural mechanisms underlying the social modulation of communication. Catecholamines are implicated in the regulation of social behavior and motor control, but the degree to which catecholamines influence vocal communication signals remains largely unknown. Using a songbird, the Bengalese finch, we examined the extent to which the social context in which song is produced affected immediate early gene expression (EGR-1) in catecholamine-synthesising neurons in the midbrain. Further, we assessed the degree to which administration of amphetamine, which increases catecholamine concentrations in the brain, mimicked the effect of social context on vocal signals. We found that significantly more catecholaminergic neurons in the ventral tegmental area and substantia nigra (but not the central grey, locus coeruleus or subcoeruleus) expressed EGR-1 in birds that were exposed to females and produced courtship song than in birds that produced non-courtship song in isolation. Furthermore, we found that amphetamine administration mimicked the effects of social context and caused many aspects of non-courtship song to resemble courtship song. Specifically, amphetamine increased the stereotypy of syllable structure and sequencing, the repetition of vocal elements and the degree of sequence completions. Taken together, these data highlight the conserved role of catecholamines in vocal communication across species, including songbirds and humans. © 2015 Federation of European Neuroscience Societies and John Wiley & Sons Ltd.
Acute exposure to vibration is an apoptosis-inducing stimulus in the vocal fold epithelium.
Novaleski, Carolyn K; Kimball, Emily E; Mizuta, Masanobu; Rousseau, Bernard
2016-10-01
Clinical voice disorders pose significant communication-related challenges to patients. The purpose of this study was to quantify the rate of apoptosis and tumor necrosis factor-alpha (TNF-α) signaling in vocal fold epithelial cells in response to increasing time-doses and cycle-doses of vibration. 20 New Zealand white breeder rabbits were randomized to three groups of time-doses of vibration exposure (30, 60, 120min) or a control group (120min of vocal fold adduction and abduction). Estimated cycle-doses of vocal fold vibration were extrapolated based on mean fundamental frequency. Laryngeal tissue specimens were evaluated for apoptosis and gene transcript and protein levels of TNF-α. Results revealed that terminal deoxynucleotidyl transferase dUTP nick end labeling (TUNEL) staining was significantly higher after 120min of vibration compared to the control. Transmission electron microscopy (TEM) revealed no significant effect of time-dose on the mean area of epithelial cell nuclei. Extrapolated cycle-doses of vibration exposure were closely related to experimental time-dose conditions, although no significant correlations were observed with TUNEL staining or mean area of epithelial cell nuclei. TUNEL staining was positively correlated with TNF-α protein expression. Our findings suggest that apoptosis can be induced in the vocal fold epithelium after 120min of modal intensity phonation. In contrast, shorter durations of vibration exposure do not result in apoptosis signaling. However, morphological features of apoptosis are not observed using TEM. Future studies are necessary to examine the contribution of abnormal apoptosis to vocal fold diseases. Copyright © 2016 Elsevier Ltd. All rights reserved.
Acute Exposure to Vibration is an Apoptosis-Inducing Stimulus in the Vocal Fold Epithelium
Novaleski, Carolyn K.; Kimball, Emily E.; Mizuta, Masanobu; Rousseau, Bernard
2016-01-01
Clinical voice disorders pose significant communication-related challenges to patients. The purpose of this study was to quantify the rate of apoptosis and tumor necrosis factor-alpha (TNF-α) signaling in vocal fold epithelial cells in response to increasing time-doses and cycle-doses of vibration. 20 New Zealand white breeder rabbits were randomized to three groups of time-doses of vibration exposure (30, 60, 120 minutes) or a control group (120 minutes of vocal fold adduction and abduction). Estimated cycle-doses of vocal fold vibration were extrapolated based on mean fundamental frequency. Laryngeal tissue specimens were evaluated for apoptosis and gene transcript and protein levels of TNF-α. Results revealed that terminal deoxynucleotidyl transferase dUTP nick end labeling (TUNEL) staining was significantly higher after 120 minutes of vibration compared to the control. Transmission electron microscopy (TEM) revealed no significant effect of time-dose on the mean area of epithelial cell nuclei. Extrapolated cycle-doses of vibration exposure were closely related to experimental time-dose conditions, although no significant correlations were observed with TUNEL staining or mean area of epithelial cell nuclei. TUNEL staining was positively correlated with TNF-α protein expression. Our findings suggest that apoptosis can be induced in the vocal fold epithelium after 120 minutes of modal intensity phonation. In contrast, shorter durations of vibration exposure do not result in apoptosis signaling. However, morphological features of apoptosis are not observed using TEM. Future studies are necessary to examine the contribution of abnormal apoptosis to vocal fold diseases. PMID:27577014
NASA Astrophysics Data System (ADS)
Mosko, J. D.; Stevens, K. N.; Griffin, G. R.
1983-08-01
Acoustical analyses were conducted of words produced by four speakers in a motion stress-inducing situation. The aim of the analyses was to document the kinds of changes that occur in the vocal utterances of speakers who are exposed to motion stress and to comment on the implications of these results for the design and development of voice interactive systems. The speakers differed markedly in the types and magnitudes of the changes that occurred in their speech. For some speakers, the stress-inducing experimental condition caused an increase in fundamental frequency, changes in the pattern of vocal fold vibration, shifts in vowel production and changes in the relative amplitudes of sounds containing turbulence noise. All speakers showed greater variability in the experimental condition than in more relaxed control situation. The variability was manifested in the acoustical characteristics of individual phonetic elements, particularly in speech sound variability observed serve to unstressed syllables. The kinds of changes and variability observed serve to emphasize the limitations of speech recognition systems based on template matching of patterns that are stored in the system during a training phase. There is need for a better understanding of these phonetic modifications and for developing ways of incorporating knowledge about these changes within a speech recognition system.
Food-Associated Calling in Gorillas (Gorilla g. gorilla) in the Wild
Luef, Eva Maria; Breuer, Thomas; Pika, Simone
2016-01-01
Many nonhuman primates produce food-associated vocalizations upon encountering or ingesting particular food. Concerning the great apes, only food-associated vocalizations of chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) have been studied in detail, providing evidence that these vocalizations can be produced flexibly in relation to a variety of factors, such as the quantity and quality of food and/or the type of audience. Only anecdotal evidence exists of eastern (Gorilla beringei) and western gorillas (Gorilla gorilla) producing food-associated vocalizations, termed singing or humming. To enable a better understanding of the context in which these calls are produced, we investigated and compared the vocal behavior of two free-ranging groups of western lowland gorillas (Gorilla g. gorilla) at Mondika, Republic of Congo. Our results show that (a) food-associated call production occurs only during feeding and not in other contexts; (b) calling is not uniformly distributed across age and sex classes; (c) calls are only produced during feeding on specific foods; and (d) normally just one individual gives calls during group feeding sessions, however, certain food types elicit simultaneous calling of two or more individuals. Our findings provide new insight into the vocal abilities of gorillas but also carry larger implications for questions concerning vocal variability among the great apes. Food-associated calls of nonhuman primates have been shown to be flexible in terms of when they are used and who they are directed at, making them interesting vocalizations from the viewpoint of language evolution. Food-associated vocalizations in great apes can offer new opportunities to investigate the phylogenetic development of vocal communication within the primate lineage and can possibly contribute novel insights into the origins of human language. PMID:26909518
Bjørgesaeter, Anders; Ugland, Karl Inne; Bjørge, Arne
2004-10-01
The male harbor seal (Phoca vitulina) produces broadband nonharmonic vocalizations underwater during the breeding season. In total, 120 vocalizations from six colonies were analyzed to provide a description of the acoustic structure and for the presence of geographic variation. The complex harbor seal vocalizations may be described by how the frequency bandwidth varies over time. An algorithm that identifies the boundaries between noise and signal from digital spectrograms was developed in order to extract a frequency bandwidth contour. The contours were used as inputs for multivariate analysis. The vocalizations' sound types (e.g., pulsed sound, whistle, and broadband nonharmonic sound) were determined by comparing the vocalizations' spectrographic representations with sound waves produced by known sound sources. Comparison between colonies revealed differences in the frequency contours, as well as some geographical variation in use of sound types. The vocal differences may reflect a limited exchange of individuals between the six colonies due to long distances and strong site fidelity. Geographically different vocal repertoires have potential for identifying discrete breeding colonies of harbor seals, but more information is needed on the nature and extent of early movements of young, the degree of learning, and the stability of the vocal repertoire. A characteristic feature of many vocalizations in this study was the presence of tonal-like introductory phrases that fit into the categories pulsed sound and whistles. The functions of these phrases are unknown but may be important in distance perception and localization of the sound source. The potential behavioral consequences of the observed variability may be indicative of adaptations to different environmental properties influencing determination of distance and direction and plausible different male mating tactics.
2016-01-01
The goal of this study is to quantify the effects of vocal fold nodules on vibratory motion in children using high-speed videoendoscopy. Differences in vibratory motion were evaluated in 20 children with vocal fold nodules (5–11 years) and 20 age and gender matched typically developing children (5–11 years) during sustained phonation at typical pitch and loudness. Normalized kinematic features of vocal fold displacements from the mid-membranous vocal fold point were extracted from the steady-state high-speed video. A total of 12 kinematic features representing spatial and temporal characteristics of vibratory motion were calculated. Average values and standard deviations (cycle-to-cycle variability) of the following kinematic features were computed: normalized peak displacement, normalized average opening velocity, normalized average closing velocity, normalized peak closing velocity, speed quotient, and open quotient. Group differences between children with and without vocal fold nodules were statistically investigated. While a moderate effect size was observed for the spatial feature of speed quotient, and the temporal feature of normalized average closing velocity in children with nodules compared to vocally normal children, none of the features were statistically significant between the groups after Bonferroni correction. The kinematic analysis of the mid-membranous vocal fold displacement revealed that children with nodules primarily differ from typically developing children in closing phase kinematics of the glottal cycle, whereas the opening phase kinematics are similar. Higher speed quotients and similar opening phase velocities suggest greater relative forces are acting on vocal fold in the closing phase. These findings suggest that future large-scale studies should focus on spatial and temporal features related to the closing phase of the glottal cycle for differentiating the kinematics of children with and without vocal fold nodules. PMID:27124157
[Phoniatric surgery and conservative treatment of vocal cord hematoma].
Milutinović, Z
1997-01-01
Functional-traumatic lesions of the vocal fold include mucous stranding, "nodular" lesions, polyps, cysts, contact hyperplasia and haematoma of the vocal fold. An acute voice overuse may result in bleeding (haematoma) within the vocal fold. This may be in the form of petechial bleeding, or a genuine haematoma develops within the tissues of the vocal fold. Haematoma may also arise as a consequence of prolonged cough, forceful vomiting, lifting of a heavy weight, various effortful activities, etc. Haematoma is usually located close to the vocal fold free edge and therefore disturbs the glottic closure during phonation. The treatment is adapted to the size and localization of haematoma, as well as to the time elapsed from onset of the lesion. Phonosurgery can be used in therapy, as well as corticosteroid treatment. A series of 102 vocal fold haematomas has been treated by phonosurgery (39) and conservative therapy (63). Phonosurgical interventions were performed by an indirect approach, by use of microstroboscopy (28 patients) and videostroboscopy (11 causes). Conservative treatment consisted of corticosteroid therapy. During a 10-year period 1550 phonosurgical operations were performed for benign lesions of the vocal fold, including 39 haematomas (2.5%). It was established that recovery of vibration pattern was significantly faster in the surgery group in comparison to the group of patients treated conservatively. All surgical patients were operated within the first several days after the onset of symptoms. In case of a vocal fold haematoma, it is very important to establish the diagnosis as soon as possible in order to start with the therapy early enough. Within the first several days after the onset (the best within 24-48 hours) a phonosurgical treatment is indicated, preferably by the use of indirect videostroboscopy. If the treatment is started later we use corticoids. However, the results are inferior as compared to surgery. We did not perform direct microlaryngoscopy in these cases, for a lack of function monitoring and possible local trauma to the tissues. In the majority of cases the voice therapy is required as well.
Final Syllable Lengthening (FSL) in infant vocalizations.
Nathani, Suneeti; Oller, D Kimbrough; Cobo-Lewis, Alan B
2003-02-01
Final Syllable Lengthening (FSL) has been extensively examined in infant vocalizations in order to determine whether its basis is biological or learned. Findings suggest there may be a U-shaped developmental trajectory for FSL. The present study sought to verify this pattern and to determine whether vocal maturity and deafness influence FSL. Eight normally hearing infants, aged 0;3 to 1;0, and eight deaf infants, aged 0;8 to 4;0, were examined at three levels of prelinguistic vocal development: precanonical, canonical, and postcanonical. FSL was found at all three levels suggesting a biological basis for this phenomenon. Individual variability was, however, considerable. Reduction in the magnitude of FSL across the three sessions provided some support for a downward trend for FSL in infancy. Findings further indicated that auditory deprivation can significantly affect temporal aspects of infant speech production.
Paul, Nilanjan; Kumar, Suman; Chatterjee, Indranil; Mukherjee, Biswarup
2011-01-01
In-depth study on laryngeal biomechanics and vocal fold vibratory patterns reveal that a single vibratory cycle can be divided into two major phases, the closed and open phase, which is subdivided into opening and closing phases. Studies reveal that the relative time course of abduction and adduction, which in turn is dependent on the relative relaxing and tensing of the vocal fold cover and body, to be the determining factor in production of a particular vocal register like the modal (or chest), falsetto, glottal fry registers. Studies further point out Electroglottography to be particularly suitable for the study of vocal vibratory patterns during register changes. However, to date, there has been limited study on quantitative parameterization of EGG wave form in vocal fry register. Moreover, contradictory findings abound in literature regarding effects of gender and vowel types on vocal vibratory patterns, especially during phonation at different registers. The present study endeavors to find out the effects of vowel and gender differences on the vocal fold vibratory patterns in different registers and how these would be reflected in standard EGG parameters of Contact Quotient (CQ) and Contact Index (CI), taking into consideration the Indian sociolinguistic context. Electroglottographic recordings of 10 young adults (5 males and 5 females) were taken while the subjects phonated the three vowels /a/,/i/,/u/ each in two vocal registers, modal and vocal fry. Obtained raw EGG were further normalized using the Derived EGG algorithm and theCQ and CI values were derived. Obtained data were subject to statistical analysis using the 3-way ANOVA with gender, vowel and vocal register as the three variables. Post-hoc Dunnett C multiple comparison analysis were also performed. Results reveal that CQ values are significantly higher in vocal fry than modal phonation for both males and females, indicating a relatively hyperconstricted vocal system during vocal fry. The males have significantly greater CQ values than females both at modal and vocal fry phonations which indicate that the males are predisposed to greater vocal fold constriction. Females demonstrated no significant increase in CI values in vocal fry state; and in some cases actually decrease in the CI values which suggest an inherently distinct vocal fold physiological adjustment from that in males. No vowel effects were found in any conditions. Perturbation values (CQP and CIP) are significantly more in vocal fry register than in modal register, and the increase was more in case of females than males. The findings give strong evidence to certain hypotheses in literature regarding effects of vowel, gender and phonatory register on vocal fold vibratory patterns.
Hamaguchi, Kosuke; Mooney, Richard
2012-01-01
Complex brain functions, such as the capacity to learn and modulate vocal sequences, depend on activity propagation in highly distributed neural networks. To explore the synaptic basis of activity propagation in such networks, we made dual in vivo intracellular recordings in anesthetized zebra finches from the input (nucleus HVC) and output (lateral magnocellular nucleus of the anterior nidopallium (LMAN)) neurons of a songbird cortico-basal ganglia (BG) pathway necessary to the learning and modulation of vocal motor sequences. These recordings reveal evidence of bidirectional interactions, rather than only feedforward propagation of activity from HVC to LMAN, as had been previously supposed. A combination of dual and triple recording configurations and pharmacological manipulations was used to map out circuitry by which activity propagates from LMAN to HVC. These experiments indicate that activity travels to HVC through at least two independent ipsilateral pathways, one of which involves fast signaling through a midbrain dopaminergic cell group, reminiscent of recurrent mesocortical loops described in mammals. We then used in vivo pharmacological manipulations to establish that augmented LMAN activity is sufficient to restore high levels of sequence variability in adult birds, suggesting that recurrent interactions through highly distributed forebrain – midbrain pathways can modulate learned vocal sequences. PMID:22915110
Acoustic characteristics of simulated respiratory-induced vocal tremor.
Lester, Rosemary A; Story, Brad H
2013-05-01
The purpose of this study was to investigate the relation of respiratory forced oscillation to the acoustic characteristics of vocal tremor. Acoustical analyses were performed to determine the characteristics of the intensity and fundamental frequency (F0) for speech samples obtained by Farinella, Hixon, Hoit, Story, and Jones (2006) using a respiratory forced oscillation paradigm with 5 healthy adult males to simulate vocal tremor involving respiratory pressure modulation. The analyzed conditions were sustained productions of /a/ with amplitudes of applied pressure of 0, 1, 2, and 4 cmH2O and a rate of 5 Hz. Forced oscillation of the respiratory system produced modulation of the intensity and F0 for all participants. Variability was observed between participants and conditions in the change in intensity and F0 per unit of pressure change, as well as in the mean intensity and F0. However, the extent of modulation of intensity and F0 generally increased as the applied pressure increased, as would be expected. These findings suggest that individuals develop idiosyncratic adaptations to pressure modulations, which are important to understanding aspects of variability in vocal tremor, and highlight the need to assess all components of the speech mechanism that may be directly or indirectly affected by tremor.
Measures of native and non-native rhythm in a quantity language.
Stockmal, Verna; Markus, Dace; Bond, Dzintra
2005-01-01
The traditional phonetic classification of language rhythm as stress-timed or syllable-timed is attributed to Pike. Recently, two different proposals have been offered for describing the rhythmic structure of languages from acoustic-phonetic measurements. Ramus has suggested a metric based on the proportion of vocalic intervals and the variability (SD) of consonantal intervals. Grabe has proposed Pairwise Variability Indices (nPVI, rPVI) calculated from the differences in vocalic and consonantal durations between successive syllables. We have calculated both the Ramus and Grabe metrics for Latvian, traditionally considered a syllable rhythm language, and for Latvian as spoken by Russian learners. Native speakers and proficient learners were very similar whereas low-proficiency learners showed high variability on some properties. The metrics did not provide an unambiguous classification of Latvian.
Sigafoos, Jeff; Didden, Robert; O'Reilly, Mark
2003-01-01
We evaluated the role of digitized speech output on the maintenance of requesting and frequency of vocalizations in three children with developmental disabilities. The children were taught to request access to preferred objects using an augmentative communication speech-generating device (SGD). Following acquisition, rates of requesting and vocalizations were compared across two conditions (speech output on versus speech output off) that were alternated on a session-by-session basis. There were no major or consistent differences across the two conditions for the three children, suggesting that access to preferred objects was the critical variable maintaining use of the SGDs. The results also suggest that feedback in the form of digitized speech from the SGD did not inhibit vocalizations. One child began to speak single words during the latter part of the study, suggesting that in some cases AAC intervention involving SGDs may facilitate speech.
Biosimulation of Inflammation and Healing in Surgically Injured Vocal Folds
Li, Nicole Y. K.; Vodovotz, Yoram; Hebda, Patricia A.; Abbott, Katherine Verdolini
2010-01-01
Objectives The pathogenesis of vocal fold scarring is complex and remains to be deciphered. The current study is part of research endeavors aimed at applying systems biology approaches to address the complex biological processes involved in the pathogenesis of vocal fold scarring and other lesions affecting the larynx. Methods We developed a computational agent-based model (ABM) to quantitatively characterize multiple cellular and molecular interactions involved in inflammation and healing in vocal fold mucosa after surgical trauma. The ABM was calibrated with empirical data on inflammatory mediators (eg, tumor necrosis factor) and extracellular matrix components (eg, hyaluronan) from published studies on surgical vocal fold injury in the rat population. Results The simulation results reproduced and predicted trajectories seen in the empirical data from the animals. Moreover, the ABM studies suggested that hyaluronan fragments might be the clinical surrogate of tissue damage, a key variable that in these simulations both is enhanced by and further induces inflammation. Conclusions A relatively simple ABM such as the one reported in this study can provide new understanding of laryngeal wound healing and generate working hypotheses for further wet-lab studies. PMID:20583741
Combining Multiobjective Optimization and Cluster Analysis to Study Vocal Fold Functional Morphology
Palaparthi, Anil; Riede, Tobias
2017-01-01
Morphological design and the relationship between form and function have great influence on the functionality of a biological organ. However, the simultaneous investigation of morphological diversity and function is difficult in complex natural systems. We have developed a multiobjective optimization (MOO) approach in association with cluster analysis to study the form-function relation in vocal folds. An evolutionary algorithm (NSGA-II) was used to integrate MOO with an existing finite element model of the laryngeal sound source. Vocal fold morphology parameters served as decision variables and acoustic requirements (fundamental frequency, sound pressure level) as objective functions. A two-layer and a three-layer vocal fold configuration were explored to produce the targeted acoustic requirements. The mutation and crossover parameters of the NSGA-II algorithm were chosen to maximize a hypervolume indicator. The results were expressed using cluster analysis and were validated against a brute force method. Results from the MOO and the brute force approaches were comparable. The MOO approach demonstrated greater resolution in the exploration of the morphological space. In association with cluster analysis, MOO can efficiently explore vocal fold functional morphology. PMID:24771563
Biosimulation of inflammation and healing in surgically injured vocal folds.
Li, Nicole Y K; Vodovotz, Yoram; Hebda, Patricia A; Abbott, Katherine Verdolini
2010-06-01
The pathogenesis of vocal fold scarring is complex and remains to be deciphered. The current study is part of research endeavors aimed at applying systems biology approaches to address the complex biological processes involved in the pathogenesis of vocal fold scarring and other lesions affecting the larynx. We developed a computational agent-based model (ABM) to quantitatively characterize multiple cellular and molecular interactions involved in inflammation and healing in vocal fold mucosa after surgical trauma. The ABM was calibrated with empirical data on inflammatory mediators (eg, tumor necrosis factor) and extracellular matrix components (eg, hyaluronan) from published studies on surgical vocal fold injury in the rat population. The simulation results reproduced and predicted trajectories seen in the empirical data from the animals. Moreover, the ABM studies suggested that hyaluronan fragments might be the clinical surrogate of tissue damage, a key variable that in these simulations both is enhanced by and further induces inflammation. A relatively simple ABM such as the one reported in this study can provide new understanding of laryngeal wound healing and generate working hypotheses for further wet-lab studies.
Berg Soto, Alvaro; Marsh, Helene; Everingham, Yvette; Smith, Joshua N; Parra, Guido J; Noad, Michael
2014-08-01
Australian snubfin and Indo-Pacific humpback dolphins co-occur throughout most of their range in coastal waters of tropical Australia. Little is known of their ecology or acoustic repertoires. Vocalizations from humpback and snubfin dolphins were recorded in two locations along the Queensland coast during 2008 and 2010 to describe their vocalizations and evaluate the acoustic differences between these two species. Broad vocalization types were categorized qualitatively. Both species produced click trains burst pulses and whistles. Principal component analysis of the nine acoustic variables extracted from the whistles produced nine principal components that were input into discriminant function analyses to classify 96% of humpback dolphin whistles and about 78% of snubfin dolphin calls correctly. Results indicate clear acoustic differences between the vocal whistle repertoires of these two species. A stepwise routine identified two principal components as significantly distinguishable between whistles of each species: frequency parameters and frequency trend ratio. The capacity to identify these species using acoustic monitoring techniques has the potential to provide information on presence/absence, habitat use and relative abundance for each species.
Vocal fold pseudocyst: results of 46 cases undergoing a uniform treatment algorithm.
Estes, Christine; Sulica, Lucian
2014-05-01
To describe treatment results and identify predictors of the need for surgical intervention in patients with vocal fold pseudocyst. Retrospective cohort study with longitudinal followup via survey. Clinical records were reviewed for demographic information, VHI-10 score, and degree of severity of dysphonia. Videostroboscopic examinations were evaluated for presence of vocal fold pseudocyst, along with additional clinical variables, including laterality, reactive lesion, paresis, varix, and hemorrhage. Follow-up surveys were sent to all participants to evaluate current VHI-10 score and degree of vocal limitation. Results were analyzed to determine predictors of surgery and recurrence of pathology. Forty-six patients (41F:5M) with pseudocyst (40 unilateral: 6 bilateral) were reviewed. Twenty-three (50%) had reactive lesions, nineteen (41%) had paresis by clinical criteria, 10 (22%) had varices, and 6 (13%) had hemorrhage on examination. All underwent initial behavioral management (2-12 sessions of voice therapy; mean of 8 sessions). Seventeen (37%) eventually required surgical intervention. No demographic or clinical variables proved predictive of surgical intervention. Follow-up surveys were completed by 63% of patients, and 79% agreed with the statement that they were not professionally limited by their voices. This experience supports behavioral management as an initial intervention in patients with pseudocyst, sufficient by itself to restore vocal function in approximately two out of three patients. Neither initial severity nor any of the studied clinical findings predicted the need for surgery. The large majority of patients with pseudocyst are able to be treated effectively without impact in their professional function. © 2013 The American Laryngological, Rhinological and Otological Society, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bouchard, Kristofer E.; Conant, David F.; Anumanchipalli, Gopala K.
A complete neurobiological understanding of speech motor control requires determination of the relationship between simultaneously recorded neural activity and the kinematics of the lips, jaw, tongue, and larynx. Many speech articulators are internal to the vocal tract, and therefore simultaneously tracking the kinematics of all articulators is nontrivial-especially in the context of human electrophysiology recordings. Here, we describe a noninvasive, multi-modal imaging system to monitor vocal tract kinematics, demonstrate this system in six speakers during production of nine American English vowels, and provide new analysis of such data. Classification and regression analysis revealed considerable variability in the articulator-to-acoustic relationship acrossmore » speakers. Non-negative matrix factorization extracted basis sets capturing vocal tract shapes allowing for higher vowel classification accuracy than traditional methods. Statistical speech synthesis generated speech from vocal tract measurements, and we demonstrate perceptual identification. We demonstrate the capacity to predict lip kinematics from ventral sensorimotor cortical activity. These results demonstrate a multi-modal system to non-invasively monitor articulator kinematics during speech production, describe novel analytic methods for relating kinematic data to speech acoustics, and provide the first decoding of speech kinematics from electrocorticography. These advances will be critical for understanding the cortical basis of speech production and the creation of vocal prosthetics.« less
Laukkanen, Anne-Maria; Titze, Ingo R.; Hoffman, Henry; Finnegan, Eileen
2015-01-01
Voice training exploits semiocclusives, which increase vocal tract interaction with the source. Modeling results suggest that vocal economy (maximum flow declination rate divided by maximum area declination rate, MADR) is improved by matching the glottal and vocal tract impedances. Changes in MADR may be correlated with thyroarytenoid (TA) muscle activity. Here the effects of impedance matching are studied for laryngeal muscle activity and glottal resistance. One female repeated [pa:p:a] before and immediately after (a) phonation into different-sized tubes and (b) voiced bilabial fricative [β:]. To allow estimation of subglottic pressure from the oral pressure, [p] was inserted also in the repetitions of the semiocclusions. Airflow was registered using a flow mask. EMG was registered from TA, cricothyroid (CT) and lateral cricoarytenoid (LCA) muscles. Phonation was simulated using a 7 × 5 × 5 point-mass model of the vocal folds, allowing inputs of simulated laryngeal muscle activation. The variables were TA, CT and LCA activities. Increased vocal tract impedance caused the subject to raise TA activity compared to CT and LCA activities. Computer simulation showed that higher glottal economy and efficiency (oral radiated power divided by aerodynamic power) were obtained with a higher TA/CT ratio when LCA activity was tuned for ideal adduction. PMID:19011306
Anumanchipalli, Gopala K.; Dichter, Benjamin; Chaisanguanthum, Kris S.; Johnson, Keith; Chang, Edward F.
2016-01-01
A complete neurobiological understanding of speech motor control requires determination of the relationship between simultaneously recorded neural activity and the kinematics of the lips, jaw, tongue, and larynx. Many speech articulators are internal to the vocal tract, and therefore simultaneously tracking the kinematics of all articulators is nontrivial—especially in the context of human electrophysiology recordings. Here, we describe a noninvasive, multi-modal imaging system to monitor vocal tract kinematics, demonstrate this system in six speakers during production of nine American English vowels, and provide new analysis of such data. Classification and regression analysis revealed considerable variability in the articulator-to-acoustic relationship across speakers. Non-negative matrix factorization extracted basis sets capturing vocal tract shapes allowing for higher vowel classification accuracy than traditional methods. Statistical speech synthesis generated speech from vocal tract measurements, and we demonstrate perceptual identification. We demonstrate the capacity to predict lip kinematics from ventral sensorimotor cortical activity. These results demonstrate a multi-modal system to non-invasively monitor articulator kinematics during speech production, describe novel analytic methods for relating kinematic data to speech acoustics, and provide the first decoding of speech kinematics from electrocorticography. These advances will be critical for understanding the cortical basis of speech production and the creation of vocal prosthetics. PMID:27019106
Vasconcelos, Maria J M; Ventura, Sandra M R; Freitas, Diamantino R S; Tavares, João Manuel R S
2012-03-01
The morphological and dynamic characterisation of the vocal tract during speech production has been gaining greater attention due to the motivation of the latest improvements in magnetic resonance (MR) imaging; namely, with the use of higher magnetic fields, such as 3.0 Tesla. In this work, the automatic study of the vocal tract from 3.0 Tesla MR images was assessed through the application of statistical deformable models. Therefore, the primary goal focused on the analysis of the shape of the vocal tract during the articulation of European Portuguese sounds, followed by the evaluation of the results concerning the automatic segmentation, i.e. identification of the vocal tract in new MR images. In what concerns speech production, this is the first attempt to automatically characterise and reconstruct the vocal tract shape of 3.0 Tesla MR images by using deformable models; particularly, by using active and appearance shape models. The achieved results clearly evidence the adequacy and advantage of the automatic analysis of the 3.0 Tesla MR images of these deformable models in order to extract the vocal tract shape and assess the involved articulatory movements. These achievements are mostly required, for example, for a better knowledge of speech production, mainly of patients suffering from articulatory disorders, and to build enhanced speech synthesizer models.
Bouchard, Kristofer E.; Conant, David F.; Anumanchipalli, Gopala K.; ...
2016-03-28
A complete neurobiological understanding of speech motor control requires determination of the relationship between simultaneously recorded neural activity and the kinematics of the lips, jaw, tongue, and larynx. Many speech articulators are internal to the vocal tract, and therefore simultaneously tracking the kinematics of all articulators is nontrivial-especially in the context of human electrophysiology recordings. Here, we describe a noninvasive, multi-modal imaging system to monitor vocal tract kinematics, demonstrate this system in six speakers during production of nine American English vowels, and provide new analysis of such data. Classification and regression analysis revealed considerable variability in the articulator-to-acoustic relationship acrossmore » speakers. Non-negative matrix factorization extracted basis sets capturing vocal tract shapes allowing for higher vowel classification accuracy than traditional methods. Statistical speech synthesis generated speech from vocal tract measurements, and we demonstrate perceptual identification. We demonstrate the capacity to predict lip kinematics from ventral sensorimotor cortical activity. These results demonstrate a multi-modal system to non-invasively monitor articulator kinematics during speech production, describe novel analytic methods for relating kinematic data to speech acoustics, and provide the first decoding of speech kinematics from electrocorticography. These advances will be critical for understanding the cortical basis of speech production and the creation of vocal prosthetics.« less
Costa, Giulia; Morelli, Micaela
2015-01-01
Background: Rats emit 50kHz ultrasonic vocalizations (USVs) in response to either natural or pharmacological pleasurable stimuli, and these USVs have emerged as a new behavioral measure for investigating the motivational properties of drugs. Earlier studies have indicated that activation of the dopaminergic system is critically involved in 50kHz USV emissions. However, evidence also exists that non-dopaminergic neurotransmitters participate in this behavioral response. Methods: To ascertain whether glutamate transmission plays a role in 50kHz USV emissions stimulated by amphetamine, rats received five amphetamine (1–2mg/kg, i.p.) administrations on alternate days in a test cage, either alone or combined with the glutamate N-methyl-D-aspartate receptor antagonist MK-801 (0.1–0.5mg/kg, i.p.). Seven days after treatment discontinuation, rats were re-exposed to the test cage to assess drug conditioning, and afterwards received a drug challenge. USVs and locomotor activity were evaluated, along with immunofluorescence for Zif-268 in various brain regions and spontaneous alternation in a Y maze. Results: Amphetamine-treated rats displayed higher 50kHz USV emissions and locomotor activity than vehicle-treated rats, and emitted conditioned vocalizations on test cage re-exposure. Rats co-administered amphetamine and MK-801 displayed lower and dose-dependent 50kHz USV emissions, but not lower locomotor activity, during repeated treatment and challenge, and scarce conditioned vocalization compared with amphetamine-treated rats. These effects were associated with lower levels of Zif-268 after amphetamine challenge and spontaneous alternation deficits. Conclusions: These results indicate that glutamate transmission participates in the acute, long-term, and conditioned effects of amphetamine on 50kHz USVs, possibly by influencing amphetamine-induced long-term neuronal changes and/or amphetamine-associated memories. PMID:25991653
Harbor Seal (Phoca vitulina) Reproductive Advertisement Behavior and the Effects of Vessel Noise
NASA Astrophysics Data System (ADS)
Matthews, Leanna P.
Harbor seals (Phoca vitulina) are a widely distributed pinniped species that mate underwater. Similar to other aquatically mating pinnipeds, male harbor seals produce vocalizations during the breeding season that function in male-male interactions and possibly as an attractant for females. I investigated multiple aspects of these reproductive advertisement displays in a population of harbor seals in Glacier Bay National Park and Preserve, Alaska. First, I looked at vocal production as a function of environmental variables, including season, daylight, and tidal state. Vocalizations were highly seasonal and detection of these vocalizations peaked in June and July, which correspond with the estimated time of breeding. Vocalizations also varied with light, with the lowest probability of detection during the day and the highest probability of detection at night. The high probability of detection corresponded to when females are known to forage. These results are similar to the vocal behavior of previously studied populations. However, unlike previously studied populations, the detection of harbor seal breeding vocalizations did not vary with tidal state. This is likely due to the location of the hydrophone, as it was not near the haul out and depth was therefore not significantly influenced by changes in tidal height. I also investigated the source levels and call parameters of vocalizations, as well as call rate and territoriality. The average source level of harbor seal breeding vocalizations was 144 dB re 1 ?Pa at 1 m and measurements ranged from 129 to 149 dB re 1 ?Pa. Analysis of call parameters indicated that vocalizations of harbor seals in Glacier Bay were similar in duration to other populations, but were much lower in frequency. During the breeding season, there were two discrete calling areas that likely represent two individual males; the average call rate in these display areas was approximately 1 call per minute. The harbor seal breeding season also overlaps with peak tourism in Glacier Bay, and the majority of tourists visit the park on a motorized vessel. Because of this overlap, I investigated the impacts of vessel noise on the vocal behavior of individual males. In the presence of vessel noise, male harbor seals increase the amplitude of their vocalizations, decrease the duration, and increase the minimum frequency. These vocal shifts are similar to studies of noise impacts on other species across taxa, but it is unknown how this could impact the reproductive success of male harbor seals. Finally, I looked at the role of female preference for male vocalizations. Using playbacks of male vocalizations to captive female harbor seals, I found that females have a higher response to vocalizations that correspond to dominant males. Females were less responsive to subordinate male vocalizations, which had a shorter duration and a higher frequency. Given that male harbor seals decrease the duration and increase the frequency of vocalizations in the presence of noise, it is possible that these vocalizations become less attractive in noise.
Sielska-Badurek, Ewelina M; Sobol, Maria; Jędra, Katarzyna; Rzepakowska, Anna; Osuch-Wójcikiewicz, Ewa; Niemczyk, Kazimierz
2017-09-01
Injection laryngoplasty (glottis augmentation) is the preferred method in surgical management of unilateral vocal fold paralysis (UVFP). Traditionally, these procedures are performed in the operating room. Nowadays, however, these procedures have moved into the office. To evaluate the voice quality after transoral injection laryngoplasty under local anaesthesia in patients with unilateral vocal fold paralysis. Fourteen subjects (5 women and 9 men) with unilateral vocal fold paresis (9 with right vocal fold paresis and 5 with left vocal fold paresis) were included in the study. The mean age of the group was 57.8 ±19.0 years (32-83 years). All of the injection laryngoplasties were performed transorally, under local anaesthesia. The injection material was calcium hydroxylapatite. Before and 1, 3 and 6 months after the procedure the following variables were evaluated: voice perception, videostroboscopy, acoustic analysis, aerodynamic evaluation, and the subjective rating of the voice quality by the patient. After injection laryngoplasty, complete glottal closure was achieved or there was a significant improvement in the glottal closure of each subject. We noted great improvement in the post-injection objective and subjective voice outcomes and patients reported improvement in the voice-related quality of life. The transoral approach for injection laryngoplasty under local anaesthesia is an effective and safe way to treat incomplete glottal closure in patients with UVFP. The transoral approach is an efficient alternative to other surgical techniques used for vocal fold injection.
Azul, David
2016-11-01
Transmasculine people assigned female gender at birth but who do not identify with this classification have traditionally received little consideration in the voice literature. Existing analyses tend to be focused on evaluating speaker voice characteristics, whereas other factors that contribute to the production of vocal gender have remained underexplored. Most studies rely on researcher-centred perspectives, whereas very little is known about how transmasculine people themselves experience and make sense of their vocal situations. To explore how participants described their subjective gender positionings; which gender attributions they wished to receive from others; which gender they self-attributed to their voices; which gender attributions they had received from others; and how far participants were satisfied with the gender-related aspects of their vocal situations. Transcripts of semi-structured interviews with 14 German-speaking transmasculine people served as the original data corpus. Sections in which participants described the gender-related aspects of their vocal situations and that were relevant to the current research objectives were selected and explored using qualitative content analysis. The analysis revealed diverse accounts pertaining to the factors that contribute to the production of vocal gender for individual participants and variable levels of satisfaction with vocal gender presentation and attribution. Transmasculine people need to be regarded as a heterogeneous population and clinical practice needs to follow a client-centred, individualized approach. © 2016 Royal College of Speech and Language Therapists.
Sielska-Badurek, Ewelina M.; Jędra, Katarzyna; Rzepakowska, Anna; Osuch-Wójcikiewicz, Ewa; Niemczyk, Kazimierz
2017-01-01
Introduction Injection laryngoplasty (glottis augmentation) is the preferred method in surgical management of unilateral vocal fold paralysis (UVFP). Traditionally, these procedures are performed in the operating room. Nowadays, however, these procedures have moved into the office. Aim To evaluate the voice quality after transoral injection laryngoplasty under local anaesthesia in patients with unilateral vocal fold paralysis. Material and methods Fourteen subjects (5 women and 9 men) with unilateral vocal fold paresis (9 with right vocal fold paresis and 5 with left vocal fold paresis) were included in the study. The mean age of the group was 57.8 ±19.0 years (32–83 years). All of the injection laryngoplasties were performed transorally, under local anaesthesia. The injection material was calcium hydroxylapatite. Before and 1, 3 and 6 months after the procedure the following variables were evaluated: voice perception, videostroboscopy, acoustic analysis, aerodynamic evaluation, and the subjective rating of the voice quality by the patient. Results After injection laryngoplasty, complete glottal closure was achieved or there was a significant improvement in the glottal closure of each subject. We noted great improvement in the post-injection objective and subjective voice outcomes and patients reported improvement in the voice-related quality of life. Conclusions The transoral approach for injection laryngoplasty under local anaesthesia is an effective and safe way to treat incomplete glottal closure in patients with UVFP. The transoral approach is an efficient alternative to other surgical techniques used for vocal fold injection. PMID:29062449
Treatment outcomes for professional voice users.
Wingate, Judith M; Brown, William S; Shrivastav, Rahul; Davenport, Paul; Sapienza, Christine M
2007-07-01
Professional voice users comprise 25% to 35% of the U.S. working population. Their voice problems may interfere with job performance and impact costs for both employers and employees. The purpose of this study was to examine treatment outcomes of two specific rehabilitation programs for a group of professional voice users. Eighteen professional voice users participated in this study; half had complaints of throat pain or vocal fatigue (Dysphonia Group), and half were found to have benign vocal fold lesions (Lesion Group). One group received 5 weeks of expiratory muscle strength training followed by six sessions of traditional voice therapy. Treatment order was reversed for the second group. The study was designed as a repeated measures study with independent variables of treatment order, laryngeal diagnosis (lesion vs non-lesion), gender, and time. Dependent variables included maximum expiratory pressure (MEP), Voice Handicap Index (VHI) score, Vocal Rating Scale (VRS) score, Voice Effort Scale score, phonetogram measures, subglottal pressures, and acoustic and perceptual measures. Results showed significant improvements in MEP, VHI scores, and VRS scores, subglottal pressure for loud intensity, phonetogram area, and dynamic range. No significant difference was found between laryngeal diagnosis groups. A significant difference was not observed for treatment order. It was concluded that the combined treatment was responsible for the improvements observed. The results indicate that a combined modality treatment may be successful in the remediation of vocal problems for professional voice users.
Cygan, Fanny; Oudiette, Delphine; Leclair-Visonneau, Laurène; Leu-Semenescu, Smaranda; Arnulf, Isabelle
2010-12-15
The video-polysomnographic criteria of REM sleep behavior disorder (RBD) have not been well described. We evaluated the between-night reproducibility of phasic and tonic enhanced muscle activity during REM sleep as well as the associated behaviors and vocalizations of the patients. Fifteen patients with clinical RBD underwent two consecutive video-polysomnographies. The amount of excessive phasic and tonic chin muscle activity during REM sleep was measured in 15 patients in 3-sec mini-epochs. The time spent with motor (minor, major, complex, and scenic) or vocal (sounds, mumblings, and comprehensible speeches) events was measured in 7 patients during REM sleep. There was a good between-night agreement for tonic (Spearman rho = 0.55, p = 0.03; Kendall tau = 0.48, p = 0.01) but not for phasic (rho = 0.47, p = 0.1; tau = 0.31, p = 0.1) excessive chin muscle activity. On the video and audio recordings, the minor RBD behaviors tended to occur more frequently during the second night than the first, whereas the patients spoke longer during the first than the second night. The excessive tonic activity during REM sleep is a reliable marker of RBD. It could represent the extent of dysfunction in the permissive atonia systems. In contrast, the more variable phasic activity and motor/vocal events could be more dependent on dream content (executive systems).
Iyer, Suneeti Nathani; Oller, D. Kimbrough
2010-01-01
Little research has been conducted on the development of suprasegmental characteristics of vocalizations in typically developing infants (TDI) and the role of audition in the development of these characteristics. The purpose of the present study was to examine the longitudinal development of fundamental frequency (F0) in eight TDI and eight infants with severe-to-profound hearing loss matched for level of vocal development. Results revealed no significant changes in F0 with advances in pre-language vocal development for TDI. Infants with hearing loss, however, showed a statistically reliable higher variability of F0 than TDI, when age was accounted for as a covariate. The results suggest development of F0 may be strongly influenced by audition. PMID:19031191
Vocal handicap index in popular and erudite professional singers.
Loiola-Barreiro, Camila Miranda; Silva, Marta Assumpção de Andrada E
To compare the voice handicap index of popular and erudite professional singers according to gender, age, professional experience time, and presence or absence of self-reported vocal complaints. One hundred thirty-two professional singers, 74 popular and 58 erudite, who responded to a questionnaire with regards to identification, age, gender, professional experience time in singing, musical genres (for popular singers), vocal classification (for erudite singers), presence of self-reported vocal complaints, and the specific protocols for popular (Modern Singing Handicap Index - MSHI) and erudite (Classical Singing Handicap Index - CSHI) singing. Higher proportion of women and higher incidence of vocal complaints were observed in the popular singers compared with the erudite singers. Most of the popular singers belonged to the genre of Brazilian Popular Music. Regarding the classification of erudite singers, there was greater participation of sopranos and tenors. No statistical differences were observed with respect to age and professional experience time between the groups. Comparison of the MSHI and CSHI scores showed no statistically significant difference between these scores and genre or age in both groups of singers. Professional experience time was related to the total score and the subscales disability and impairment in the MSHI, only for popular singers with vocal complaints. There was no correlation between these variables and the CSHI for erudite singers. The impact of vocal difficulty/problem interferes differently in these two musical genres when related to vocal complaint and professional experience time. The MSHI and CSHI protocols proved to be important tools not only for the identification of problems, but also for the understanding of how these individuals relate their voices with this occupational activity.
Watts, Christopher; Barnes-Burroughs, Kathryn; Estis, Julie; Blanton, Debra
2006-03-01
A growing body of contemporary research has investigated differences between trained and untrained singing voices. However, few studies have separated untrained singers into those who do and do not express abilities related to singing talent, including accurate pitch control and production of a pleasant timbre (voice quality). This investigation studied measures of the singing power ratio (SPR), which is a quantitative measure of the resonant quality of the singing voice. SPR reflects the amplification or suppression in the vocal tract of the harmonics produced by the sound source. This measure was acquired from the voices of untrained talented and nontalented singers as a means to objectively investigate voice quality differences. Measures of SPR were acquired from vocal samples with fast Fourier transform (FFT) power spectra to analyze the amplitude level of the partials in the acoustic spectrum. Long-term average spectra (LTAS) were also analyzed. Results indicated significant differences in SPR between groups, which suggest that vocal tract resonance, and its effect on perceived vocal timbre or quality, may be an important variable related to the perception of singing talent. LTAS confirmed group differences in the tuning of vocal tract harmonics.
An Automated Procedure for Evaluating Song Imitation
Mandelblat-Cerf, Yael; Fee, Michale S.
2014-01-01
Songbirds have emerged as an excellent model system to understand the neural basis of vocal and motor learning. Like humans, songbirds learn to imitate the vocalizations of their parents or other conspecific “tutors.” Young songbirds learn by comparing their own vocalizations to the memory of their tutor song, slowly improving until over the course of several weeks they can achieve an excellent imitation of the tutor. Because of the slow progression of vocal learning, and the large amounts of singing generated, automated algorithms for quantifying vocal imitation have become increasingly important for studying the mechanisms underlying this process. However, methodologies for quantifying song imitation are complicated by the highly variable songs of either juvenile birds or those that learn poorly because of experimental manipulations. Here we present a method for the evaluation of song imitation that incorporates two innovations: First, an automated procedure for selecting pupil song segments, and, second, a new algorithm, implemented in Matlab, for computing both song acoustic and sequence similarity. We tested our procedure using zebra finch song and determined a set of acoustic features for which the algorithm optimally differentiates between similar and non-similar songs. PMID:24809510
Effects of subsampling of passive acoustic recordings on acoustic metrics.
Thomisch, Karolin; Boebel, Olaf; Zitterbart, Daniel P; Samaran, Flore; Van Parijs, Sofie; Van Opzeeland, Ilse
2015-07-01
Passive acoustic monitoring is an important tool in marine mammal studies. However, logistics and finances frequently constrain the number and servicing schedules of acoustic recorders, requiring a trade-off between deployment periods and sampling continuity, i.e., the implementation of a subsampling scheme. Optimizing such schemes to each project's specific research questions is desirable. This study investigates the impact of subsampling on the accuracy of two common metrics, acoustic presence and call rate, for different vocalization patterns (regimes) of baleen whales: (1) variable vocal activity, (2) vocalizations organized in song bouts, and (3) vocal activity with diel patterns. To this end, above metrics are compared for continuous and subsampled data subject to different sampling strategies, covering duty cycles between 50% and 2%. The results show that a reduction of the duty cycle impacts negatively on the accuracy of both acoustic presence and call rate estimates. For a given duty cycle, frequent short listening periods improve accuracy of daily acoustic presence estimates over few long listening periods. Overall, subsampling effects are most pronounced for low and/or temporally clustered vocal activity. These findings illustrate the importance of informed decisions when applying subsampling strategies to passive acoustic recordings or analyses for a given target species.
Intra- and intergroup vocal behavior in resident killer whales, Orcinus orca.
Weiss, Brigitte M; Symonds, Helena; Spong, Paul; Ladich, Friedrich
2007-12-01
Vocal communication within and between groups of individuals has been described extensively in birds and terrestrial mammals, however, little is known about how cetaceans utilize their sounds in their natural environment. Resident killer whales, Orcinus orca, live in highly stable matrilines and exhibit group-specific vocal dialects. Single call types cannot exclusively be associated with particular behaviors and calls are thought to function in group identification and intragroup communication. In the present study call usage of three closely related matrilines of the Northern resident community was compared in various intra- and intergroup contexts. In two out of the three matrilines significant changes in vocal behavior depending both on the presence and identity of accompanying whales were found. Most evidently, family-specific call subtypes, as well as aberrant and variable calls, were emitted at higher rates, whereas "low arousal" call types were used less in the presence of matrilines from different pods, subclans, or clans. Ways in which the observed changes may function both in intra- and intergroup communication.
An investigation of articulatory setting using real-time magnetic resonance imaging
Ramanarayanan, Vikram; Goldstein, Louis; Byrd, Dani; Narayanan, Shrikanth S.
2013-01-01
This paper presents an automatic procedure to analyze articulatory setting in speech production using real-time magnetic resonance imaging of the moving human vocal tract. The procedure extracts frames corresponding to inter-speech pauses, speech-ready intervals and absolute rest intervals from magnetic resonance imaging sequences of read and spontaneous speech elicited from five healthy speakers of American English and uses automatically extracted image features to quantify vocal tract posture during these intervals. Statistical analyses show significant differences between vocal tract postures adopted during inter-speech pauses and those at absolute rest before speech; the latter also exhibits a greater variability in the adopted postures. In addition, the articulatory settings adopted during inter-speech pauses in read and spontaneous speech are distinct. The results suggest that adopted vocal tract postures differ on average during rest positions, ready positions and inter-speech pauses, and might, in that order, involve an increasing degree of active control by the cognitive speech planning mechanism. PMID:23862826
Northrup, Jessie B.; Iverson, Jana M.
2015-01-01
This study examined vocal coordination during mother-infant interactions in the infant siblings (high risk infants; HR) of children with autism spectrum disorder (ASD), a population at heightened risk for developing language delays. Vocal coordination between mothers and HR infants was compared to a group of low risk (LR; no first- or second-degree relative with ASD) dyads, and used to predict later language development. Nine-month-old infants were videotaped at home playing with their mothers, and interactions were coded for the frequency and timing of vocalizations. Percent infant simultaneous speech was predictive of later language delay (LD), and dyads with LD infants were less coordinated with one another in average latency to respond than dyads with non-delayed (ND) infants. The degree of coordination between mothers and infants on this variable predicted a continuous measure of language development in the third year. This research underscores the importance of understanding early development in the context of interaction. PMID:26345517
Vocal economy in vocally trained actresses and untrained female subjects.
Master, Suely; Guzman, Marco; Dowdall, Jayme
2013-11-01
Vocally trained actresses are expected to have more vocal economy than nonactresses. Therefore, we hypothesize that there will be differences in the electroglottogram-based voice economy parameter quasi-output cost ratio (QOCR) between actresses and nonactresses. This difference should remain across different levels of intensity. A total of 30 actresses and 30 nonactresses were recruited for this study. Participants from both groups were required to sustain the vowels /a/, /i/, and /u/, in habitual, moderate, and high intensity levels. Acoustic variables such as sound pressure level (SPL), fundamental frequency (F0), and glottal contact quotient (CQ) were obtained. The QOCR was then calculated. There were no significant differences among the groups for QOCR. Positive correlations were observed for QOCR versus SPL and QOCR versus F0 in all intensity levels. Negative correlation was found between QOCR and CQ in all intensity levels. Considering the differences among intensity levels, from habitual to moderate and from moderate to loud, only the CQ did not differ significantly. The QOCR, SPL, and F0 presented significant differences throughout the different intensity levels. The QOCR did not reflect the level of vocal training when comparing trained and nontrained female subjects in the present study. Both groups demonstrated more vocal economy in moderate and high intensity levels owing to more voice output without an increase in glottal adduction. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
RIEDE, TOBIAS
2014-01-01
Rodents produce highly variable ultrasound whistles as communication signals unlike many other mammals, who employ flow-induced vocal fold oscillations to produce sound. The role of larynx muscles in controlling sound features across different call types in ultrasound vocalization (USV) was investigated using laryngeal muscle electromyographic (EMG) activity, subglottal pressure measurements and vocal sound output in awake and spontaneously behaving Sprague–Dawley rats. Results support the hypothesis that glottal shape determines fundamental frequency. EMG activities of thyroarytenoid and cricothyroid muscles were aligned with call duration. EMG intensity increased with fundamental frequency. Phasic activities of both muscles were aligned with fast changing fundamental frequency contours, for example in trills. Activities of the sternothyroid and sternohyoid muscles, two muscles involved in vocal production in other mammals, are not critical for the production of rat USV. To test how stereotypic laryngeal and respiratory activity are across call types and individuals, sets of ten EMG and subglottal pressure parameters were measured in six different call types from six rats. Using discriminant function analysis, on average 80% of parameter sets were correctly assigned to their respective call type. This was significantly higher than the chance level. Since fundamental frequency features of USV are tightly associated with stereotypic activity of intrinsic laryngeal muscles and muscles contributing to build-up of subglottal pressure, USV provide insight into the neurophysiological control of peripheral vocal motor patterns. PMID:23423862
Portillo, María Priscilla; Rojas, Sandra; Guzman, Marco; Quezada, Camilo
2018-03-01
The present study aimed to observe whether physiological warm-up and traditional singing warm-up differently affect aerodynamic, electroglottographic, acoustic, and self-perceived parameters of voice in Contemporary Commercial Music singers. Thirty subjects were asked to perform a 15-minute session of vocal warm-up. They were randomly assigned to one of two types of vocal warm-up: physiological (based on semi-occluded exercises) or traditional (singing warm-up based on open vowel [a:]). Aerodynamic, electroglottographic, acoustic, and self-perceived voice quality assessments were carried out before (pre) and after (post) warm-up. No significant differences were found when comparing both types of vocal warm-up methods, either in subjective or in objective measures. Furthermore, the main positive effect observed in both groups when comparing pre and post conditions was a better self-reported quality of voice. Additionally, significant differences were observed for sound pressure level (decrease), glottal airflow (increase), and aerodynamic efficiency (decrease) in the traditional warm-up group. Both traditional and physiological warm-ups produce favorable voice sensations. Moreover, there are no evident differences in aerodynamic and electroglottographic variables when comparing both types of vocal warm-ups. Some changes after traditional warm-up (decreased intensity, increased airflow, and decreased aerodynamic efficiency) could imply an early stage of vocal fatigue. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Koda, Hiroki; Lemasson, Alban; Oyakawa, Chisako; Rizaldi; Pamungkas, Joko; Masataka, Nobuo
2013-01-01
Mother-infant vocal interactions play a crucial role in the development of human language. However, comparatively little is known about the maternal role during vocal development in nonhuman primates. Here, we report the first evidence of mother-daughter vocal interactions contributing to vocal development in gibbons, a singing and monogamous ape species. Gibbons are well known for their species-specific duets sung between mates, yet little is known about the role of intergenerational duets in gibbon song development. We observed singing interactions between free-ranging mothers and their sub-adult daughters prior to emigration. Daughters sang simultaneously with their mothers at different rates. First, we observed significant acoustic variation between daughters. Co-singing rates between mother and daughter were negatively correlated with the temporal precision of the song’s synchronization. In addition, songs of daughters who co-sang less with their mothers were acoustically more similar to the maternal song than any other adult female’s song. All variables have been reported to be influenced by social relationships of pairs. Therefore those correlations would be mediated by mother-daughter social relationship, which would be modifiable in daughter’s development. Here we hypothesized that daughters who co-sing less often, well-synchronize, and converge acoustically with the maternal acoustic pattern would be at a more advanced stage of social independence in sub-adult females prior to emigration. Second, we observed acoustic matching between mothers and daughters when co-singing, suggesting short-term vocal flexibility. Third, we found that mothers adjusted songs to a more stereotyped pattern when co-singing than when singing alone. This vocal adjustment was stronger for mothers with daughters who co-sang less. These results indicate the presence of socially mediated vocal flexibility in gibbon sub-adults and adults, and that mother-daughter co-singing interactions may enhance vocal development. More comparative work, notably longitudinal and experimental, is now needed to clarify maternal roles during song development. PMID:23951160
Coding of vocalizations by single neurons in ventrolateral prefrontal cortex.
Plakke, Bethany; Diltz, Mark D; Romanski, Lizabeth M
2013-11-01
Neuronal activity in single prefrontal neurons has been correlated with behavioral responses, rules, task variables and stimulus features. In the non-human primate, neurons recorded in ventrolateral prefrontal cortex (VLPFC) have been found to respond to species-specific vocalizations. Previous studies have found multisensory neurons which respond to simultaneously presented faces and vocalizations in this region. Behavioral data suggests that face and vocal information are inextricably linked in animals and humans and therefore may also be tightly linked in the coding of communication calls in prefrontal neurons. In this study we therefore examined the role of VLPFC in encoding vocalization call type information. Specifically, we examined previously recorded single unit responses from the VLPFC in awake, behaving rhesus macaques in response to 3 types of species-specific vocalizations made by 3 individual callers. Analysis of responses by vocalization call type and caller identity showed that ∼19% of cells had a main effect of call type with fewer cells encoding caller. Classification performance of VLPFC neurons was ∼42% averaged across the population. When assessed at discrete time bins, classification performance reached 70 percent for coos in the first 300 ms and remained above chance for the duration of the response period, though performance was lower for other call types. In light of the sub-optimal classification performance of the majority of VLPFC neurons when only vocal information is present, and the recent evidence that most VLPFC neurons are multisensory, the potential enhancement of classification with the addition of accompanying face information is discussed and additional studies recommended. Behavioral and neuronal evidence has shown a considerable benefit in recognition and memory performance when faces and voices are presented simultaneously. In the natural environment both facial and vocalization information is present simultaneously and neural systems no doubt evolved to integrate multisensory stimuli during recognition. This article is part of a Special Issue entitled "Communication Sounds and the Brain: New Directions and Perspectives". Copyright © 2013 Elsevier B.V. All rights reserved.
Koda, Hiroki; Lemasson, Alban; Oyakawa, Chisako; Rizaldi; Pamungkas, Joko; Masataka, Nobuo
2013-01-01
Mother-infant vocal interactions play a crucial role in the development of human language. However, comparatively little is known about the maternal role during vocal development in nonhuman primates. Here, we report the first evidence of mother-daughter vocal interactions contributing to vocal development in gibbons, a singing and monogamous ape species. Gibbons are well known for their species-specific duets sung between mates, yet little is known about the role of intergenerational duets in gibbon song development. We observed singing interactions between free-ranging mothers and their sub-adult daughters prior to emigration. Daughters sang simultaneously with their mothers at different rates. First, we observed significant acoustic variation between daughters. Co-singing rates between mother and daughter were negatively correlated with the temporal precision of the song's synchronization. In addition, songs of daughters who co-sang less with their mothers were acoustically more similar to the maternal song than any other adult female's song. All variables have been reported to be influenced by social relationships of pairs. Therefore those correlations would be mediated by mother-daughter social relationship, which would be modifiable in daughter's development. Here we hypothesized that daughters who co-sing less often, well-synchronize, and converge acoustically with the maternal acoustic pattern would be at a more advanced stage of social independence in sub-adult females prior to emigration. Second, we observed acoustic matching between mothers and daughters when co-singing, suggesting short-term vocal flexibility. Third, we found that mothers adjusted songs to a more stereotyped pattern when co-singing than when singing alone. This vocal adjustment was stronger for mothers with daughters who co-sang less. These results indicate the presence of socially mediated vocal flexibility in gibbon sub-adults and adults, and that mother-daughter co-singing interactions may enhance vocal development. More comparative work, notably longitudinal and experimental, is now needed to clarify maternal roles during song development.
NASA Astrophysics Data System (ADS)
Volodin, Ilya A.; Volodina, Elena V.; Frey, Roland; Kirilyuk, Vadim E.; Naidenko, Sergey V.
2017-06-01
In neonate ruminants, the acoustic structure of vocalizations may depend on sex, vocal anatomy, hormonal profiles and body mass and on environmental factors. In neonate wild-living Mongolian gazelles Procapra gutturosa, hand-captured during biomedical monitoring in the Daurian steppes at the Russian-Mongolian border, we spectrographically analysed distress calls and measured body mass of 22 individuals (6 males, 16 females). For 20 (5 male, 15 female) of these individuals, serum testosterone levels were also analysed. In addition, we measured relevant dimensions of the vocal apparatus (larynx, vocal folds, vocal tract) in one stillborn male Mongolian gazelle specimen. Neonate distress calls of either sex were high in maximum fundamental frequency (800-900 Hz), but the beginning and minimum fundamental frequencies were significantly lower in males than in females. Body mass was larger in males than in females. The levels of serum testosterone were marginally higher in males. No correlations were found between either body mass or serum testosterone values and any acoustic variable for males and females analysed together or separately. We discuss that the high-frequency calls of neonate Mongolian gazelles are more typical for closed-habitat neonate ruminants, whereas other open-habitat neonate ruminants (goitred gazelle Gazella subgutturosa, saiga antelope Saiga tatarica and reindeer Rangifer tarandus) produce low-frequency (<200 Hz) distress calls. Proximate cause for the high fundamental frequency of distress calls of neonate Mongolian gazelles is their very short, atypical vocal folds (4 mm) compared to the 7-mm vocal folds of neonate goitred gazelles, producing distress calls as low as 120 Hz.
Cooper, Timothy; Dziegielewski, Peter T; Singh, Praby; Seemann, Robert
2016-11-01
To present a case of bilateral vocal fold immobility (BVCI) in a patient with acromegaly and review the current literature describing this presentation. Case report and literature review. Academic tertiary care center. English language literature search of online journal databases. A 56-year-old man presented with 3 months of progressive stridor and shortness of breath. Transnasal flexible endoscopy revealed BVCI. A tracheostomy was performed to secure his airway. Further history was suggestive of acromegaly and imaging demonstrated a pituitary macroadenoma. The diagnosis of acromegaly was made. The patient was treated with octreotide followed by an endoscopic trans sphenoidal resection of the pituitary adenoma. Sixteen months after his initial presentation, a right laser arytenoidectomy was performed and the patient was subsequently decannulated. In the literature to date, 11 cases of BVCI in acromegaly have been reported. These patients often present with stridor and require a tracheostomy. With treatment of their acromegaly, these patients may regain vocal fold mobility and may be decannulated. Acromegaly with BVCI is a rare presentation. Acute management of the airway of patients with acromegaly presenting with BVCI typically requires a tracheostomy. A period of 15 months should be allowed for restoration of vocal fold mobility before airway opening procedures are considered. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Interspeaker Variability in Hard Palate Morphology and Vowel Production
ERIC Educational Resources Information Center
Lammert, Adam; Proctor, Michael; Narayanan, Shrikanth
2013-01-01
Purpose: Differences in vocal tract morphology have the potential to explain interspeaker variability in speech production. The potential acoustic impact of hard palate shape was examined in simulation, in addition to the interplay among morphology, articulation, and acoustics in real vowel production data. Method: High-front vowel production from…
Increasing Vocal Variability in Children with Autism Using a Lag Schedule of Reinforcement
ERIC Educational Resources Information Center
Esch, John W.; Esch, Barbara E.; Love, Jessa R.
2009-01-01
Variability has been demonstrated to be an operant dimension of behavior (Neuringer, 2002; Page & Neuringer, 1985). Recently, lag schedules have been used to demonstrate operant variability of verbal behavior in persons with a diagnosis of autism (e.g., Lee, McComas, & Jawor, 2002). The current study evaluated the effects of a Lag 1 schedule on…
2018-01-01
This study tested the hypothesis that object-based attention modulates the discrimination of level increments in stop-consonant noise bursts. With consonant-vowel-consonant (CvC) words consisting of an ≈80-dB vowel (v), a pre-vocalic (Cv) and a post-vocalic (vC) stop-consonant noise burst (≈60-dB SPL), we measured discrimination thresholds (LDTs) for level increments (ΔL) in the noise bursts presented either in CvC context or in isolation. In the 2-interval 2-alternative forced-choice task, each observation interval presented a CvC word (e.g., /pæk/ /pæk/), and normal-hearing participants had to discern ΔL in the Cv or vC burst. Based on the linguistic word labels, the auditory events of each trial were perceived as two auditory objects (Cv-v-vC and Cv-v-vC) that group together the bursts and vowels, hindering selective attention to ΔL. To discern ΔL in Cv or vC, the events must be reorganized into three auditory objects: the to-be-attended pre-vocalic (Cv–Cv) or post-vocalic burst pair (vC–vC), and the to-be-ignored vowel pair (v–v). Our results suggest that instead of being automatic this reorganization requires training, in spite of using familiar CvC words. Relative to bursts in isolation, bursts in context always produced inferior ΔL discrimination accuracy (a context effect), which depended strongly on the acoustic separation between the bursts and the vowel, being much keener for the object apart from (post-vocalic) than for the object adjoining (pre-vocalic) the vowel (a temporal-position effect). Variability in CvC dimensions that did not alter the noise-burst perceptual grouping had minor effects on discrimination accuracy. In addition to being robust and persistent, these effects are relatively general, evincing in forced-choice tasks with one or two observation intervals, with or without variability in the temporal position of ΔL, and with either fixed or roving CvC standards. The results lend support to the hypothesis. PMID:29364931
Striatal FoxP2 Is Actively Regulated during Songbird Sensorimotor Learning
Teramitsu, Ikuko; Poopatanapong, Amy; Torrisi, Salvatore; White, Stephanie A.
2010-01-01
Background Mutations in the FOXP2 transcription factor lead to language disorders with developmental onset. Accompanying structural abnormalities in cortico-striatal circuitry indicate that at least a portion of the behavioral phenotype is due to organizational deficits. We previously found parallel FoxP2 expression patterns in human and songbird cortico/pallio-striatal circuits important for learned vocalizations, suggesting that FoxP2's function in birdsong may generalize to speech. Methodology/Principal Findings We used zebra finches to address the question of whether FoxP2 is additionally important in the post-organizational function of these circuits. In both humans and songbirds, vocal learning depends on auditory guidance to achieve and maintain optimal vocal output. We tested whether deafening prior to or during the sensorimotor phase of song learning disrupted FoxP2 expression in song circuitry. As expected, the songs of deafened juveniles were abnormal, however basal FoxP2 levels were unaffected. In contrast, when hearing or deaf juveniles sang for two hours in the morning, FoxP2 was acutely down-regulated in the striatal song nucleus, area X. The extent of down-regulation was similar between hearing and deaf birds. Interestingly, levels of FoxP2 and singing were correlated only in hearing birds. Conclusions/Significance Hearing appears to link FoxP2 levels to the amount of vocal practice. As juvenile birds spent more time practicing than did adults, their FoxP2 levels are likely to be low more often. Behaviorally-driven reductions in the mRNA encoding this transcription factor could ultimately affect downstream molecules that function in vocal exploration, especially during sensorimotor learning. PMID:20062527
Vocal communication in African elephants (Loxodonta africana).
Soltis, Joseph
2010-01-01
Research on vocal communication in African elephants has increased in recent years, both in the wild and in captivity, providing an opportunity to present a comprehensive review of research related to their vocal behavior. Current data indicate that the vocal repertoire consists of perhaps nine acoustically distinct call types, "rumbles" being the most common and acoustically variable. Large vocal production anatomy is responsible for the low-frequency nature of rumbles, with fundamental frequencies in the infrasonic range. Additionally, resonant frequencies of rumbles implicate the trunk in addition to the oral cavity in shaping the acoustic structure of rumbles. Long-distance communication is thought possible because low-frequency sounds propagate more faithfully than high-frequency sounds, and elephants respond to rumbles at distances of up to 2.5 km. Elephant ear anatomy appears designed for detecting low frequencies, and experiments demonstrate that elephants can detect infrasonic tones and discriminate small frequency differences. Two vocal communication functions in the African elephant now have reasonable empirical support. First, closely bonded but spatially separated females engage in rumble exchanges, or "contact calls," that function to coordinate movement or reunite animals. Second, both males and females produce "mate attraction" rumbles that may advertise reproductive states to the opposite sex. Additionally, there is evidence that the structural variation in rumbles reflects the individual identity, reproductive state, and emotional state of callers. Growth in knowledge about the communication system of the African elephant has occurred from a rich combination of research on wild elephants in national parks and captive elephants in zoological parks.
Brown, Jennifer A; Derksen, Frederik J; Stick, John A; Hartmann, William M; Robinson, N Edward
2005-01-01
To report the effect of unilateral laser vocal cordectomy on respiratory noise and airway function in horses with experimentally induced laryngeal hemiplegia (LH). Experimental study. Six Standardbred horses without upper airway abnormalities at rest or during high-speed treadmill exercise. Respiratory sounds and inspiratory trans-upper airway pressure (P(Ui)) were measured before (baseline) and 14 days after induction of LH by left recurrent laryngeal neurectomy, and again 30, 60, 90, and 120 days after endoscopically assisted laser cordectomy of the left vocal cord. Data were collected with the horses exercising on a treadmill at a speed producing maximum heart rate (HR(max)). In horses exercising at HR(max), induction of LH caused a significant increase in P(Ui), sound level (SL), and the sound intensity of formant 2 (F(2)) and 3 (F(3)). The sound intensity of formant 1 (F(1)) was unaffected by induction of LH. Laser vocal cordectomy had no effect on SL, or on the sound intensity of F(1) and F(3). At 30, 60, 90, and 120 days after surgery, P(Ui) and the sound intensity of F(2) were significantly reduced, but these variables remained significantly different from baseline values. Unilateral laser vocal cordectomy did not effectively improve upper airway noise in horses with LH. The procedure decreased upper airway obstruction to the same degree as bilateral ventriculocordectomy. Currently, laser vocal cordectomy cannot be recommended for the treatment of upper airway noise in horses with LH.
The Effects of Hormonal Contraception on the Voice: History of Its Evolution in the Literature.
Rodney, Jennifer P; Sataloff, Robert Thayer
2016-11-01
Women of reproductive age commonly use hormonal contraceptives, the vocal effects of which have been studied. Otolaryngologists should be aware of this relationship to make recommendations on hormonal contraception as it relates to each patient's voice requirements. A comprehensive literature review of PubMed was completed. The terms "contraception," "vocal folds," "vocal cords," and "voice" were searched in various combinations. Articles from 1971 to 2015 that addressed the effects of contraception on the vocal folds were included. In total, 24 articles were available for review. Historically, contraception was believed to affect the voice negatively. However, more recent studies using low-dose oral contraceptive pills (OCPs) show that they stabilize the voice. However, stabilization generally occurs only during sustained vowel production; connected speech appears unaffected. Therefore, singers may be the only population that experiences clinically increased vocal stability as a result of taking hormonal contraceptives. Only combined OCPs have been studied; other forms of hormonal contraception have not been evaluated for effects on the voice. Significant variability exists between studies in the physical attributes of patients and parameters tested. Hormonal contraception likely has no clinically perceptible effects on the speaking voice. Singers may experience increased vocal stability with low-dose, combined OCP use. Other available forms of contraception have not been studied. Greater consistency in methodology is needed in future research, and other forms of hormonal contraception require study. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Tissue-Engineered Vocal Fold Mucosa Implantation in Rabbits.
Shiba, Travis L; Hardy, Jordan; Luegmair, Georg; Zhang, Zhaoyan; Long, Jennifer L
2016-04-01
To assess phonatory function and wound healing of a tissue-engineered vocal fold mucosa (TE-VFM) in rabbits. An "artificial" vocal fold would be valuable for reconstructing refractory scars and resection defects, particularly one that uses readily available autologous cells and scaffold. This work implants a candidate TE-VFM after resecting native epithelium and lamina propria in rabbits. Prospective animal study. Research laboratory. Rabbit adipose-derived stem cells were isolated and cultured in three-dimensional fibrin scaffolds to form TE-VFM. Eight rabbits underwent laryngofissure, unilateral European Laryngologic Society type 2 cordectomy, and immediate reconstruction with TE-VFM. After 4 weeks, larynges were excised, phonated, and examined by histology. Uniform TE-VFM implants were created, with rabbit mesenchymal cells populated throughout fibrin hydrogels. Rabbits recovered uneventfully after implantation. Phonation was achieved in all, with mucosal waves evident at the implant site. Histology after 4 weeks showed resorbed fibrin matrix, continuous epithelium, and mildly increased collagen relative to contralateral unoperated vocal folds. Elastic fiber appearance was highly variable. Inflammatory cell infiltrate was limited to animals receiving sex-mismatched implants. TE-VFMs were successfully implanted into 8 rabbits, with minor evidence of scar formation and immune reaction. Vibration was preserved 4 weeks after resecting and reconstructing the complete vocal fold cover layer. Further studies will investigate the mechanism and durability of improvement. TE-VFM with autologous cells is a promising new approach for vocal fold reconstruction. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.
Feng, Jun-Juan; Cui, Liang-Wei; Ma, Chang-Yong; Fei, Han-Lan; Fan, Peng-Fei
2014-01-01
Vocal individuality and stability has been used to conduct population surveys, monitor population dynamics, and detect dispersal patterns in avian studies. To our knowledge, it has never been used in these kinds of studies among primates. The cao vit gibbon is a critically endangered species with only one small population living in a karst forest along China-Vietnam border. Due to the difficult karst terrain, an international border, long life history, and similarity in male morphology, detailed monitoring of population dynamics and dispersal patterns are not possible using traditional observation methods. In this paper, we test individuality and stability in male songs of cao vit gibbons. We then discuss the possibility of using vocal individuality for population surveys and monitoring population dynamics and dispersal patterns. Significant individuality of vocalization was detected in all 9 males, and the correct rate of individual identification yielded by discriminant function analysis using a subset of variables was satisfactory (>90%). Vocal stability over 2-6 years was also documented in 4 males. Several characters of cao vit gibbons allowed long-term population monitoring using vocal recordings in both China and Vietnam: 1) regular loud calls, 2) strong individuality and stability in male songs, 3) stable territories, and 4) long male tenure. During the course of this research, we also observed one male replacement (confirmed by vocal analysis). This time- and labor-saving method might be the most effective way to detect dispersal patterns in this transboundary population.
Temporal Lobe Epilepsy Alters Auditory-motor Integration For Voice Control
Li, Weifeng; Chen, Ziyi; Yan, Nan; Jones, Jeffery A.; Guo, Zhiqiang; Huang, Xiyan; Chen, Shaozhen; Liu, Peng; Liu, Hanjun
2016-01-01
Temporal lobe epilepsy (TLE) is the most common drug-refractory focal epilepsy in adults. Previous research has shown that patients with TLE exhibit decreased performance in listening to speech sounds and deficits in the cortical processing of auditory information. Whether TLE compromises auditory-motor integration for voice control, however, remains largely unknown. To address this question, event-related potentials (ERPs) and vocal responses to vocal pitch errors (1/2 or 2 semitones upward) heard in auditory feedback were compared across 28 patients with TLE and 28 healthy controls. Patients with TLE produced significantly larger vocal responses but smaller P2 responses than healthy controls. Moreover, patients with TLE exhibited a positive correlation between vocal response magnitude and baseline voice variability and a negative correlation between P2 amplitude and disease duration. Graphical network analyses revealed a disrupted neuronal network for patients with TLE with a significant increase of clustering coefficients and path lengths as compared to healthy controls. These findings provide strong evidence that TLE is associated with an atypical integration of the auditory and motor systems for vocal pitch regulation, and that the functional networks that support the auditory-motor processing of pitch feedback errors differ between patients with TLE and healthy controls. PMID:27356768
Vocal cord dysfunction in a child.
Juliá, J C; Martorell, A; Armengot, M A; Lluch, R; Boluda, C F; Cerdá, J C; Alvarez, V
1999-07-01
Vocal cord dysfunction (VCD) involves paradoxical adduction of the vocal cord during the respiratory cycle. This usually occurs during inspiration, but can also be seen in expiration. Vocal cord appositioning produces airflow obstruction sufficient to cause wheezing, shortness of breath, chest tightness, and coughing. These symptoms often imitate the respiratory alterations of asthma, thus leading to inappropriate treatment; intubation or tracheotomy may prove necessary. An 11-year-old girl was admitted with intractable dyspnea. She had been diagnosed with atopic asthma, although she failed to respond to an increase in antiasthma medication, including high-dose oral steroids. Flow-volume loops were abnormal, with evidence of variable extrathoracic airway obstruction, manifested as a flat inspiratory loop. No structural abnormalities were seen with either computed tomography (CT) or magnetic resonance imaging (MRI). Fibroscopy revealed paradoxical adduction of the vocal cords during the respiratory cycle, no obstructive disorder being observed. After the diagnosis of VCD, the clinical manifestations resolved with psychiatric treatment. Adduction was not demonstrable at repeat fibroscopy after treatment. VCD may simulate bronchial asthma; it may also be associated with that disorder, thus masking the diagnosis. It should be suspected in patients with recurrent wheezing who fail to respond to usual asthma treatment. An early diagnosis avoids unnecessary aggressive management. Treatment should consist of respiratory and phonatory exercises; psychotherapy may be useful.
Comparing Measures of Voice Quality From Sustained Phonation and Continuous Speech.
Gerratt, Bruce R; Kreiman, Jody; Garellek, Marc
2016-10-01
The question of what type of utterance-a sustained vowel or continuous speech-is best for voice quality analysis has been extensively studied but with equivocal results. This study examines whether previously reported differences derive from the articulatory and prosodic factors occurring in continuous speech versus sustained phonation. Speakers with voice disorders sustained vowels and read sentences. Vowel samples were excerpted from the steadiest portion of each vowel in the sentences. In addition to sustained and excerpted vowels, a 3rd set of stimuli was created by shortening sustained vowel productions to match the duration of vowels excerpted from continuous speech. Acoustic measures were made on the stimuli, and listeners judged the severity of vocal quality deviation. Sustained vowels and those extracted from continuous speech contain essentially the same acoustic and perceptual information about vocal quality deviation. Perceived and/or measured differences between continuous speech and sustained vowels derive largely from voice source variability across segmental and prosodic contexts and not from variations in vocal fold vibration in the quasisteady portion of the vowels. Approaches to voice quality assessment by using continuous speech samples average across utterances and may not adequately quantify the variability they are intended to assess.
Kendall, Katherine A; Leonard, Rebecca J
2011-01-01
Up to one-third of patients presenting with adductor spasmodic dysphonia will have an associated vocal tremor. These patients may not respond fully to treatment using thyroarytenoid (TA) muscle botulinum toxin (Botox) injection. Treatment failures are attributed to the involvement of multiple muscle groups in the tremor. This study evaluates the results of combined interarytenoid (IA) and TA muscle Botox injection in a group of 27 patients with adductor spasmodic dysphonia and vocal tremor and in four patients with severe vocal tremor alone. Patient-satisfaction data were reviewed retrospectively. Pre- and postinjection acoustic data were collected prospectively. Acoustic measures of fundamental frequency and cycle-by-cycle variability in frequency (jitter) and intensity (shimmer) were obtained from 15 patients' sustained vowel productions. Measures were collected after TA muscle injection, alone, and after combined TA and IA (TA+IA) muscle injections. In addition, two experienced voice clinicians blindly assessed tremor severity from recordings made for each patient in the two conditions. Patients were also queried regarding their satisfaction with the results of the injections and whether they desired to continue receiving TA+IA treatment. Significant improvement in all acoustic measures except for % jitter was observed after the TA+IA muscle injections. Listeners identified voice samples after TA+IA muscle injections as demonstrating less tremor in 73% of the paired comparisons. Sixty-seven percent of the patients with spasmodic dysphonia and vocal tremor wished to continue to receive IA muscle injections. Only one patient with severe vocal tremor wished to continue with injections. The addition of an IA muscle Botox injection to the treatment of patients with a combination adductor spasmodic dysphonia and vocal tremor may improve voice outcomes. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Soltis, Joseph M.; Savage, Anne; Leong, Kirsten M.
2004-05-01
The most commonly occurring elephant vocalization is the rumble, a frequency-modulated call with infrasonic components. Upwards of ten distinct rumble subtypes have been proposed, but little quantitative work on the acoustic properties of rumbles has been conducted. Rumble vocalizations (N=269) from six females housed at Disney's Animal Kingdom were analyzed. Vocalizations were recorded from microphones in collars around subject necks, and rumbles were digitized and measured using SIGNAL software. Sixteen acoustic variables were measured for each call, extracting both source and filter features. Multidimensional scaling analysis indicates that there are no acoustically distinct rumble subtypes, but that there is quantitative variation across rumbles. Discriminant function analysis showed that the acoustic characteristics of rumbles differ across females. A classification success rate of 65% was achieved when assigning unselected rumbles to one of the six females (test set =64 calls) according to the functions derived from the originally selected calls (training set =205 calls). The rumble is best viewed as a single call type with graded variation, but information regarding individual identity is encoded in female rumbles.
An approach for automatic classification of grouper vocalizations with passive acoustic monitoring.
Ibrahim, Ali K; Chérubin, Laurent M; Zhuang, Hanqi; Schärer Umpierre, Michelle T; Dalgleish, Fraser; Erdol, Nurgun; Ouyang, B; Dalgleish, A
2018-02-01
Grouper, a family of marine fishes, produce distinct vocalizations associated with their reproductive behavior during spawning aggregation. These low frequencies sounds (50-350 Hz) consist of a series of pulses repeated at a variable rate. In this paper, an approach is presented for automatic classification of grouper vocalizations from ambient sounds recorded in situ with fixed hydrophones based on weighted features and sparse classifier. Group sounds were labeled initially by humans for training and testing various feature extraction and classification methods. In the feature extraction phase, four types of features were used to extract features of sounds produced by groupers. Once the sound features were extracted, three types of representative classifiers were applied to categorize the species that produced these sounds. Experimental results showed that the overall percentage of identification using the best combination of the selected feature extractor weighted mel frequency cepstral coefficients and sparse classifier achieved 82.7% accuracy. The proposed algorithm has been implemented in an autonomous platform (wave glider) for real-time detection and classification of group vocalizations.
Contextual effects of noise on vocalization encoding in primary auditory cortex
Ni, Ruiye; Bender, David A.; Shanechi, Amirali M.; Gamble, Jeffrey R.
2016-01-01
Robust auditory perception plays a pivotal function for processing behaviorally relevant sounds, particularly with distractions from the environment. The neuronal coding enabling this ability, however, is still not well understood. In this study, we recorded single-unit activity from the primary auditory cortex (A1) of awake marmoset monkeys (Callithrix jacchus) while delivering conspecific vocalizations degraded by two different background noises: broadband white noise and vocalization babble. Noise effects on neural representation of target vocalizations were quantified by measuring the responses' similarity to those elicited by natural vocalizations as a function of signal-to-noise ratio. A clustering approach was used to describe the range of response profiles by reducing the population responses to a summary of four response classes (robust, balanced, insensitive, and brittle) under both noise conditions. This clustering approach revealed that, on average, approximately two-thirds of the neurons change their response class when encountering different noises. Therefore, the distortion induced by one particular masking background in single-unit responses is not necessarily predictable from that induced by another, suggesting the low likelihood of a unique group of noise-invariant neurons across different background conditions in A1. Regarding noise influence on neural activities, the brittle response group showed addition of spiking activity both within and between phrases of vocalizations relative to clean vocalizations, whereas the other groups generally showed spiking activity suppression within phrases, and the alteration between phrases was noise dependent. Overall, the variable single-unit responses, yet consistent response types, imply that primate A1 performs scene analysis through the collective activity of multiple neurons. NEW & NOTEWORTHY The understanding of where and how auditory scene analysis is accomplished is of broad interest to neuroscientists. In this paper, we systematically investigated neuronal coding of multiple vocalizations degraded by two distinct noises at various signal-to-noise ratios in nonhuman primates. In the process, we uncovered heterogeneity of single-unit representations for different auditory scenes yet homogeneity of responses across the population. PMID:27881720
Watts, Christopher R
2016-01-01
Reduced vocal intensity is a core impairment of hypokinetic dysarthria in Parkinson's disease (PD). Speech treatments have been developed to rehabilitate the vocal subsystems underlying this impairment. Intensive treatment programs requiring high-intensity voice and speech exercises with clinician-guided prompting and feedback have been established as effective for improving vocal function. Less is known, however, regarding long-term outcomes of clinical benefit in speakers with PD who receive these treatments. A retrospective cohort design was utilized. Data from 78 patient files across a three year period were analyzed. All patients received a structured, intensive program of voice therapy focusing on speaking intent and loudness. The dependent variable for all analyses was vocal intensity in decibels (dBSPL). Vocal intensity during sustained vowel production, reading, and novel conversational speech was compared at pre-treatment, post-treatment, six month follow-up, and twelve month follow-up periods. Statistically significant increases in vocal intensity were found at post-treatment, 6 months, and 12 month follow-up periods with intensity gains ranging from 5 to 17 dB depending on speaking condition and measurement period. Significant treatment effects were found in all three speaking conditions. Effect sizes for all outcome measures were large, suggesting a strong degree of practical significance. Significant increases in vocal intensity measured at 6 and 12 moth follow-up periods suggested that the sample of patients maintained treatment benefit for up to a year. These findings are supported by outcome studies reporting treatment outcomes within a few months post-treatment, in addition to prior studies that have reported long-term outcome results. The positive treatment outcomes experienced by the PD cohort in this study are consistent with treatment responses subsequent to other treatment approaches which focus on high-intensity, clinician guided motor learning for voice and speech production in PD. Theories regarding the underlying neurophysiological response to treatment will be discussed.
Learned Vocal Variation Is Associated with Abrupt Cryptic Genetic Change in a Parrot Species Complex
Ribot, Raoul F. H.; Buchanan, Katherine L.; Endler, John A.; Joseph, Leo; Bennett, Andrew T. D.; Berg, Mathew L.
2012-01-01
Contact zones between subspecies or closely related species offer valuable insights into speciation processes. A typical feature of such zones is the presence of clinal variation in multiple traits. The nature of these traits and the concordance among clines are expected to influence whether and how quickly speciation will proceed. Learned signals, such as vocalizations in species having vocal learning (e.g. humans, many birds, bats and cetaceans), can exhibit rapid change and may accelerate reproductive isolation between populations. Therefore, particularly strong concordance among clines in learned signals and population genetic structure may be expected, even among continuous populations in the early stages of speciation. However, empirical evidence for this pattern is often limited because differences in vocalisations between populations are driven by habitat differences or have evolved in allopatry. We tested for this pattern in a unique system where we may be able to separate effects of habitat and evolutionary history. We studied geographic variation in the vocalizations of the crimson rosella (Platycercus elegans) parrot species complex. Parrots are well known for their life-long vocal learning and cognitive abilities. We analysed contact calls across a ca 1300 km transect encompassing populations that differed in neutral genetic markers and plumage colour. We found steep clinal changes in two acoustic variables (fundamental frequency and peak frequency position). The positions of the two clines in vocal traits were concordant with a steep cline in microsatellite-based genetic variation, but were discordant with the steep clines in mtDNA, plumage and habitat. Our study provides new evidence that vocal variation, in a species with vocal learning, can coincide with areas of restricted gene flow across geographically continuous populations. Our results suggest that traits that evolve culturally can be strongly associated with reduced gene flow between populations, and therefore may promote speciation, even in the absence of other barriers. PMID:23227179
Contextual effects of noise on vocalization encoding in primary auditory cortex.
Ni, Ruiye; Bender, David A; Shanechi, Amirali M; Gamble, Jeffrey R; Barbour, Dennis L
2017-02-01
Robust auditory perception plays a pivotal function for processing behaviorally relevant sounds, particularly with distractions from the environment. The neuronal coding enabling this ability, however, is still not well understood. In this study, we recorded single-unit activity from the primary auditory cortex (A1) of awake marmoset monkeys (Callithrix jacchus) while delivering conspecific vocalizations degraded by two different background noises: broadband white noise and vocalization babble. Noise effects on neural representation of target vocalizations were quantified by measuring the responses' similarity to those elicited by natural vocalizations as a function of signal-to-noise ratio. A clustering approach was used to describe the range of response profiles by reducing the population responses to a summary of four response classes (robust, balanced, insensitive, and brittle) under both noise conditions. This clustering approach revealed that, on average, approximately two-thirds of the neurons change their response class when encountering different noises. Therefore, the distortion induced by one particular masking background in single-unit responses is not necessarily predictable from that induced by another, suggesting the low likelihood of a unique group of noise-invariant neurons across different background conditions in A1. Regarding noise influence on neural activities, the brittle response group showed addition of spiking activity both within and between phrases of vocalizations relative to clean vocalizations, whereas the other groups generally showed spiking activity suppression within phrases, and the alteration between phrases was noise dependent. Overall, the variable single-unit responses, yet consistent response types, imply that primate A1 performs scene analysis through the collective activity of multiple neurons. The understanding of where and how auditory scene analysis is accomplished is of broad interest to neuroscientists. In this paper, we systematically investigated neuronal coding of multiple vocalizations degraded by two distinct noises at various signal-to-noise ratios in nonhuman primates. In the process, we uncovered heterogeneity of single-unit representations for different auditory scenes yet homogeneity of responses across the population. Copyright © 2017 the American Physiological Society.
CMT2C with vocal cord paresis associated with short stature and mutations in the TRPV4 gene
Chen, D.-H.; Sul, Y.; Weiss, M.; Hillel, A.; Lipe, H.; Wolff, J.; Matsushita, M.; Raskind, W.; Bird, T.
2010-01-01
Background: Recently, mutations in the transient receptor potential cation channel, subfamily V, member 4 gene (TRPV4) have been reported in Charcot-Marie-Tooth Type 2C (CMT2C) with vocal cord paresis. Other mutations in this same gene have been described in separate families with various skeletal dysplasias. Further clarification is needed of the different phenotypes associated with this gene. Methods: We performed clinical evaluation, electrophysiology, and genetic analysis of the TRPV4 gene in 2 families with CMT2C. Results: Two multigenerational families had a motor greater than sensory axonal neuropathy associated with variable vocal cord paresis. The vocal cord paresis varied from absent to severe, requiring permanent tracheotomy in 2 subjects. One family with mild neuropathy also manifested pronounced short stature, more than 2 SD below the average height for white Americans. There was one instance of dolichocephaly. A novel S542Y mutation in the TRPV4 gene was identified in this family. The other family had a more severe, progressive, motor neuropathy with sensory loss, but less remarkable short stature and an R315W mutation in TRPV4. Third cranial nerve involvement and sleep apnea occurred in one subject in each family. Conclusion: CMT2C with axonal neuropathy, vocal cord paresis, and short stature is a unique syndrome associated with mutations in the TRPV4 gene. Mutations in TRPV4 can cause abnormalities in bone, peripheral nerve, or both and may result in highly variable orthopedic and neurologic phenotypes. GLOSSARY CMAP = compound muscle action potential; CMT = Charcot-Marie-Tooth; CMT2C = Charcot-Marie-Tooth Type 2C; HMSN = hereditary motor and sensory neuropathy; NCV = nerve conduction velocity; RFLP = restriction fragment length polymorphism; SMA = spinal muscular atrophy; SNAP = sensory nerve action potential; SPSMA = scapuloperoneal spinal muscular atrophy. PMID:21115951
Southeast Pacific atmospheric composition and variability sampled along 20˚S during VOCALS-REx
DOE Office of Scientific and Technical Information (OSTI.GOV)
Allen, G.; Kleinman, L.; Coe, H.
2011-01-10
The VAMOS Ocean-Climate-Atmosphere-Land Regional Experiment (VOCALS-REx) was conducted from 15 October to 15 November 2008 in the South East Pacific region to investigate interactions between land, sea and atmosphere in this unique tropical eastern ocean environment and to improve the skill of global and regional models in representing the region. This study synthesises selected aircraft, ship and surface site observations from VOCALS-REx to statistically summarise and characterise the atmospheric composition and variability of the Marine Boundary Layer (MBL) and Free Troposphere (FT) along the 20{sup o} S parallel between 70{sup o} W and 85{sup o} W. Significant zonal gradients inmore » mean MBL sub-micron aerosol particle size and composition, carbon monoxide, ozone and sulphur dioxide were seen over the campaign, with a generally more variable and polluted coastal environment and a less variable, more pristine remote maritime regime. Gradients are observed to be associated with strong gradients in cloud droplet number. The FT is often more polluted in terms of trace gases than the MBL in the mean; however increased variability in the FT composition suggests an episodic nature to elevated concentrations. This is consistent with a complex vertical interleaving of airmasses with diverse sources and hence pollutant concentrations as seen by generalised back trajectory analysis, which suggests contributions from both local and long-range sources. Furthermore, back trajectory analysis demonstrates that the observed zonal gradients both in the boundary layer and the free troposphere are characteristic of marked changes in airmass history with distance offshore - coastal boundary layer airmasses having been in recent contact with the local land surface and remote maritime airmasses having resided over ocean for in excess of ten days. Boundary layer composition to the east of 75{sup o} W was observed to be dominated by coastal emissions from sources to the west of the Andes, with evidence for diurnal pumping of the Andean boundary layer above the height of the marine capping inversion. The climatology presented here aims to provide a valuable dataset to inform model simulation and future process studies, particularly in the context of aerosol-cloud interaction and further evaluation of dynamical processes in the SEP region for conditions analogous to those during VOCALS-REx.« less
Speech task effects on acoustic measure of fundamental frequency in Cantonese-speaking children.
Ma, Estella P-M; Lam, Nina L-N
2015-12-01
Speaking fundamental frequency (F0) is a voice measure frequently used to document changes in vocal performance over time. Knowing the intra-subject variability of speaking F0 has implications on its clinical usefulness. The present study examined the speaking F0 elicited from three speech tasks in Cantonese-speaking children. The study also compared the variability of speaking F0 elicited from different speech tasks. Fifty-six vocally healthy Cantonese-speaking children (31 boys and 25 girls) aged between 7.0 and 10.11 years participated. For each child, speaking F0 was elicited using speech tasks at three linguistic levels (sustained vowel /a/ prolongation, reading aloud a sentence and passage). Two types of variability, within-session (trial-to-trial) and across-session (test-retest) variability, were compared across speech tasks. Significant differences in mean speaking F0 values were found between speech tasks. Mean speaking F0 value elicited from sustained vowel phonations was significantly higher than those elicited from the connected speech tasks. The variability of speaking F0 was higher in sustained vowel prolongation than that in connected speech. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
PVCM, PVCD, EPL, and irritable larynx syndrome: what are we talking about and how do we treat it?
Andrianopoulos, M V; Gallivan, G J; Gallivan, K H
2000-12-01
Paroxysmal vocal cord movement/motion (PVCM), paroxysmal vocal cord dysfunction (PVCD), episodic paroxysmal laryngospasm (EPL), and irritable larynx syndrome (ILS) are terms used to describe laryngeal dysfunction masquerading as asthma, upper airway obstruction, or functional and organic voice disorders. The differential diagnosis of PVCM, PVCD, EPL, and ILS is critical to successful medical and behavioral management of the patient. During the past 10 years, 27 subjects, ages 15-79 years, were identified to have paroxysms of inspiratory stridor, acute respiratory distress, associated aphonia and dysphonia, resulting in misdiagnosis and unnecessary emergency treatments, including endotracheal intubation, cardiopulmonary resuscitation, massive pharmacotherapy, or tracheostomy. A multifactorial management program is proposed utilizing principles of motor learning, neurolinguistic programming model, respiratory and phonatory synchronization, relaxation techniques, concurrent monitoring of behavioral adjustments, and formal psychological counseling.
Respiratory retraining therapy in long-term treatment of paradoxical vocal fold dysfunction.
Nacci, Andrea; Fattori, Bruno; Segnini, Giovanni; Dallan, Iacopo; Panicucci, Erica; Rocchi, Valeria; Ursino, Francesco
2011-01-01
Paradoxical vocal fold dysfunction (PVFD) is a disorder in the larynx featuring involuntary adduction of the vocal folds during the inspiratory phase of breathing. The symptoms include acute episodes of dyspnea and bouts of coughing. To date, there is no universally acknowledged treatment for PVFD, though respiratory retraining therapy is the treatment of choice. The purpose of this work was to evaluate the results of long-term respiratory retraining therapy in cases of PVFD. We treated 20 patients with PVFD for 2 years: 10 subjects were submitted to a cycle of respiratory retraining therapy every 12 months (receiving a total of 3 cycles) while 10 were given a cycle every 3 months (for a total of 9 cycles) no matter what their clinical conditions were. The results show that long-term respiratory retraining is particularly efficacious if the cycles of treatment are repeated, no matter what clinical conditions are present. In fact, when only one cycle of retraining treatment is given a year, there is initial improvement followed by progressive worsening. Long-term respiratory rehabilitation is effective, especially if the treatment is given at least once every 3 months. Copyright © 2010 S. Karger AG, Basel.
Owen, Yvonne; Amory, Jonathan R
2011-01-01
Traditional techniques used to capture New World monkeys, such as net capture, can induce high levels of acute stress detrimental to welfare. Alternatively, training nonhuman animals via operant conditioning to voluntarily participate in husbandry and/or veterinary practices is accepted as a humane process that can reduce stress and improve welfare. This study details the use of operant conditioning using positive reinforcement training (PRT) and target training to train a family of 5 captive red-bellied tamarins (Saguinus labiatus) in a wildlife park to voluntarily enter a transportation box and remain calm for 1 min after 54 training sessions. Observations of 2 unrelated net-capture processes provided measures of locomotion and vocalizations as indicators of stress behavior that were compared with those of the trained tamarins. Net-captured monkeys exhibited rapid erratic locomotion and emitted long, high-frequency vocalizations during capture whereas the trained tamarins exhibited minimal locomotion and emitted only 4 brief vocalizations (root mean square 35 dB) during capture. This indicates that the use of PRT considerably reduced potential for stress and improved welfare during the capture and containment of the tamarins. Copyright © Taylor & Francis Group, LLC
Effect of singing training on total laryngectomees wearing a tracheoesophageal voice prosthesis.
Onofre, Fernanda; Ricz, Hilton Marcos Alves; Takeshita-Monaretti, Telma Kioko; Prado, Maria Yuka de Almeida; Aguiar-Ricz, Lílian Neto
2013-02-01
To assess the effect of a program of singing training on the voice of total laryngectomees wearing tracheoesophageal voice prosthesis, considering the quality of alaryngeal phonation, vocal extension and the musical elements of tunning and legato. Five laryngectomees wearing tracheoesophageal voice prosthesis completed the singing training program over a period of three months, with exploration of the strengthening of the respiratory muscles and vocalization and with evaluation of perceptive-auditory and singing voice being performed before and after 12 sessions of singing therapy. After the program of singing voice training, the quality of tracheoesophageal voice showed improvement or the persistence of the general degree of dysphonia for the emitted vowels and for the parameters of roughness and breathiness. For the vowel "a", the pitch was displaced to grave in two participants and to acute in one, and remained adequate in the others. A similar situation was observed also for the vowel "i". After the singing program, all participants presented tunning and most of them showed a greater presence of legato. The vocal extension improved in all participants. Singing training seems to have a favorable effect on the quality of tracheoesophageal phonation and on singing voice.
Arruda, Polyanna; Diniz da Rosa, Marine Raquel; Almeida, Larissa Nadjara Alves; de Araujo Pernambuco, Leandro; Almeida, Anna Alice
2018-03-07
Estradiol production varies cyclically, changes in levels are hypothesized to affect the voice. The main objective of this study was to investigate vocal acoustic and auditory-perceptual characteristics during fluctuations in the levels of the hormone estradiol during the menstrual cycle. A total of 44 volunteers aged between 18 and 45 were selected. Of these, 27 women with regular menstrual cycles comprised the test group (TG) and 17 combined oral contraceptive users comprised the control group (CG). The study was performed in two phases. In phase 1, anamnesis was performed. Subsequently, the TG underwent blood sample collection for measurement of estradiol levels and voice recording for later acoustic and auditory-perceptual analysis. The CG underwent only voice recording. Phase 2 involved the same measurements as phase 1 for each group. Variables were evaluated using descriptive and inferential analysis to compare groups and phases and to determine relationships between variables. Voice changes were found during the menstrual cycle, and such changes were determined to be related to variations in estradiol levels. Impaired voice quality was observed to be associated with decreased levels of estradiol. The CG did not demonstrate significant vocal changes during phases 1 and 2. The TG showed significant increases in vocal parameters of roughness, tension, and instability during phase 2 (the period of low estradiol levels) when compared with the CG. Low estradiol levels were also found to be negatively correlated with the parameters of tension, instability, and jitter and positively correlated with fundamental voice frequency. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Retrieving Tract Variables From Acoustics: A Comparison of Different Machine Learning Strategies.
Mitra, Vikramjit; Nam, Hosung; Espy-Wilson, Carol Y; Saltzman, Elliot; Goldstein, Louis
2010-09-13
Many different studies have claimed that articulatory information can be used to improve the performance of automatic speech recognition systems. Unfortunately, such articulatory information is not readily available in typical speaker-listener situations. Consequently, such information has to be estimated from the acoustic signal in a process which is usually termed "speech-inversion." This study aims to propose and compare various machine learning strategies for speech inversion: Trajectory mixture density networks (TMDNs), feedforward artificial neural networks (FF-ANN), support vector regression (SVR), autoregressive artificial neural network (AR-ANN), and distal supervised learning (DSL). Further, using a database generated by the Haskins Laboratories speech production model, we test the claim that information regarding constrictions produced by the distinct organs of the vocal tract (vocal tract variables) is superior to flesh-point information (articulatory pellet trajectories) for the inversion process.
Dietrich, Maria; Verdolini Abbott, Katherine; Gartner-Schmidt, Jackie; Rosen, Clark A
2008-07-01
The study's objectives were to investigate (1) the frequency of perceived stress, anxiety, and depression for patients with common voice disorders, (2) the distribution of these variables by diagnosis, and (3) the distribution of the variables by gender. Retrospective data were derived from self-report questionnaires assessing recent stress (Perceived Stress Scale-10), anxiety, and depression (Hospital Anxiety and Depression Scale) in a cohort of new patients presenting to a voice clinic. Data are presented on 160 patients with muscle tension dysphonia (MTD), benign vocal fold lesions, paradoxical vocal fold movement disorder (PVFMD), or glottal insufficiency. Pooled data indicated that average stress, anxiety, and depression scores were similar to those found for the healthy population. However, 25.0%, 36.9%, and 31.2% of patients showed elevated stress, anxiety, and depression scores, respectively, compared to norms. Patients with PVFMD had the most frequent occurrence-and patients with glottal insufficiency had the least frequent occurrence of elevated stress, anxiety, and depression. Stress and depression were more common with MTD than with lesions, whereas reverse results were obtained for anxiety. More females than males had elevated stress, anxiety, and depression scores. The data are consistent with suggestions that stress, anxiety, and depression may be common among some patients with PVFMD, MTD, and vocal fold lesions and more common for women than men. However, individual variability in the data set was large. Further studies should evaluate the specific role of these conditions for selected categories of voice disorders in susceptible individuals.
A Computerized Tomography Study of Vocal Tract Setting in Hyperfunctional Dysphonia and in Belting.
Saldias, Marcelo; Guzman, Marco; Miranda, Gonzalo; Laukkanen, Anne-Maria
2018-04-03
Vocal tract setting in hyperfunctional patients is characterized by a high larynx and narrowing of the epilaryngeal and pharyngeal region. Similar observations have been made for various singing styles, eg, belting. The voice quality in belting has been described to be loud, speech like, and high pitched. It is also often described as sounding "pressed" or "tense". The above mentioned has led to the hypothesis that belting may be strenuous to the vocal folds. However, singers and teachers of belting do not regard belting as particularly strenuous. This study investigates possible similarities and differences between hyperfunctional voice production and belting. This study concerns vocal tract setting. Four male patients with hyperfunctional dysphonia and one male contemporary commercial music singer were registered with computerized tomography while phonating on [a:] in their habitual speaking pitch. Additionally, the singer used the pitch G4 in belting. The scannings were studied in sagittal and transversal dimensions by measuring lengths, widths, and areas. Various similarities were found between belting and hyperfunction: high vertical larynx position, small hypopharyngeal width, and epilaryngeal outlet. On the other hand, belting differed from dysphonia (in addition to higher pitch) by a wider lip and jaw opening, and larger volumes of the oral cavity. Belting takes advantage of "megaphone shape" of the vocal tract. Future studies should focus on modeling and simulation to address sound energy transfer. Also, they should consider aerodynamic variables and vocal fold vibration to evaluate the "price of decibels" in these phonation types. Copyright © 2018. Published by Elsevier Inc.
Van Stan, Jarrad H.; Mehta, Daryush D.; Zeitels, Steven M.; Burns, James A.; Barbu, Anca M.; Hillman, Robert E.
2015-01-01
Objectives Clinical management of phonotraumatic vocal fold lesions (nodules, polyps) is based largely on assumptions that abnormalities in habitual levels of sound pressure level (SPL), fundamental frequency (f0), and/or amount of voice use play a major role in lesion development and chronic persistence. This study used ambulatory voice monitoring to evaluate if significant differences in voice use exist between patients with phonotraumatic lesions and normal matched controls. Methods Subjects were 70 adult females: 35 with vocal fold nodules or polyps and 35 age-, sex-, and occupation-matched normal individuals. Weeklong summary statistics of voice use were computed from anterior neck surface acceleration recorded using a smartphone-based ambulatory voice monitor. Results Paired t-tests and Kolmogorov-Smirnov tests resulted in no statistically significant differences between patients and matched controls regarding average measures of SPL, f0, vocal dose measures, and voicing/voice rest periods. Paired t-tests comparing f0 variability between the groups resulted in statistically significant differences with moderate effect sizes. Conclusions Individuals with phonotraumatic lesions did not exhibit differences in average ambulatory measures of vocal behavior when compared with matched controls. More refined characterizations of underlying phonatory mechanisms and other potentially contributing causes are warranted to better understand risk factors associated with phonotraumatic lesions. PMID:26024911
Texting while driving using Google Glass™: Promising but not distraction-free.
He, Jibo; Choi, William; McCarley, Jason S; Chaparro, Barbara S; Wang, Chun
2015-08-01
Texting while driving is risky but common. This study evaluated how texting using a Head-Mounted Display, Google Glass, impacts driving performance. Experienced drivers performed a classic car-following task while using three different interfaces to text: fully manual interaction with a head-down smartphone, vocal interaction with a smartphone, and vocal interaction with Google Glass. Fully manual interaction produced worse driving performance than either of the other interaction methods, leading to more lane excursions and variable vehicle control, and higher workload. Compared to texting vocally with a smartphone, texting using Google Glass produced fewer lane excursions, more braking responses, and lower workload. All forms of texting impaired driving performance compared to undistracted driving. These results imply that the use of Google Glass for texting impairs driving, but its Head-Mounted Display configuration and speech recognition technology may be safer than texting using a smartphone. Copyright © 2015 Elsevier Ltd. All rights reserved.
Dimensionality in voice quality.
Bele, Irene Velsvik
2007-05-01
This study concerns speaking voice quality in a group of male teachers (n = 35) and male actors (n = 36), as the purpose was to investigate normal and supranormal voices. The goal was the development of a method of valid perceptual evaluation for normal to supranormal and resonant voices. The voices (text reading at two loudness levels) had been evaluated by 10 listeners, for 15 vocal characteristics using VA scales. In this investigation, the results of an exploratory factor analysis of the vocal characteristics used in this method are presented, reflecting four dimensions of major importance for normal and supranormal voices. Special emphasis is placed on the effects on voice quality of a change in the loudness variable, as two loudness levels are studied. Furthermore, the vocal characteristics Sonority and Ringing voice quality are paid special attention, as the essence of the term "resonant voice" was a basic issue throughout a doctoral dissertation where this study was included.
Effect of body position on vocal tract acoustics: Acoustic pharyngometry and vowel formants.
Vorperian, Houri K; Kurtzweil, Sara L; Fourakis, Marios; Kent, Ray D; Tillman, Katelyn K; Austin, Diane
2015-08-01
The anatomic basis and articulatory features of speech production are often studied with imaging studies that are typically acquired in the supine body position. It is important to determine if changes in body orientation to the gravitational field alter vocal tract dimensions and speech acoustics. The purpose of this study was to assess the effect of body position (upright versus supine) on (1) oral and pharyngeal measurements derived from acoustic pharyngometry and (2) acoustic measurements of fundamental frequency (F0) and the first four formant frequencies (F1-F4) for the quadrilateral point vowels. Data were obtained for 27 male and female participants, aged 17 to 35 yrs. Acoustic pharyngometry showed a statistically significant effect of body position on volumetric measurements, with smaller values in the supine than upright position, but no changes in length measurements. Acoustic analyses of vowels showed significantly larger values in the supine than upright position for the variables of F0, F3, and the Euclidean distance from the centroid to each corner vowel in the F1-F2-F3 space. Changes in body position affected measurements of vocal tract volume but not length. Body position also affected the aforementioned acoustic variables, but the main vowel formants were preserved.
Soltis, Joseph; Blowers, Tracy E; Savage, Anne
2011-02-01
As in other mammals, there is evidence that the African elephant voice reflects affect intensity, but it is less clear if positive and negative affective states are differentially reflected in the voice. An acoustic comparison was made between African elephant "rumble" vocalizations produced in negative social contexts (dominance interactions), neutral social contexts (minimal social activity), and positive social contexts (affiliative interactions) by four adult females housed at Disney's Animal Kingdom®. Rumbles produced in the negative social context exhibited higher and more variable fundamental frequencies (F(0)) and amplitudes, longer durations, increased voice roughness, and higher first formant locations (F1), compared to the neutral social context. Rumbles produced in the positive social context exhibited similar shifts in most variables (F(0 )variation, amplitude, amplitude variation, duration, and F1), but the magnitude of response was generally less than that observed in the negative context. Voice roughness and F(0) observed in the positive social context remained similar to that observed in the neutral context. These results are most consistent with the vocal expression of affect intensity, in which the negative social context elicited higher intensity levels than the positive context, but differential vocal expression of positive and negative affect cannot be ruled out.
Synthetic, multi-layer, self-oscillating vocal fold model fabrication.
Murray, Preston R; Thomson, Scott L
2011-12-02
Sound for the human voice is produced via flow-induced vocal fold vibration. The vocal folds consist of several layers of tissue, each with differing material properties. Normal voice production relies on healthy tissue and vocal folds, and occurs as a result of complex coupling between aerodynamic, structural dynamic, and acoustic physical phenomena. Voice disorders affect up to 7.5 million annually in the United States alone and often result in significant financial, social, and other quality-of-life difficulties. Understanding the physics of voice production has the potential to significantly benefit voice care, including clinical prevention, diagnosis, and treatment of voice disorders. Existing methods for studying voice production include in vivo experimentation using human and animal subjects, in vitro experimentation using excised larynges and synthetic models, and computational modeling. Owing to hazardous and difficult instrument access, in vivo experiments are severely limited in scope. Excised larynx experiments have the benefit of anatomical and some physiological realism, but parametric studies involving geometric and material property variables are limited. Further, they are typically only able to be vibrated for relatively short periods of time (typically on the order of minutes). Overcoming some of the limitations of excised larynx experiments, synthetic vocal fold models are emerging as a complementary tool for studying voice production. Synthetic models can be fabricated with systematic changes to geometry and material properties, allowing for the study of healthy and unhealthy human phonatory aerodynamics, structural dynamics, and acoustics. For example, they have been used to study left-right vocal fold asymmetry, clinical instrument development, laryngeal aerodynamics, vocal fold contact pressure, and subglottal acoustics (a more comprehensive list can be found in Kniesburges et al.) Existing synthetic vocal fold models, however, have either been homogenous (one-layer models) or have been fabricated using two materials of differing stiffness (two-layer models). This approach does not allow for representation of the actual multi-layer structure of the human vocal folds that plays a central role in governing vocal fold flow-induced vibratory response. Consequently, one- and two-layer synthetic vocal fold models have exhibited disadvantages such as higher onset pressures than what are typical for human phonation (onset pressure is the minimum lung pressure required to initiate vibration), unnaturally large inferior-superior motion, and lack of a "mucosal wave" (a vertically-traveling wave that is characteristic of healthy human vocal fold vibration). In this paper, fabrication of a model with multiple layers of differing material properties is described. The model layers simulate the multi-layer structure of the human vocal folds, including epithelium, superficial lamina propria (SLP), intermediate and deep lamina propria (i.e., ligament; a fiber is included for anterior-posterior stiffness), and muscle (i.e., body) layers. Results are included that show that the model exhibits improved vibratory characteristics over prior one- and two-layer synthetic models, including onset pressure closer to human onset pressure, reduced inferior-superior motion, and evidence of a mucosal wave.
Toward dynamic magnetic resonance imaging of the vocal tract during speech production.
Ventura, Sandra M Rua; Freitas, Diamantino Rui S; Tavares, João Manuel R S
2011-07-01
The most recent and significant magnetic resonance imaging (MRI) improvements allow for the visualization of the vocal tract during speech production, which has been revealed to be a powerful tool in dynamic speech research. However, a synchronization technique with enhanced temporal resolution is still required. The study design was transversal in nature. Throughout this work, a technique for the dynamic study of the vocal tract with MRI by using the heart's signal to synchronize and trigger the imaging-acquisition process is presented and described. The technique in question is then used in the measurement of four speech articulatory parameters to assess three different syllables (articulatory gestures) of European Portuguese Language. The acquired MR images are automatically reconstructed so as to result in a variable sequence of images (slices) of different vocal tract shapes in articulatory positions associated with Portuguese speech sounds. The knowledge obtained as a result of the proposed technique represents a direct contribution to the improvement of speech synthesis algorithms, thereby allowing for novel perceptions in coarticulation studies, in addition to providing further efficient clinical guidelines in the pursuit of more proficient speech rehabilitation processes. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Morphological Variation in the Adult Hard Palate and Posterior Pharyngeal Wall
Lammert, Adam; Proctor, Michael; Narayanan, Shrikanth
2013-01-01
Purpose Adult human vocal tracts display considerable morphological variation across individuals, but the nature and extent of this variation has not been extensively studied for many vocal tract structures. There exists a need to analyze morphological variation and, even more basically, to develop a methodology for morphological analysis of the vocal tract. Such analysis will facilitate fundamental characterization of the speech production system, with broad implications from modeling to explaining inter-speaker variability. Method A data-driven methodology to automatically analyze the extent and variety of morphological variation is proposed and applied to a diverse subject pool of 36 adults. Analysis is focused on two key aspects of vocal tract structure: the midsagittal shape of the hard palate and the posterior pharyngeal wall. Result Palatal morphology varies widely in its degree of concavity, but also in anteriority and sharpness. Pharyngeal wall morphology, by contrast, varies mostly in terms of concavity alone. The distribution of morphological characteristics is complex, and analysis suggests that certain variations may be categorical in nature. Conclusion Major modes of morphological variation are identified, including their relative magnitude, distribution and categorical nature. Implications of these findings for speech articulation strategies and speech acoustics are discussed. PMID:23690566
Moving to the Beat and Singing are Linked in Humans
Dalla Bella, Simone; Berkowska, Magdalena; Sowiński, Jakub
2015-01-01
The abilities to sing and to move to the beat of a rhythmic auditory stimulus emerge early during development, and both engage perceptual, motor, and sensorimotor processes. These similarities between singing and synchronization to a beat may be rooted in biology. Patel (2008) has suggested that motor synchronization to auditory rhythms may have emerged during evolution as a byproduct of selection for vocal learning (“vocal learning and synchronization hypothesis”). This view predicts a strong link between vocal performance and synchronization skills in humans. Here, we tested this prediction by asking occasional singers to tap along with auditory pulse trains and to imitate familiar melodies. Both vocal imitation and synchronization skills were measured in terms of accuracy and precision or consistency. Accurate and precise singers tapped more in the vicinity of the pacing stimuli (i.e., they were more accurate) than less accurate and less precise singers. Moreover, accurate singers were more consistent when tapping to the beat. These differences cannot be ascribed to basic motor skills or to motivational factors. Individual differences in terms of singing proficiency and synchronization skills may reflect the variability of a shared sensorimotor translation mechanism. PMID:26733370
Kogan, J A; Margoliash, D
1998-04-01
The performance of two techniques is compared for automated recognition of bird song units from continuous recordings. The advantages and limitations of dynamic time warping (DTW) and hidden Markov models (HMMs) are evaluated on a large database of male songs of zebra finches (Taeniopygia guttata) and indigo buntings (Passerina cyanea), which have different types of vocalizations and have been recorded under different laboratory conditions. Depending on the quality of recordings and complexity of song, the DTW-based technique gives excellent to satisfactory performance. Under challenging conditions such as noisy recordings or presence of confusing short-duration calls, good performance of the DTW-based technique requires careful selection of templates that may demand expert knowledge. Because HMMs are trained, equivalent or even better performance of HMMs can be achieved based only on segmentation and labeling of constituent vocalizations, albeit with many more training examples than DTW templates. One weakness in HMM performance is the misclassification of short-duration vocalizations or song units with more variable structure (e.g., some calls, and syllables of plastic songs). To address these and other limitations, new approaches for analyzing bird vocalizations are discussed.
Abnormal stress responsivity in a rodent developmental disruption model of schizophrenia.
Zimmerman, Eric C; Bellaire, Mark; Ewing, Samuel G; Grace, Anthony A
2013-10-01
Although numerous studies have implicated stress in the pathophysiology of schizophrenia, less is known about how the effects of stress interact with genetic, developmental, and/or environmental determinants to promote disease progression. In particular, it has been proposed that in humans, stress exposure in adolescence could combine with a predisposition towards increased stress sensitivity, leading to prodromal symptoms and eventually psychosis. However, the neurobiological substrates for this interaction are not fully characterized. Previous work in our lab has demonstrated that rats born to dams administered with the DNA-methylating agent methylazoxymethanol acetate (MAM) at gestational day 17 exhibit as adults behavioral and anatomical abnormalities consistent with those observed in patients with schizophrenia. Here, we examined behavioral and neuroendocrine responses to stress in the MAM model of schizophrenia. MAM-treated male rats were exposed to acute and repeated footshock stress at prepubertal, peripubteral, and adult ages. Ultrasonic vocalizations (USVs), freezing, and corticosterone responses were quantified. We found that juvenile MAM-treated rats emitted significantly more calls, spent more time vocalizing, emitted calls at a higher rate, and showed more freezing in response to acute footshock stress when compared with their saline (SAL) treated counterparts, and that this difference is not present in older animals. In addition, adolescent MAM-treated animals displayed a blunted HPA axis corticosterone response to acute footshock that did not adapt after 10 days of stress exposure. These data demonstrate abnormal stress responsivity in the MAM model of schizophrenia and suggest that these animals are more sensitive to the effects of stress in youth.
Kurauchi, Isao; Shigemi, Kazutaka; Kabuki, Yusuke; Hamasu, Kousuke; Yamane, Haruka; Aoki, Mami; Kawada, Yoko; Morishita, Koji; Denbow, D Michael; Furuse, Mitsuhiro
2010-02-01
To clarify whether L-ornithine and/or its metabolite involves sedative and hypnotic effects under social separation stress, the effects of intracerebroventricular (i.c.v.) injection of L-ornithine and polyamines (putrescine, spermidine and spermine) were compared in chicks. Birds were injected i.c.v. with 0.5 mumol of L-ornithine, putrescine, spermidine, spermine or saline (control). After injection, chicks were immediately separated from the flock and monitored for the number of distress vocalizations and various postures. L-Ornithine greatly attenuated the stress response and caused sedative and hypnotic effects. Among the polyamines, only putrescine attenuated distress vocalizations but did not induce sleep. In conclusion, the sedative and hypnotic effect of L-ornithine was mainly induced by L-ornithine itself, while the polyamines contributed to the sedative, but not hypnotic, effect under social separation stress.
Air Swallowing Caused Recurrent Ileus in Tourette’s Syndrome
Frye, Richard E.; Hait, Elizabeth J.
2008-01-01
This report describes an adolescent boy who has Tourette’s syndrome and developed a subtle but significant increase in vocal tics after an 8-month respite. The increase in vocal tics was associated with an acute increase in psychological stressors and resulted in recurrent air swallowing, which, in turn, caused abdominal cramping, eructation, and flatus, eventually leading to aeroenteria. Air swallowing was recognized only after a second hospital admission for recurrent ileus. Air swallowing and associated symptoms were mitigated by reinstitution of psychopharmacologic treatment and an increase in the patient’s self-awareness of the air-swallowing behavior. Clinically significant air swallowing has not been described previously in Tourette syndrome or a tic disorder. This case is important for pediatricians and pediatric gastroenterologists because either may be the first to evaluate a child or an adolescent with unexplained recurrent ileus. This report also documents the importance of the connection between the brain and the body. PMID:16651280
ERIC Educational Resources Information Center
Anderson, Bruce; And Others
1980-01-01
Examines trends in popular music by analyzing the number one recordings from 1940 to 1977 in terms of the relationships among four variables: manufacturer (market concentration), song type (musical genre), artist type (performance mode), and lyric content (vocal message). (JMF)
Time-of-day effects on voice range profile performance in young, vocally untrained adult females.
van Mersbergen, M R; Verdolini, K; Titze, I R
1999-12-01
Time-of-day effects on voice range profile performance were investigated in 20 vocally healthy untrained women between the ages of 18 and 35 years. Each subject produced two complete voice range profiles: one in the morning and one in the evening, about 36 hours apart. The order of morning and evening trials was counterbalanced across subjects. Dependent variables were (1) average minimum and average maximum intensity, (2) Voice range profile area and (3) center of gravity (median semitone pitch and median intensity). In this study, the results failed to reveal any clear evidence of time-of-day effects on voice range profile performance, for any of the dependent variables. However, a reliable interaction of time-of-day and trial order was obtained for average minimum intensity. Investigation of other subject populations, in particular trained vocalists or those with laryngeal lesions, is required for any generalization of the results.
Long-term functional outcome after unilateral cordectomy.
Keilmann, Annerose; Napiontek, Ulrike; Engel, Christiane; Nakarat, Todsaporn; Schneider, Astrid; Mann, Wolf
2011-01-01
Our aim was to longitudinally analyze the vocal outcome after endoscopic CO(2) laser resection of early glottic carcinoma. Sixteen patients treated with laser surgery for T1 or T2 tumors of the vocal cords received voice therapy and were examined 1, 2, 3, 4.5, 6 and 12 months postoperatively. Besides videolaryngostroboscopy, each examination included history, phonetogram of the speaking and the singing voice, language-specific hoarseness diagram and a questionnaire (Voice Handicap Index 12 in German). Objective parameters demonstrated a broad variability with a slight tendency of improvement over time. For the maximal phonation time a nearly constant improvement was seen. After an initial improvement deterioration for subjective assessment in the Voice Handicap Index was noted in most patients 3-6 months postoperatively. The functional outcome after cordectomy is variable. MESSAGE OF THE PAPER: Discrepancies between objective findings and patient satisfaction over time have to be considered after cordectomy. Copyright © 2010 S. Karger AG, Basel.
The shear modulus of the human vocal fold, preliminary results from 20 larynxes.
Goodyer, Eric; Hemmerich, Sandra; Müller, Frank; Kobler, James B; Hess, Markus
2007-01-01
Quantification of the elastic properties of the human vocal fold provides invaluable data for researchers deriving mathematical models of phonation, developing tissue engineering therapies, and as normative data for comparison between healthy and scarred tissue. This study measured the shear modulus of excised cadaver vocal folds from 20 subjects. Twenty freshly excised human larynxes were evaluated less than four days post-mortem. They were split along the saggital plane and mounted without tension. Shear modulus was obtained by two different methods. For method 1 cyclical shear stress was applied transversely to the mid-membranous portion of the vocal fold, and shear modulus derived by applying a simple shear model. For method 2 the apparatus was configured as an indentometer, and shear modulus obtained from the stress/strain data by applying an established analytical technique. Method 1 shear model for male larynxes yielded a range from 246 to 3,356 Pa, with a mean value of 1,008 and SD of 380. The range for female larynxes was 286-3,332 Pa, with a mean value of 1,237 and SD of 768. Method 2 indentometer model for male larynxes yielded a range from 552 to 2,741 Pa, with a mean value of 1,000 and SD of 460. The range for female larynxes was 509-1,989 Pa, with a mean value of 1,332 and SD of 428. We have successfully demonstrated two methodologies that are capable of directly measuring the shear modulus of the human vocal fold, without dissecting out the vocal fold cover tissue. The sample size of nine female and 11 male larynxes is too small to validate a general conclusion. The high degree of variability in this small cohort of subjects indicates that factors such as age, health status, and post-mortem delay may be significant; and that there is range of 'normality' for vocal fold tissue.
Ferrone, Carol; Galgano, Jessica; Ramig, Lorraine Olson
2011-05-01
To test the hypothesis that extensive use of La MaMa vocal technique may result in symptoms of vocal abuse, an evaluation of the acoustic and perceptual characteristics of voice for eight performers from the Great Jones Repertory Company of the La MaMa Experimental Theater was conducted. This vocal technique includes wide ranges of frequency from 46 to 2003 Hz and vocal intensity that is sustained at 90-108 dB sound pressure level with a mouth-to-microphone distance of 30 cm for 3-4 hours per performance. The actors rehearsed for 4 hours per day, 5 days per week for 14 weeks before the series of performances. Thirty-nine performances were presented in 6 weeks. Three pretraining, three posttraining, and two postperformance series data collection sessions were carried out for each performer. Speech samples were gathered using the CSL 4500 and analyzed using Real-Time Pitch program and Multidimensional Voice Program. Acoustic analysis was performed on 48 tokens of sustained vowel phonation for each subject. Statistical analysis was performed using the Friedman test of related samples. Perceptual analysis included professional listeners rating voice quality in pretraining, posttraining, and postperformance samples of the Rainbow Passage and sample lines from the plays. The majority of professional listeners (11/12) judged that this technique would result in symptoms of vocal abuse; however, acoustic data revealed statistically stable or improved measurements for all subjects in most dependent acoustic variables when compared with both posttraining and postperformance trials. These findings add support to the notion that a technique that may be perceived as vocally abusive, generating 90-100 dB sound pressure level and sustained over 6 weeks of performances, actually resulted in improved vocal strength and flexibility. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
2014-01-01
Background Whether listening to background music enhances verbal learning performance is still a matter of dispute. In this study we investigated the influence of vocal and instrumental background music on verbal learning. Methods 226 subjects were randomly assigned to one of five groups (one control group and 4 experimental groups). All participants were exposed to a verbal learning task. One group served as control group while the 4 further groups served as experimental groups. The control group learned without background music while the 4 experimental groups were exposed to vocal or instrumental musical pieces during learning with different subjective intensity and valence. Thus, we employed 4 music listening conditions (vocal music with high intensity: VOC_HIGH, vocal music with low intensity: VOC_LOW, instrumental music with high intensity: INST_HIGH, instrumental music with low intensity: INST_LOW) and one control condition (CONT) during which the subjects learned the word lists. Since it turned out that the high and low intensity groups did not differ in terms of the rated intensity during the main experiment these groups were lumped together. Thus, we worked with 3 groups: one control group and two groups, which were exposed to background music (vocal and instrumental) during verbal learning. As dependent variable, the number of learned words was used. Here we measured immediate recall during five learning sessions (recall 1 – recall 5) and delayed recall for 15 minutes (recall 6) and 14 days (recall 7) after the last learning session. Results Verbal learning improved during the first 5 recall sessions without any strong difference between the control and experimental groups. Also the delayed recalls were similar for the three groups. There was only a trend for attenuated verbal learning for the group passively listened to vocals. This learning attenuation diminished during the following learning sessions. Conclusions The exposure to vocal or instrumental background music during encoding did not influence verbal learning. We suggest that the participants are easily able to cope with this background stimulation by ignoring this information channel in order to focus on the verbal learning task. PMID:24670048
The Use of a Discrimination-Training Procedure to Teach Mand Variability to Children with Autism
ERIC Educational Resources Information Center
Brodhead, Matthew T.; Higbee, Thomas S.; Gerencser, Kristina R.; Akers, Jessica S.
2016-01-01
We investigated the effects of a script-fading and discrimination-training procedure on mand variability in preschoolers with autism. Participants were taught to vary their vocal mands in the presence of written scripts, a green placemat, and a lag schedule of reinforcement. They were also taught to engage in repetitive mands in the presence of…
A novel model for examining recovery of phonation after vocal nerve damage.
Bhama, Prabhat K; Hillel, Allen D; Merati, Albert L; Perkel, David J
2011-05-01
Recurrent laryngeal nerve injury remains a dominant clinical issue in laryngology. To date, no animal model of laryngeal reinnervation has offered an outcome measure that can reflect the degree of recovery based on vocal function. We present an avian model system for studying recovery of learned vocalizations after nerve injury. Prospective animal study. Digital recordings of bird song were made from 11 adult male zebra finches; nine birds underwent bilateral crushing of the nerve supplying the vocal organ, and two birds underwent sham surgery. Songs from all the birds were then recorded regularly and analyzed based on temporal and spectral characteristics using computer software. Indices were calculated to indicate the degree of similarity between preoperative and postoperative song. Nerve crush caused audible differences in song quality and significant drops (P<0.05) in measured spectral and, to a lesser degree, temporal indices. Spectral indices recovered significantly (mean=43.0%; standard deviation [SD]=40.7; P<0.02), and there was an insignificant trend toward recovery of temporal index (mean=28.0%; SD=41.4; P=0.0771). In five of the nine (56%) birds, there was a greater than 50% recovery of spectral indices within a 4-week period. Two birds exhibited substantially less recovery of spectral indices and two birds had a persistent decline in spectral indices. Recovery of temporal index was highly variable as well, ranging from persistent further declines of 45.1% to recovery of 87%. Neither sham bird exhibited significant (P>0.05) differences in song after nerve crush. The songbird model system allows functional analysis of learned vocalization after surgical damage to vocal nerves. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Miller, Nicola A; Gregory, Jennifer S; Aspden, Richard M; Stollery, Peter J; Gilbert, Fiona J
2014-09-01
The shape of the vocal tract and associated structures (eg, tongue and velum) is complicated and varies according to development and function. This variability challenges interpretation of voice experiments. Quantifying differences between shapes and understanding how vocal structures move in relation to each other is difficult using traditional linear and angle measurements. With statistical shape models, shape can be characterized in terms of independent modes of variation. Here, we build an active shape model (ASM) to assess morphologic and pitch-related functional changes affecting vocal structures and the airway. Using a cross-sectional study design, we obtained six midsagittal magnetic resonance images from 10 healthy adults (five men and five women) at rest, while breathing out, and while listening to, and humming low and high notes. Eighty landmark points were chosen to define the shape of interest and an ASM was built using these (60) images. Principal component analysis was used to identify independent modes of variation, and statistical analysis was performed using one-way repeated-measures analysis of variance. Twenty modes of variation were identified with modes 1 and 2 accounting for half the total variance. Modes 1 and 9 were significantly associated with humming low and high notes (P < 0.001) and showed coordinated changes affecting the cervical spine, vocal structures, and airway. Mode 2 highlighted wide structural variations between subjects. This study highlights the potential of active shape modeling to advance understanding of factors underlying morphologic and pitch-related functional variations affecting vocal structures and the airway in health and disease. Copyright © 2014 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Scheerer, Nichole E; Jones, Jeffery A
2014-12-01
Speech production requires the combined effort of a feedback control system driven by sensory feedback, and a feedforward control system driven by internal models. However, the factors that dictate the relative weighting of these feedback and feedforward control systems are unclear. In this event-related potential (ERP) study, participants produced vocalisations while being exposed to blocks of frequency-altered feedback (FAF) perturbations that were either predictable in magnitude (consistently either 50 or 100 cents) or unpredictable in magnitude (50- and 100-cent perturbations varying randomly within each vocalisation). Vocal and P1-N1-P2 ERP responses revealed decreases in the magnitude and trial-to-trial variability of vocal responses, smaller N1 amplitudes, and shorter vocal, P1 and N1 response latencies following predictable FAF perturbation magnitudes. In addition, vocal response magnitudes correlated with N1 amplitudes, vocal response latencies, and P2 latencies. This pattern of results suggests that after repeated exposure to predictable FAF perturbations, the contribution of the feedforward control system increases. Examination of the presentation order of the FAF perturbations revealed smaller compensatory responses, smaller P1 and P2 amplitudes, and shorter N1 latencies when the block of predictable 100-cent perturbations occurred prior to the block of predictable 50-cent perturbations. These results suggest that exposure to large perturbations modulates responses to subsequent perturbations of equal or smaller size. Similarly, exposure to a 100-cent perturbation prior to a 50-cent perturbation within a vocalisation decreased the magnitude of vocal and N1 responses, but increased P1 and P2 latencies. Thus, exposure to a single perturbation can affect responses to subsequent perturbations. © 2014 Federation of European Neuroscience Societies and John Wiley & Sons Ltd.
Activity propagation in an avian basal ganglia-thalamo-cortical circuit essential for vocal learning
Kojima, Satoshi; Doupe, Allison J.
2009-01-01
In mammalian basal ganglia-thalamo-cortical circuits, GABAergic pallidal neurons are thought to ‘gate’ or modulate excitation in thalamus with their strong inhibitory inputs, and thus signal to cortex by pausing and permitting thalamic neurons to fire in response to excitatory drive. In contrast, in a homologous circuit specialized for vocal learning in songbirds, evidence suggests that pallidal neurons signal by eliciting postinhibitory rebound spikes in thalamus, which could occur even without any excitatory drive to thalamic neurons. To test whether songbird pallidal neurons can also communicate with thalamus by gating excitatory drive, as well as by postinhibitory rebound, we examined the activity of thalamic relay neurons in response to acute inactivation of the basal ganglia structure Area X; Area X contains the pallidal neurons that project to thalamus. Although inactivation of Area X should eliminate rebound-mediated spiking in thalamus, this manipulation tonically increases the firing rate of thalamic relay neurons, providing evidence that songbird pallidal neurons can gate tonic thalamic excitatory drive. We also found that the increased thalamic activity was fed forward to its target in the avian equivalent of cortex, which includes neurons that project to the vocal premotor area. These data raise the possibility that basal ganglia circuits can signal to cortex through thalamus both by generating postinhibitory rebound and by gating excitatory drive, and may switch between these modes depending on the statistics of pallidal firing. Moreover, these findings provide insight into the strikingly different disruptive effects of basal ganglia and ‘cortical’ lesions on songbird vocal learning. PMID:19369547
Lim, Jae-Yol; Kim, Han Su; Kim, Young-Ho; Kim, Kwang-Moon; Choi, Hong-Shik
2008-03-01
Both PMMA (polymethylmetacrylate) microspheres (PM) and stabilized hyaluronic acid (HA) are recently used for facial augmentation. The aim of this study was to test functional effect, durability, and safety of the injection of these two materials into true vocal folds, and test their availability as injection laryngoplasty materials in vivo canine model. The study was carried out with 16 beagle dogs (8 males and 8 females, average weight of 12.4 kg). No biological difference was detected between two groups; PM (Artecoll) injection and HA (Restylane) injection group. After inducing complete unilateral paralysis of the recurrent laryngeal nerve of the dogs, either PM or HA was injected into the paralyzed vocal fold. One, 3, 6, and 9 months after the injection, clinical outcomes and videostroboscopic findings were evaluated by investigators who were blind to the injection materials. Histological study and microscopic computerized augmentation dimension analysis were also performed. In HA injection group, up to 30% the HA was gradually resorbed over time. However, in PM group, the dimension of the augmented region after 9 months was similar to that after 1 month. In both groups, the mucosal waves of the vocal folds decreased in amplitude and periodicity, but they were still well detected during the follow-up periods. Acute immune reaction to HA was not detected, but some degree of foreign body reaction occurred in PM injection group. Both PM and HA are safe and relatively durable in vocal folds and they are considered as useful candidates for injection laryngoplasty.
[Identification of emotions in patients with low-grade gliomas versus cerebrovascular accidents].
du Boullay, V; Plaza, M; Capelle, L; Chaby, L
2013-03-01
Facial and vocal emotions contribute to sustain efficient social relationships. Brain disease may impair their identification. In the case of slow-growth tumors (Low Grade Gliomas [LGG]) or sudden stroke (cerebrovascular accidents [CVA]), the lesions induce contrasted plasticity and reorganisation processes. We compared the facial, vocal and intermodal identification of six emotions (happiness, fear, angriness, sadness, disgust and neutral) of three groups: patients with LGG before and after tumor resection, patients with CVA and control subjects. In LGG patients, the results revealed less efficient performances after tumor resection and in CVA patients weak performances regarding negative emotions. The intermodal condition (simultaneous visual and vocal association) improved performances in all groups and enabled equivalent performance in CVA subjects compared with control subjects. The intergroup differences may be related to variable brain plasticity as a function of type and rapidity of brain injury. Intermodal processing appears to be a compensatory condition. Copyright © 2012. Published by Elsevier Masson SAS.
Glottal contact quotient in Mediterranean tongue trill.
Hamdan, Abdul-latif; Nassar, Jihad; Al Zaghal, Zeid; El-Khoury, Eleine; Bsat, Marwa; Tabri, Dollen
2012-09-01
To determine the effect of a Mediterranean tongue trill (Zalghouta) on estimated glottal closed quotient (CQ). A total of 10 female subjects participated in this study. Vocal fold CQ was measured for both sustained vowel [a] and the tongue trill named Zlaghouta using electroglottography. Frequencies and means (±standard deviation) were used to describe categorical and continuous variables, respectively. The Wilcoxon signed nonparametric test was used to determine any significant changes in CQ means pretrill and during trill. There was a significant decrease in the mean CQ during the Mediterranean tongue trill (Zalghouta) versus the sustained vowel [a] (P value of 0.002) by 15.98. When stratified by training status there was a significant decrease for the trained group (P value of 0.031) and a nonsignificant decrease for the untrained group (P value of 0.125). Zalghouta induces a decrease in the vocal fold contact time. The potential benefits of the Mediterranean tongue trill in vocal therapy need consideration. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Savitha, D; Sejil, T V; Rao, Shwetha; Roshan, C J; Roshan, C J
2013-01-01
The purpose of the study was to investigate the effect of vocal and instrumental music on various physiological parameters during submaximal exercise. Each subject underwent three sessions of exercise protocol without music, with vocal music, and instrumental versions of same piece of music. The protocol consisted of 10 min treadmill exercise at 70% HR(max) and 20 min of recovery. Minute to minute heart rate and breath by breath recording of respiratory parameters, rate of energy expenditure and perceived exertion levels were measured. Music, irrespective of the presence or absence of lyrics, enabled the subjects to exercise at a significantly lower heart rate and oxygen consumption, reduced the metabolic cost and perceived exertion levels of exercise (P < 0.05). There was faster recovery of systolic and diastolic blood pressures and exertion levels during the post exercise period. Music having a relaxant effect could have probably increased the parasympathetic activation leading to these effects.
I feel your voice. Cultural differences in the multisensory perception of emotion.
Tanaka, Akihiro; Koizumi, Ai; Imai, Hisato; Hiramatsu, Saori; Hiramoto, Eriko; de Gelder, Beatrice
2010-09-01
Cultural differences in emotion perception have been reported mainly for facial expressions and to a lesser extent for vocal expressions. However, the way in which the perceiver combines auditory and visual cues may itself be subject to cultural variability. Our study investigated cultural differences between Japanese and Dutch participants in the multisensory perception of emotion. A face and a voice, expressing either congruent or incongruent emotions, were presented on each trial. Participants were instructed to judge the emotion expressed in one of the two sources. The effect of to-be-ignored voice information on facial judgments was larger in Japanese than in Dutch participants, whereas the effect of to-be-ignored face information on vocal judgments was smaller in Japanese than in Dutch participants. This result indicates that Japanese people are more attuned than Dutch people to vocal processing in the multisensory perception of emotion. Our findings provide the first evidence that multisensory integration of affective information is modulated by perceivers' cultural background.
Hall, Joseph E.; Suehiro, Atsushi; Branski, Ryan C.; Garrett, C. Gaelyn; Rousseau, Bernard
2015-01-01
Objective To investigate the hypothesis that prophylactic triamcinolone modulates acute vocal fold inflammatory and profibrotic signaling during acute phonotrauma. Study Design In vivo rabbit phonation model. Setting Academic medical center. Subjects and Methods Forty New Zealand white breeder rabbits were randomly assigned to 1 of 4 groups: control (no intervention), no treatment (30 minutes of raised intensity phonation), sham treatment (bilateral intralaryngeal triamcinolone acetonide injection at 0 μg/25 μL followed by 30 minutes of raised intensity phonation), or steroid treatment (bilateral intralaryngeal triamcinolone acetonide injection at 400 μg/25 μL followed by 30 minutes of raised intensity phonation). Quantitative polymerase chain reaction (qPCR) was used to investigate gene expression levels of cyclooxygenase-2 (COX-2), interleukin (IL)–1β, and transforming growth factor (TGF)–β1. Results Results revealed a significant main effect for COX-2 (P = .002). Post hoc testing revealed that rabbits receiving no treatment (15.10) had higher COX-2 gene expression than control (5.90; P <.001). There were no significant differences in COX-2 expression between treatment groups. Results revealed a significant main effect for IL-1β (P < .001). Post hoc testing revealed that rabbits receiving no treatment (14.70) had higher IL-1β gene expression than control (6.30) (P = .001). There were no significant differences in IL-1β gene expression between treatment groups. There were no significant differences in TGF-β1 gene expression (P = .525) between treatment and control groups. Conclusion Given conflicting evidence, further studies are necessary to investigate vocal fold steroid injections prior to and following the induction of phonotrauma. Prophylactic administration of triamcinolone immediately prior to acute phonotrauma resulted in no significant changes in COX-2, IL-1β, and TGF-β1 gene transcript levels. PMID:22399283
Mapping the Early Language Environment Using All-Day Recordings and Automated Analysis.
Gilkerson, Jill; Richards, Jeffrey A; Warren, Steven F; Montgomery, Judith K; Greenwood, Charles R; Kimbrough Oller, D; Hansen, John H L; Paul, Terrance D
2017-05-17
This research provided a first-generation standardization of automated language environment estimates, validated these estimates against standard language assessments, and extended on previous research reporting language behavior differences across socioeconomic groups. Typically developing children between 2 to 48 months of age completed monthly, daylong recordings in their natural language environments over a span of approximately 6-38 months. The resulting data set contained 3,213 12-hr recordings automatically analyzed by using the Language Environment Analysis (LENA) System to generate estimates of (a) the number of adult words in the child's environment, (b) the amount of caregiver-child interaction, and (c) the frequency of child vocal output. Child vocalization frequency and turn-taking increased with age, whereas adult word counts were age independent after early infancy. Child vocalization and conversational turn estimates predicted 7%-16% of the variance observed in child language assessment scores. Lower socioeconomic status (SES) children produced fewer vocalizations, engaged in fewer adult-child interactions, and were exposed to fewer daily adult words compared with their higher socioeconomic status peers, but within-group variability was high. The results offer new insight into the landscape of the early language environment, with clinical implications for identification of children at-risk for impoverished language environments.
Laryngoscopic, acoustic, perceptual, and functional assessment of voice in rock singers.
Guzman, Marco; Barros, Macarena; Espinoza, Fernanda; Herrera, Alejandro; Parra, Daniela; Muñoz, Daniel; Lloyd, Adam
2013-01-01
The present study aimed to vocally assess a group of rock singers who use growl voice and reinforced falsetto. A group of 21 rock singers and a control group of 18 pop singers were included. Singing and speaking voice was assessed through acoustic, perceptual, functional and laryngoscopic analysis. No significant differences were observed between groups in most of the analyses. Acoustic and perceptual analysis of the experimental group demonstrated normality of speaking voice. Endoscopic evaluation showed that most rock singers presented during singing voice a high vertical laryngeal position, pharyngeal compression and laryngeal supraglottic compression. Supraglottic activity during speaking voice tasks was also observed. However, overall vocal fold integrity was demonstrated in most of the participants. Slightly abnormal observations were demonstrated in few of them. Singing voice handicap index revealed that the most affected variable was the physical sphere, followed by the social and emotional spheres. Although growl voice and reinforced falsetto represent laryngeal and pharyngeal hyperfunctional activity, they did not seem to contribute to the presence of any major vocal fold disorder in our subjects. Nevertheless, we cannot rule out the possibility that more evident vocal fold disorders could be found in singers who use these techniques more often and during a longer period of time.
Koda, Hiroki; Tokuda, Isao T; Wakita, Masumi; Ito, Tsuyoshi; Nishimura, Takeshi
2015-06-01
Whistle-like high-pitched "phee" calls are often used as long-distance vocal advertisements by small-bodied marmosets and tamarins in the dense forests of South America. While the source-filter theory proposes that vibration of the vocal fold is modified independently from the resonance of the supralaryngeal vocal tract (SVT) in human speech, a source-filter coupling that constrains the vibration frequency to SVT resonance effectively produces loud tonal sounds in some musical instruments. Here, a combined approach of acoustic analyses and simulation with helium-modulated voices was used to show that phee calls are produced principally with the same mechanism as in human speech. The animal keeps the fundamental frequency (f0) close to the first formant (F1) of the SVT, to amplify f0. Although f0 and F1 are primarily independent, the degree of their tuning can be strengthened further by a flexible source-filter interaction, the variable strength of which depends upon the cross-sectional area of the laryngeal cavity. The results highlight the evolutionary antiquity and universality of the source-filter model in primates, but the study can also explore the diversification of vocal physiology, including source-filter interaction and its anatomical basis in non-human primates.
Tervo, Outi M; Parks, Susan E; Miller, Lee A
2009-09-01
Singing behavior has been described from bowhead whales in the Bering Sea during their annual spring migration and from Davis Strait during their spring feeding season. It has been suggested that this spring singing behavior is a remnant of the singing during the winter breeding season, though no winter recordings are available. In this study, the authors describe recordings made during the winter and spring months of bowhead whales in Disko Bay, Western-Greenland. A total of 7091 bowhead whale sounds were analyzed to describe the vocal repertoire, the singing behavior, and the changes in vocal behavior from February to May. The vocal signals could be divided into simple (frequency-modulated) calls (n=483), complex (amplitude-modulated) calls (n=635), and song notes (n=5973). Recordings from the end of February to middle of March were characterized by higher call rates with a greater diversity of call types than recordings made later in the season. This study is the first description of bowhead song from the stock in Western-Greenland during both the winter and spring months, and provides support for the hypothesis that song during the winter months contains more song notes than song from the spring making the winter song more variable.
Transmasculine People's Voice Function: A Review of the Currently Available Evidence.
Azul, David; Nygren, Ulrika; Södersten, Maria; Neuschaefer-Rube, Christiane
2017-03-01
This study aims to evaluate the currently available discursive and empirical data relating to those aspects of transmasculine people's vocal situations that are not primarily gender-related, to identify restrictions to voice function that have been observed in this population, and to make suggestions for future voice research and clinical practice. We conducted a comprehensive review of the voice literature. Publications were identified by searching six electronic databases and bibliographies of relevant articles. Twenty-two publications met inclusion criteria. Discourses and empirical data were analyzed for factors and practices that impact on voice function and for indications of voice function-related problems in transmasculine people. The quality of the evidence was appraised. The extent and quality of studies investigating transmasculine people's voice function was found to be limited. There was mixed evidence to suggest that transmasculine people might experience restrictions to a range of domains of voice function, including vocal power, vocal control/stability, glottal function, pitch range/variability, vocal endurance, and voice quality. More research into the different factors and practices affecting transmasculine people's voice function that takes account of a range of parameters of voice function and considers participants' self-evaluations is needed to establish how functional voice production can be best supported in this population. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Histological evaluation of the equine larynx after unilateral laser-assisted ventriculocordectomy.
Robinson, P; Williams, K J; Sullins, K E; Arnoczky, S P; Stick, J A; Robinson, N E; de Feijter-Rupp, H; Derksen, F J
2007-05-01
Trans-endoscopic laser surgery, such as unilateral laser-assisted ventriculocordectomy (LVC), has gained popularity in the treatment of RLN because a laryngotomy incision or general anaesthesia are not required. However, removal of the vocal fold and ventricle takes considerable laser energy and could cause collateral tissue damage, including injury to the adjacent laryngeal cartilages. To document the histological effects of laser surgery on laryngeal tissues in horses that have undergone LVC for the treatment of laryngeal hemiplegia (LH). Six horses were used: 4 with experimentally induced LH that had subsequently undergone LVC 6 months prior to euthanasia; and, 2 horses were used as controls. One of the control horses with naturally occurring LH was used to study the effect of neuropathy alone, whereas the other was subjected to euthanasia immediately following LVC to evaluate the acute effect of laser surgery. Using a band saw, each larynx was sectioned transversely at 5 mm intervals and evaluated histologically. Acutely, LVC caused thermal damage to adjacent soft tissues but did not affect the histology of the laryngeal cartilages. Six months after LVC, laryngeal cartilages were histologically normal and there was squamous metaplasia of the repaired laryngeal mucosa, resulting in restitution of the mucosal integrity. Using a diode laser in contact fashion at 20 W, LVC can be used to remove the laryngeal vocal fold and ventricle without causing laryngeal cartilage damage. Laryngeal chondritis is an unlikely consequence of LVC.
Hirano, Tetsushi; Yanai, Shogo; Takada, Tadashi; Yoneda, Naoki; Omotehara, Takuya; Kubota, Naoto; Minami, Kiichi; Yamamoto, Anzu; Mantani, Youhei; Yokoyama, Toshifumi; Kitagawa, Hiroshi; Hoshi, Nobuhiko
2018-01-05
Neonicotinoids are novel systemic pesticides acting as agonists on the nicotinic acetylcholine receptors (nAChRs) of insects. Experimental studies have revealed that neonicotinoids pose potential risks for the nervous systems of non-target species, but the brain regions responsible for their behavioral effects remain incompletely understood. This study aimed to assess the neurobehavioral effects of clothianidin (CTD), a later neonicotinoid developed in 2001 and widely used worldwide, and to explore the target regions of neonicotinoids in the mammalian brain. A single-administration of 5 or 50mg/kg CTD to male C57BL/6N mice at or below the no-observed-adverse-effect level (NOAEL) induced an acute increase in anxiety during the elevated plus-maze test. In addition, mice in the CTD-administered group spontaneously emitted human-audible vocalizations (4-16kHz), which are behavioral signs of aversive emotions, and showed increased numbers of c-fos immunoreactive cells in the paraventricular thalamic nucleus and dentate gyrus of the hippocampus. In conclusion, mice exposed to NOAEL-dose CTD would be rendered vulnerable to a novel environment via the activation of thalamic and hippocampal regions related to stress responses. These findings should provide critical insight into the neurobehavioral effects of neonicotinoids on mammals. Copyright © 2017 Elsevier B.V. All rights reserved.
Bradt, Joke; Norris, Marisol; Shim, Minjung; Gracely, Edward J.; Gerrity, Patricia
2017-01-01
Background To date, research on music for pain management has focused primarily on listening to pre-recorded music for acute pain. Research is needed on the impact of active music therapy interventions on chronic pain management. Objective The aim of this mixed methods research study was to determine feasibility and estimates of effect of vocal music therapy for chronic pain management. Methods Fifty-five inner city adults, predominantly African-Americans, with chronic pain were randomized to an 8-week vocal music therapy treatment group or waitlist control group. Consent and attrition rates, treatment compliance, and instrument appropriateness/burden were tracked. Physical functioning (pain interference and general activities), self-efficacy, emotional functioning, pain intensity, pain coping, and participant perception of change were measured at baseline, 4, 8, and 12 weeks. Focus groups were conducted at the 12-week follow-up. Results The consent rate was 77%. The attrition rate was 27% at follow-up. We established acceptability of the intervention. Large effect sizes were obtained for self-efficacy at weeks 8 and 12; a moderate effect size was found for pain interference at week 8; no improvements were found for general activities and emotional functioning. Moderate effect sizes were obtained for pain intensity and small effect sizes for coping, albeit not statistically significant. Qualitative findings suggested that the treatment resulted in enhanced self-management, motivation, empowerment, a sense of belonging, and reduced isolation. Conclusions This study suggests that vocal music therapy may be effective in building essential stepping stones for effective chronic pain management namely enhanced self-efficacy, motivation, empowerment, and social engagement. PMID:27090149
Bradt, Joke; Norris, Marisol; Shim, Minjung; Gracely, Edward J; Gerrity, Patricia
2016-01-01
To date, research on music for pain management has focused primarily on listening to prerecorded music for acute pain. Research is needed on the impact of active music therapy interventions on chronic pain management. The aim of this mixed methods research study was to determine feasibility and estimates of effect of vocal music therapy for chronic pain management. Fifty-five inner-city adults, predominantly African Americans, with chronic pain were randomized to an 8-week vocal music therapy treatment group or waitlist control group. Consent and attrition rates, treatment compliance, and instrument appropriateness/burden were tracked. Physical functioning (pain interference and general activities), self-efficacy, emotional functioning, pain intensity, pain coping, and participant perception of change were measured at baseline, 4, 8, and 12 weeks. Focus groups were conducted at the 12-week follow-up. The consent rate was 77%. The attrition rate was 27% at follow-up. We established acceptability of the intervention. Large effect sizes were obtained for self-efficacy at weeks 8 and 12; a moderate effect size was found for pain interference at week 8; no improvements were found for general activities and emotional functioning. Moderate effect sizes were obtained for pain intensity and small effect sizes for coping, albeit not statistically significant. Qualitative findings suggested that the treatment resulted in enhanced self-management, motivation, empowerment, a sense of belonging, and reduced isolation. This study suggests that vocal music therapy may be effective in building essential stepping-stones for effective chronic pain management, namely enhanced self-efficacy, motivation, empowerment, and social engagement. © the American Music Therapy Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Exploring Attitudes of Indian Classical Singers Toward Seeking Vocal Health Care.
Gunjawate, Dhanshree R; Aithal, Venkataraja U; Guddattu, Vasudeva; Kishore, Amrutha; Bellur, Rajashekhar
2016-11-01
The attitude of Indian classical singers toward seeking vocal health care is a dimension yet to be explored. The current study was aimed to determine the attitudes of these singers toward seeking vocal health care and further understand the influence of age and gender. Cross-sectional. A 10-item self-report questionnaire adapted from a study on contemporary commercial music singers was used. An additional question was added to ask if the singer was aware about the profession and role of speech-language pathologists (SLPs). The questionnaire was administered on 55 randomly selected self-identified trained Indian classical singers who rated the items using a five-point Likert scale. Demographic variables were summarized using descriptive statistics and t test was used to compare the mean scores between genders and age groups. Of the singers, 78.2% were likely to see a doctor for heath-related problems, whereas 81.8% were unlikely to seek medical care for voice-related problems; the difference was statistically significant (P < 0.001). Responses for the questions assessing the attitudes toward findings from medical examination by a specialist revealed a statistically significant difference (P = 0.02) between the genders. Age did not have a significant influence on the responses. Only 23.6% of the respondents were aware about the profession and the role of SLPs. The findings are in tune with western literature reporting hesitation of singers toward seeking vocal health care and draws attention of SLPs to promote their role in vocal health awareness and management. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Chen, Wenli; Woo, Peak; Murry, Thomas
2017-09-01
High-speed videoendoscopy captures the cycle-to-cycle vibratory motion of each individual vocal fold in normal and severely disordered phonation. Therefore, it provides a direct method to examine the specific vibratory changes following vocal fold surgery. The purpose of this study was to examine the vocal fold vibratory pattern changes in the surgically treated pathologic vocal fold and the contralateral vocal fold in three vocal pathologies: vocal polyp (n = 3), paresis or paralysis (n = 3), and scar (n = 3). Digital kymography was used to extract high-speed kymographic vocal fold images at the mid-membranous region of the vocal fold. Spectral analysis was subsequently applied to the digital kymography to quantify the cycle-to-cycle movements of each vocal fold, expressed as a spectrum. Surgical modification resulted in significantly improved spectral power of the treated pathologic vocal fold. Furthermore, the contralateral vocal fold also presented with improved spectral power irrespective of vocal pathology. In comparison with normal vocal fold spectrum, postsurgical vocal fold vibrations continued to demonstrate decreased vibratory amplitude in both vocal folds. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Kojima, Tsuyoshi; Van Deusen, Mark; Jerome, W. Gray; Garrett, C. Gaelyn; Sivasankar, M. Preeti; Novaleski, Carolyn K.; Rousseau, Bernard
2014-01-01
Because the vocal folds undergo repeated trauma during continuous cycles of vibration, the epithelium is routinely susceptible to damage during phonation. Excessive and prolonged vibration exposure is considered a significant predisposing factor in the development of vocal fold pathology. The purpose of the present study was to quantify the extent of epithelial surface damage following increased time and magnitude doses of vibration exposure using an in vivo rabbit phonation model. Forty-five New Zealand white breeder rabbits were randomized to nine groups and received varying phonation time-doses (30, 60, or 120 minutes) and magnitude-doses (control, modal intensity phonation, or raised intensity phonation) of vibration exposure. Scanning electron microscopy and transmission electron microscopy was used to quantify the degree of epithelial surface damage. Results revealed a significant reduction in microprojection density, microprojection height, and depth of the epithelial surface with increasing time and phonation magnitudes doses, signifying increased epithelial surface damage risk with excessive and prolonged vibration exposure. Destruction to the epithelial cell surface may provide significant insight into the disruption of cell function following prolonged vibration exposure. One important goal achieved in the present study was the quantification of epithelial surface damage using objective imaging criteria. These data provide an important foundation for future studies of long-term tissue recovery from excessive and prolonged vibration exposure. PMID:24626217
Laryngeal complications after type 1 thyroplasty.
Cotter, C S; Avidano, M A; Crary, M A; Cassisi, N J; Gorham, M M
1995-12-01
Type I thyroplasty has become a primary surgical choice for voice restoration in patients with glottal incompetence. This study examines factors associated with laryngeal complications after type I thyroplasty. Ten laryngoscopic variables were analyzed from preoperative, intraoperative, and postoperative videolaryngoscopies of 51 patients undergoing 58 medialization procedures. Ten patient and operative variables were examined by medical record review. Major complications were defined as wound hemorrhage, airway obstruction, or prosthesis extrusion. Minor complications were defined as vocal fold hematoma without airway obstruction or prosthesis movement. The major complication rate was 8.6%, and the minor complication rate was 29%. No delayed hemorrhage or airway obstruction occurred. Prosthesis extrusion occurred in five (8.6%) patients 1 week to 5 months after surgery. Extrusion was associated with suboptimal prosthesis placement in 80% of cases. Two patients retained excellent glottal closure despite extrusion. Vocal fold hematoma was identified in 14 (24%) cases and resolved within 1 week. Prosthesis movement occurred in three (5%) patients 1 week to 6 months after surgery and resulted in poor glottal closure. All patients with prosthesis extrusion or movement were female. Type I thyroplasty remains a safe outpatient procedure with few major complications. Prosthesis extrusion was associated with suboptimal prosthesis placement and may or may not result in poor glottal closure. Minor vocal fold hematomas were relatively frequent, resolved rapidly, and were not associated with airway obstruction. Female patients may be more prone to complications because of their small laryngeal size.
Kaneko, Mami; Hitomi, Takefumi; Takekawa, Takashi; Tsuji, Takuya; Kishimoto, Yo; Hirano, Shigeru
2017-09-26
Injury to the superior laryngeal nerve can result in dysphonia, and in particular, loss of vocal range. It can be an especially difficult problem to address with either voice therapy or surgical intervention. Some clinicians and scientists suggest that combining vocal exercises with adjunctive neuromuscular electrical stimulation may enhance the positive effects of voice therapy for superior laryngeal nerve paresis (SLNP). However, the effects of voice therapy without neuromuscular electrical stimulation are unknown. The purpose of this retrospective study was to demonstrate the clinical effectiveness of voice therapy for rehabilitating chronic SLNP dysphonia in two subjects, using interspike interval (ISI) variability of laryngeal motor units by laryngeal electromyography (LEMG). Both patients underwent LEMG and were diagnosed with having 70% recruitment of the cricothyroid muscle, and 70% recruitment of the cricothyroid and thyroarytenoid muscles, respectively. Both patients received voice therapy for 3 months. Grade, roughness, breathiness, asthenia, and strain (GRBAS) scale, stroboscopic examination, aerodynamic assessment, acoustic analysis, and Voice Handicap Index-10 were performed before and after voice therapy. Mean ISI variability during steady phonation was also assessed. After voice therapy, both patients showed improvement in vocal assessments by acoustic, aerodynamic, GRBAS, and Voice Handicap Index-10 analysis. LEMG indicated shortened ISIs in both cases. This study suggests that voice therapy for chronic SLNP dysphonia can be useful for improving SLNP and voice quality. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Vocal cord dysfunction related to water-damaged buildings.
Cummings, Kristin J; Fink, Jordan N; Vasudev, Monica; Piacitelli, Chris; Kreiss, Kathleen
2013-01-01
Vocal cord dysfunction (VCD) is the intermittent paradoxical adduction of the vocal cords during respiration, resulting in variable upper airway obstruction. Exposure to damp indoor environments is associated with adverse respiratory health outcomes, including asthma, but its role in the development of VCD is not well described. We describe the spectrum of respiratory illness in occupants of 2 water-damaged office buildings. The National Institute for Occupational Safety and Health conducted a health hazard evaluation that included interviews with managers, a maintenance officer, a remediation specialist who had evaluated the 2 buildings, employees, and consulting physicians. In addition, medical records and reports of building evaluations were reviewed. Diagnostic evaluations for VCD had been conducted at the Asthma and Allergy Center of the Medical College of Wisconsin. Two cases of VCD were temporally related to occupancy of water-damaged buildings. The patients experienced cough, chest tightness, dyspnea, wheezing, and hoarseness when in the buildings. Spirometry was normal. Methacholine challenge did not show bronchial hyperreactivity but did elicit symptoms of VCD and inspiratory flow-volume loop truncation. Direct laryngoscopy revealed vocal cord adduction during inspiration. Coworkers developed upper and lower respiratory symptoms; their diagnoses included sinusitis and asthma, consistent with recognized effects of exposure to indoor dampness. Building evaluations provided evidence of water damage and mold growth. VCD can occur with exposure to water-damaged buildings and should be considered in exposed patients with asthma-like symptoms. Published by Elsevier Inc.
Chan, Roger W; Rodriguez, Maritza L
2008-08-01
Previous studies reporting the linear viscoelastic shear properties of the human vocal fold cover or mucosa have been based on torsional rheometry, with measurements limited to low audio frequencies, up to around 80 Hz. This paper describes the design and validation of a custom-built, controlled-strain, linear, simple-shear rheometer system capable of direct empirical measurements of viscoelastic shear properties at phonatory frequencies. A tissue specimen was subjected to simple shear between two parallel, rigid acrylic plates, with a linear motor creating a translational sinusoidal displacement of the specimen via the upper plate, and the lower plate transmitting the harmonic shear force resulting from the viscoelastic response of the specimen. The displacement of the specimen was measured by a linear variable differential transformer whereas the shear force was detected by a piezoelectric transducer. The frequency response characteristics of these system components were assessed by vibration experiments with accelerometers. Measurements of the viscoelastic shear moduli (G' and G") of a standard ANSI S2.21 polyurethane material and those of human vocal fold cover specimens were made, along with estimation of the system signal and noise levels. Preliminary results showed that the rheometer can provide valid and reliable rheometric data of vocal fold lamina propria specimens at frequencies of up to around 250 Hz, well into the phonatory range.
Dosage dependent effect of high-resistance straw exercise in dysphonic and non-dysphonic women.
Paes, Sabrina Mazzer; Behlau, Mara
2017-03-09
to study the dosage dependent effect of high-resistance straw exercise in women with behavioral dysphonia and in vocally healthy women. 25 dysphonic women (DG), with average age of 35 years (SD = 10.5) and 30 vocally healthy women (VHG), with average age of 31.6 years (SD = 10.3). The participants produced a continuous sound into a thin high-resistance straw for seven minutes, being interrupted after the first, third, fifth and seventh minute. At each interval, speech samples were recorded (sustained vowel and counting up to 20) and subsequently acoustically analyzed. Each participant reported the effort necessary to perform exercise and to speak, indicating their ratings on visual analog scales (VAS). with regard to the DG, the exercise caused positive vocal changes, especially between the third and fifth minute: less phonatory effort, increase in MPT, and reduction of F0 variability; these voice parameters deteriorated after five minutes. This fact associated with the increased effort to perform the exercise indicates a possible overload of the phonatory system. As to the VHG, MPT improved after one minute of exercise, while the other parameters did not change over time, probably due to the fact that the voices were not deviant; seven minutes did not seem to impose an overload in this population. positive vocal changes were observed with the high-resistance straw exercise; however, there are dosage restrictions, especially for dysphonic women.
Effects of spectral and temporal disruption on cortical encoding of gerbil vocalizations
Ter-Mikaelian, Maria; Semple, Malcolm N.
2013-01-01
Animal communication sounds contain spectrotemporal fluctuations that provide powerful cues for detection and discrimination. Human perception of speech is influenced both by spectral and temporal acoustic features but is most critically dependent on envelope information. To investigate the neural coding principles underlying the perception of communication sounds, we explored the effect of disrupting the spectral or temporal content of five different gerbil call types on neural responses in the awake gerbil's primary auditory cortex (AI). The vocalizations were impoverished spectrally by reduction to 4 or 16 channels of band-passed noise. For this acoustic manipulation, an average firing rate of the neuron did not carry sufficient information to distinguish between call types. In contrast, the discharge patterns of individual AI neurons reliably categorized vocalizations composed of only four spectral bands with the appropriate natural token. The pooled responses of small populations of AI cells classified spectrally disrupted and natural calls with an accuracy that paralleled human performance on an analogous speech task. To assess whether discharge pattern was robust to temporal perturbations of an individual call, vocalizations were disrupted by time-reversing segments of variable duration. For this acoustic manipulation, cortical neurons were relatively insensitive to short reversal lengths. Consistent with human perception of speech, these results indicate that the stable representation of communication sounds in AI is more dependent on sensitivity to slow temporal envelopes than on spectral detail. PMID:23761696
Yanagihara, Shin; Hessler, Neal A.
2011-01-01
Reactivations of waking experiences during sleep have been considered fundamental neural processes for memory consolidation. In songbirds, evidence suggests the importance of sleep-related neuronal activity in song system motor pathway nuclei for both juvenile vocal learning and maintenance of adult song. Like those in singing motor nuclei, neurons in the basal ganglia nucleus Area X, part of the basal ganglia-thalamocortical circuit essential for vocal plasticity, exhibit singing-related activity. It is unclear, however, whether Area X neurons show any distinctive spiking activity during sleep similar to that during singing. Here we demonstrate that, during sleep, Area X pallidal neurons exhibit phasic spiking activity, which shares some firing properties with activity during singing. Shorter interspike intervals that almost exclusively occurred during singing in awake periods were also observed during sleep. The level of firing variability was consistently higher during singing and sleep than during awake non-singing states. Moreover, deceleration of firing rate, which is considered to be an important firing property for transmitting signals from Area X to the thalamic nucleus DLM, was observed mainly during sleep as well as during singing. These results suggest that songbird basal ganglia circuitry may be involved in the off-line processing potentially critical for vocal learning during sensorimotor learning phase. PMID:21991379
Vibratory regime classification of infant phonation.
Buder, Eugene H; Chorna, Lesya B; Oller, D Kimbrough; Robinson, Rebecca B
2008-09-01
Infant phonation is highly variable in many respects, including the basic vibratory patterns by which the vocal tissues create acoustic signals. Previous studies have identified the regular occurrence of nonmodal phonation types in normal infant phonation. The glottis is like many oscillating systems that, because of nonlinear relationships among the elements, may vibrate in ways representing the deterministic patterns classified theoretically within the mathematical framework of nonlinear dynamics. The infant's preverbal vocal explorations present such a variety of phonations that it may be possible to find effectively all the classes of vibration predicted by nonlinear dynamic theory. The current report defines acoustic criteria for an important subset of such vibratory regimes, and demonstrates that analysts can be trained to reliably use these criteria for a classification that includes all instances of infant phonation in the recorded corpora. The method is thus internally comprehensive in the sense that all phonations are classified, but it is not exhaustive in the sense that all vocal qualities are thereby represented. Using the methods thus developed, this study also demonstrates that the distributions of these phonation types vary significantly across sessions of recording in the first year of life, suggesting developmental changes. The method of regime classification is thus capable of tracking changes that may be indicative of maturation of the mechanism, the learning of categories of phonatory control, and the possibly varying use of vocalizations across social contexts.
Violence in schools and the voice of teachers.
Dornelas, Rodrigo; Santos, Thaynara Alves Dos; Oliveira, Daniela Sena de; Irineu, Roxane de Alencar; Brito, Aline; Silva, Kelly
2017-08-10
To correlate self-reporting of voice disorders with habits that impact voice production and situations of violence experienced by teachers. The study involved 41 elementary-school teachers of rural and urban areas. Two instruments were used for data collection: The Vocal Production Condition - Teacher (CPV-P) questionnaire and the Screening Index for Voice Disorders - ITDV. The chi-square test was used to verify association among variables with a significance level of 5%. The sample consisted of 8 men and 33 women aged 25-66 years with a median of 39 years. Regarding vocal habits, 33 people (80.5%) mentioned the screaming as usual practice, 40 people (97.5%) declared they talk a lot. As for voice care, 31 people (73.1%) reported drinking water while using their voice. As for the ITDV total score, 30 teachers (73.1%) were above the score threshold set for predisposition to vocal disorders. Statistical analysis revealed a significant association between female participants and complaint of graffiti writings as a type of violence. No significant correlation between the ITDV results with gender and the ITDV with forms of violence evaluated in the study was indicated. Self-reporting of voice disorders showed no significant relationship with acts of violence. However, analysis of the context of violence in schools and vocal problems are issues worthy of attention, particularly the observed naturalization of gender inssues, which is seldom problematized.
Provine, Robert R.; Emmorey, Karen
2008-01-01
The placement of laughter in the speech of hearing individuals is not random but “punctuates” speech, occurring during pauses and at phrase boundaries where punctuation would be placed in a transcript of a conversation. For speakers, language is dominant in the competition for the vocal tract since laughter seldom interrupts spoken phrases. For users of American Sign Language, however, laughter and language do not compete in the same way for a single output channel. This study investigated whether laughter occurs simultaneously with signing, or punctuates signing, as it does speech, in 11 signed conversations (with two to five participants) that had at least one instance of audible, vocal laughter. Laughter occurred 2.7 times more often during pauses and at phrase boundaries than simultaneously with a signed utterance. Thus, the production of laughter involves higher order cognitive or linguistic processes rather than the low-level regulation of motor processes competing for a single vocal channel. In an examination of other variables, the social dynamics of deaf and hearing people were similar, with “speakers” (those signing) laughing more than their audiences and females laughing more than males. PMID:16891353
Speech vs. singing: infants choose happier sounds
Corbeil, Marieve; Trehub, Sandra E.; Peretz, Isabelle
2013-01-01
Infants prefer speech to non-vocal sounds and to non-human vocalizations, and they prefer happy-sounding speech to neutral speech. They also exhibit an interest in singing, but there is little knowledge of their relative interest in speech and singing. The present study explored infants' attention to unfamiliar audio samples of speech and singing. In Experiment 1, infants 4–13 months of age were exposed to happy-sounding infant-directed speech vs. hummed lullabies by the same woman. They listened significantly longer to the speech, which had considerably greater acoustic variability and expressiveness, than to the lullabies. In Experiment 2, infants of comparable age who heard the lyrics of a Turkish children's song spoken vs. sung in a joyful/happy manner did not exhibit differential listening. Infants in Experiment 3 heard the happily sung lyrics of the Turkish children's song vs. a version that was spoken in an adult-directed or affectively neutral manner. They listened significantly longer to the sung version. Overall, happy voice quality rather than vocal mode (speech or singing) was the principal contributor to infant attention, regardless of age. PMID:23805119
Provine, Robert R; Emmorey, Karen
2006-01-01
The placement of laughter in the speech of hearing individuals is not random but "punctuates" speech, occurring during pauses and at phrase boundaries where punctuation would be placed in a transcript of a conversation. For speakers, language is dominant in the competition for the vocal tract since laughter seldom interrupts spoken phrases. For users of American Sign Language, however, laughter and language do not compete in the same way for a single output channel. This study investigated whether laughter occurs simultaneously with signing, or punctuates signing, as it does speech, in 11 signed conversations (with two to five participants) that had at least one instance of audible, vocal laughter. Laughter occurred 2.7 times more often during pauses and at phrase boundaries than simultaneously with a signed utterance. Thus, the production of laughter involves higher order cognitive or linguistic processes rather than the low-level regulation of motor processes competing for a single vocal channel. In an examination of other variables, the social dynamics of deaf and hearing people were similar, with "speakers" (those signing) laughing more than their audiences and females laughing more than males.
Dawel, Amy; O'Kearney, Richard; McKone, Elinor; Palermo, Romina
2012-11-01
The present meta-analysis aimed to clarify whether deficits in emotion recognition in psychopathy are restricted to certain emotions and modalities or whether they are more pervasive. We also attempted to assess the influence of other important variables: age, and the affective factor of psychopathy. A systematic search of electronic databases and a subsequent manual search identified 26 studies that included 29 experiments (N = 1376) involving six emotion categories (anger, disgust, fear, happiness, sadness, surprise) across three modalities (facial, vocal, postural). Meta-analyses found evidence of pervasive impairments across modalities (facial and vocal) with significant deficits evident for several emotions (i.e., not only fear and sadness) in both adults and children/adolescents. These results are consistent with recent theorizing that the amygdala, which is believed to be dysfunctional in psychopathy, has a broad role in emotion processing. We discuss limitations of the available data that restrict the ability of meta-analysis to consider the influence of age and separate the sub-factors of psychopathy, highlighting important directions for future research. Copyright © 2012 Elsevier Ltd. All rights reserved.
Briseño-Jaramillo, Margarita; Biquand, Véronique; Estrada, Alejandro; Lemasson, Alban
2017-05-01
Alouatta species utter the most powerful primate vocalizations in the Neotropics and are well-known for their loud and long-lasting male howling bouts. However, the diversity of acoustic structures used in these howling bouts, as well as in non-howling contexts, and the relative contribution of the different group members to the entire vocal repertoire, needed to be explored further. This report provides the first detailed description of the vocal repertoire of black howler monkeys (Alouatta pigra), focusing on acoustic structures and contexts of emission of both loud and soft calls as well as on the contribution rate of males and females to the different call types. Three free-ranging social groups of black howler monkeys living in Palenque National Park, Mexico were monitored. We identified twelve acoustically discriminable call types, eight described previously and four described here for the first time. A few call types were systematically emitted either isolated or during howling bouts, but most of them could be heard in both calling contexts. Three call types were emitted only by females and two only by males. Adult males' call rates (for the seven shared call types) were higher than those of females but only when considering calls emitted within howling bouts. Our contextual analysis enabled us to divide call types into potential functional categories, according to their degree of contribution, to intra-group versus inter-group interactions and to neutral-positive versus negative situations. We then discussed how socio-ecological factors, notably sex differences in social behaviors, may explain the variability found in the vocal repertoire of this species and compared our findings with the literature on other primate species. © 2017 Wiley Periodicals, Inc.
Farran, Alexandra J E; Teller, Sean S; Jha, Amit K; Jiao, Tong; Hule, Rohan A; Clifton, Rodney J; Pochan, Darrin P; Duncan, Randall L; Jia, Xinqiao
2010-04-01
Vocal fold diseases and disorders are difficult to treat surgically or therapeutically. Tissue engineering offers an alternative strategy for the restoration of functional vocal folds. As a first step toward vocal fold tissue engineering, we investigated the responses of primary vocal fold fibroblasts (PVFFs) to two types of collagen and hyaluronic acid (HA)-based hydrogels that are compositionally similar, but structurally variable and mechanically different. Type A hydrogels were composed of mature collagen fibers reinforced by oxidized HA, whereas type B hydrogels contained immature collagen fibrils interpenetrated in an amorphous, covalently cross-linked HA matrix. PVFFs encapsulated in either matrix adopted a fibroblastic morphology and expressed genes related to important extracellular matrix proteins. DNA analysis indicated a linear growth profile for cells encapsulated in type B gels from day 0 to 21, in contrast to an initial dormant, nonproliferative period from day 0 to 3 experienced by cells in type A gels. At the end of the culture, similar DNA content was detected in both types of constructs. A reduction in collagen content was observed for both types of constructs after 28 days of culture, with type A constructs generally retaining higher amounts of collagen than type B constructs. The HA content in the constructs decreased steadily throughout the culture, with type A constructs consistently exhibiting less HA than type B constructs. Using the torsional wave analysis, we found that the elastic moduli for type A constructs decreased sharply during the first week of culture, followed by 2 weeks of matrix stabilization without significant changes in matrix stiffness. Conversely, the elastic modulus for type B constructs increased moderately over time. It is postulated that PVFFs residing in gels alter the matrix organization, chemical compositions, and viscoelasticity through cell-mediated remodeling processes.
Faham, Maryam; Jalilevand, Nahid; Torabinezhad, Farhad; Silverman, Erin Pearson; Ahmadi, Akram; Anaraki, Zahra Ghayoumi; Jafari, Narges
2017-07-01
Teachers are at high risk of developing voice problems because of the excessive vocal demands necessitated by their profession. Teachers' self-assessment of vocal complaints, combined with subjective and objective measures of voice, may enable better therapeutic decision-making. This investigation compared audio-perceptual assessment and acoustic variables in teachers with and without voice complaints. Ninety-nine teachers completed this cross-sectional study and were assigned to one of two groups: those "with voice complaint (VC)" and those "without voice complaint (W-VC)." Voice samples were collected during reading, counting, and vowel prolongation tasks. Teachers were also asked to document any voice symptoms they experienced. Voice samples were analyzed using Dr. Speech program (4th version; Tiger Ltd., USA), and labeled "normal" or "abnormal" according to the "grade" dimension "G" from GRBAS scale. Twenty-one teachers were assigned to the VC group based on self-assessment data. There were statistically significant differences between the two groups with regard to self-reported voice symptoms of hoarseness, breathiness, pitch breaks, and vocal fatigue (P < 0.05). Fourteen participants in the VC group and 40 from the W-VC group were determined to demonstrate "abnormal" vocal quality on perceptual assessment. Only harmonic-to-noise ratio was significantly higher for the W-VC group (ES = 0.55). Teachers with and without voice complaints differed in the incidence, but not type of voice symptoms. Teachers' voice complaints did not correspond to perceptual and acoustic measures. This suggests a potential unmet need for teachers to receive further education on voice disorders. Copyright © 2017 The Voice Foundation. All rights reserved.
Artificially lengthened and constricted vocal tract in vocal training methods.
Bele, Irene Velsvik
2005-01-01
It is common practice in vocal training to make use of vocal exercise techniques that involve partial occlusion of the vocal tract. Various techniques are used; some of them form an occlusion within the front part of the oral cavity or at the lips. Another vocal exercise technique involves lengthening the vocal tract; for example, the method of phonation into small tubes. This essay presents some studies made on the effects of various vocal training methods that involve an artificially lengthened and constricted vocal tract. The influence of sufficient acoustic impedance on vocal fold vibration and economical voice production is presented.
The Influence of Interactive Context on Prelinguistic Vocalizations and Maternal Responses
ERIC Educational Resources Information Center
Gros-Louis, Julie; West, Meredith J.; King, Andrew P.
2016-01-01
Many studies have documented influences of maternal responsiveness on cognitive and language development. Given the bidirectionality of interactions in caregiver-infant dyads, it is important to understand how infant behavior elicits variable responses. Prior studies have shown that mothers respond differentially to features of prelinguistic…
Observational Data on the Psychological Experience of Childbirth.
ERIC Educational Resources Information Center
Standley, Kay
This paper explores the effects of the childbirth environment and demographic variables on the laboring woman's expressions of tension and pain. Four observable physical measures were combined into a behavioral index of pain and tension: irregular breathing, tension of the upper extremities (rigid muscle flexion), vocalizations of pain (a cry,…
Analyzing Stimulus-Stimulus Pairing Effects on Preferences for Speech Sounds
ERIC Educational Resources Information Center
Petursdottir, Anna Ingeborg; Carp, Charlotte L.; Matthies, Derek W.; Esch, Barbara E.
2011-01-01
Several studies have demonstrated effects of stimulus-stimulus pairing (SSP) on children's vocalizations, but numerous treatment failures have also been reported. The present study attempted to isolate procedural variables related to failures of SSP to condition speech sounds as reinforcers. Three boys diagnosed with autism-spectrum disorders…
Reynolds, Conner D; Nolan, Suzanne O; Huebschman, Jessica L; Hodges, Samantha L; Lugo, Joaquin N
2017-07-01
Early-life seizures are known to cause long-term deficits in social behavior, learning, and memory, however little is known regarding their acute impact. Ultrasonic vocalization (USV) recordings have been developed as a tool for investigating early communicative deficits in mice. Previous investigation from our lab found that postnatal day (PD) 10 seizures cause male-specific suppression of 50-kHz USVs on PD12 in 129 SvEvTac mouse pups. The present study extends these findings by spectrographic characterization of USVs following neonatal seizures. On PD10, male C57BL/6 pups were administered intraperitoneal injections of kainic acid or physiological saline. On PD12, isolation-induced recordings were captured using a broad-spectrum ultrasonic microphone. Status epilepticus significantly suppressed USV quantity (p=0.001) and total duration (p<0.05). Seizure pups also utilized fewer complex calls than controls (p<0.05). There were no changes in call latency or inter-call intervals. Spectrographic analysis revealed increased peak amplitude for complex, downward, short, two-syllable, and upward calls, as well as reduced mean duration for short and two-syllable calls in seizure mice. This investigation provides the first known spectrographic characterization of USVs following early-life seizures. These findings also enhance evidence for USVs as an indicator of select communicative impairment. Copyright © 2017 Elsevier Inc. All rights reserved.
A new measure of child vocal reciprocity in children with autism spectrum disorder.
Harbison, Amy L; Woynaroski, Tiffany G; Tapp, Jon; Wade, Joshua W; Warlaumont, Anne S; Yoder, Paul J
2018-06-01
Children's vocal development occurs in the context of reciprocal exchanges with a communication partner who models "speechlike" productions. We propose a new measure of child vocal reciprocity, which we define as the degree to which an adult vocal response increases the probability of an immediately following child vocal response. Vocal reciprocity is likely to be associated with the speechlikeness of vocal communication in young children with autism spectrum disorder (ASD). Two studies were conducted to test the utility of the new measure. The first used simulated vocal samples with randomly sequenced child and adult vocalizations to test the accuracy of the proposed index of child vocal reciprocity. The second was an empirical study of 21 children with ASD who were preverbal or in the early stages of language development. Daylong vocal samples collected in the natural environment were computer analyzed to derive the proposed index of child vocal reciprocity, which was highly stable when derived from two daylong vocal samples and was associated with speechlikeness of vocal communication. This association was significant even when controlling for chance probability of child vocalizations to adult vocal responses, probability of adult vocalizations, or probability of child vocalizations. A valid measure of children's vocal reciprocity might eventually improve our ability to predict which children are on track to develop useful speech and/or are most likely to respond to language intervention. A link to a free, publicly-available software program to derive the new measure of child vocal reciprocity is provided. Autism Res 2018, 11: 903-915. © 2018 International Society for Autism Research, Wiley Periodicals, Inc. Children and adults often engage in back-and-forth vocal exchanges. The extent to which they do so is believed to support children's early speech and language development. Two studies tested a new measure of child vocal reciprocity using computer-generated and real-life vocal samples of young children with autism collected in natural settings. The results provide initial evidence of accuracy, test-retest reliability, and validity of the new measure of child vocal reciprocity. A sound measure of children's vocal reciprocity might improve our ability to predict which children are on track to develop useful speech and/or are most likely to respond to language intervention. A free, publicly-available software program and manuals are provided. © 2018 International Society for Autism Research, Wiley Periodicals, Inc.
Broaddus-Lawrence, P L; Treole, K; McCabe, R B; Allen, R L; Toppin, L
2000-03-01
The purpose of the present study was to determine the effects of vocal hygiene education on the vocal hygiene behaviors and perceptual vocal characteristics of untrained singers. Eleven adult untrained singers served as subjects. They attended four 1-hour class sessions on vocal hygiene, including anatomy and physiology of the phonatory mechanism, vocally abusive behaviors, voice disorders commonly seen in singers, and measures to prevent voice disorders. Pre- and postinstruction surveys were used to record subjects' vocal abuses and their perceptions of their speaking and singing voice. They also rated their perceived value of vocal hygiene education. Results revealed minimal changes in vocal hygiene behaviors and perceptual voice characteristics. The subjects did report a high degree of benefit and learning, however.
Acoustic Analysis and Electroglottography in Elite Vocal Performers.
Villafuerte-Gonzalez, Rocio; Valadez-Jimenez, Victor M; Sierra-Ramirez, Jose A; Ysunza, Pablo Antonio; Chavarria-Villafuerte, Karen; Hernandez-Lopez, Xochiquetzal
2017-05-01
Acoustic analysis of voice (AAV) and electroglottography (EGG) have been used for assessing vocal quality in patients with voice disorders. The effectiveness of these procedures for detecting mild disturbances in vocal quality in elite vocal performers has been controversial. To compare acoustic parameters obtained by AAV and EGG before and after vocal training to determine the effectiveness of these procedures for detecting vocal improvements in elite vocal performers. Thirty-three elite vocal performers were studied. The study group included 14 males and 19 females, ages 18-40 years, without a history of voice disorders. Acoustic parameters were obtained through AAV and EGG before and after vocal training using the Linklater method. Nonsignificant differences (P > 0.05) were found between values of fundamental frequency (F 0 ), shimmer, and jitter obtained by both procedures before vocal training. Mean F 0 was similar after vocal training. Jitter percentage as measured by AAV showed nonsignificant differences (P > 0.05) before and after vocal training. Shimmer percentage as measured by AAV demonstrated a significant reduction (P < 0.05) after vocal training. As measured by EGG after vocal training, shimmer and jitter were significantly reduced (P < 0.05); open quotient was significantly increased (P < 0.05); and irregularity was significantly reduced (P < 0.05). AAV and EGG were effective for detecting improvements in vocal function after vocal training in male and female elite vocal performers undergoing vocal training. EGG demonstrated better efficacy for detecting improvements and provided additional parameters as compared to AAV. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Differential Expression of Glutamate Receptors in Avian Neural Pathways for Learned Vocalization
WADA, KAZUHIRO; SAKAGUCHI, HIRONOBU; JARVIS, ERICH D.; HAGIWARA, MASATOSHI
2008-01-01
Learned vocalization, the substrate for human language, is a rare trait. It is found in three distantly related groups of birds—parrots, hummingbirds, and songbirds. These three groups contain cerebral vocal nuclei for learned vocalization not found in their more closely related vocal nonlearning relatives. Here, we cloned 21 receptor subunits/subtypes of all four glutamate receptor families (AMPA, kainate, NMDA, and metabotropic) and examined their expression in vocal nuclei of songbirds. We also examined expression of a subset of these receptors in vocal nuclei of hummingbirds and parrots, as well as in the brains of dove species as examples of close vocal nonlearning relatives. Among the 21 subunits/subtypes, 19 showed higher and/or lower prominent differential expression in songbird vocal nuclei relative to the surrounding brain subdivisions in which the vocal nuclei are located. This included relatively lower levels of all four AMPA subunits in lMAN, strikingly higher levels of the kainite subunit GluR5 in the robust nucleus of the arcopallium (RA), higher and lower levels respectively of the NMDA subunits NR2A and NR2B in most vocal nuclei and lower levels of the metabotropic group I subtypes (mGluR1 and -5) in most vocal nuclei and the group II subtype (mGluR2), showing a unique expression pattern of very low levels in RA and very high levels in HVC. The splice variants of AMPA subunits showed further differential expression in vocal nuclei. Some of the receptor subunits/subtypes also showed differential expression in hummingbird and parrot vocal nuclei. The magnitude of differential expression in vocal nuclei of all three vocal learners was unique compared with the smaller magnitude of differences found for nonvocal areas of vocal learners and vocal nonlearners. Our results suggest that evolution of vocal learning was accompanied by differential expression of a conserved gene family for synaptic transmission and plasticity in vocal nuclei. They also suggest that neural activity and signal transduction in vocal nuclei of vocal learners will be different relative to the surrounding brain areas. PMID:15236466
Neural Correlates of the Lombard Effect in Primate Auditory Cortex
Eliades, Steven J.
2012-01-01
Speaking is a sensory-motor process that involves constant self-monitoring to ensure accurate vocal production. Self-monitoring of vocal feedback allows rapid adjustment to correct perceived differences between intended and produced vocalizations. One important behavior in vocal feedback control is a compensatory increase in vocal intensity in response to noise masking during vocal production, commonly referred to as the Lombard effect. This behavior requires mechanisms for continuously monitoring auditory feedback during speaking. However, the underlying neural mechanisms are poorly understood. Here we show that when marmoset monkeys vocalize in the presence of masking noise that disrupts vocal feedback, the compensatory increase in vocal intensity is accompanied by a shift in auditory cortex activity toward neural response patterns seen during vocalizations under normal feedback condition. Furthermore, we show that neural activity in auditory cortex during a vocalization phrase predicts vocal intensity compensation in subsequent phrases. These observations demonstrate that the auditory cortex participates in self-monitoring during the Lombard effect, and may play a role in the compensation of noise masking during feedback-mediated vocal control. PMID:22855821
The perceptual features of vocal fatigue as self-reported by a group of actors and singers.
Kitch, J A; Oates, J
1994-09-01
Performers (10 actors/10 singers) rated via a self-report questionnaire the severity of their voice-related changes when vocally fatigued. Similar frequency patterns and perceptual features of vocal fatigue were found across subjects. Actors rated "power" aspects (e.g., voice projection) and singers rated vocal dynamic aspects (e.g., pitch range) of their voices as most affected when vocally fatigued. Vocal fatigue was evidenced by changes in kinesthetic/proprioceptive sensations and vocal dynamics. The causes and context of vocal fatigue were vocal misuse, being "run down," high performance demands, and using high pitch/volume levels. Further research is needed to delineate the perceptual features of "normal" levels of vocal fatigue and its possible causes.
Reinforcement of Infant Vocalizations through Contingent Vocal Imitation
ERIC Educational Resources Information Center
Pelaez, Martha; Virues-Ortega, Javier; Gewirtz, Jacob L.
2011-01-01
Maternal vocal imitation of infant vocalizations is highly prevalent during face-to-face interactions of infants and their caregivers. Although maternal vocal imitation has been associated with later verbal development, its potentially reinforcing effect on infant vocalizations has not been explored experimentally. This study examined the…
2018-01-01
Human vocal development is dependent on learning by imitation through social feedback between infants and caregivers. Recent studies have revealed that vocal development is also influenced by parental feedback in marmoset monkeys, suggesting vocal learning mechanisms in nonhuman primates. Marmoset infants that experience more contingent vocal feedback than their littermates develop vocalizations more rapidly, and infant marmosets with limited parental interaction exhibit immature vocal behavior beyond infancy. However, it is yet unclear whether direct parental interaction is an obligate requirement for proper vocal development because all monkeys in the aforementioned studies were able to produce the adult call repertoire after infancy. Using quantitative measures to compare distinct call parameters and vocal sequence structure, we show that social interaction has a direct impact not only on the maturation of the vocal behavior but also on acoustic call structures during vocal development. Monkeys with limited parental interaction during development show systematic differences in call entropy, a measure for maturity, compared with their normally raised siblings. In addition, different call types were occasionally uttered in motif-like sequences similar to those exhibited by vocal learners, such as birds and humans, in early vocal development. These results indicate that a lack of parental interaction leads to long-term disturbances in the acoustic structure of marmoset vocalizations, suggesting an imperative role for social interaction in proper primate vocal development. PMID:29651461
Gultekin, Yasemin B; Hage, Steffen R
2018-04-01
Human vocal development is dependent on learning by imitation through social feedback between infants and caregivers. Recent studies have revealed that vocal development is also influenced by parental feedback in marmoset monkeys, suggesting vocal learning mechanisms in nonhuman primates. Marmoset infants that experience more contingent vocal feedback than their littermates develop vocalizations more rapidly, and infant marmosets with limited parental interaction exhibit immature vocal behavior beyond infancy. However, it is yet unclear whether direct parental interaction is an obligate requirement for proper vocal development because all monkeys in the aforementioned studies were able to produce the adult call repertoire after infancy. Using quantitative measures to compare distinct call parameters and vocal sequence structure, we show that social interaction has a direct impact not only on the maturation of the vocal behavior but also on acoustic call structures during vocal development. Monkeys with limited parental interaction during development show systematic differences in call entropy, a measure for maturity, compared with their normally raised siblings. In addition, different call types were occasionally uttered in motif-like sequences similar to those exhibited by vocal learners, such as birds and humans, in early vocal development. These results indicate that a lack of parental interaction leads to long-term disturbances in the acoustic structure of marmoset vocalizations, suggesting an imperative role for social interaction in proper primate vocal development.
A Survey of the Research on Sex Differences in Nonverbal Communication.
ERIC Educational Resources Information Center
Blahna, Loretta J.
Although the bulk of recent research on nonverbal communication has involved studies of the functions of nonverbal behavior (emotion conveying, regulation, and adaption), a few studies have focused on the differences in nonverbal communication variables between men and women. These differences have been found in vocal patterns, intensities, length…
The aquatic frog, Xenopus tropicalis, is being developed for use as a model amphibian species for inclusion in the EPA’s Endocrine Disruptor Screening Program. Current toxicity test designs do not incorporate measures of fecundity due to high variability in the responses of frog...
Measurement of Phonated Intervals during Four Fluency-Inducing Conditions
ERIC Educational Resources Information Center
Davidow, Jason H.; Bothe, Anne K.; Andreatta, Richard D.; Ye, Jun
2009-01-01
Purpose: Previous investigations of persons who stutter have demonstrated changes in vocalization variables during fluency-inducing conditions (FICs). A series of studies has also shown that a reduction in short intervals of phonation, those from 30 to 200 ms, is associated with decreased stuttering. The purpose of this study, therefore, was to…
Clinical dysphagia risk predictors after prolonged orotracheal intubation
de Medeiros, Gisele Chagas; Sassi, Fernanda Chiarion; Mangilli, Laura Davison; Zilberstein, Bruno; de Andrade, Claudia Regina Furquim
2014-01-01
OBJECTIVES: To elucidate independent risk factors for dysphagia after prolonged orotracheal intubation. METHODS: The participants were 148 consecutive patients who underwent clinical bedside swallowing assessments from September 2009 to September 2011. All patients had received prolonged orotracheal intubations and were admitted to one of several intensive care units of a large Brazilian school hospital. The correlations between the conducted water swallow test results and dysphagia risk levels were analyzed for statistical significance. RESULTS: Of the 148 patients included in the study, 91 were male and 57 were female (mean age, 53.64 years). The univariate analysis results indicated that specific variables, including extraoral loss, multiple swallows, cervical auscultation, vocal quality, cough, choking, and other signs, were possible significant high-risk indicators of dysphagia onset. The multivariate analysis results indicated that cervical auscultation and coughing were independent predictive variables for high dysphagia risk. CONCLUSIONS: Patients displaying extraoral loss, multiple swallows, cervical auscultation, vocal quality, cough, choking and other signs should benefit from early swallowing evaluations. Additionally, early post-extubation dysfunction recognition is paramount in reducing the morbidity rate in this high-risk population. PMID:24473554
Kreiman, Jody; Shue, Yen-Liang; Chen, Gang; Iseli, Markus; Gerratt, Bruce R.; Neubauer, Juergen; Alwan, Abeer
2012-01-01
Increases in open quotient are widely assumed to cause changes in the amplitude of the first harmonic relative to the second (H1*–H2*), which in turn correspond to increases in perceived vocal breathiness. Empirical support for these assumptions is rather limited, and reported relationships among these three descriptive levels have been variable. This study examined the empirical relationship among H1*–H2*, the glottal open quotient (OQ), and glottal area waveform skewness, measured synchronously from audio recordings and high-speed video images of the larynges of six phonetically knowledgeable, vocally healthy speakers who varied fundamental frequency and voice qualities quasi-orthogonally. Across speakers and voice qualities, OQ, the asymmetry coefficient, and fundamental frequency accounted for an average of 74% of the variance in H1*–H2*. However, analyses of individual speakers showed large differences in the strategies used to produce the same intended voice qualities. Thus, H1*–H2* can be predicted with good overall accuracy, but its relationship to phonatory characteristics appears to be speaker dependent. PMID:23039455
Clinical dysphagia risk predictors after prolonged orotracheal intubation.
Medeiros, Gisele Chagas de; Sassi, Fernanda Chiarion; Mangilli, Laura Davison; Zilberstein, Bruno; Andrade, Claudia Regina Furquim de
2014-01-01
To elucidate independent risk factors for dysphagia after prolonged orotracheal intubation. The participants were 148 consecutive patients who underwent clinical bedside swallowing assessments from September 2009 to September 2011. All patients had received prolonged orotracheal intubations and were admitted to one of several intensive care units of a large Brazilian school hospital. The correlations between the conducted water swallow test results and dysphagia risk levels were analyzed for statistical significance. Of the 148 patients included in the study, 91 were male and 57 were female (mean age, 53.64 years). The univariate analysis results indicated that specific variables, including extraoral loss, multiple swallows, cervical auscultation, vocal quality, cough, choking, and other signs, were possible significant high-risk indicators of dysphagia onset. The multivariate analysis results indicated that cervical auscultation and coughing were independent predictive variables for high dysphagia risk. Patients displaying extraoral loss, multiple swallows, cervical auscultation, vocal quality, cough, choking and other signs should benefit from early swallowing evaluations. Additionally, early post-extubation dysfunction recognition is paramount in reducing the morbidity rate in this high-risk population.
Vazan, Rastislav; Filcikova, Diana; Mravec, Boris
2017-12-01
The effect of Stroop test (ST)-induced stress on autonomic nervous system activity is often examined via measurement of heart rate variability (HRV). However, HRV may be significantly affected by interfering factors, including vocalization and inappropriate body position. Surprisingly, published studies ignore these issues, so the aim of our study was to test the innovative procedure for correct HRV measurement in individuals exposed to ST. Moreover, we examined possible gender differences in HRV and heart rate (HR) during ST. Healthy participants (21 men, 34 women) were placed in a supine position (no orthostatic activation) and then exposed to a 5 min relaxation period and a 5 min period of computerized ST (no vocalization). We found significant differences in HR and HRV parameters in both genders during ST when compared to relaxation (baseline) values. On the other hand, there were no significant differences in HR and HRV between males and females (except for a higher baseline HR in women). Also, reactivity to ST stress (difference between baseline and ST value) showed no gender differences. In conclusion, when performed in a supine position and without vocalization, the ST induces significant stress-related changes of HR and HRV in both genders, with no gender differences in the magnitude of reaction to ST stress. This experimental procedure can be used for correct examination of mental stress-related changes in the autonomic nervous system, and is particularly useful for examining mixed-gender experimental groups. Copyright © 2017 Elsevier B.V. All rights reserved.
2015-01-01
Abstract Vowels provide the acoustic foundation of communication through speech and song, but little is known about how the brain orchestrates their production. Positron emission tomography was used to study regional cerebral blood flow (rCBF) during sustained production of the vowel /a/. Acoustic and blood flow data from 13, normal, right-handed, native speakers of American English were analyzed to identify CBF patterns that predicted the stability of the first and second formants of this vowel. Formants are bands of resonance frequencies that provide vowel identity and contribute to voice quality. The results indicated that formant stability was directly associated with blood flow increases and decreases in both left- and right-sided brain regions. Secondary brain regions (those associated with the regions predicting formant stability) were more likely to have an indirect negative relationship with first formant variability, but an indirect positive relationship with second formant variability. These results are not definitive maps of vowel production, but they do suggest that the level of motor control necessary to produce stable vowels is reflected in the complexity of an underlying neural system. These results also extend a systems approach to functional image analysis, previously applied to normal and ataxic speech rate that is solely based on identifying patterns of brain activity associated with specific performance measures. Understanding the complex relationships between multiple brain regions and the acoustic characteristics of vocal stability may provide insight into the pathophysiology of the dysarthrias, vocal disorders, and other speech changes in neurological and psychiatric disorders. PMID:25295385
de la Hoz, Rafael E; Shohet, Michael R; Bienenfeld, Laura A; Afilaka, Aboaba A; Levin, Stephen M; Herbert, Robin
2008-03-01
Vocal cord dysfunction (VCD) is a condition characterized by paradoxical partial adduction of the vocal cords on inspiration. It has been associated with exposures to irritants, as well as with psychological illnesses and conditions. Workers who participated in the recovery of the WTC disaster site were exposed to a large amount of irritants as well as considerable psychological stressors. We describe the clinical characteristics of 10 symptomatic former WTC workers diagnosed with this condition, as well as the frequency of spirometric findings suggestive of variable extrathoracic obstruction. Workers who became symptomatic after their WTC work experience have been evaluated clinically by a multidisciplinary team at an academic medical center. The evaluation included history, physical examination, chest radiograph, blood tests, and pre- and post-bronchodilator spirometry in all patients. Additional evaluations and diagnostic tests included otolaryngological evaluation with flexible rhinolaryngoscopy and stroboscopy, gastroenterological and psychiatric evaluations. A randomly selected sample of 172 spirometry results were reviewed for evidence of inspiratory flow limitation. Variable extrathoracic obstruction was found in 18.6% of the spirometries. Ten patients were diagnosed with VCD. In addition to symptoms suggestive of co-morbid conditions (particularly rhinitis and acid reflux disease), most of the 10 patients had (1) hoarseness, (2) dyspnea that was not associated with bronchial hyperreactivity, or (3) dyspnea associated with asthma, with either mild bronchial hyperreactivity and/or poor response to asthma treatment. VCD appears to be part of the spectrum of airway disorders caused by occupational exposures at the WTC disaster site. Further study of this association is warranted.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Saide, Pablo; Spak, S. N.; Carmichael, Gregory
2012-03-30
We evaluate a regional-scale simulation with the WRF-Chem model for the VAMOS (Variability of the American Monsoon Systems) Ocean-Cloud-Atmosphere-Land Study Regional Experiment (VOCALS-REx), which sampled the Southeast Pacific's persistent stratocumulus deck. Evaluation of VOCALS-REx ship-based and aircraft observations focuses on analyzing how aerosol loading affects marine boundary layer (MBL) dynamics and cloud microphysics. We compare local time series and campaign averaged longitudinal gradients, and highlight differences in model simulations with (W) and without wet (NW) deposition processes. The higher aerosol loadings in the NW case produce considerable changes in MBL dynamics and cloud microphysics, in accordance with the established conceptualmore » model of aerosol indirect effects. These include increase in cloud albedo, increase in MBL and cloud heights, drizzle suppression, increase in liquid water content, and increase in cloud lifetime. Moreover, better statistical representation of aerosol mass and number concentration improves model fidelity in reproducing observed spatial and temporal variability in cloud properties, including top and base height, droplet concentration, water content, rain rate, optical depth (COD) and liquid water path (LWP). Together, these help to quantify confidence in WRF-Chem's modeled aerosol-cloud interactions, while identifying structural and parametric uncertainties including: irreversibility in rain wet removal; overestimation of marine DMS and sea salt emissions and accelerated aqueous sulfate conversion. Our findings suggest that WRF-Chem simulates marine cloud-aerosol interactions at a level sufficient for applications in forecasting weather and air quality and studying aerosol climate forcing, including the reliability required for policy analysis and geo-engineering applications.« less
Vocal Fold Bowing in Elderly Male Monozygotic Twins: A Case Study
Tanner, Kristine; Sauder, Cara; Thibeault, Susan L.; Dromey, Christopher; Smith, Marshall E.
2009-01-01
Objectives This study examined case histories, diagnostic features, and treatment response in two 79-year-old male monozygotic (identical) twins with vocal fold bowing, exploring both genetic and environmental factors. Study Design Case study. Methods DNA concordance was examined via cheek swab. Case histories, videostroboscopy, auditory- and visual-perceptual assessment, electromyography, acoustic measures, and Voice Handicap ratings were undertaken. Both twins underwent surgical intervention and subsequent voice therapy. Results Monozygosity was confirmed for DNA polymorphisms, with 10 of 10 concordance for STR DNA markers. For both twins, auditory and visual-perceptual assessments indicated severe bowing, hoarseness and breathiness, although Twin 1 was judged to be extremely severe. Differences in RMS amplitudes were observed for TA and LCA muscles, with smaller relative amplitudes observed for the Twin 1 versus Twin 2. No consistent voice improvement was observed following surgical intervention(s), despite improved mid-membranous vocal fold closure. Marked reductions in Voice Handicap Index total scores were observed following behavioral voice therapy, coinciding with increased mid-membranous and posterior laryngeal (interarytenoid) glottal closure. No substantive differences in acoustic measures were observed. Conclusions Vocal fold bowing was more severe for Twin 1 versus Twin 2 despite identical heritability factors. Overall voice improvement with treatment was greater for Twin 2 than Twin 1. Environmental factors might partially account for the differences observed between the twins, including variability in their responsiveness to behavioral voice therapy. Voice therapy was useful in improving mid-membranous and posterior laryngeal closure, although dysphonia remained severe in both cases. PMID:19664899
Vibratory Regime Classification of Infant Phonation
Buder, Eugene H.; Chorna, Lesya B.; Oller, D. Kimbrough; Robinson, Rebecca B.
2008-01-01
Infant phonation is highly variable in many respects, including the basic vibratory patterns by which the vocal tissues create acoustic signals. Previous studies have identified the regular occurrence of non-modal phonation types in normal infant phonation. The glottis is like many oscillating systems that, because of non-linear relationships among the elements, may vibrate in ways representing the deterministic patterns classified theoretically within the mathematical framework of non-linear dynamics. The infant’s pre-verbal vocal explorations present such a variety of phonations that it may be possible to find effectively all the classes of vibration predicted by non-linear dynamic theory. The current report defines acoustic criteria for an important subset of such vibratory regimes, and demonstrates that analysts can be trained to reliably use these criteria for a classification that includes all instances of infant phonation in the recorded corpora. The method is thus internally comprehensive in the sense that all phonations are classified, but it is not exhaustive in the sense that all vocal qualities are thereby represented. Using the methods thus developed, this study also demonstrates that the distributions of these phonation types vary significantly across sessions of recording in the first year of life, suggesting developmental changes. The method of regime classification is thus capable of tracking changes that may be indicative of maturation of the mechanism, the learning of categories of phonatory control, and the possibly varying use of vocalizations across social contexts. PMID:17509829
Kang, Jing; Xue, Chao; Chou, Adriana; Scholp, Austin; Gong, Ting; Zhang, Yi; Chen, Zhen; Jiang, Jack J
2018-02-05
The aim of this study was to quantify the effects of traditional and physiological warm-up exercises and to determine the optimal duration of these methods using acoustic and aerodynamic metrics. Twenty-six subjects were recruited to participate in both straw phonation exercises (physiological vocal warm-up) and traditional singing exercises (traditional vocal warm-up) for 20 minutes each, 24 hours apart. Phonation threshold pressure (PTP), fundamental frequency, jitter, shimmer, and noise-to-harmonics ratio were measured before the intervention (m0), as well as after 5 minutes (m5), 10 minutes (m10), 15 minutes (m15), and 20 minutes (m20) of intervention. PTP decreased significantly after straw phonation and reached a minimum value at 10 minutes (P < 0.001) and remained stable in traditional singing exercises. There were significant differences in fundamental frequency and shimmer from m0 to m15 and m20 in the traditional singing group (P = 0.001, P = 0.001, P = 0.001, and P = 0.002, respectively). No significant changes in acoustic parameters were observed after straw phonation. Both straw phonation exercises and traditional singing exercises are effective for voice warm-up. Straw phonation improves the subjects' fatigue resistance and vocal economy, resulting in a reduced PTP, whereas traditional singing exercises focus on technical singing skills, leading to an improvement of acoustic variables. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Histopathologic study of human vocal fold mucosa unphonated over a decade.
Sato, Kiminori; Umeno, Hirohito; Ono, Takeharu; Nakashima, Tadashi
2011-12-01
Mechanotransduction caused by vocal fold vibration could possibly be an important factor in the maintenance of extracellular matrices and layered structure of the human adult vocal fold mucosa as a vibrating tissue after the layered structure has been completed. Vocal fold stellate cells (VFSCs) in the human maculae flavae of the vocal fold mucosa are inferred to be involved in the metabolism of extracellular matrices of the vocal fold mucosa. Maculae flavae are also considered to be an important structure in the growth and development of the human vocal fold mucosa. Tension caused by phonation (vocal fold vibration) is hypothesized to stimulate the VFSCs to accelerate production of extracellular matrices. A human adult vocal fold mucosa unphonated over a decade was investigated histopathologically. Vocal fold mucosa unphonated for 11 years and 2 months of a 64-year-old male with cerebral hemorrhage was investigated by light and electron microscopy. The vocal fold mucosae (including maculae flavae) were atrophic. The vocal fold mucosa did not have a vocal ligament, Reinke's space or a layered structure. The lamina propria appeared as a uniform structure. Morphologically, the VFSCs synthesized fewer extracellular matrices, such as fibrous protein and glycosaminoglycan. Consequently, VFSCs appeared to decrease their level of activity.
Coos, booms, and hoots: The evolution of closed-mouth vocal behavior in birds.
Riede, Tobias; Eliason, Chad M; Miller, Edward H; Goller, Franz; Clarke, Julia A
2016-08-01
Most birds vocalize with an open beak, but vocalization with a closed beak into an inflating cavity occurs in territorial or courtship displays in disparate species throughout birds. Closed-mouth vocalizations generate resonance conditions that favor low-frequency sounds. By contrast, open-mouth vocalizations cover a wider frequency range. Here we describe closed-mouth vocalizations of birds from functional and morphological perspectives and assess the distribution of closed-mouth vocalizations in birds and related outgroups. Ancestral-state optimizations of body size and vocal behavior indicate that closed-mouth vocalizations are unlikely to be ancestral in birds and have evolved independently at least 16 times within Aves, predominantly in large-bodied lineages. Closed-mouth vocalizations are rare in the small-bodied passerines. In light of these results and body size trends in nonavian dinosaurs, we suggest that the capacity for closed-mouth vocalization was present in at least some extinct nonavian dinosaurs. As in birds, this behavior may have been limited to sexually selected vocal displays, and hence would have co-occurred with open-mouthed vocalizations. © 2016 The Author(s). Evolution © 2016 The Society for the Study of Evolution.
The Vocal Repertoire of Adult and Neonate Giant Otters (Pteronura brasiliensis)
Mumm, Christina A. S.; Knörnschild, Mirjam
2014-01-01
Animals use vocalizations to exchange information about external events, their own physical or motivational state, or about individuality and social affiliation. Infant babbling can enhance the development of the full adult vocal repertoire by providing ample opportunity for practice. Giant otters are very social and frequently vocalizing animals. They live in highly cohesive groups, generally including a reproductive pair and their offspring born in different years. This basic social structure may vary in the degree of relatedness of the group members. Individuals engage in shared group activities and different social roles and thus, the social organization of giant otters provides a basis for complex and long-term individual relationships. We recorded and analysed the vocalizations of adult and neonate giant otters from wild and captive groups. We classified the adult vocalizations according to their acoustic structure, and described their main behavioural context. Additionally, we present the first description of vocalizations uttered in babbling bouts of new born giant otters. We expected to find 1) a sophisticated vocal repertoire that would reflect the species’ complex social organisation, 2) that giant otter vocalizations have a clear relationship between signal structure and function, and 3) that the vocal repertoire of new born giant otters would comprise age-specific vocalizations as well as precursors of the adult repertoire. We found a vocal repertoire with 22 distinct vocalization types produced by adults and 11 vocalization types within the babbling bouts of the neonates. A comparison within the otter subfamily suggests a relation between vocal and social complexity, with the giant otters being the socially and vocally most complex species. PMID:25391142
Garcia, Elisângela Zacanti; Yamashita, Hélio Kiitiro; Garcia, Davi Sousa; Padovani, Marina Martins Pereira; Azevedo, Renata Rangel; Chiari, Brasília Maria
2016-01-01
Cone beam computed tomography (CBCT), which represents an alternative to traditional computed tomography and magnetic resonance imaging, may be a useful instrument to study vocal tract physiology related to vocal exercises. This study aims to evaluate the applicability of CBCT to the assessment of variations in the vocal tract of healthy individuals before and after vocal exercises. Voice recordings and CBCT images before and after vocal exercises performed by 3 speech-language pathologists without vocal complaints were collected and compared. Each participant performed 1 type of exercise, i.e., Finnish resonance tube technique, prolonged consonant "b" technique, or chewing technique. The analysis consisted of an acoustic analysis and tomographic imaging. Modifications of the vocal tract settings following vocal exercises were properly detected by CBCT, and changes in the acoustic parameters were, for the most part, compatible with the variations detected in image measurements. CBCT was shown to be capable of properly assessing the changes in vocal tract settings promoted by vocal exercises. © 2017 S. Karger AG, Basel.
Iconicity can ground the creation of vocal symbols.
Perlman, Marcus; Dale, Rick; Lupyan, Gary
2015-08-01
Studies of gestural communication systems find that they originate from spontaneously created iconic gestures. Yet, we know little about how people create vocal communication systems, and many have suggested that vocalizations do not afford iconicity beyond trivial instances of onomatopoeia. It is unknown whether people can generate vocal communication systems through a process of iconic creation similar to gestural systems. Here, we examine the creation and development of a rudimentary vocal symbol system in a laboratory setting. Pairs of participants generated novel vocalizations for 18 different meanings in an iterative 'vocal' charades communication game. The communicators quickly converged on stable vocalizations, and naive listeners could correctly infer their meanings in subsequent playback experiments. People's ability to guess the meanings of these novel vocalizations was predicted by how close the vocalization was to an iconic 'meaning template' we derived from the production data. These results strongly suggest that the meaningfulness of these vocalizations derived from iconicity. Our findings illuminate a mechanism by which iconicity can ground the creation of vocal symbols, analogous to the function of iconicity in gestural communication systems.
Automatic classification of animal vocalizations
NASA Astrophysics Data System (ADS)
Clemins, Patrick J.
2005-11-01
Bioacoustics, the study of animal vocalizations, has begun to use increasingly sophisticated analysis techniques in recent years. Some common tasks in bioacoustics are repertoire determination, call detection, individual identification, stress detection, and behavior correlation. Each research study, however, uses a wide variety of different measured variables, called features, and classification systems to accomplish these tasks. The well-established field of human speech processing has developed a number of different techniques to perform many of the aforementioned bioacoustics tasks. Melfrequency cepstral coefficients (MFCCs) and perceptual linear prediction (PLP) coefficients are two popular feature sets. The hidden Markov model (HMM), a statistical model similar to a finite autonoma machine, is the most commonly used supervised classification model and is capable of modeling both temporal and spectral variations. This research designs a framework that applies models from human speech processing for bioacoustic analysis tasks. The development of the generalized perceptual linear prediction (gPLP) feature extraction model is one of the more important novel contributions of the framework. Perceptual information from the species under study can be incorporated into the gPLP feature extraction model to represent the vocalizations as the animals might perceive them. By including this perceptual information and modifying parameters of the HMM classification system, this framework can be applied to a wide range of species. The effectiveness of the framework is shown by analyzing African elephant and beluga whale vocalizations. The features extracted from the African elephant data are used as input to a supervised classification system and compared to results from traditional statistical tests. The gPLP features extracted from the beluga whale data are used in an unsupervised classification system and the results are compared to labels assigned by experts. The development of a framework from which to build animal vocalization classifiers will provide bioacoustics researchers with a consistent platform to analyze and classify vocalizations. A common framework will also allow studies to compare results across species and institutions. In addition, the use of automated classification techniques can speed analysis and uncover behavioral correlations not readily apparent using traditional techniques.
Gaskill, Christopher S; Quinney, Dana M
2012-05-01
Phonation into narrow tubes or straws has been used as a voice training and voice therapy technique and belongs to a group of techniques known as semi-occluded vocal tract exercises. The use of what are called resonance tubes has received renewed attention in the voice research literature, in both theoretical and empirical studies. The assumption is that the partially occluded and lengthened vocal tract alters supraglottal acoustics in such a way as to allow phonation near a lowered first vocal tract formant, which has been suggested as a way to bring about a more efficient glottal closure pattern for sustained oscillation. In this study, two groups of male participants, 10 with no vocal training and 10 with classical vocal training, phonated into a resonance tube for approximately 1 minute. Electroglottography was used to estimate glottal contact quotient (CQ) during spoken /a/ vowels before tube phonation, during tube phonation, and again during spoken /a/ vowels after tube phonation. Half of each group of participants was made to keep pitch and loudness consistent for all phases of the experiment, replicating the method of a previous study by this author. The other half was instructed to practice phonating into the resonance tube before collecting data and was encouraged to find a pitch and loudness combination that maximized ease of phonation and a sense of forward oral resonance. Glottal CQ altered considerably from baseline for almost all participants during tube phonation, with a larger variability than that during vowel production. Small differences in glottal CQ were found as a function of training and instruction, with most participants' CQ increasing during tube phonation. A small post-tube phonation effect was found primarily for the trained and instructed group. Secondary single-subject analyses revealed large intersubject variation, highlighting the highly individualized response to the resonance tube task. Continued study of resonance tubes is recommended, comparing both male and female as well as vocally trained and untrained participants. Future studies should continue to examine systematic variations in task instruction, length of practice, and resonance tube dimensions. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Phylogenetic signal in the acoustic parameters of the advertisement calls of four clades of anurans.
Gingras, Bruno; Mohandesan, Elmira; Boko, Drasko; Fitch, W Tecumseh
2013-07-01
Anuran vocalizations, especially their advertisement calls, are largely species-specific and can be used to identify taxonomic affiliations. Because anurans are not vocal learners, their vocalizations are generally assumed to have a strong genetic component. This suggests that the degree of similarity between advertisement calls may be related to large-scale phylogenetic relationships. To test this hypothesis, advertisement calls from 90 species belonging to four large clades (Bufo, Hylinae, Leptodactylus, and Rana) were analyzed. Phylogenetic distances were estimated based on the DNA sequences of the 12S mitochondrial ribosomal RNA gene, and, for a subset of 49 species, on the rhodopsin gene. Mean values for five acoustic parameters (coefficient of variation of root-mean-square amplitude, dominant frequency, spectral flux, spectral irregularity, and spectral flatness) were computed for each species. We then tested for phylogenetic signal on the body-size-corrected residuals of these five parameters, using three statistical tests (Moran's I, Mantel, and Blomberg's K) and three models of genetic distance (pairwise distances, Abouheif's proximities, and the variance-covariance matrix derived from the phylogenetic tree). A significant phylogenetic signal was detected for most acoustic parameters on the 12S dataset, across statistical tests and genetic distance models, both for the entire sample of 90 species and within clades in several cases. A further analysis on a subset of 49 species using genetic distances derived from rhodopsin and from 12S broadly confirmed the results obtained on the larger sample, indicating that the phylogenetic signals observed in these acoustic parameters can be detected using a variety of genetic distance models derived either from a variable mitochondrial sequence or from a conserved nuclear gene. We found a robust relationship, in a large number of species, between anuran phylogenetic relatedness and acoustic similarity in the advertisement calls in a taxon with no evidence for vocal learning, even after correcting for the effect of body size. This finding, covering a broad sample of species whose vocalizations are fairly diverse, indicates that the intense selection on certain call characteristics observed in many anurans does not eliminate all acoustic indicators of relatedness. Our approach could potentially be applied to other vocal taxa.
Master, Suely; Guzman, Marco; Carlos de Miranda, Helder; Lloyd, Adam
2013-03-01
Previous studies with long-term average spectrum (LTAS) showed the importance of the glottal source for understanding the projected voices of actresses. In this study, electroglottographic (EGG) analysis was used to investigate the contribution of the glottal source to the projected voice, comparing actresses and nonactresses' voices, in different levels of intensity. Thirty actresses and 30 nonactresses sustained vowels in habitual, moderate, and loud intensity levels. The EGG variables were contact quotient (CQ), closing quotient (QCQ), and opening quotient (QOQ). Other variables were sound pressure level (SPL) and fundamental frequency (F0). A KayPENTAX EGG was used. Variables were inputted in a general linear model. Actresses showed significantly higher values for SPL, in all levels, and both groups increased SPL significantly while changing from habitual to moderate and further to loud. There were no significant differences between groups for EGG quotients. There were significant differences between the levels only for F0 and CQ for both groups. SPL was significantly higher among actresses in all intensity levels, but in the EGG analysis, no differences were found. This apparently weak contribution of the glottal source in the supposedly projected voices of actresses, contrary to previous LTAS studies, might be because of a higher subglottal pressure or perhaps greater vocal tract contribution in SPL. Results from the present study suggest that trained subjects did not produce a significant higher SPL than untrained individuals by increasing the cost in terms of higher vocal fold collision and hence more impact stress. Future researches should explore the difference between trained and nontrained voices by aerodynamic measurements to evaluate the relationship between physiologic findings and the acoustic and EGG data. Moreover, further studies should consider both types of vocal tasks, sustained vowel and running speech, for both EGG and LTAS analysis. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Dependence of phonation threshold pressure on vocal tract acoustics and vocal fold tissue mechanics.
Chan, Roger W; Titze, Ingo R
2006-04-01
Analytical and computer simulation studies have shown that the acoustic impedance of the vocal tract as well as the viscoelastic properties of vocal fold tissues are critical for determining the dynamics and the energy transfer mechanism of vocal fold oscillation. In the present study, a linear, small-amplitude oscillation theory was revised by taking into account the propagation of a mucosal wave and the inertive reactance (inertance) of the supraglottal vocal tract as the major energy transfer mechanisms for flow-induced self-oscillation of the vocal fold. Specifically, analytical results predicted that phonation threshold pressure (Pth) increases with the viscous shear properties of the vocal fold, but decreases with vocal tract inertance. This theory was empirically tested using a physical model of the larynx, where biological materials (fat, hyaluronic acid, and fibronectin) were implanted into the vocal fold cover to investigate the effect of vocal fold tissue viscoelasticity on Pth. A uniform-tube supraglottal vocal tract was also introduced to examine the effect of vocal tract inertance on Pth. Results showed that Pth decreased with the inertive impedance of the vocal tract and increased with the viscous shear modulus (G") or dynamic viscosity (eta') of the vocal fold cover, consistent with theoretical predictions. These findings supported the potential biomechanical benefits of hyaluronic acid as a surgical bioimplant for repairing voice disorders involving the superficial layer of the lamina propria, such as scarring, sulcus vocalis, atrophy, and Reinke's edema.
Van Lierde, Kristiane M; D'haeseleer, Evelien; Wuyts, Floris L; De Ley, Sophia; Geldof, Ruben; De Vuyst, Julie; Sofie, Claeys
2010-09-01
The purpose of the present cross-sectional study was to determine the objective vocal quality and the vocal characteristics (vocal risk factors, vocal and corporal complaints) in 197 female students in speech-language pathology during the 4 years of study. The objective vocal quality was measured by means of the Dysphonia Severity Index (DSI). Perceptual voice assessment, the Voice Handicap Index (VHI), questionnaires addressing vocal risks, and vocal and corporal complaints during and/or after voice usage were performed. Speech-language pathology (SLP) students have a borderline vocal quality corresponding to a DSI% of 68. The analysis of variance revealed no significant change of the objective vocal quality between the first bachelor year and the master year. No psychosocial handicapping effect of the voice was observed by means of the VHI total, though there was an effect at the functional VHI level in addition to some vocal complaints. Ninety-three percent of the student SLPs reported the presence of corporal pain during and/or after speaking. In particular, sore throat and headache were mentioned as the prevalent corporal pain symptoms. A longitudinal study of the objective vocal quality of the same subjects during their career as an SLP might provide new insights. 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
The mimetic repertoire of the spotted bowerbird Ptilonorhynchus maculatus
NASA Astrophysics Data System (ADS)
Kelley, Laura A.; Healy, Susan D.
2011-06-01
Although vocal mimicry in songbirds is well documented, little is known about the function of such mimicry. One possibility is that the mimic produces the vocalisations of predatory or aggressive species to deter potential predators or competitors. Alternatively, these sounds may be learned in error as a result of their acoustic properties such as structural simplicity. We determined the mimetic repertoires of a population of male spotted bowerbirds Ptilonorhynchus maculatus, a species that mimics predatory and aggressive species. Although male mimetic repertoires contained an overabundance of vocalisations produced by species that were generally aggressive, there was also a marked prevalence of mimicry of sounds that are associated with alarm such as predator calls, alarm calls and mobbing calls, irrespective of whether the species being mimicked was aggressive or not. We propose that it may be the alarming context in which these sounds are first heard that may lead both to their acquisition and to their later reproduction. We suggest that enhanced learning capability during acute stress may explain vocal mimicry in many species that mimic sounds associated with alarm.
Chronic Left Lower Lobe Pulmonary Infiltrates During Military Deployment.
Hunninghake, John C; Skabelund, Andrew J; Morris, Michael J
2016-08-01
Deployment to Southwest Asia is associated with increased airborne hazards such as geologic dusts, burn pit smoke, vehicle exhaust, or air pollution. There are numerous ongoing studies to evaluate the potential effects of inhaled particulate matter on reported increases in acute and chronic respiratory symptoms. Providers need to be aware of potential causes of pulmonary disease such as acute eosinophilic pneumonia, asthma, and vocal cord dysfunction that have been associated with deployment. Other pulmonary disorders such as interstitial lung disease are infrequently reported. Not all deployment-related respiratory complaints may result from deployment airborne hazards and a broad differential should be considered. We present the case of a military member with a prolonged deployment found to have lobar infiltrates secondary to pulmonary vein stenosis from treatment for atrial fibrillation. Reprint & Copyright © 2016 Association of Military Surgeons of the U.S.
University Vocal Training and Vocal Health of Music Educators and Music Therapists
ERIC Educational Resources Information Center
Baker, Vicki D.; Cohen, Nicki
2017-01-01
The purpose of this study was to describe the university vocal training and vocal health of music educators and music therapists. The participants (N = 426), music educators (n = 351) and music therapists (n = 75), completed a survey addressing demographics, vocal training, voice usage, and vocal health. Both groups reported singing at least 50%…
Monkey vocal tracts are speech-ready.
Fitch, W Tecumseh; de Boer, Bart; Mathur, Neil; Ghazanfar, Asif A
2016-12-01
For four decades, the inability of nonhuman primates to produce human speech sounds has been claimed to stem from limitations in their vocal tract anatomy, a conclusion based on plaster casts made from the vocal tract of a monkey cadaver. We used x-ray videos to quantify vocal tract dynamics in living macaques during vocalization, facial displays, and feeding. We demonstrate that the macaque vocal tract could easily produce an adequate range of speech sounds to support spoken language, showing that previous techniques based on postmortem samples drastically underestimated primate vocal capabilities. Our findings imply that the evolution of human speech capabilities required neural changes rather than modifications of vocal anatomy. Macaques have a speech-ready vocal tract but lack a speech-ready brain to control it.
Vocal Dose Measures: Quantifying Accumulated Vibration Exposure in Vocal Fold Tissues
Titze, Ingo R.; Švec, Jan G.; Popolo, Peter S.
2011-01-01
To measure the exposure to self-induced tissue vibration in speech, three vocal doses were defined and described: distance dose, which accumulates the distance that tissue particles of the vocal folds travel in an oscillatory trajectory; energy dissipation dose, which accumulates the total amount of heat dissipated over a unit volume of vocal fold tissues; and time dose, which accumulates the total phonation time. These doses were compared to a previously used vocal dose measure, the vocal loading index, which accumulates the number of vibration cycles of the vocal folds. Empirical rules for viscosity and vocal fold deformation were used to calculate all the doses from the fundamental frequency (F0) and sound pressure level (SPL) values of speech. Six participants were asked to read in normal, monotone, and exaggerated speech and the doses associated with these vocalizations were calculated. The results showed that large F0 and SPL variations in speech affected the dose measures, suggesting that accumulation of phonation time alone is insufficient. The vibration exposure of the vocal folds in normal speech was related to the industrial limits for hand-transmitted vibration, in which the safe distance dose was derived to be about 500 m. This limit was found rather low for vocalization; it was related to a comparable time dose of about 17 min of continuous vocalization, or about 35 min of continuous reading with normal breathing and unvoiced segments. The voicing pauses in normal speech and dialogue effectively prolong the safe time dose. The derived safety limits for vocalization will likely require refinement based on a more detailed knowledge of the differences in hand and vocal fold tissue morphology and their response to vibrational stress, and on the effect of recovery of the vocal fold tissue during voicing pauses. PMID:12959470
Neural FoxP2 and FoxP1 expression in the budgerigar, an avian species with adult vocal learning.
Hara, Erina; Perez, Jemima M; Whitney, Osceola; Chen, Qianqian; White, Stephanie A; Wright, Timothy F
2015-04-15
Vocal learning underlies acquisition of both language in humans and vocal signals in some avian taxa. These bird groups and humans exhibit convergent developmental phases and associated brain pathways for vocal communication. The transcription factor FoxP2 plays critical roles in vocal learning in humans and songbirds. Another member of the forkhead box gene family, FoxP1 also shows high expression in brain areas involved in vocal learning and production. Here, we investigate FoxP2 and FoxP1 mRNA and protein in adult male budgerigars (Melopsittacus undulatus), a parrot species that exhibits vocal learning as both juveniles and adults. To examine these molecules in adult vocal learners, we compared their expression patterns in the budgerigar striatal nucleus involved in vocal learning, magnocellular nucleus of the medial striatum (MMSt), across birds with different vocal states, such as vocalizing to a female (directed), vocalizing alone (undirected), and non-vocalizing. We found that both FoxP2 mRNA and protein expressions were consistently lower in MMSt than in the adjacent striatum regardless of the vocal states, whereas previous work has shown that songbirds exhibit down-regulation in the homologous region, Area X, only after singing alone. In contrast, FoxP1 levels were high in MMSt compared to the adjacent striatum in all groups. Taken together these results strengthen the general hypothesis that FoxP2 and FoxP1 have specialized expression in vocal nuclei across a range of taxa, and suggest that the adult vocal plasticity seen in budgerigars may be a product of persistent down-regulation of FoxP2 in MMSt. Copyright © 2015 Elsevier B.V. All rights reserved.
Neural FoxP2 and FoxP1 expression in the budgerigar, an avian species with adult vocal learning
Hara, Erina; Perez, Jemima M.; Whitney, Osceola; Chen, Qianqian; White, Stephanie A.; Wright, Timothy F.
2015-01-01
Vocal learning underlies acquisition of both language in humans and vocal signals in some avian taxa. These bird groups and humans exhibit convergent developmental phases and associated brain pathways for vocal communication. The transcription factor FoxP2 plays critical roles in vocal learning in humans and songbirds. Another member of the forkhead box gene family, FoxP1 also shows high expression in brain areas involved in vocal learning and production. Here, we investigate FoxP2 and FoxP1 mRNA and protein in adult male budgerigars (Melopsittacus undulatus), a parrot species that exhibits vocal learning as both juveniles and adults. To examine these molecules in adult vocal learners, we compared their expression patterns in the budgerigar striatal nucleus involved in vocal learning, magnocellular nucleus of the medial striatum (MMSt), across birds with different vocal states, such as vocalizing to a female (directed), vocalizing alone (undirected), and non-vocalizing. We found that both FoxP2 mRNA and protein expressions were consistently lower in MMSt than in the adjacent striatum regardless of the vocal states, whereas previous work has shown that songbirds exhibit downregulation in the homologous region, Area X, only after singing alone. In contrast, FoxP1 levels were high in MMSt compared to the adjacent striatum in all groups. Taken together these results strengthen the general hypothesis that FoxP2 and FoxP1 have specialized expression in vocal nuclei across a range of taxa, and suggest that the adult vocal plasticity seen in budgerigars may be a product of persistent down-regulation of FoxP2 in MMSt. PMID:25601574
Vocal dose measures: quantifying accumulated vibration exposure in vocal fold tissues.
Titze, Ingo R; Svec, Jan G; Popolo, Peter S
2003-08-01
To measure the exposure to self-induced tissue vibration in speech, three vocal doses were defined and described: distance dose, which accumulates the distance that tissue particles of the vocal folds travel in an oscillatory trajectory; energy dissipation dose, which accumulates the total amount of heat dissipated over a unit volume of vocal fold tissues; and time dose, which accumulates the total phonation time. These doses were compared to a previously used vocal dose measure, the vocal loading index, which accumulates the number of vibration cycles of the vocal folds. Empirical rules for viscosity and vocal fold deformation were used to calculate all the doses from the fundamental frequency (F0) and sound pressure level (SPL) values of speech. Six participants were asked to read in normal, monotone, and exaggerated speech and the doses associated with these vocalizations were calculated. The results showed that large F0 and SPL variations in speech affected the dose measures, suggesting that accumulation of phonation time alone is insufficient. The vibration exposure of the vocal folds in normal speech was related to the industrial limits for hand-transmitted vibration, in which the safe distance dose was derived to be about 500 m. This limit was found rather low for vocalization; it was related to a comparable time dose of about 17 min of continuous vocalization, or about 35 min of continuous reading with normal breathing and unvoiced segments. The voicing pauses in normal speech and dialogue effectively prolong the safe time dose. The derived safety limits for vocalization will likely require refinement based on a more detailed knowledge of the differences in hand and vocal fold tissue morphology and their response to vibrational stress, and on the effect of recovery of the vocal fold tissue during voicing pauses.
Sielska-Badurek, Ewelina; Osuch-Wójcikiewicz, Ewa; Sobol, Maria; Kazanecka, Ewa; Niemczyk, Kazimierz
2017-01-01
This study investigated vocal function knowledge and vocal tract sensorimotor self-awareness and the impact of functional voice rehabilitation on vocal function knowledge and self-awareness. This is a prospective, randomized study. Twenty singers (study group [SG]) completed a questionnaire before and after functional voice rehabilitation. Twenty additional singers, representing the control group, also completed the questionnaire without functional voice rehabilitation at a 3-month interval. The questionnaire consisted of three parts. The first part evaluated the singers' attitude to the anatomical and physiological knowledge of the vocal tract and their self-esteem of the knowledge level. The second part assessed the theoretical knowledge of the singers' vocal tract physiology. The third part of the questionnaire assessed singers' sensorimotor self-awareness of the vocal tract. The results showed that most singers indicated that knowledge of the vocal tract's anatomy and physiology is useful (59% SG, 67% control group). However, 75% of all participants defined their knowledge of the vocal tract's anatomy and physiology as weak or inadequate. In the SG, vocal function knowledge at the first assessment was 45%. After rehabilitation, the level increased to 67.7%. Vocal tract sensorimotor self-awareness initially was 38.9% in SG but rose to 66.7%. Findings of the study suggest that classical singers lack knowledge about the physiology of the vocal mechanism, especially the breathing patterns. In addition, they have low sensorimotor self-awareness of their vocal tract. The results suggest that singers would benefit from receiving services from phoniatrists and speech-language pathologists during their voice training. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Audio-vocal interaction in single neurons of the monkey ventrolateral prefrontal cortex.
Hage, Steffen R; Nieder, Andreas
2015-05-06
Complex audio-vocal integration systems depend on a strong interconnection between the auditory and the vocal motor system. To gain cognitive control over audio-vocal interaction during vocal motor control, the PFC needs to be involved. Neurons in the ventrolateral PFC (VLPFC) have been shown to separately encode the sensory perceptions and motor production of vocalizations. It is unknown, however, whether single neurons in the PFC reflect audio-vocal interactions. We therefore recorded single-unit activity in the VLPFC of rhesus monkeys (Macaca mulatta) while they produced vocalizations on command or passively listened to monkey calls. We found that 12% of randomly selected neurons in VLPFC modulated their discharge rate in response to acoustic stimulation with species-specific calls. Almost three-fourths of these auditory neurons showed an additional modulation of their discharge rates either before and/or during the monkeys' motor production of vocalization. Based on these audio-vocal interactions, the VLPFC might be well positioned to combine higher order auditory processing with cognitive control of the vocal motor output. Such audio-vocal integration processes in the VLPFC might constitute a precursor for the evolution of complex learned audio-vocal integration systems, ultimately giving rise to human speech. Copyright © 2015 the authors 0270-6474/15/357030-11$15.00/0.
Multiple Coordination Patterns in Infant and Adult Vocalizations
Abney, Drew H.; Warlaumont, Anne S.; Oller, D. Kimbrough; Wallot, Sebastian; Kello, Christopher T.
2017-01-01
The study of vocal coordination between infants and adults has led to important insights into the development of social, cognitive, emotional and linguistic abilities. We used an automatic system to identify vocalizations produced by infants and adults over the course of the day for fifteen infants studied longitudinally during the first two years of life. We measured three different types of vocal coordination: coincidence-based, rate-based, and cluster-based. Coincidence-based and rate-based coordination are established measures in the developmental literature. Cluster-based coordination is new and measures the strength of matching in the degree to which vocalization events occur in hierarchically nested clusters. We investigated whether various coordination patterns differ as a function of vocalization type, whether different coordination patterns provide unique information about the dynamics of vocal interaction, and how the various coordination patterns each relate to infant age. All vocal coordination patterns displayed greater coordination for infant speech-related vocalizations, adults adapted the hierarchical clustering of their vocalizations to match that of infants, and each of the three coordination patterns had unique associations with infant age. Altogether, our results indicate that vocal coordination between infants and adults is multifaceted, suggesting a complex relationship between vocal coordination and the development of vocal communication. PMID:29375276
Behroozmand, Roozbeh; Karvelis, Laura; Liu, Hanjun; Larson, Charles R.
2009-01-01
Objective The present study investigated whether self-vocalization enhances auditory neural responsiveness to voice pitch feedback perturbation and how this vocalization-induced neural modulation can be affected by the extent of the feedback deviation. Method Event related potentials (ERPs) were recorded in 15 subjects in response to +100, +200 and +500 cents pitch-shifted voice auditory feedback during active vocalization and passive listening to the playback of the self-produced vocalizations. Result The amplitude of the evoked P1 (latency: 73.51 ms) and P2 (latency: 199.55 ms) ERP components in response to feedback perturbation were significantly larger during vocalization than listening. The difference between P2 peak amplitudes during vocalization vs. listening was shown to be significantly larger for +100 than +500 cents stimulus. Conclusion Results indicate that the human auditory cortex is more responsive to voice F0 feedback perturbations during vocalization than passive listening. Greater vocalization-induced enhancement of the auditory responsiveness to smaller feedback perturbations may imply that the audio-vocal system detects and corrects for errors in vocal production that closely match the expected vocal output. Significance Findings of this study support previous suggestions regarding the enhanced auditory sensitivity to feedback alterations during self-vocalization, which may serve the purpose of feedback-based monitoring of one’s voice. PMID:19520602
Limiting parental feedback disrupts vocal development in marmoset monkeys
Gultekin, Yasemin B.; Hage, Steffen R.
2017-01-01
Vocalizations of human infants undergo dramatic changes across the first year by becoming increasingly mature and speech-like. Human vocal development is partially dependent on learning by imitation through social feedback between infants and caregivers. Recent studies revealed similar developmental processes being influenced by parental feedback in marmoset monkeys for apparently innate vocalizations. Marmosets produce infant-specific vocalizations that disappear after the first postnatal months. However, it is yet unclear whether parental feedback is an obligate requirement for proper vocal development. Using quantitative measures to compare call parameters and vocal sequence structure we show that, in contrast to normally raised marmosets, marmosets that were separated from parents after the third postnatal month still produced infant-specific vocal behaviour at subadult stages. These findings suggest a significant role of social feedback on primate vocal development until the subadult stages and further show that marmoset monkeys are a compelling model system for early human vocal development. PMID:28090084
Fukushima, Makoto; Saunders, Richard C; Fujii, Naotaka; Averbeck, Bruno B; Mishkin, Mortimer
2014-01-01
Vocal production is an example of controlled motor behavior with high temporal precision. Previous studies have decoded auditory evoked cortical activity while monkeys listened to vocalization sounds. On the other hand, there have been few attempts at decoding motor cortical activity during vocal production. Here we recorded cortical activity during vocal production in the macaque with a chronically implanted electrocorticographic (ECoG) electrode array. The array detected robust activity in motor cortex during vocal production. We used a nonlinear dynamical model of the vocal organ to reduce the dimensionality of `Coo' calls produced by the monkey. We then used linear regression to evaluate the information in motor cortical activity for this reduced representation of calls. This simple linear model accounted for circa 65% of the variance in the reduced sound representations, supporting the feasibility of using the dynamical model of the vocal organ for decoding motor cortical activity during vocal production.
Female mice ultrasonically interact with males during courtship displays
Neunuebel, Joshua P; Taylor, Adam L; Arthur, Ben J; Egnor, SE Roian
2015-01-01
During courtship males attract females with elaborate behaviors. In mice, these displays include ultrasonic vocalizations. Ultrasonic courtship vocalizations were previously attributed to the courting male, despite evidence that both sexes produce virtually indistinguishable vocalizations. Because of this similarity, and the difficulty of assigning vocalizations to individuals, the vocal contribution of each individual during courtship is unknown. To address this question, we developed a microphone array system to localize vocalizations from socially interacting, individual adult mice. With this system, we show that female mice vocally interact with males during courtship. Males and females jointly increased their vocalization rates during chases. Furthermore, a female's participation in these vocal interactions may function as a signal that indicates a state of increased receptivity. Our results reveal a novel form of vocal communication during mouse courtship, and lay the groundwork for a mechanistic dissection of communication during social behavior. DOI: http://dx.doi.org/10.7554/eLife.06203.001 PMID:26020291
Putman, Peter; Hermans, Erno J; Koppeschaar, Hans; van Schijndel, Alexandra; van Honk, Jack
2007-08-01
Chronically elevated HPA activity has often been associated with fear and anxiety, but there is evidence that single administrations of glucocorticoids may acutely reduce fear. Moreover, peri-traumatic cortisol elevation may protect against development of post-traumatic stress disorder. Hypervigilant processing of threat information plays a role in anxiety disorders and although relations with HPA functioning have been established, causality of these relations remains unclear. Presently, self-reported anxiety and response time patterns on a masked emotional Stroop task with fearful faces were measured in 20 healthy young men after double-blind, placebo-controlled oral administration of 40 mg cortisol. The masked fearful Stroop task measures vocal colornaming response latencies for pictures of neutral and fearful faces presented below the threshold for conscious perception. Results showed increased response times on trials for fearful compared to neutral faces after placebo, but this emotional Stroop effect was acutely abolished by cortisol administration. This effect was most pronounced in subjects with heightened anxiety levels. This is the first evidence showing that exogenous cortisol acutely reduces anxiety-driven selective attention to threat. These results extend earlier findings of acute fear reduction after glucocorticoid administration. This suggests interactions of HPA functioning and vigilant attention in the pathogenesis of anxiety disorders. Possible neuroendocrine mechanisms of action are discussed.
[Temperament of children with vocal fold nodules].
Wei, Youhua; Wang, Zhinan; Xu, Zhongqiang; Chen, Ping; Hao, Lili
2009-11-01
To examine the temperament of children with vocal fold nodules. To compare the temperament dimension and temperamental types of 42 children with vocal fold nodules with 46 vocally normal children, using Chinese children's Temperament Problem Screening system (CCTPSs). The children with vocal fold nodules differed significantly from the comparison group in their temperament dimension's adaptability, intensity of reaction, mood value, persistency and temperamental types. There are more difficult and slow-to-warm-up children in patients with vocal fold nodules than vocally normal children.
Yu, Chengzhu; Hansen, John H L
2017-03-01
Human physiology has evolved to accommodate environmental conditions, including temperature, pressure, and air chemistry unique to Earth. However, the environment in space varies significantly compared to that on Earth and, therefore, variability is expected in astronauts' speech production mechanism. In this study, the variations of astronaut voice characteristics during the NASA Apollo 11 mission are analyzed. Specifically, acoustical features such as fundamental frequency and phoneme formant structure that are closely related to the speech production system are studied. For a further understanding of astronauts' vocal tract spectrum variation in space, a maximum likelihood frequency warping based analysis is proposed to detect the vocal tract spectrum displacement during space conditions. The results from fundamental frequency, formant structure, as well as vocal spectrum displacement indicate that astronauts change their speech production mechanism when in space. Moreover, the experimental results for astronaut voice identification tasks indicate that current speaker recognition solutions are highly vulnerable to astronaut voice production variations in space conditions. Future recommendations from this study suggest that successful applications of speaker recognition during extended space missions require robust speaker modeling techniques that could effectively adapt to voice production variation caused by diverse space conditions.
Ortiz-Ramírez, Marco F; Andersen, Michael J; Zaldívar-Riverón, Alejandro; Ornelas, Juan Francisco; Navarro-Sigüenza, Adolfo G
2016-01-01
Montane barriers influence the evolutionary history of lineages by promoting isolation of populations. The effects of these historical processes are evident in patterns of differentiation among extant populations, which are often expressed as genetic and behavioral variation between populations. We investigated the effects of geographic barriers on the evolutionary history of a Mesoamerican bird by studying patterns of genetic and vocal variation in the Ruddy-capped Nightingale-Thrush (Turdidae: Catharus frantzii), a non-migratory oscine bird that inhabits montane forests from central Mexico to Panama. We reconstructed the phylogeographic history and estimated divergence times between populations using Bayesian and maximum likelihood methods. We found strong support for the existence of four mitochondrial lineages of C. frantzii corresponding to isolated mountain ranges: Sierra Madre Oriental; Sierra Madre del Sur; the highlands of Chiapas, Guatemala, and El Salvador; and the Talamanca Cordillera. Vocal features in C. frantzii were highly variable among the four observed clades, but vocal variation and genetic variation were uncorrelated. Song variation in C. frantzii suggests that sexual selection and cultural drift could be important factors driving song differentiation in C. frantzii. Copyright © 2015 Elsevier Inc. All rights reserved.
Vassilieva, Anna B; Gogoleva, Svetlana S; Poyarkov, Nikolay A Jr
2016-06-24
We present new data on the distribution, reproduction, larval morphology and vocalization of Rhacophorus helenae (Rhacophoridae), a narrowly distributed frog from southern Vietnam. Two new populations of R. helenae were discovered during field surveys in the lowland monsoon forests in Dong Nai and Ba Ria-Vung Tau provinces in 2010-2013. Spawning was observed in May 2013. Egg clutches containing small (2.3±0.1 mm) unpigmented eggs were embedded in a foam nest and suspended high on trees above temporary ponds. The tadpoles of R. helenae have a morphology typical of pond-dwelling Rhacophorus larvae with a moderate tail length and a labial tooth row formula of 5(2-5)/3. Postmetamorphic juveniles differed from adult frogs in the features of their coloration and less developed webbing. The complex vocal repertoire of R. helenae included five types of tonal, wideband and pulsed calls and several transitional signal types differentiated by frequency and amplitude parameters. Calls were uttered as singular signals (pulsed calls) or within non-stereotyped series of variable duration (other call types). The complex structure of the advertisement call markedly distinguishes R. helenae from other members of the Rhacophorus reinwardtii species complex.
LAURIELLO, M.; ANGELONE, A.M.; RIENZO BUSINCO, L.; PASSALI, D.; BELLUSSI, L.M.; PASSALI, F.M.
2011-01-01
SUMMARY Aim of the present study was to investigate the prevalence of allergy in patients affected by both organic and/or functional vocal fold disorders. The secondary aim was to assess the correlation between sex and allergy in dysphonic patients. A retrospective chart review was performed on dysphonic patients. A total of 76 patients underwent fiberoptic endoscopy to assess the objective picture. Logistic regression analyses have been conducted to assess the association between sex and the outcome variables. The laryngoscopic examination revealed the presence of poor glottic closure in 32.9%, hyperkinesias in 11.8%, redness in 11.84%, polyps in 5.3%, oedema in 3.95%, vocal fold hypertrophy in 5.3%, nodules in 42.1%, cordectomy in 2.6%. Allergic rhinitis was present in 56.6%, milk intolerance in 13.2%, asthma in 9.2%, atopic dermatitis in 3.9%, drugs intolerance in 11.8%. A total of 76.32% patients presenting with dysphonia were allergic. A statistically significant association was found between female sex and presence of allergy. In conclusion, allergy testing should be performed routinely on female professional voice users. Mild respiratory disorders must be taken into serious consideration in female professional voice users, who may primarily complain of vocal dysfunction rather than upper and lower respiratory diseases. PMID:22064840
Wyman, Megan T.; Charlton, Benjamin D.; Locatelli, Yann; Reby, David
2011-01-01
Males of all polygynous deer species (Cervinae) give conspicuous calls during the reproductive season. The extreme interspecific diversity that characterizes these vocalizations suggests that they play a strong role in species discrimination. However, interbreeding between several species of Cervinae indicates permeable interspecific reproductive barriers. This study examines the contribution of vocal behavior to female species discrimination and mating preferences in two closely related polygynous deer species known to hybridize in the wild after introductions. Specifically, we investigate the reaction of estrous female red deer (Cervus elaphus) to playbacks of red deer vs. sika deer (Cervus nippon) male mating calls, with the prediction that females will prefer conspecific calls. While on average female red deer preferred male red deer roars, two out of twenty females spent more time in close proximity to the speaker broadcasting male sika deer moans. We suggest that this absence of strict vocal preference for species-specific mating calls may contribute to the permeability of pre-zygotic reproductive barriers observed between these species. Our results also highlight the importance of examining inter-individual variation when studying the role of female preferences in species discrimination and intraspecific mate selection. PMID:21887242
Precursors of Dancing and Singing to Music in Three- to Four-Months-Old Infants
Fujii, Shinya; Watanabe, Hama; Oohashi, Hiroki; Hirashima, Masaya; Nozaki, Daichi; Taga, Gentaro
2014-01-01
Dancing and singing to music involve auditory-motor coordination and have been essential to our human culture since ancient times. Although scholars have been trying to understand the evolutionary and developmental origin of music, early human developmental manifestations of auditory-motor interactions in music have not been fully investigated. Here we report limb movements and vocalizations in three- to four-months-old infants while they listened to music and were in silence. In the group analysis, we found no significant increase in the amount of movement or in the relative power spectrum density around the musical tempo in the music condition compared to the silent condition. Intriguingly, however, there were two infants who demonstrated striking increases in the rhythmic movements via kicking or arm-waving around the musical tempo during listening to music. Monte-Carlo statistics with phase-randomized surrogate data revealed that the limb movements of these individuals were significantly synchronized to the musical beat. Moreover, we found a clear increase in the formant variability of vocalizations in the group during music perception. These results suggest that infants at this age are already primed with their bodies to interact with music via limb movements and vocalizations. PMID:24837135
Precursors of dancing and singing to music in three- to four-months-old infants.
Fujii, Shinya; Watanabe, Hama; Oohashi, Hiroki; Hirashima, Masaya; Nozaki, Daichi; Taga, Gentaro
2014-01-01
Dancing and singing to music involve auditory-motor coordination and have been essential to our human culture since ancient times. Although scholars have been trying to understand the evolutionary and developmental origin of music, early human developmental manifestations of auditory-motor interactions in music have not been fully investigated. Here we report limb movements and vocalizations in three- to four-months-old infants while they listened to music and were in silence. In the group analysis, we found no significant increase in the amount of movement or in the relative power spectrum density around the musical tempo in the music condition compared to the silent condition. Intriguingly, however, there were two infants who demonstrated striking increases in the rhythmic movements via kicking or arm-waving around the musical tempo during listening to music. Monte-Carlo statistics with phase-randomized surrogate data revealed that the limb movements of these individuals were significantly synchronized to the musical beat. Moreover, we found a clear increase in the formant variability of vocalizations in the group during music perception. These results suggest that infants at this age are already primed with their bodies to interact with music via limb movements and vocalizations.
Auditory and audio-vocal responses of single neurons in the monkey ventral premotor cortex.
Hage, Steffen R
2018-03-20
Monkey vocalization is a complex behavioral pattern, which is flexibly used in audio-vocal communication. A recently proposed dual neural network model suggests that cognitive control might be involved in this behavior, originating from a frontal cortical network in the prefrontal cortex and mediated via projections from the rostral portion of the ventral premotor cortex (PMvr) and motor cortex to the primary vocal motor network in the brainstem. For the rapid adjustment of vocal output to external acoustic events, strong interconnections between vocal motor and auditory sites are needed, which are present at cortical and subcortical levels. However, the role of the PMvr in audio-vocal integration processes remains unclear. In the present study, single neurons in the PMvr were recorded in rhesus monkeys (Macaca mulatta) while volitionally producing vocalizations in a visual detection task or passively listening to monkey vocalizations. Ten percent of randomly selected neurons in the PMvr modulated their discharge rate in response to acoustic stimulation with species-specific calls. More than four-fifths of these auditory neurons showed an additional modulation of their discharge rates either before and/or during the monkeys' motor production of the vocalization. Based on these audio-vocal interactions, the PMvr might be well positioned to mediate higher order auditory processing with cognitive control of the vocal motor output to the primary vocal motor network. Such audio-vocal integration processes in the premotor cortex might constitute a precursor for the evolution of complex learned audio-vocal integration systems, ultimately giving rise to human speech. Copyright © 2018 Elsevier B.V. All rights reserved.
Iwaki, Shinobu; Maeda, Tatsuyoshi; Saito, Miki; Otsuki, Naoki; Takahashi, Miki; Wakui, Emi; Shinomiya, Hirotaka; Morimoto, Koichi; Inoue, Hiroyuki; Masuoka, Hiroo; Miyauchi, Akira; Nibu, Ken-Ichi
2017-03-01
Quality of voice after immediate recurrent laryngeal nerve (RLN) reconstruction in thyroid cancers has not been thoroughly studied. Thirteen patients with fixed vocal cords (fixed vocal cord group) and 8 patients with intact or impaired mobile vocal cords (mobile vocal cord group) who had immediate RLN reconstruction simultaneously with total thyroidectomy, and patients who had arytenoid adduction and thyroplasty for vocal cord paralysis caused by previous surgery (arytenoid adduction thyroplasty group) were enrolled in this study. Preoperative phonation efficiency index was significantly lower (p = .008) in the fixed vocal cord group than in the mobile vocal cord group. One year after surgery, all voice parameters of the patients in the fixed vocal cord group had improved, compared with their preoperative data. The fixed vocal cord group had attained satisfactory voice qualities equivalent to those of the mobile vocal cord group in terms of various voice parameters. The present results support the idea that immediate RLN reconstruction at the time of surgery for thyroid cancers may spare the need for subsequent arytenoid adduction thyroplasty even in the patients with preoperatively fixed vocal cords. © 2016 Wiley Periodicals, Inc. Head Neck 39: 427-431, 2017. © 2016 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Švancara, P.; Horáček, J.; Švec, J. G.
The study presents a three-dimensional (3D) finite element (FE) model of the flow-induced self-oscillation of the human vocal folds in interaction with acoustics of simplified vocal tract models. The 3D vocal tract models of the acoustic spaces shaped for simulation of phonation of Czech vowels [a:], [i:] and [u:] were created by converting the data from the magnetic resonance images (MRI). For modelling of the fluid-structure interaction, explicit coupling scheme with separated solvers for fluid and structure domain was utilized. The FE model comprises vocal folds pretension before starting phonation, large deformations of the vocal fold tissue, vocal-fold collisions, fluid-structure interaction, morphing the fluid mesh according to the vocal-fold motion (Arbitrary Lagrangian-Eulerian approach), unsteady viscous compressible airflow described by the Navier-Stokes equations and airflow separation. The developed FE model enables to study the relationship between flow-induced vibrations of the vocal folds and acoustic wave propagation in the vocal tract and can also be used to simulate for example pathological changes in the vocal fold tissue and their influence on the voice production.
The effect of voice amplification on occupational vocal dose in elementary school teachers.
Gaskill, Christopher S; O'Brien, Shenendoah G; Tinter, Sara R
2012-09-01
Two elementary school teachers, one with and one without a history of vocal complaints, wore a vocal dosimeter all day at school for a 3-week period. In the second week, each teacher wore a portable voice amplifier. Each teacher showed a reduction in vocal intensity during the week of amplification, with a larger effect for the teacher with vocal difficulties. This teacher also showed a decrease in hourly vocal fold distance dose as measured by the dosimeter despite incurring longer phonation times. Fundamental frequency and vocal fold cycle dose did not appear to be affected by the use of amplification during the teaching day. Both teachers showed evidence of a possible moderate effect of adjusting vocal intensity in the week after amplification, possibly as a means to recalibrate their perceived vocal loudness. This study demonstrates the usefulness of both vocal dosimetry and amplification in monitoring and modifying vocal dose in an occupational setting and reinforces previous data suggesting the effectiveness of amplification in reducing the vocal load in schoolteachers. Implications of the data for future research regarding prevention and treatment of occupational voice disorders are discussed. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Harold, Meredith Poore; Barlow, Steven M.
2012-01-01
The vocalizations and jaw kinematics of 30 infants aged 6–8 months were recorded using a Motion Analysis System and audiovisual technologies. This study represents the first attempt to determine the effect of play environment on infants’ rate of vocalization and jaw movement. Four play conditions were compared: watching videos, social contingent reinforcement and vocal modeling with an adult, playing alone with small toys, and playing alone with large toys. The fewest vocalizations and spontaneous movement were observed when infants were watching videos or interacting with an adult. Infants vocalized most when playing with large toys. The small toys, which naturally elicited gross motor movement (e.g., waving, banging, shaking), educed fewer vocalizations. This study was also the first to quantify the kinematics of vocalized and non-vocalized jaw movements of 6–8 month-old infants. Jaw kinematics did not differentiate infants who produced canonical syllables from those who did not. All infants produced many jaw movements without vocalization. However, during vocalization, infants were unlikely to move their jaw. This contradicts current theories that infant protophonic vocalizations are jaw dominant. Results of the current study can inform socio-linguistic and kinematic theories of canonical babbling. PMID:23261792
Kuo, Chung-Feng Jeffrey; Chu, Yueng-Hsiang; Wang, Po-Chun; Lai, Chun-Yu; Chu, Wen-Lin; Leu, Yi-Shing; Wang, Hsing-Won
2013-12-01
The human larynx is an important organ for voice production and respiratory mechanisms. The vocal cord is approximated for voice production and open for breathing. The videolaryngoscope is widely used for vocal cord examination. At present, physicians usually diagnose vocal cord diseases by manually selecting the image of the vocal cord opening to the largest extent (abduction), thus maximally exposing the vocal cord lesion. On the other hand, the severity of diseases such as vocal palsy, atrophic vocal cord is largely dependent on the vocal cord closing to the smallest extent (adduction). Therefore, diseases can be assessed by the image of the vocal cord opening to the largest extent, and the seriousness of breathy voice is closely correlated to the gap between vocal cords when closing to the smallest extent. The aim of the study was to design an automatic vocal cord image selection system to improve the conventional selection process by physicians and enhance diagnosis efficiency. Also, due to the unwanted fuzzy images resulting from examination process caused by human factors as well as the non-vocal cord images, texture analysis is added in this study to measure image entropy to establish a screening and elimination system to effectively enhance the accuracy of selecting the image of the vocal cord closing to the smallest extent. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Responses of primate frontal cortex neurons during natural vocal communication.
Miller, Cory T; Thomas, A Wren; Nummela, Samuel U; de la Mothe, Lisa A
2015-08-01
The role of primate frontal cortex in vocal communication and its significance in language evolution have a controversial history. While evidence indicates that vocalization processing occurs in ventrolateral prefrontal cortex neurons, vocal-motor activity has been conjectured to be primarily subcortical and suggestive of a distinctly different neural architecture from humans. Direct evidence of neural activity during natural vocal communication is limited, as previous studies were performed in chair-restrained animals. Here we recorded the activity of single neurons across multiple regions of prefrontal and premotor cortex while freely moving marmosets engaged in a natural vocal behavior known as antiphonal calling. Our aim was to test whether neurons in marmoset frontal cortex exhibited responses during vocal-signal processing and/or vocal-motor production in the context of active, natural communication. We observed motor-related changes in single neuron activity during vocal production, but relatively weak sensory responses for vocalization processing during this natural behavior. Vocal-motor responses occurred both prior to and during call production and were typically coupled to the timing of each vocalization pulse. Despite the relatively weak sensory responses a population classifier was able to distinguish between neural activity that occurred during presentations of vocalization stimuli that elicited an antiphonal response and those that did not. These findings are suggestive of the role that nonhuman primate frontal cortex neurons play in natural communication and provide an important foundation for more explicit tests of the functional contributions of these neocortical areas during vocal behaviors. Copyright © 2015 the American Physiological Society.
Responses of primate frontal cortex neurons during natural vocal communication
Thomas, A. Wren; Nummela, Samuel U.; de la Mothe, Lisa A.
2015-01-01
The role of primate frontal cortex in vocal communication and its significance in language evolution have a controversial history. While evidence indicates that vocalization processing occurs in ventrolateral prefrontal cortex neurons, vocal-motor activity has been conjectured to be primarily subcortical and suggestive of a distinctly different neural architecture from humans. Direct evidence of neural activity during natural vocal communication is limited, as previous studies were performed in chair-restrained animals. Here we recorded the activity of single neurons across multiple regions of prefrontal and premotor cortex while freely moving marmosets engaged in a natural vocal behavior known as antiphonal calling. Our aim was to test whether neurons in marmoset frontal cortex exhibited responses during vocal-signal processing and/or vocal-motor production in the context of active, natural communication. We observed motor-related changes in single neuron activity during vocal production, but relatively weak sensory responses for vocalization processing during this natural behavior. Vocal-motor responses occurred both prior to and during call production and were typically coupled to the timing of each vocalization pulse. Despite the relatively weak sensory responses a population classifier was able to distinguish between neural activity that occurred during presentations of vocalization stimuli that elicited an antiphonal response and those that did not. These findings are suggestive of the role that nonhuman primate frontal cortex neurons play in natural communication and provide an important foundation for more explicit tests of the functional contributions of these neocortical areas during vocal behaviors. PMID:26084912
How small could a pup sound? The physical bases of signaling body size in harbor seals
Gross, Stephanie; Garcia, Maxime; Rubio-Garcia, Ana; de Boer, Bart
2017-01-01
Abstract Vocal communication is a crucial aspect of animal behavior. The mechanism which most mammals use to vocalize relies on three anatomical components. First, air overpressure is generated inside the lower vocal tract. Second, as the airstream goes through the glottis, sound is produced via vocal fold vibration. Third, this sound is further filtered by the geometry and length of the upper vocal tract. Evidence from mammalian anatomy and bioacoustics suggests that some of these three components may covary with an animal’s body size. The framework provided by acoustic allometry suggests that, because vocal tract length (VTL) is more strongly constrained by the growth of the body than vocal fold length (VFL), VTL generates more reliable acoustic cues to an animal’s size. This hypothesis is often tested acoustically but rarely anatomically, especially in pinnipeds. Here, we test the anatomical bases of the acoustic allometry hypothesis in harbor seal pups Phoca vitulina. We dissected and measured vocal tract, vocal folds, and other anatomical features of 15 harbor seals post-mortem. We found that, while VTL correlates with body size, VFL does not. This suggests that, while body growth puts anatomical constraints on how vocalizations are filtered by harbor seals’ vocal tract, no such constraints appear to exist on vocal folds, at least during puppyhood. It is particularly interesting to find anatomical constraints on harbor seals’ vocal tracts, the same anatomical region partially enabling pups to produce individually distinctive vocalizations. PMID:29492005
Dynamic interactions between musical, cardiovascular, and cerebral rhythms in humans.
Bernardi, Luciano; Porta, Cesare; Casucci, Gaia; Balsamo, Rossella; Bernardi, Nicolò F; Fogari, Roberto; Sleight, Peter
2009-06-30
Reactions to music are considered subjective, but previous studies suggested that cardiorespiratory variables increase with faster tempo independent of individual preference. We tested whether compositions characterized by variable emphasis could produce parallel instantaneous cardiovascular/respiratory responses and whether these changes mirrored music profiles. Twenty-four young healthy subjects, 12 musicians (choristers) and 12 nonmusician control subjects, listened (in random order) to music with vocal (Puccini's "Turandot") or orchestral (Beethoven's 9th Symphony adagio) progressive crescendos, more uniform emphasis (Bach cantata), 10-second period (ie, similar to Mayer waves) rhythmic phrases (Giuseppe Verdi's arias "Va pensiero" and "Libiam nei lieti calici"), or silence while heart rate, respiration, blood pressures, middle cerebral artery flow velocity, and skin vasomotion were recorded.Common responses were recognized by averaging instantaneous cardiorespiratory responses regressed against changes in music profiles and by coherence analysis during rhythmic phrases. Vocal and orchestral crescendos produced significant (P=0.05 or better) correlations between cardiovascular or respiratory signals and music profile, particularly skin vasoconstriction and blood pressures, proportional to crescendo, in contrast to uniform emphasis, which induced skin vasodilation and reduction in blood pressures. Correlations were significant both in individual and group-averaged signals. Phrases at 10-second periods by Verdi entrained the cardiovascular autonomic variables. No qualitative differences in recorded measurements were seen between musicians and nonmusicians. Music emphasis and rhythmic phrases are tracked consistently by physiological variables. Autonomic responses are synchronized with music, which might therefore convey emotions through autonomic arousal during crescendos or rhythmic phrases.
Efficacy of Cool-Down Exercises In the Practice Regimen of Elite Singers
NASA Astrophysics Data System (ADS)
Gottliebson, Renee O.
Cool-down exercises are routinely prescribed for singers, yet few data exist about the efficacy of active recovery or cooling down of the vocal mechanism. The purpose of the present study was to compare three aspects of vocal function after using different recovery methods following rigorous voice use. Vocal function was assessed using (1) phonation threshold pressure (PTP); (2) acoustic measures (accuracy of tone production, duration of notes and duration of intervals between notes); and (3) measures of subjective perception: perceived phonatory effort (PPE) and Singing Voice Handicap Index (SVHI). Data were collected after 10-minutes of cool-down exercises, complete voice rest, and conversation immediately following a 50-minute voice lesson. Data were collected again 12-24 hours later. Participants included actively performing elite singers (7 women, 2 men) enrolled in the graduate program (M.M., D.M.A.) at the University of Cincinnati's College-Conservatory of Music. While it was expected that PTP estimates after cool downs would be significantly lower than baselines and the other conditions, it turns out that PTP estimates after cool downs were significantly higher at the 80% level of the pitch range. Statistically significant correlations between PTP estimates and PPE scores were found when comparing levels of the participants' pitch ranges (10%, 20%, 80%). Mean PPE scores were highest at the 80% level of the pitch range. The acoustic measures yielded variable results. Cool-down exercises did not result in significantly more accurate tone production and shorter staccato note duration and duration of intervals between staccato notes as compared to baselines and recovery conditions. Instead, participants demonstrated greater accuracy of tone production during baselines and lesser accuracy after voice rest. Staccato notes were significantly shorter in duration after the conversation condition as compared to voice rest. Duration between staccato notes was significantly shorter 12-24 hours after voice rest compared to baselines and the other follow-up conditions. SVHI mean scores were higher during baselines than after the recovery conditions and during follow-up sessions. Statistical significance is noted in comparison of mean SVHI scores 12-24 hours after cool downs (overall lowest mean score) and baselines. The relationship between vocal cool downs and their aerodynamic and acoustic effects remains unclear. What was found was that perhaps the perceived benefit of vocal cool downs is not apparent immediately after their use, but is evident 12-24 hours later. While it appears that conversation may be an acceptable form of active vocal recovery, cool-down exercises may be most beneficial as they raise a conscious awareness of optimum, resonant voice use which may carryover into conversational speech. Future research may benefit from examination of long-term use of vocal cool-down exercises in subsequent vocal performance.
Roy, Sabyasachi; Zhao, Lingyun; Wang, Xiaoqin
2016-11-30
Although evidence from human studies has long indicated the crucial role of the frontal cortex in speech production, it has remained uncertain whether the frontal cortex in nonhuman primates plays a similar role in vocal communication. Previous studies of prefrontal and premotor cortices of macaque monkeys have found neural signals associated with cue- and reward-conditioned vocal production, but not with self-initiated or spontaneous vocalizations (Coudé et al., 2011; Hage and Nieder, 2013), which casts doubt on the role of the frontal cortex of the Old World monkeys in vocal communication. A recent study of marmoset frontal cortex observed modulated neural activities associated with self-initiated vocal production (Miller et al., 2015), but it did not delineate whether these neural activities were specifically attributed to vocal production or if they may result from other nonvocal motor activity such as orofacial motor movement. In the present study, we attempted to resolve these issues and examined single neuron activities in premotor cortex during natural vocal exchanges in the common marmoset (Callithrix jacchus), a highly vocal New World primate. Neural activation and suppression were observed both before and during self-initiated vocal production. Furthermore, by comparing neural activities between self-initiated vocal production and nonvocal orofacial motor movement, we identified a subpopulation of neurons in marmoset premotor cortex that was activated or suppressed by vocal production, but not by orofacial movement. These findings provide clear evidence of the premotor cortex's involvement in self-initiated vocal production in natural vocal behaviors of a New World primate. Human frontal cortex plays a crucial role in speech production. However, it has remained unclear whether the frontal cortex of nonhuman primates is involved in the production of self-initiated vocalizations during natural vocal communication. Using a wireless multichannel neural recording technique, we observed in the premotor cortex neural activation and suppression both before and during self-initiated vocalizations when marmosets, a highly vocal New World primate species, engaged in vocal exchanges with conspecifics. A novel finding of the present study is the discovery of a subpopulation of premotor cortex neurons that was activated by vocal production, but not by orofacial movement. These observations provide clear evidence of the premotor cortex's involvement in vocal production in a New World primate species. Copyright © 2016 the authors 0270-6474/16/3612168-12$15.00/0.
Prosody in Infant-Directed Speech Is Similar across Western and Traditional Cultures
ERIC Educational Resources Information Center
Broesch, Tanya L.; Bryant, Gregory A.
2015-01-01
When speaking to infants, adults typically alter the acoustic properties of their speech in a variety of ways compared with how they speak to other adults; for example, they use higher pitch, increased pitch range, more pitch variability, and slower speech rate. Research shows that these vocal changes happen similarly across industrialized…
Human Language Technology: Opportunities and Challenges
2005-01-01
because of the connections to and reliance on signal processing. Audio diarization critically includes indexing of speakers [12], since speaker ...to reduce inter- speaker variability in training. Standard techniques include vocal-tract length normalization, adaptation of acoustic models using...maximum likelihood linear regression (MLLR), and speaker -adaptive training based on MLLR. The acoustic models are mixtures of Gaussians, typically with
ERIC Educational Resources Information Center
Streufert, Sara Rebecca
2012-01-01
In recent years, scholars have become more vocal regarding counselors and counseling psychologists' responsibilities to advance efforts for social change (Goodman et al., 2004; Speight & Vera, 2004; Vera & Speight, 2004). As a result, empirical investigations have started to evaluate variables that may contribute to trainees and mental…
Phase-Specific Vocalizations of Male Mice at the Initial Encounter during the Courtship Sequence
Matsumoto, Yui K.; Okanoya, Kazuo
2016-01-01
Mice produce ultrasonic vocalizations featuring a variety of syllables. Vocalizations are observed during social interactions. In particular, males produce numerous syllables during courtship. Previous studies have shown that vocalizations change according to sexual behavior, suggesting that males vary their vocalizations depending on the phase of the courtship sequence. To examine this process, we recorded large sets of mouse vocalizations during male–female interactions and acoustically categorized these sounds into 12 vocal types. We found that males emitted predominantly short syllables during the first minute of interaction, more long syllables in the later phases, and mainly harmonic sounds during mounting. These context- and time-dependent changes in vocalization indicate that vocal communication during courtship in mice consists of at least three stages and imply that each vocalization type has a specific role in a phase of the courtship sequence. Our findings suggest that recording for a sufficiently long time and taking the phase of courtship into consideration could provide more insights into the role of vocalization in mouse courtship behavior in future study. PMID:26841117
Iconicity can ground the creation of vocal symbols
Perlman, Marcus; Dale, Rick; Lupyan, Gary
2015-01-01
Studies of gestural communication systems find that they originate from spontaneously created iconic gestures. Yet, we know little about how people create vocal communication systems, and many have suggested that vocalizations do not afford iconicity beyond trivial instances of onomatopoeia. It is unknown whether people can generate vocal communication systems through a process of iconic creation similar to gestural systems. Here, we examine the creation and development of a rudimentary vocal symbol system in a laboratory setting. Pairs of participants generated novel vocalizations for 18 different meanings in an iterative ‘vocal’ charades communication game. The communicators quickly converged on stable vocalizations, and naive listeners could correctly infer their meanings in subsequent playback experiments. People's ability to guess the meanings of these novel vocalizations was predicted by how close the vocalization was to an iconic ‘meaning template’ we derived from the production data. These results strongly suggest that the meaningfulness of these vocalizations derived from iconicity. Our findings illuminate a mechanism by which iconicity can ground the creation of vocal symbols, analogous to the function of iconicity in gestural communication systems. PMID:26361547
Viscoelastic properties of rabbit vocal folds after augmentation.
Hertegård, Stellan; Dahlqvist, Ake; Laurent, Claude; Borzacchiello, Assunta; Ambrosio, Luigi
2003-03-01
Vocal fold function is closely related to tissue viscoelasticity. Augmentation substances may alter the viscoelastic properties of vocal fold tissues and hence their vibratory capacity. We sought to investigate the viscoelastic properties of rabbit vocal folds in vitro after injections of various augmentation substances. Polytetrafluoroethylene (Teflon), cross-linked collagen (Zyplast), and cross-linked hyaluronan, hylan b gel (Hylaform) were injected into the lamina propria and the thyroarytenoid muscle of rabbit vocal folds. Dynamic viscosity of the injected vocal fold as a function of frequency was measured with a Bohlin parallel-plate rheometer during small-amplitude oscillation. All injected vocal folds showed a decreasing dynamic viscosity with increasing frequency. Vocal fold samples injected with hylan b gel showed the lowest dynamic viscosity, quite close to noninjected control samples. Vocal folds injected with polytetrafluoroethylene showed the highest dynamic viscosity followed by the collagen samples. The data indicated that hylan b gel in short-term renders the most natural viscoelastic properties to the vocal fold among the substances tested. This is of importance to restore/preserve the vibratory capacity of the vocal folds when glottal insufficiency is treated with injections.
Central pattern generators for social vocalization: Androgen-dependent neurophysiological mechanisms
Bass, Andrew H.; Remage-Healey, Luke
2008-01-01
Historically, most studies of vertebrate central pattern generators (CPGs) have focused on mechanisms for locomotion and respiration. Here, we highlight new results for ectothermic vertebrates, namely teleost fish and amphibians, showing how androgenic steroids can influence the temporal patterning of CPGs for social vocalization. Investigations of vocalizing teleosts show how androgens can rapidly (within minutes) modulate the neurophysiological output of the vocal CPG (fictive vocalizations that mimic the temporal properties of natural vocalizations) inclusive of their divergent actions between species, as well as intraspecific differences between male reproductive morphs. Studies of anuran amphibians (frogs) demonstrate that long-term steroid treatments (wks) can masculinize the fictive vocalizations of females, inclusive of its sensitivity to rapid modulation by serotonin. Given the conserved organization of vocal control systems across vertebrate groups, the vocal CPGs of fish and amphibians provide tractable models for identifying androgen-dependent events that are fundamental to the mechanisms of vocal motor patterning. These basic mechanisms can also inform our understanding of the more complex CPGs for vocalization, and social behaviors in general, that have evolved among birds and mammals. PMID:18262186
On the role of the reticular formation in vocal pattern generation.
Jürgens, Uwe; Hage, Steffen R
2007-09-04
This review is an attempt to localize the brain region responsible for pattern generation of species-specific vocalizations. A catalogue is set up, listing the criteria considered to be essential for a vocal pattern generator. According to this catalogue, a vocal pattern generator should show vocalization-correlated activity, starting before vocal onset and reflecting specific acoustic features of the vocalization. Artificial activation by electrical or glutamatergic stimulation should produce artificially sounding vocalization. Lesioning is expected to have an inhibitory or deteriorating effect on vocalization. Anatomically, a vocal pattern generator can be assumed to have direct or, at least, oligosynaptic connections with all the motoneuron pools involved in phonation. A survey of the literature reveals that the only area meeting all these criteria is a region, reaching from the parvocellular pontine reticular formation just above the superior olive through the lateral reticular formation around the facial nucleus and nucleus ambiguus down to the caudalmost medulla, including the dorsal and ventral reticular nuclei and nucleus retroambiguus. It is proposed that vocal pattern generation takes place within this whole region.
Samlan, Robin A.; Story, Brad H.
2011-01-01
Purpose To relate vocal fold structure and kinematics to two acoustic measures: cepstral peak prominence (CPP) and the amplitude of the first harmonic relative to the second (H1-H2). Method A computational, kinematic model of the medial surfaces of the vocal folds was used to specify features of vocal fold structure and vibration in a manner consistent with breathy voice. Four model parameters were altered: degree of vocal fold adduction, surface bulging, vibratory nodal point, and supraglottal constriction. CPP and H1-H2 were measured from simulated glottal area, glottal flow and acoustic waveforms and related to the underlying vocal fold kinematics. Results CPP decreased with increased separation of the vocal processes, whereas the nodal point location had little effect. H1-H2 increased as a function of separation of the vocal processes in the range of 1–1.5 mm and decreased with separation > 1.5 mm. Conclusions CPP is generally a function of vocal process separation. H1*-H2* will increase or decrease with vocal process separation based on vocal fold shape, pivot point for the rotational mode, and supraglottal vocal tract shape, limiting its utility as an indicator of breathy voice. Future work will relate the perception of breathiness to vocal fold kinematics and acoustic measures. PMID:21498582
Tan, Melin; Pitman, Michael J
2011-07-01
We present a patient with a novel finding of bilateral mucosal bridges, bilateral type III trans-vocal fold sulci vocales, and a vocal fold polyp. Although sulci and mucosal bridges occur in the vocal folds, it is rare to find multiples of these lesions in a single patient, and it is even more uncommon when they occur in conjunction with a vocal fold polyp. To our knowledge, this is the first description of a vocal fold polyp in combination with multiple vocal fold bridges and multiple type III sulci vocales in a single patient. To describe and visually present the diagnosis and treatment of a patient with an intracordal polyp, bilateral mucosal bridges, as well as bilateral type III trans-vocal fold sulci vocales. Presentation of a set of high definition intraoperative photos displaying the extent of the vocal fold lesions and the resection of the intracordal polyp. This patient presented with only 6 months of significant dysphonia. It was felt that the recent change in voice was because of the polyp and not the bridges or sulci vocales. Considering the patient's presentation and the possible morbidity of resection of mucosal bridges and sulci, only the polyp was excised. Postoperatively, the patient's voice returned to his acceptable mild baseline dysphonia, and the benefit has persisted 6 months postoperatively. The combination of bilateral mucosal bridges, bilateral type III sulcus vocalis, and an intracordal polyp in one patient is rare if not novel. Treatment of the polyp alone returned the patient's voice to his lifelong baseline of mild dysphonia. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Adapted to Roar: Functional Morphology of Tiger and Lion Vocal Folds
Klemuk, Sarah A.; Riede, Tobias; Walsh, Edward J.; Titze, Ingo R.
2011-01-01
Vocal production requires active control of the respiratory system, larynx and vocal tract. Vocal sounds in mammals are produced by flow-induced vocal fold oscillation, which requires vocal fold tissue that can sustain the mechanical stress during phonation. Our understanding of the relationship between morphology and vocal function of vocal folds is very limited. Here we tested the hypothesis that vocal fold morphology and viscoelastic properties allow a prediction of fundamental frequency range of sounds that can be produced, and minimal lung pressure necessary to initiate phonation. We tested the hypothesis in lions and tigers who are well-known for producing low frequency and very loud roaring sounds that expose vocal folds to large stresses. In histological sections, we found that the Panthera vocal fold lamina propria consists of a lateral region with adipocytes embedded in a network of collagen and elastin fibers and hyaluronan. There is also a medial region that contains only fibrous proteins and hyaluronan but no fat cells. Young's moduli range between 10 and 2000 kPa for strains up to 60%. Shear moduli ranged between 0.1 and 2 kPa and differed between layers. Biomechanical and morphological data were used to make predictions of fundamental frequency and subglottal pressure ranges. Such predictions agreed well with measurements from natural phonation and phonation of excised larynges, respectively. We assume that fat shapes Panthera vocal folds into an advantageous geometry for phonation and it protects vocal folds. Its primary function is probably not to increase vocal fold mass as suggested previously. The large square-shaped Panthera vocal fold eases phonation onset and thereby extends the dynamic range of the voice. PMID:22073246
Vocal Tones Influence Young Children’s Responses to Prohibitions
Dahl, Audun; Tran, Amy Q.
2016-01-01
Vocal reactions to child transgressions convey information about the nature of those transgressions. The present research investigated children’s ability to make use of such vocal reactions. Study 1 investigated infants’ compliance with a vocal prohibition telling them to stay away from a toy. Compared to younger infants, older infants showed greater compliance with prohibitions elicited by moral (interpersonal harm) transgressions, but not with prohibitions elicited by pragmatic (inconvenience) transgressions. Study 2 investigated preschoolers’ use of firm-stern vocalizations (associated with moral transgressions) and positive vocalizations (associated with pragmatic transgressions). Most children guessed that the firm-stern vocalizations were uttered in response to a moral transgression and the positive vocalization were uttered in response to a pragmatic transgression. These two studies suggest that children use vocal tones, along with other experiences, to guide their compliance with and interpretation of prohibitions. PMID:27518810
Maurer, D; Hess, M; Gross, M
1996-12-01
Theoretic investigations of the "source-filter" model have indicated a pronounced acoustic interaction of glottal source and vocal tract. Empirical investigations of formant pattern variations apart from changes in vowel identity have demonstrated a direct relationship between the fundamental frequency and the patterns. As a consequence of both findings, independence of phonation and articulation may be limited in the speech process. Within the present study, possible interdependence of phonation and phoneme was investigated: vocal fold vibrations and larynx position for vocalizations of different vowels in a healthy man and woman were examined by high-speed light-intensified digital imaging. We found 1) different movements of the vocal folds for vocalizations of different vowel identities within one speaker and at similar fundamental frequency, and 2) constant larynx position within vocalization of one vowel identity, but different positions for vocalizations of different vowel identities. A possible relationship between the vocal fold vibrations and the phoneme is discussed.
Prat, Yosef; Taub, Mor; Pratt, Ester; Yovel, Yossi
2017-10-03
Animal acoustic communication research depends on our ability to record the vocal behaviour of different species. Only rarely do we have the opportunity to continuously follow the vocal behaviour of a group of individuals of the same species for a long period of time. Here, we provide a database of Egyptian fruit bat vocalizations, which were continuously recorded in the lab in several groups simultaneously for more than a year. The dataset includes almost 300,000 files, a few seconds each, containing social vocalizations and representing the complete vocal repertoire used by the bats in the experiment period. Around 90,000 files are annotated with details about the individuals involved in the vocal interactions, their behaviours and the context. Moreover, the data include the complete vocal ontogeny of pups, from birth to adulthood, in different conditions (e.g., isolated or in a group). We hope that this comprehensive database will stimulate studies that will enhance our understanding of bat, and mammal, social vocal communication.
Harold, Meredith Poore; Barlow, Steven M
2013-02-01
The vocalizations and jaw kinematics of 30 infants aged 6-8 months were recorded using a Motion Analysis System and audiovisual technologies. This study represents the first attempt to determine the effect of play environment on infants' rate of vocalization and jaw movement. Four play conditions were compared: watching videos, social contingent reinforcement and vocal modeling with an adult, playing alone with small toys, and playing alone with large toys. The fewest vocalizations and spontaneous movement were observed when infants were watching videos or interacting with an adult. Infants vocalized most when playing with large toys. The small toys, which naturally elicited gross motor movement (e.g., waving, banging, shaking), educed fewer vocalizations. This study was also the first to quantify the kinematics of vocalized and non-vocalized jaw movements of 6-8 month-old infants. Jaw kinematics did not differentiate infants who produced canonical syllables from those who did not. All infants produced many jaw movements without vocalization. However, during vocalization, infants were unlikely to move their jaw. This contradicts current theories that infant protophonic vocalizations are jaw-dominant. Results of the current study can inform socio-linguistic and kinematic theories of canonical babbling. Copyright © 2012 Elsevier Inc. All rights reserved.
The Risk of Vocal Fold Atrophy after Serial Corticosteroid Injections of the Vocal Fold.
Shi, Lucy L; Giraldez-Rodriguez, Laureano A; Johns, Michael M
2016-11-01
The aim of this study was to illustrate the risk of vocal fold atrophy in patients who receive serial subepithelial steroid injections for vocal fold scar. This study is a retrospective case report of two patients who underwent a series of weekly subepithelial infusions of 10 mg/mL dexamethasone for benign vocal fold lesion. Shortly after the procedures, both patients developed a weak and breathy voice. The first patient was a 53-year-old man with radiation-induced vocal fold stiffness. Six injections were performed unilaterally, and 1 week later, he developed unilateral vocal fold atrophy with new glottal insufficiency. The second patient was a 67-year-old woman with severe vocal fold inflammation related to laryngitis and calcinosis, Raynaud's phenomenon, esophagean dysmotility, sclerodactyly, and telangiectasia (CREST) syndrome. Five injections were performed bilaterally, and 1 week later, she developed bilateral vocal fold atrophy with a large midline glottal gap during phonation. In both cases, the steroid-induced vocal atrophy resolved spontaneously after 4 months. Serial subepithelial steroid infusions of the vocal folds, although safe in the majority of patients, carry the risk of causing temporary vocal fold atrophy when given at short intervals. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Histopathologic investigations of the unphonated human child vocal fold mucosa.
Sato, Kiminori; Umeno, Hirohito; Nakashima, Tadashi; Nonaka, Satoshi; Harabuchi, Yasuaki
2012-01-01
Vocal fold stellate cells (VFSCs) in the maculae flavae (MFe) located at both ends of the vocal fold mucosa are inferred to be involved in the metabolism of extracellular matrices. MFe are also considered to be an important structure in the growth and development of the human vocal fold mucosa. Tension caused by phonation (vocal fold vibration) is hypothesized to stimulate VFSCs to accelerate production of extracellular matrices. Human child vocal fold mucosae unphonated since birth were investigated histologically. Histologic analysis of human child vocal fold mucosa. Vocal fold mucosae, which have remained unphonated since birth, of two children (7 and 12 years old) with cerebral palsy were investigated by light and electron microscopy and compared with normal subjects. Vocal fold mucosae and MFe were hypoplastic and rudimentary and did not have a vocal ligament, Reinke's space, or the layered structure. The lamina propria appeared as a uniform structure. Some VFSCs in the MFe showed degeneration and not many vesicles were present at the periphery of the cytoplasm. The VFSCs synthesized fewer extracellular matrices, such as fibrous protein and glycosaminoglycan. The VFSCs appeared to have decreased activity. Vocal fold vibration (phonation) after birth is an important factor in the growth and development of the human vocal fold mucosa. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Viscoelasticity of rabbit vocal folds after injection augmentation.
Dahlqvist, Ake; Gärskog, Ola; Laurent, Claude; Hertegård, Stellan; Ambrosio, Luigi; Borzacchiello, Assunta
2004-01-01
Vocal fold function is related to the viscoelasticity of the vocal fold tissue. Augmentation substances used for injection treatment of voice insufficiency may alter the viscoelastic properties of vocal folds and their vibratory capacity. The objective was to compare the mechanical properties (viscoelasticity) of various injectable substances and the viscoelasticity of rabbit vocal folds, 6 months after injection with one of these substances. Animal model. Cross-linked collagen (Zyplast), double cross-linked hyaluronan (hylan B gel), dextranomers in hyaluronan (DHIA), and polytetrafluoroethylene (Teflon) were injected into rabbit vocal folds. Six months after the injection, the animals were killed and the right- and left-side vocal folds were removed. Dynamic viscosity of the injected substances and the vocal folds was measured with a Bohlin parallel-plate rheometer during small-amplitude oscillation. All injected vocal folds showed a decreasing dynamic viscosity with increasing frequency. Hylan B gel and DiHA showed the lowest dynamic viscosity values, and vocal folds injected with these substances also showed the lowest dynamic viscosity (similar to noninjected control samples). Teflon (and vocal folds injected with Teflon) showed the highest dynamic viscosity values, followed by the collagen samples. Substances with low viscoelasticity alter the mechanical properties of the vocal fold to a lesser degree than substances with a high viscoelasticity. The data indicated that hylan B gel and DiHA render the most natural viscoelastic properties to the vocal folds. These substances seem to be appropriate for preserving or restoring the vibratory capacity of the vocal folds when glottal insufficiency is treated with augmentative injections.
Neural Representation of a Target Auditory Memory in a Cortico-Basal Ganglia Pathway
Bottjer, Sarah W.
2013-01-01
Vocal learning in songbirds, like speech acquisition in humans, entails a period of sensorimotor integration during which vocalizations are evaluated via auditory feedback and progressively refined to achieve an imitation of memorized vocal sounds. This process requires the brain to compare feedback of current vocal behavior to a memory of target vocal sounds. We report the discovery of two distinct populations of neurons in a cortico-basal ganglia circuit of juvenile songbirds (zebra finches, Taeniopygia guttata) during vocal learning: (1) one in which neurons are selectively tuned to memorized sounds and (2) another in which neurons are selectively tuned to self-produced vocalizations. These results suggest that neurons tuned to learned vocal sounds encode a memory of those target sounds, whereas neurons tuned to self-produced vocalizations encode a representation of current vocal sounds. The presence of neurons tuned to memorized sounds is limited to early stages of sensorimotor integration: after learning, the incidence of neurons encoding memorized vocal sounds was greatly diminished. In contrast to this circuit, neurons known to drive vocal behavior through a parallel cortico-basal ganglia pathway show little selective tuning until late in learning. One interpretation of these data is that representations of current and target vocal sounds in the shell circuit are used to compare ongoing patterns of vocal feedback to memorized sounds, whereas the parallel core circuit has a motor-related role in learning. Such a functional subdivision is similar to mammalian cortico-basal ganglia pathways in which associative-limbic circuits mediate goal-directed responses, whereas sensorimotor circuits support motor aspects of learning. PMID:24005299
Schneider, David M; Woolley, Sarah M N
2010-06-01
Many social animals including songbirds use communication vocalizations for individual recognition. The perception of vocalizations depends on the encoding of complex sounds by neurons in the ascending auditory system, each of which is tuned to a particular subset of acoustic features. Here, we examined how well the responses of single auditory neurons could be used to discriminate among bird songs and we compared discriminability to spectrotemporal tuning. We then used biologically realistic models of pooled neural responses to test whether the responses of groups of neurons discriminated among songs better than the responses of single neurons and whether discrimination by groups of neurons was related to spectrotemporal tuning and trial-to-trial response variability. The responses of single auditory midbrain neurons could be used to discriminate among vocalizations with a wide range of abilities, ranging from chance to 100%. The ability to discriminate among songs using single neuron responses was not correlated with spectrotemporal tuning. Pooling the responses of pairs of neurons generally led to better discrimination than the average of the two inputs and the most discriminating input. Pooling the responses of three to five single neurons continued to improve neural discrimination. The increase in discriminability was largest for groups of neurons with similar spectrotemporal tuning. Further, we found that groups of neurons with correlated spike trains achieved the largest gains in discriminability. We simulated neurons with varying levels of temporal precision and measured the discriminability of responses from single simulated neurons and groups of simulated neurons. Simulated neurons with biologically observed levels of temporal precision benefited more from pooling correlated inputs than did neurons with highly precise or imprecise spike trains. These findings suggest that pooling correlated neural responses with the levels of precision observed in the auditory midbrain increases neural discrimination of complex vocalizations.
Rumbach, Anna F
2013-11-01
To determine the anatomical and physiological nature of voice problems and their treatment in those group fitness instructors (GFIs) who have sought a medical diagnosis; the impact of voice disorders on quality of life and their contribution to activity limitations and participation restrictions; and the perceived attitudes and level of support from the industry at large in response to instructor's voice disorders and need for treatment. Prospective self-completion questionnaire design. Thirty-eight individuals (3 males and 35 females) currently active in the Australian fitness industry who had been diagnosed with a voice disorder completed an online self-completion questionnaire administered via SurveyMonkey. Laryngeal pathology included vocal fold nodules (N = 24), vocal fold cysts (N = 2), vocal fold hemorrhage (N = 1), and recurrent chronic laryngitis (N = 3). Eight individuals reported vocal strain and muscle tension dysphonia without concurrent vocal fold pathology. Treatment methods were variable, with 73.68% (N = 28) receiving voice therapy alone, 7.89% (N = 3) having voice therapy in combination with surgery, and 10.53% (N = 4) having voice therapy in conjunction with medication. Three individuals (7.89%) received no treatment for their voice disorder. During treatment, 82% of the cohort altered their teaching practices. Half of the cohort reported that their voice problems led to social withdrawal, decreased job satisfaction, and emotional distress. Greater than 65% also reported being dissatisfied with the level of industry and coworker support during the period of voice recovery. This study identifies that GFIs are susceptible to a number of voice disorders that impact their social and professional lives, and there is a need for more proactive training and advice on voice care for instructors, as well as those in management positions within the industry to address mixed approaches and opinions regarding the importance of voice care. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Vocal Tremor Analysis with the Vocal Demodulator.
ERIC Educational Resources Information Center
Winholtz, William S.; Ramig, Lorraine Olson
1992-01-01
This paper describes the Vocal Demodulator as a new device for analysis of vocal tremor. The Vocal Demodulator produces amplitude-demodulated and frequency-demodulated outputs and measures the frequency and level of low-frequency tremor components in sustained phonation. The paper describes quantification of the demodulation process, validation…
Wang, Weian; Lu, Rong
2013-06-01
To investigate the effect of laryngoscopic surgery combined with nasal endoscopic system for the treatment of vocal cords benign lesions. Fifty-two patients admitted to our department with vocal cords benign lesions (including vocal polyps, vocal nodules, vocal cord cyst) underwent laryngoscopic surgery combined with nasal endoscopic system. All patients were treated successfully once and for all without any significant postoperative complication. The laryngoscopic surgery combined with nasal endoscopic system is a safe, minimally invasive and simple method for the treatment of benign lesions of vocal cords.
Dysphonia and vocal fold telangiectasia in hereditary hemorrhagic telangiectasia.
Chang, Joseph; Yung, Katherine C
2014-11-01
This case report is the first documentation of dysphonia and vocal fold telangiectasia as a complication of hereditary hemorrhagic telangiectasia (HHT). Case report of a 40-year-old man with HHT presenting with 2 years of worsening hoarseness. Hoarseness corresponded with a period of anticoagulation. Endoscopy revealed vocal fold scarring, vocal fold telangiectasias, and plica ventricular is suggestive of previous submucosal vocal fold hemorrhage and subsequent counterproductive compensation with ventricular phonation. Hereditary hemorrhagic telangiectasia may present as dysphonia with vocal fold telangiectasias and place patients at risk of vocal fold hemorrhage. © The Author(s) 2014.
Vocal learning in elephants: neural bases and adaptive context
Stoeger, Angela S; Manger, Paul
2014-01-01
In the last decade clear evidence has accumulated that elephants are capable of vocal production learning. Examples of vocal imitation are documented in African (Loxodonta africana) and Asian (Elephas maximus) elephants, but little is known about the function of vocal learning within the natural communication systems of either species. We are also just starting to identify the neural basis of elephant vocalizations. The African elephant diencephalon and brainstem possess specializations related to aspects of neural information processing in the motor system (affecting the timing and learning of trunk movements) and the auditory and vocalization system. Comparative interdisciplinary (from behavioral to neuroanatomical) studies are strongly warranted to increase our understanding of both vocal learning and vocal behavior in elephants. PMID:25062469
Vocal fry may undermine the success of young women in the labor market.
Anderson, Rindy C; Klofstad, Casey A; Mayew, William J; Venkatachalam, Mohan
2014-01-01
Vocal fry is speech that is low pitched and creaky sounding, and is increasingly common among young American females. Some argue that vocal fry enhances speaker labor market perceptions while others argue that vocal fry is perceived negatively and can damage job prospects. In a large national sample of American adults we find that vocal fry is interpreted negatively. Relative to a normal speaking voice, young adult female voices exhibiting vocal fry are perceived as less competent, less educated, less trustworthy, less attractive, and less hirable. The negative perceptions of vocal fry are stronger for female voices relative to male voices. These results suggest that young American females should avoid using vocal fry speech in order to maximize labor market opportunities.
Universal mechanisms of sound production and control in birds and mammals
Elemans, C.P.H; Rasmussen, J.H.; Herbst, C.T.; Düring, D.N.; Zollinger, S.A.; Brumm, H.; Srivastava, K.; Svane, N.; Ding, M.; Larsen, O.N.; Sober, S.J.; Švec, J.G.
2015-01-01
As animals vocalize, their vocal organ transforms motor commands into vocalizations for social communication. In birds, the physical mechanisms by which vocalizations are produced and controlled remain unresolved because of the extreme difficulty in obtaining in vivo measurements. Here, we introduce an ex vivo preparation of the avian vocal organ that allows simultaneous high-speed imaging, muscle stimulation and kinematic and acoustic analyses to reveal the mechanisms of vocal production in birds across a wide range of taxa. Remarkably, we show that all species tested employ the myoelastic-aerodynamic (MEAD) mechanism, the same mechanism used to produce human speech. Furthermore, we show substantial redundancy in the control of key vocal parameters ex vivo, suggesting that in vivo vocalizations may also not be specified by unique motor commands. We propose that such motor redundancy can aid vocal learning and is common to MEAD sound production across birds and mammals, including humans. PMID:26612008
Universal mechanisms of sound production and control in birds and mammals.
Elemans, C P H; Rasmussen, J H; Herbst, C T; Düring, D N; Zollinger, S A; Brumm, H; Srivastava, K; Svane, N; Ding, M; Larsen, O N; Sober, S J; Švec, J G
2015-11-27
As animals vocalize, their vocal organ transforms motor commands into vocalizations for social communication. In birds, the physical mechanisms by which vocalizations are produced and controlled remain unresolved because of the extreme difficulty in obtaining in vivo measurements. Here, we introduce an ex vivo preparation of the avian vocal organ that allows simultaneous high-speed imaging, muscle stimulation and kinematic and acoustic analyses to reveal the mechanisms of vocal production in birds across a wide range of taxa. Remarkably, we show that all species tested employ the myoelastic-aerodynamic (MEAD) mechanism, the same mechanism used to produce human speech. Furthermore, we show substantial redundancy in the control of key vocal parameters ex vivo, suggesting that in vivo vocalizations may also not be specified by unique motor commands. We propose that such motor redundancy can aid vocal learning and is common to MEAD sound production across birds and mammals, including humans.
Correlation between vocal tract symptoms and modern singing handicap index in church gospel singers.
Pinheiro, Joel; Silverio, Kelly Cristina Alves; Siqueira, Larissa Thaís Donalonso; Ramos, Janine Santos; Brasolotto, Alcione Ghedini; Zambon, Fabiana; Behlau, Mara
2017-08-24
To verify the correlation between vocal tract discomfort symptoms and perceived voice handicaps in gospel singers, analyzing possible differences according to gender. 100 gospel singers volunteered, 50 male and 50 female. All participants answered two questionnaires: Vocal Tract Discomfort (VTD) scale and the Modern Singing Handicap Index (MSHI) that investigates the vocal handicap perceived by singers, linking the results of both instruments (p<0.05). Women presented more perceived handicaps and also more frequent and higher intensity vocal tract discomfort. Furthermore, the more frequent and intense the vocal tract symptoms, the higher the vocal handicap for singing. Female gospel singers present higher frequency and intensity of vocal tract discomfort symptoms, as well as higher voice handicap for singing than male gospel singers. The higher the frequency and intensity of the laryngeal symptoms, the higher the vocal handicap will be.
FE Modelling of the Fluid-Structure-Acoustic Interaction for the Vocal Folds Self-Oscillation
NASA Astrophysics Data System (ADS)
Švancara, Pavel; Horáček, J.; Hrůza, V.
The flow induced self-oscillation of the human vocal folds in interaction with acoustic processes in the simplified vocal tract model was explored by three-dimensional (3D) finite element (FE) model. Developed FE model includes vocal folds pretension before phonation, large deformations of the vocal fold tissue, vocal folds contact, fluid-structure interaction, morphing the fluid mesh according the vocal folds motion (Arbitrary Lagrangian-Eulerian approach), unsteady viscous compressible airflow described by the Navier-Stokes equations and airflow separation during the glottis closure. Iterative partitioned approach is used for modelling the fluid-structure interaction. Computed results prove that the developed model can be used for simulation of the vocal folds self-oscillation and resulting acoustic waves. The developed model enables to numerically simulate an influence of some pathological changes in the vocal fold tissue on the voice production.
Vocal development in a Waddington landscape
Teramoto, Yayoi; Takahashi, Daniel Y; Holmes, Philip; Ghazanfar, Asif A
2017-01-01
Vocal development is the adaptive coordination of the vocal apparatus, muscles, the nervous system, and social interaction. Here, we use a quantitative framework based on optimal control theory and Waddington’s landscape metaphor to provide an integrated view of this process. With a biomechanical model of the marmoset monkey vocal apparatus and behavioral developmental data, we show that only the combination of the developing vocal tract, vocal apparatus muscles and nervous system can fully account for the patterns of vocal development. Together, these elements influence the shape of the monkeys’ vocal developmental landscape, tilting, rotating or shifting it in different ways. We can thus use this framework to make quantitative predictions regarding how interfering factors or experimental perturbations can change the landscape within a species, or to explain comparative differences in vocal development across species DOI: http://dx.doi.org/10.7554/eLife.20782.001 PMID:28092262
Domanski, Mark C; Preciado, Diego A
2012-01-01
Phrenic nerve pacing can be used to treat congenital central hypoventilation syndrome (CCHS). We report how the lack of normal vocal cord tone during phrenic paced respiration can result in passive vocal cord collapse and produce obstructive symptoms. We describe a case of passive vocal cord collapse during phrenic nerve paced respiration in a patient with CCHS. As far as we know, this is the first report of this etiology of airway obstruction. The patient, a 7-year-old with CCHS and normal waking vocal cord movement, continued to require nightly continuous positive airway pressure (CPAP) despite successful utilization of phrenic nerve pacers. On direct laryngoscopy, the patient's larynx was observed while the diaphragmatic pacers were sequentially engaged. No abnormal vocal cord stimulation was witnessed during engaging of either phrenic nerve stimulator. However, the lack of normal inspiratory vocal cord abduction during phrenic nerve-paced respiration resulted in vocal cord collapse and partial obstruction due to passive adduction of the vocal cords through the Bernoulli effect. Bilateral phrenic nerve stimulation resulted in more vocal cord collapse than unilateral stimulation. The lack of vocal cord abduction on inspiration presents a limit to phrenic nerve pacers.
Cazau, Dorian; Adam, Olivier; Aubin, Thierry; Laitman, Jeffrey T; Reidenberg, Joy S
2016-10-10
Although mammalian vocalizations are predominantly harmonically structured, they can exhibit an acoustic complexity with nonlinear vocal sounds, including deterministic chaos and frequency jumps. Such sounds are normative events in mammalian vocalizations, and can be directly traceable to the nonlinear nature of vocal-fold dynamics underlying typical mammalian sound production. In this study, we give qualitative descriptions and quantitative analyses of nonlinearities in the song repertoire of humpback whales from the Ste Marie channel (Madagascar) to provide more insight into the potential communication functions and underlying production mechanisms of these features. A low-dimensional biomechanical modeling of the whale's U-fold (vocal folds homolog) is used to relate specific vocal mechanisms to nonlinear vocal features. Recordings of living humpback whales were searched for occurrences of vocal nonlinearities (instabilities). Temporal distributions of nonlinearities were assessed within sound units, and between different songs. The anatomical production sources of vocal nonlinearities and the communication context of their occurrences in recordings are discussed. Our results show that vocal nonlinearities may be a communication strategy that conveys information about the whale's body size and physical fitness, and thus may be an important component of humpback whale songs.
NASA Astrophysics Data System (ADS)
Cazau, Dorian; Adam, Olivier; Aubin, Thierry; Laitman, Jeffrey T.; Reidenberg, Joy S.
2016-10-01
Although mammalian vocalizations are predominantly harmonically structured, they can exhibit an acoustic complexity with nonlinear vocal sounds, including deterministic chaos and frequency jumps. Such sounds are normative events in mammalian vocalizations, and can be directly traceable to the nonlinear nature of vocal-fold dynamics underlying typical mammalian sound production. In this study, we give qualitative descriptions and quantitative analyses of nonlinearities in the song repertoire of humpback whales from the Ste Marie channel (Madagascar) to provide more insight into the potential communication functions and underlying production mechanisms of these features. A low-dimensional biomechanical modeling of the whale’s U-fold (vocal folds homolog) is used to relate specific vocal mechanisms to nonlinear vocal features. Recordings of living humpback whales were searched for occurrences of vocal nonlinearities (instabilities). Temporal distributions of nonlinearities were assessed within sound units, and between different songs. The anatomical production sources of vocal nonlinearities and the communication context of their occurrences in recordings are discussed. Our results show that vocal nonlinearities may be a communication strategy that conveys information about the whale’s body size and physical fitness, and thus may be an important component of humpback whale songs.
Improvement of Vocal Pathologies Diagnosis Using High-Speed Videolaryngoscopy
Tsuji, Domingos Hiroshi; Hachiya, Adriana; Dajer, Maria Eugenia; Ishikawa, Camila Cristina; Takahashi, Marystella Tomoe; Montagnoli, Arlindo Neto
2014-01-01
Introduction The study of the dynamic properties of vocal fold vibration is important for understanding the vocal production mechanism and the impact of organic and functional changes. The advent of high-speed videolaryngoscopy (HSV) has provided the possibility of seeing the real cycle of vocal fold vibration in detail through high sampling rate of successive frames and adequate spatial resolution. Objective To describe the technique, advantages, and limitations of using HSV and digital videokymography in the diagnosis of vocal pathologies. Methods We used HSV and digital videokymography to evaluate one normophonic individual and four patients with vocal fold pathologies (nodules, unilateral paralysis of the left vocal fold, intracordal cyst, and adductor spasmodic dysphonia). The vocal fold vibration parameters (glottic closure, vibrational symmetry, periodicity, mucosal wave, amplitude, and glottal cycle phases) were assessed. Results Differences in the vocal vibration parameters were observed and correlated with the pathophysiology. Conclusion HSV is the latest diagnostic tool in visual examination of vocal behavior and has considerable potential to refine our knowledge regarding the vocal fold vibration and voice production, as well as regarding the impact of pathologic conditions have on the mechanism of phonation. PMID:25992109
Samlan, Robin A; Story, Brad H
2011-10-01
To relate vocal fold structure and kinematics to 2 acoustic measures: cepstral peak prominence (CPP) and the amplitude of the first harmonic relative to the second (H1-H2). The authors used a computational, kinematic model of the medial surfaces of the vocal folds to specify features of vocal fold structure and vibration in a manner consistent with breathy voice. Four model parameters were altered: degree of vocal fold adduction, surface bulging, vibratory nodal point, and supraglottal constriction. CPP and H1-H2 were measured from simulated glottal area, glottal flow, and acoustic waveforms and were related to the underlying vocal fold kinematics. CPP decreased with increased separation of the vocal processes, whereas the nodal point location had little effect. H1-H2 increased as a function of separation of the vocal processes in the range of 1.0 mm to 1.5 mm and decreased with separation > 1.5 mm. CPP is generally a function of vocal process separation. H1*-H2* (see paragraph 6 of article text for an explanation of the asterisks) will increase or decrease with vocal process separation on the basis of vocal fold shape, pivot point for the rotational mode, and supraglottal vocal tract shape, limiting its utility as an indicator of breathy voice. Future work will relate the perception of breathiness to vocal fold kinematics and acoustic measures.
Vessel noise cuts down communication space for vocalizing fish and marine mammals.
Putland, Rosalyn L; Merchant, Nathan D; Farcas, Adrian; Radford, Craig A
2018-04-01
Anthropogenic noise across the world's oceans threatens the ability of vocalizing marine species to communicate. Some species vocalize at key life stages or whilst foraging, and disruption to the acoustic habitat at these times could lead to adverse consequences at the population level. To investigate the risk of these impacts, we investigated the effect of vessel noise on the communication space of the Bryde's whale Balaenoptera edeni, an endangered species which vocalizes at low frequencies, and bigeye Pempheris adspersa, a nocturnal fish species which uses contact calls to maintain group cohesion while foraging. By combining long-term acoustic monitoring data with AIS vessel-tracking data and acoustic propagation modelling, the impact of vessel noise on their communication space was determined. Routine vessel passages cut down communication space by up to 61.5% for bigeyes and 87.4% for Bryde's whales. This influence of vessel noise on communication space exceeded natural variability for between 3.9 and 18.9% of the monitoring period. Additionally, during the closest point of approach of a large commercial vessel, <10 km from the listening station, the communication space of both species was reduced by a maximum of 99% compared to the ambient soundscape. These results suggest that vessel noise reduces communication space beyond the evolutionary context of these species and may have chronic effects on these populations. To combat this risk, we propose the application or extension of ship speed restrictions in ecologically significant areas, since our results indicate a reduction in sound source levels for vessels transiting at lower speeds. © 2017 John Wiley & Sons Ltd.
Romagna, Alexander; Rachinger, Walter; Schwartz, Christoph; Mehrkens, Jan-Hinnerk; Betz, Christian; Briegel, Josef; Schnell, Oliver; Tonn, Jörg-Christian; Schichor, Christian; Thon, Niklas
2015-09-01
The 10th cranial nerve (CN X) is at risk during surgery in the lower cerebellopontine angle (CPA). To evaluate endotracheal surface electrodes for assessment of CN X motor function during CPA surgery. Twenty patients were enrolled. Electrophysiological recordings were analyzed and retrospectively correlated with clinical, imaging, and intraoperative data. Recordings from endotracheal surface electrodes were reliable and eligible for analyses in 17 patients; in 3 patients, no surface electrode compound motor action potentials (CMAPs) could be obtained. Those patients with sufficient recordings underwent surgery in the CPA for tumors in 14 patients and for nontumor pathologies in 3 patients. In 12 patients, bipolar stimulation of motor rootlets in the CPA resulted in simultaneous CMAPs recorded from both surface electrodes and needle electrodes placed in the soft palate. Coactivation was particularly seen in patients with an intricate relationship between lower cranial nerves and tumor formations (n = 9/10). Amplitudes and latencies of vocal cord CMAPs showed high interindividual but low intraindividual variability. Parameters were not well correlated with the type of surgery (tumor vs nontumor surgery) and lower CN anatomy (displaced vs undisplaced). In 2 patients, vocal cord CMAPs were lost during tumor surgery, which was associated with postoperative dysphagia and hoarseness in 1 patient. Endotracheal surface electrodes allow identification of vocal cord motor rootlets in the CPA. Worsening of CMAP parameters might indicate functional impairment. These aspects support the use of endotracheal surface electrodes in selected patients in whom the vagus nerve might be at risk during CPA surgery.
Unilateral Vocal Fold Paralysis: A Systematic Review of Speech-Language Pathology Management.
Walton, Chloe; Conway, Erin; Blackshaw, Helen; Carding, Paul
2017-07-01
Dysphonia due to unilateral vocal fold paralysis (UVFP) can be characterized by hoarseness and weakness, resulting in a significant impact on patients' activity and participation. Voice therapy provided by a speech-language pathologist is designed to maximize vocal function and improve quality of life. The purpose of this paper is to systematically review literature surrounding the effectiveness of speech-language pathology intervention for the management of UVFP in adults. This is a systematic review. Electronic databases were searched using a range of key terms including dysphonia, vocal fold paralysis, and speech-language pathology. Eligible articles were extracted and reviewed by the authors for risk of bias, methodology, treatment efficacy, and clinical outcomes. Of the 3311 articles identified, 12 met the inclusion criteria: seven case series and five comparative studies. All 12 studies subjectively reported positive effects following the implementation of voice therapy for UVFP; however, the heterogeneity of participant characteristics, voice therapy, and voice outcome resulted in a low level of evidence. There is presently a lack of methodological rigor and clinical efficacy in the speech-language pathology management of dysphonia arising from UVFP in adults. Reasons for this reduced efficacy can be attributed to the following: (1) no standardized speech-language pathology intervention; (2) no consistency of assessment battery; (3) the variable etiology and clinical presentation of UVFP; and (4) inconsistent timing, frequency, and intensity of treatment. Further research is required to develop the evidence for the management of UVFP incorporating controlled treatment protocols and more rigorous clinical methodology. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Novel underwater soundscape: acoustic repertoire of plainfin midshipman fish.
McIver, Eileen L; Marchaterre, Margaret A; Rice, Aaron N; Bass, Andrew H
2014-07-01
Toadfishes are among the best-known groups of sound-producing (vocal) fishes and include species commonly known as toadfish and midshipman. Although midshipman have been the subject of extensive investigation of the neural mechanisms of vocalization, this is the first comprehensive, quantitative analysis of the spectro-temporal characters of their acoustic signals and one of the few for fishes in general. Field recordings of territorial, nest-guarding male midshipman during the breeding season identified a diverse vocal repertoire composed of three basic sound types that varied widely in duration, harmonic structure and degree of amplitude modulation (AM): 'hum', 'grunt' and 'growl'. Hum duration varied nearly 1000-fold, lasting for minutes at a time, with stable harmonic stacks and little envelope modulation throughout the sound. By contrast, grunts were brief, ~30-140 ms, broadband signals produced both in isolation and repetitively as a train of up to 200 at intervals of ~0.5-1.0 s. Growls were also produced alone or repetitively, but at variable intervals of the order of seconds with durations between those of grunts and hums, ranging 60-fold from ~200 ms to 12 s. Growls exhibited prominent harmonics with sudden shifts in pulse repetition rate and highly variable AM patterns, unlike the nearly constant AM of grunt trains and flat envelope of hums. Behavioral and neurophysiological studies support the hypothesis that each sound type's unique acoustic signature contributes to signal recognition mechanisms. Nocturnal production of these sounds against a background chorus dominated constantly for hours by a single sound type, the multi-harmonic hum, reveals a novel underwater soundscape for fish. © 2014. Published by The Company of Biologists Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Saide P. E.; Springston S.; Spak, S. N.
2012-03-29
We evaluate a regional-scale simulation with the WRF-Chem model for the VAMOS (Variability of the American Monsoon Systems) Ocean-Cloud-Atmosphere-Land Study Regional Experiment (VOCALS-REx), which sampled the Southeast Pacific's persistent stratocumulus deck. Evaluation of VOCALS-REx ship-based and three aircraft observations focuses on analyzing how aerosol loading affects marine boundary layer (MBL) dynamics and cloud microphysics. We compare local time series and campaign-averaged longitudinal gradients, and highlight differences in model simulations with (W) and without (NW) wet deposition processes. The higher aerosol loadings in the NW case produce considerable changes in MBL dynamics and cloud microphysics, in accordance with the established conceptualmore » model of aerosol indirect effects. These include increase in cloud albedo, increase in MBL and cloud heights, drizzle suppression, increase in liquid water content, and increase in cloud lifetime. Moreover, better statistical representation of aerosol mass and number concentration improves model fidelity in reproducing observed spatial and temporal variability in cloud properties, including top and base height, droplet concentration, water content, rain rate, optical depth (COD) and liquid water path (LWP). Together, these help to quantify confidence in WRF-Chem's modeled aerosol-cloud interactions, especially in the activation parameterization, while identifying structural and parametric uncertainties including: irreversibility in rain wet removal; overestimation of marine DMS and sea salt emissions, and accelerated aqueous sulfate conversion. Our findings suggest that WRF-Chem simulates marine cloud-aerosol interactions at a level sufficient for applications in forecasting weather and air quality and studying aerosol climate forcing, and may do so with the reliability required for policy analysis.« less
Rizzo, Francesca; Nespoli, Ester; Abaei, Alireza; Bar-Gad, Izhar; Deelchand, Dinesh K.; Fegert, Jörg; Rasche, Volker; Hengerer, Bastian; Boeckers, Tobias M.
2018-01-01
Tourette’s syndrome (TS) is a neurodevelopmental disorder characterized primarily by motor and vocal tics. Comorbidities such as attention deficit and hyperactivity disorder (ADHD) are observed in over 50% of TS patients. We applied aripiprazole in a juvenile rat model that displays motor tics and hyperactivity. We additionally assessed the amount of ultrasonic vocalizations (USVs) as an indicator for the presence of vocal tics and evaluated the changes in the striatal neurometabolism using in vivo proton magnetic resonance spectroscopy (1H-MRS) at 11.7T. Thirty-one juvenile spontaneously hypertensive rats (SHRs) underwent bicuculline striatal microinjection and treatment with either aripiprazole or vehicle. Control groups were sham operated and sham injected. Behavior, USVs, and striatal neurochemical profile were analyzed at early, middle, and late adolescence (postnatal days 35 to 50). Bicuculline microinjections in the dorsolateral striatum induced motor tics in SHR juvenile rats. Acute aripiprazole administration selectively reduced both tic frequency and latency, whereas stereotypies, USVs, and hyperactivity remained unaltered. The striatal neurochemical profile was only moderately altered after tic-induction and was not affected by systemic drug treatment. When applied to a young rat model that provides high degrees of construct, face, and predictive validity for TS and comorbid ADHD, aripiprazole selectively reduces motor tics, revealing that tics and stereotypies are distinct phenomena in line with clinical treatment of patients. Finally, our 1H-MRS results suggest a critical revision of the striatal role in the hypothesized cortico-striatal dysregulation in TS pathophysiology. PMID:29487562
Holt, Marla M; Noren, Dawn P; Dunkin, Robin C; Williams, Terrie M
2015-06-01
Many animals produce louder, longer or more repetitious vocalizations to compensate for increases in environmental noise. Biological costs of increased vocal effort in response to noise, including energetic costs, remain empirically undefined in many taxa, particularly in marine mammals that rely on sound for fundamental biological functions in increasingly noisy habitats. For this investigation, we tested the hypothesis that an increase in vocal effort would result in an energetic cost to the signaler by experimentally measuring oxygen consumption during rest and a 2 min vocal period in dolphins that were trained to vary vocal loudness across trials. Vocal effort was quantified as the total acoustic energy of sounds produced. Metabolic rates during the vocal period were, on average, 1.2 and 1.5 times resting metabolic rate (RMR) in dolphin A and B, respectively. As vocal effort increased, we found that there was a significant increase in metabolic rate over RMR during the 2 min following sound production in both dolphins, and in total oxygen consumption (metabolic cost of sound production plus recovery costs) in the dolphin that showed a wider range of vocal effort across trials. Increases in vocal effort, as a consequence of increases in vocal amplitude, repetition rate and/or duration, are consistent with behavioral responses to noise in free-ranging animals. Here, we empirically demonstrate for the first time in a marine mammal, that these vocal modifications can have an energetic impact at the individual level and, importantly, these data provide a mechanistic foundation for evaluating biological consequences of vocal modification in noise-polluted habitats. © 2015. Published by The Company of Biologists Ltd.
Endoscopic laterofixation in bilateral vocal cords paralysis in children.
Lidia, Zawadzka-Glos; Magdalena, Frackiewicz; Mieczyslaw, Chmielik
2010-06-01
Vocal cords paralysis is the second most frequent cause of laryngeal stridor in children. Symptoms of congenital vocal cords paralysis can occur shortly after birth or later. Vocal cords paralysis can be unilateral or bilateral. Symptoms of unilateral paralysis include hoarse weeping or stridor during a deep inhalation. In children unilateral vocal cords paralysis often retreats spontaneously or can be completely compensated. Children with bilateral vocal cords paralysis present mainly breathing disorders while phonation is normal. Symptoms are different, starting from complete occlusion of respiratory tracts and ending on small symptoms connected with the lack of effort tolerance. When symptoms are severe, patients from this group require a tracheotomy. The lack of restoration of normal function of vocal cords or lack of complete compensation and maintenance of symptoms are an indication for surgical treatment. The aim of this study is to present results of the treatment of bilateral vocal cords paralysis in children using the endoscopic method of laterofixation of vocal cords. In the Pediatric ENT Department between 1998 and 2009 sixty four children with dyspnoea and/or phonation disorders caused by vocal cords paralysis were treated. In ten cases laterofixation of vocal cords was performed, in most cases with good result. In this article the authors present the method of endoscopic laterofixation and achieved results. Endoscopic laterofixation of vocal cords in children is a safe and an easy method of surgical treatment of bilateral vocal cords paralysis. This method can be used as a first and often as a one stage treatment of vocal cords paralysis. In some cases this procedure is insufficient and has to be completed with other methods. Copyright (c) 2010 Elsevier Ireland Ltd. All rights reserved.
Female Presence and Estrous State Influence Mouse Ultrasonic Courtship Vocalizations
Hanson, Jessica L.; Hurley, Laura M.
2012-01-01
The laboratory mouse is an emerging model for context-dependent vocal signaling and reception. Mouse ultrasonic vocalizations are robustly produced in social contexts. In adults, male vocalization during courtship has become a model of interest for signal-receiver interactions. These vocalizations can be grouped into syllable types that are consistently produced by different subspecies and strains of mice. Vocalizations are unique to individuals, vary across development, and depend on social housing conditions. The behavioral significance of different syllable types, including the contexts in which different vocalizations are made and the responses listeners have to different types of vocalizations, is not well understood. We examined the effect of female presence and estrous state on male vocalizations by exploring the use of syllable types and the parameters of syllables during courtship. We also explored correlations between vocalizations and other behaviors. These experimental manipulations produced four main findings: 1) vocalizations varied among males, 2) the production of USVs and an increase in the use of a specific syllable type were temporally related to mounting behavior, 3) the frequency (kHz), bandwidth, and duration of syllables produced by males were influenced by the estrous phase of female partners, and 4) syllable types changed when females were removed. These findings show that mouse ultrasonic courtship vocalizations are sensitive to changes in female phase and presence, further demonstrating the context-sensitivity of these calls. PMID:22815817
[Varices of the vocal cord: report of 21 cases].
Li, Jin-rang; Sun, Jian-jun
2006-04-01
To study the diagnosis and treatment of varices of the vocal cord. The clinical data of 21 cases with varix of vocal cord were analyzed. All the patients presented hoarseness. There were 15 female and 6 male cases with their ages ranged from 23 to 68 years (median 44 years old). The varix was found on the right vocal cord in 12 cases, on the left vocal cord in 9 cases. Isolated varix existed on the vocal cord in 10 cases, varix with vocal cord polyps or nodules in 10 cases, varix with vocal cord paralysis in 1 case. All the patients were diagnosed under the laryngovideoscopy. The lesions appeared on the superior surface of the vocal cord. Varices manifested as abnormally dilated capillary running in the anterior to posterior direction in 6 cases, as clusters of capillary in 3 cases, as a dot or small sheet or short line of capillary in 12 cases. The varices were disappeared in 2 of 8 cases with vocal cord varices and polyps after removed the polyps. The varices of others patients had no change after following up for more than 6 months, but one patient happened hemorrhage of the contralateral vocal cord. Varices are most commonly seen in female. Laryngovideoscopy is the key in determining the vocal fold varices. Management of patients with a varix includes medical therapy, speech therapy, and occasionally surgical vaporization.
Hertegård, S; Dahlqvist, A; Goodyer, E
2006-07-01
The scarring model resulted in significant damage and elevated viscoelasticity of the lamina propria. Hyaluronan preparations may alter viscoelasticity in scarred rabbit vocal folds. Vocal fold scarring results in stiffness of the lamina propria and severe voice problems. The aims of this study were to examine the degree of scarring achieved in the experiment and to measure the viscoelastic properties after injection of hyaluronan in rabbit vocal folds. Twenty-two vocal folds from 15 New Zealand rabbits were scarred, 8 vocal folds were controls. After 8 weeks 12 of the scarred vocal folds received injections with 2 types of cross-linked hyaluronan products and 10 scarred folds were injected with saline. After 11 more weeks the animals were sacrificed. After dissection, 15 vocal folds were frozen for viscoelastic measurements, whereas 14 vocal folds were prepared and stained. Measurements were made of the lamina propria thickness. Viscoelasticity was measured on intact vocal folds with a linear skin rheometer (LSR) adapted to laryngeal measurements. Measurements on the digitized slides showed a thickened lamina propria in the scarred samples as compared with the normal vocal folds (p<0.05). The viscoelastic analysis showed a tendency to stiffening of the scarred vocal folds as compared with the normal controls (p=0.05). There was large variation in stiffness between the two injected hyaluronan products.
Vocalization Subsystem Responses to a Temporarily Induced Unilateral Vocal Fold Paralysis
ERIC Educational Resources Information Center
Croake, Daniel J.; Andreatta, Richard D.; Stemple, Joseph C.
2018-01-01
Purpose: The purpose of this study is to quantify the interactions of the 3 vocalization subsystems of respiration, phonation, and resonance before, during, and after a perturbation to the larynx (temporarily induced unilateral vocal fold paralysis) in 10 vocally healthy participants. Using dynamic systems theory as a guide, we hypothesized that…
NASA Astrophysics Data System (ADS)
Saidi, Hiba; Erath, Byron D.
2015-11-01
The vocal folds play a major role in human communication by initiating voiced sound production. During voiced speech, the vocal folds are set into sustained vibrations. Synthetic self-oscillating vocal fold models are regularly employed to gain insight into flow-structure interactions governing the phonation process. Commonly, a fixed boundary condition is applied to the lateral, anterior, and posterior sides of the synthetic vocal fold models. However, physiological observations reveal the presence of adipose tissue on the lateral surface between the thyroid cartilage and the vocal folds. The goal of this study is to investigate the influence of including this substrate layer of adipose tissue on the dynamics of phonation. For a more realistic representation of the human vocal folds, synthetic multi-layer vocal fold models have been fabricated and tested while including a soft lateral layer representative of adipose tissue. Phonation parameters have been collected and are compared to those of the standard vocal fold models. Results show that vocal fold kinematics are affected by adding the adipose tissue layer as a new boundary condition.
Yamauchi, Akihito; Yokonishi, Hisayuki; Imagawa, Hiroshi; Sakakibara, Ken-Ichi; Nito, Takaharu; Tayama, Niro
2016-11-01
The goal of this work was to objectively elucidate the vibratory characteristics of vocal fold paralysis (VFP) using high-speed digital imaging (HSDI). HSDI was performed in 29 vocally healthy subjects (12 women and 17 men) and in 107 patients with VFP (40 women and 67 men). Then, the HSDI data were evaluated by visual-perceptual rating, single-line kymography, multiline kymography, laryngotopography, and glottal area waveform analysis. Patients with VFP compared with vocally healthy subjects revealed more frequent incomplete glottal closure, greater asymmetry in amplitude, mucosal wave, frequency, and phase, as well as larger open quotient, smaller speed index, larger maximal and minimal glottal area, and smaller glottal area difference. Paralyzed vocal folds in VFP revealed reduced mucosal wave than nonparalyzed vocal folds in VFP or in intact vocal folds in vocally healthy subjects. HSDI was effective in documenting the characteristics of vocal fold vibrations in patients with VFP and in exploring the vibratory disturbance for estimating the severity of dysphonia. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Aghajan, Yasmin; Yoon, Janet M; Crawford, John Ross
2017-04-24
Severe neuropathy is a known adverse effect of vincristine in patients with Charcot-Marie-Tooth disease (CMT). We present the case of a 16-year-old girl with anaplastic medulloblastoma treated with gross total resection and high-dose craniospinal radiation with adjuvant vincristine chemotherapy who developed acute-onset severe quadriplegia and vocal cord paralysis. Vincristine and radiation therapy were discontinued. Although her neuropathy slowly improved over several weeks, she developed metastatic extraneural medulloblastoma and died 5 months after diagnosis. Subsequent genetic testing revealed previously asymptomatic and undiagnosed CMT1A. Our case highlights the importance of early recognition of acute vincristine neurotoxicity that should raise suspicion of an underlying hereditary neuropathy. © BMJ Publishing Group Ltd (unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Deciphering the complexity of acute inflammation using mathematical models.
Vodovotz, Yoram
2006-01-01
Various stresses elicit an acute, complex inflammatory response, leading to healing but sometimes also to organ dysfunction and death. We constructed both equation-based models (EBM) and agent-based models (ABM) of various degrees of granularity--which encompass the dynamics of relevant cells, cytokines, and the resulting global tissue dysfunction--in order to begin to unravel these inflammatory interactions. The EBMs describe and predict various features of septic shock and trauma/hemorrhage (including the response to anthrax, preconditioning phenomena, and irreversible hemorrhage) and were used to simulate anti-inflammatory strategies in clinical trials. The ABMs that describe the interrelationship between inflammation and wound healing yielded insights into intestinal healing in necrotizing enterocolitis, vocal fold healing during phonotrauma, and skin healing in the setting of diabetic foot ulcers. Modeling may help in understanding the complex interactions among the components of inflammation and response to stress, and therefore aid in the development of novel therapies and diagnostics.
Nelson, Erik J; Wu, Jennifer Y
2017-01-17
BACKGROUND Postoperative conversion disorder is rare and has been reported. The diagnosis is usually made after all major organic causes have been ruled out. CASE REPORT We describe a case of a 13-year-old female who presented in the post-anesthesia care unit with acute-onset inspiratory stridor and unresponsiveness to verbal or painful stimuli after receiving a general anesthetic for upper endoscopy. Later in the post-anesthesia care unit, she presented with acute-onset right hemiplegia and sensory loss. She was first evaluated for causes of her stridor and unresponsiveness. The evaluation revealed paradoxical vocal cord movement, and all laboratory test values were normal. For her hemiplegia and sensory loss, she was evaluated for stroke with head MRI and CT scans, which were normal. CONCLUSIONS After extensive workup and consideration of multiple etiologies for her presenting signs and symptoms, the most likely diagnosis was conversion disorder.
Early experience shapes vocal neural coding and perception in songbirds
Woolley, Sarah M. N.
2012-01-01
Songbirds, like humans, are highly accomplished vocal learners. The many parallels between speech and birdsong and conserved features of mammalian and avian auditory systems have led to the emergence of the songbird as a model system for studying the perceptual mechanisms of vocal communication. Laboratory research on songbirds allows the careful control of early life experience and high-resolution analysis of brain function during vocal learning, production and perception. Here, I review what songbird studies have revealed about the role of early experience in the development of vocal behavior, auditory perception and the processing of learned vocalizations by auditory neurons. The findings of these studies suggest general principles for how exposure to vocalizations during development and into adulthood influences the perception of learned vocal signals. PMID:22711657
Interstitial protein alterations in rabbit vocal fold with scar.
Thibeault, Susan L; Bless, Diane M; Gray, Steven D
2003-09-01
Fibrous and interstitial proteins compose the extracellular matrix of the vocal fold lamina propria and account for its biomechanic properties. Vocal fold scarring is characterized by altered biomechanical properties, which create dysphonia. Although alterations of the fibrous proteins have been confirmed in the rabbit vocal fold scar, interstitial proteins, which are known to be important in wound repair, have not been investigated to date. Using a rabbit model, interstitial proteins decorin, fibromodulin, and fibronectin were examined immunohistologically, two months postinduction of vocal fold scar by means of forcep biopsy. Significantly decreased decorin and fibromodulin with significantly increased fibronectin characterized scarred vocal fold tissue. The implications of altered interstitial proteins levels and their affect on the fibrous proteins will be discussed in relation to increased vocal fold stiffness and viscosity, which characterizes vocal fold scar.
The forgotten cause of stridor in the emergency department.
Ng, Tian-Tee
2017-01-01
Paradoxical Vocal Fold Movement Disorder is where the larynx exhibits paradoxical vocal cords closure during respiration, creating partial airway obstruction. Causes of vocal fold movement disorder are multifactorial, and patients describe tightness of throat, difficulty getting air in, have stridor, and do not respond to inhalers. We propose using transnasal laryngoscopy examination, which will show narrowing of vocal cords on inspiration, and The Pittsburgh Vocal Cord Dysfunction Index with a cutoff score of ≥4 to distinguish vocal fold movement disorder from asthma and other causes of stridor. Management of paradoxical vocal fold movement disorder involves a combination of pharmacological, psychological, psychiatric, and speech training. Paradoxical vocal fold movement disorder is a very treatable cause of stridor, so long as it is identified and other organic causes are excluded.
Evolutionary Origins for Social Vocalization in a Vertebrate Hindbrain–Spinal Compartment
Bass, Andrew H.; Gilland, Edwin H.; Baker, Robert
2008-01-01
The macroevolutionary events leading to neural innovations for social communication, such as vocalization, are essentially unexplored. Many fish vocalize during female courtship and territorial defense, as do amphibians, birds, and mammals. Here, we map the neural circuitry for vocalization in larval fish and show that the vocal network develops in a segment-like region across the most caudal hindbrain and rostral spinal cord. Taxonomic analysis demonstrates a highly conserved pattern between fish and all major lineages of vocal tetrapods. We propose that the vocal basis for acoustic communication among vertebrates evolved from an ancestrally shared developmental compartment already present in the early fishes. PMID:18635807
Non-song vocalizations of pygmy blue whales in Geographe Bay, Western Australia.
Recalde-Salas, A; Salgado Kent, C P; Parsons, M J G; Marley, S A; McCauley, R D
2014-05-01
Non-song vocalizations of migrating pygmy blue whales (Balaenoptera musculus brevicauda) in Western Australia are described. Simultaneous land-based visual observations and underwater acoustic recordings detected 27 groups in Geographe Bay, WA over 2011 to 2012. Six different vocalizations were recorded that were not repeated in a pattern or in association with song, and thus were identified as non-song vocalizations. Five of these were not previously described for this population. Their acoustic characteristics and context are presented. Given that 56% of groups vocalized, 86% of which produced non-song vocalizations and 14% song units, the inclusion of non-song vocalizations in passive-acoustic monitoring is proposed.
Kaneko, Mami; Kishimoto, Yo; Suzuki, Ryo; Kawai, Yoshitaka; Tateya, Ichiro; Hirano, Shigeru
2017-05-01
Professional voice users, such as singers and teachers, are at greater risk of developing vocal fold injury from excessive use of voice; thus, protection of the vocal fold is essential. One of the most important factors that aggravates injury is the production of reactive oxygen species at the wound site. The purpose of the current study was to assess the effect of astaxanthin, a strong antioxidant, on the protection of the vocal fold from injury and inflammation due to vocal loading. This study is an institutional review board-approved human clinical trial. Ten male subjects underwent a 60-minute vocal loading session and received vocal assessments prior to, immediately after, and 30 minutes postvocal loading (AST(-) status). All subjects were then prescribed 24 mg/day of astaxanthin for 28 days, after which they received the same vocal task and assessments (AST(+) status). Phonatory parameters were compared between both groups. Aerodynamic assessment, acoustic analysis, and GRBAS scale (grade, roughness, breathiness, asthenia, and strain) were significantly worse in the AST(-) status immediately after vocal loading, but improved by 30 minutes after loading. In contrast, none of the phonatory parameters in the AST(+) status were statistically worse, even when measured immediately after vocal loading. No allergic responses or adverse effects were observed after administration of astaxanthin. The current results suggest that astaxanthin can protect the vocal fold from injury and inflammation caused by vocal loading possibly through the regulation of oxidative stress. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Ghassemi, Marzyeh; Van Stan, Jarrad H; Mehta, Daryush D; Zañartu, Matías; Cheyne, Harold A; Hillman, Robert E; Guttag, John V
2014-06-01
Voice disorders are medical conditions that often result from vocal abuse/misuse which is referred to generically as vocal hyperfunction. Standard voice assessment approaches cannot accurately determine the actual nature, prevalence, and pathological impact of hyperfunctional vocal behaviors because such behaviors can vary greatly across the course of an individual's typical day and may not be clearly demonstrated during a brief clinical encounter. Thus, it would be clinically valuable to develop noninvasive ambulatory measures that can reliably differentiate vocal hyperfunction from normal patterns of vocal behavior. As an initial step toward this goal we used an accelerometer taped to the neck surface to provide a continuous, noninvasive acceleration signal designed to capture some aspects of vocal behavior related to vocal cord nodules, a common manifestation of vocal hyperfunction. We gathered data from 12 female adult patients diagnosed with vocal fold nodules and 12 control speakers matched for age and occupation. We derived features from weeklong neck-surface acceleration recordings by using distributions of sound pressure level and fundamental frequency over 5-min windows of the acceleration signal and normalized these features so that intersubject comparisons were meaningful. We then used supervised machine learning to show that the two groups exhibit distinct vocal behaviors that can be detected using the acceleration signal. We were able to correctly classify 22 of the 24 subjects, suggesting that in the future measures of the acceleration signal could be used to detect patients with the types of aberrant vocal behaviors that are associated with hyperfunctional voice disorders.
ERIC Educational Resources Information Center
Kopecková, Romana; Marecka, Marta; Wrembel, Magdalena; Gut, Ulrike
2016-01-01
This study examines the interactions between three vocalic subsystems of multilingual speakers and explores the role language status might play in explaining variability across them. Eight 14-year-olds raised in Germany, who had learnt English at school for 6 years and Polish for 1 year, participated in the study. They were divided into three…
ERIC Educational Resources Information Center
Shport, Irina A.
2018-01-01
Purpose: The goal of this study was to test whether fronting and lengthening of lax vowels influence the perception of femininity in listeners whose dialect is characterized as already having relatively fronted and long lax vowels in male and female speech. Method: Sixteen English words containing the /? ? ? ?/ vowels were produced by a male…
Overcoming the Effects of Variation in Infant Speech Segmentation: Influences of Word Familiarity
ERIC Educational Resources Information Center
Singh, Leher; Nestor, Sarah S.; Bortfeld, Heather
2008-01-01
Previous studies have shown that 7.5-month-olds can track and encode words in fluent speech, but they fail to equate instances of a word that contrast in talker gender, vocal affect, and fundamental frequency. By 10.5 months, they succeed at generalizing across such variability, marking a clear transition period during which infants' word…
Impact of call center work in subjective voice symptoms and complaints--an analytic study.
Rechenberg, Leila; Goulart, Bárbara Niegia Garcia de; Roithmann, Renato
2011-12-01
To estimate the prevalence of vocal symptoms, occupational risk factors, associated symptoms and their impact on the professional activity of the telemarketers. Cross-section analytical study with 124 telemarketers and 109 administrative workers (control group) selected from a random sample stratified by gender. The subjects answered an anonymous self-administered questionnaire involving issues related to the presence of vocal symptoms, potential risk factors for dysphonia, and vocal impact of symptoms in professional activity. The presence of one or more voice symptoms that occurred daily or weekly was considered positive for the presence of vocal symptoms. The prevalence of vocal symptoms was found in 33% of telemarketers and in 21% of the control group, indicating an association between vocal symptoms and the activity of the telemarketer. When adjusted for confounders, this association remained in the sense of risk. In telemarketers, the sensation of dry air, ambient noise, and lack of vocal rest were the most frequently reported complaints reported by those presenting vocal symptoms. Almost 70% of telemarketers with vocal symptoms reported that these symptoms interfere with their professional activity. The rate of absenteeism by vocal symptoms in this group was 29%. Vocal symptoms are common in most telemarketers when compared to their peer controls, and significantly affect their job performance.
Wingstrand, Vibe Lindeblad; Jensen, David H.; Bork, Kristian; Sebbesen, Lars; Balle, Jesper; Fischer-Nielsen, Anne; von Buchwald, Christian
2016-01-01
Objectives Therapy with mesenchymal stem cells exhibits potential for the development of novel interventions for many diseases and injuries. The use of mesenchymal stem cells in regenerative therapy for vocal fold scarring exhibited promising results to reduce stiffness and enhance the biomechanical properties of injured vocal folds. This study evaluated the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring. Data Sources PubMed, Embase, the Cochrane Library and Google Scholar were searched. Methods Controlled studies that assessed the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring were included. Primary outcomes were viscoelastic properties and mucosal wave amplitude. Results Seven preclinical animal studies (n = 152 single vocal folds) were eligible for inclusion. Evaluation of viscoelastic parameters revealed a decreased dynamic viscosity (η’) and elastic modulus (G’), i.e., decreased resistance and stiffness, in scarred vocal folds treated with mesenchymal stem cells compared to non-treated scarred vocal folds. Mucosal wave amplitude was increased in scarred vocal folds treated with mesenchymal stem cells vs. non-treated scarred vocal folds. Conclusion The results from these studies suggest an increased regenerative effect of therapy with mesenchymal stem cells for scarred vocal folds and are encouraging for further clinical studies. PMID:27631373
Wingstrand, Vibe Lindeblad; Grønhøj Larsen, Christian; Jensen, David H; Bork, Kristian; Sebbesen, Lars; Balle, Jesper; Fischer-Nielsen, Anne; von Buchwald, Christian
2016-01-01
Therapy with mesenchymal stem cells exhibits potential for the development of novel interventions for many diseases and injuries. The use of mesenchymal stem cells in regenerative therapy for vocal fold scarring exhibited promising results to reduce stiffness and enhance the biomechanical properties of injured vocal folds. This study evaluated the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring. PubMed, Embase, the Cochrane Library and Google Scholar were searched. Controlled studies that assessed the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring were included. Primary outcomes were viscoelastic properties and mucosal wave amplitude. Seven preclinical animal studies (n = 152 single vocal folds) were eligible for inclusion. Evaluation of viscoelastic parameters revealed a decreased dynamic viscosity (η') and elastic modulus (G'), i.e., decreased resistance and stiffness, in scarred vocal folds treated with mesenchymal stem cells compared to non-treated scarred vocal folds. Mucosal wave amplitude was increased in scarred vocal folds treated with mesenchymal stem cells vs. non-treated scarred vocal folds. The results from these studies suggest an increased regenerative effect of therapy with mesenchymal stem cells for scarred vocal folds and are encouraging for further clinical studies.
Modulation of voice related to tremor and vibrato
NASA Astrophysics Data System (ADS)
Lester, Rosemary Anne
Modulation of voice is a result of physiologic oscillation within one or more components of the vocal system including the breathing apparatus (i.e., pressure supply), the larynx (i.e. sound source), and the vocal tract (i.e., sound filter). These oscillations may be caused by pathological tremor associated with neurological disorders like essential tremor or by volitional production of vibrato in singers. Because the acoustical characteristics of voice modulation specific to each component of the vocal system and the effect of these characteristics on perception are not well-understood, it is difficult to assess individuals with vocal tremor and to determine the most effective interventions for reducing the perceptual severity of the disorder. The purpose of the present studies was to determine how the acoustical characteristics associated with laryngeal-based vocal tremor affect the perception of the magnitude of voice modulation, and to determine if adjustments could be made to the voice source and vocal tract filter to alter the acoustic output and reduce the perception of modulation. This research was carried out using both a computational model of speech production and trained singers producing vibrato to simulate laryngeal-based vocal tremor with different voice source characteristics (i.e., vocal fold length and degree of vocal fold adduction) and different vocal tract filter characteristics (i.e., vowel shapes). It was expected that, by making adjustments to the voice source and vocal tract filter that reduce the amplitude of the higher harmonics, the perception of magnitude of voice modulation would be reduced. The results of this study revealed that listeners' perception of the magnitude of modulation of voice was affected by the degree of vocal fold adduction and the vocal tract shape with the computational model, but only by the vocal quality (corresponding to the degree of vocal fold adduction) with the female singer. Based on regression analyses, listeners' judgments were predicted by modulation information in both low and high frequency bands. The findings from these studies indicate that production of a breathy vocal quality might be a useful compensatory strategy for reducing the perceptual severity of modulation of voice for individuals with tremor affecting the larynx.
Psychosocial Intervention for Young Children With Chronic Tics
2018-06-18
Tourette's Syndrome; Tourette's Disorder; Tourette's Disease; Tourette Disorder; Tourette Disease; Tic Disorder, Combined Vocal and Multiple Motor; Multiple Motor and Vocal Tic Disorder, Combined; Gilles de La Tourette's Disease; Gilles de la Tourette Syndrome; Gilles De La Tourette's Syndrome; Combined Vocal and Multiple Motor Tic Disorder; Combined Multiple Motor and Vocal Tic Disorder; Chronic Motor and Vocal Tic Disorder
A Rat Excised Larynx Model of Vocal Fold Scar
ERIC Educational Resources Information Center
Welham, Nathan V.; Montequin, Douglas W.; Tateya, Ichiro; Tateya, Tomoko; Choi, Seong Hee; Bless, Diane M.
2009-01-01
Purpose: To develop and evaluate a rat excised larynx model for the measurement of acoustic, aerodynamic, and vocal fold vibratory changes resulting from vocal fold scar. Method: Twenty-four 4-month-old male Sprague-Dawley rats were assigned to 1 of 4 experimental groups: chronic vocal fold scar, chronic vocal fold scar treated with 100-ng basic…
LANGUAGE DEVELOPMENT. The developmental dynamics of marmoset monkey vocal production.
Takahashi, D Y; Fenley, A R; Teramoto, Y; Narayanan, D Z; Borjon, J I; Holmes, P; Ghazanfar, A A
2015-08-14
Human vocal development occurs through two parallel interactive processes that transform infant cries into more mature vocalizations, such as cooing sounds and babbling. First, natural categories of sounds change as the vocal apparatus matures. Second, parental vocal feedback sensitizes infants to certain features of those sounds, and the sounds are modified accordingly. Paradoxically, our closest living ancestors, nonhuman primates, are thought to undergo few or no production-related acoustic changes during development, and any such changes are thought to be impervious to social feedback. Using early and dense sampling, quantitative tracking of acoustic changes, and biomechanical modeling, we showed that vocalizations in infant marmoset monkeys undergo dramatic changes that cannot be solely attributed to simple consequences of growth. Using parental interaction experiments, we found that contingent parental feedback influences the rate of vocal development. These findings overturn decades-old ideas about primate vocalizations and show that marmoset monkeys are a compelling model system for early vocal development in humans. Copyright © 2015, American Association for the Advancement of Science.
Carroll, Thomas; Nix, John; Hunter, Eric; Titze, Ingo; Abaza, Mona
2016-01-01
Objectives To evaluate vocal fatigue by using objective and subjective measurements of dose recorded by the National Center for Voice and Speech (NCVS) Dosimeter™ (Dosimeter). Study Design and Setting Seven subjects completed a two-week study period. The Dosimeter recorded vocal load, soft phonation tasks and subjective soft voice ratings. Three vocal doses (time, distance, and cycle) were measured in classical singers' larynges during an intensive practice period. Results Spikes in vocal load are reflected as harsher subjective ratings on the same day as well as 24–72 hours later. When at least 48 hours of vocal rest occurred before a vocal load, improved subjective evaluations were seen after the load. Conclusions The NCVS Dosimeter appears to be an effective tool for data collection on prolonged use of the voice. Significance This is the first multi-day study comparing objective and subjective data on vocal fatigue in a group of professional singers. PMID:17011424
Goldstein, Michael H.; Schwade, Jennifer A.; Bornstein, Marc H.
2014-01-01
The early noncry vocalizations of infants are salient social signals. Caregivers spontaneously respond to 30-50% of these sounds, and their responsiveness to infants' prelinguistic noncry vocalizations facilitates the development of phonology and speech. Have infants learned that their vocalizations influence the behavior of social partners? If infants have learned the contingency between their vocalizing and the social responses of others, they should show an extinction burst when the contingency is removed, increasing their rate of noncry vocalizing then decreasing. Thirty-eight 5-month-olds were tested in the still-face paradigm, during which they engaged in a 2-min still-face interaction with an unfamiliar adult. When the adult assumed a still face, infants showed an extinction burst. This pattern of infant vocalizations suggests that 5-month-olds have learned the social efficacy of their vocalizations on caregivers' behavior. Furthermore, the magnitude of 5-month infants' extinction bursts predicted their language comprehension at 13 months. PMID:19489893
Tchernichovski, Ofer; Marcus, Gary
2014-01-01
Studies of vocal learning in songbirds typically focus on the acquisition of sensory templates for song imitation and on the consequent process of matching song production to templates. However, functional vocal development also requires the capacity to adaptively diverge from sensory templates, and to flexibly assemble vocal units. Examples of adaptive divergence include the corrective imitation of abnormal songs, and the decreased tendency to copy overabundant syllables. Such frequency-dependent effects might mirror tradeoffs between the assimilation of group identity (culture) while establishing individual and flexibly expressive songs. Intriguingly, although the requirements for vocal plasticity vary across songbirds, and more so between birdsong and language, the capacity to flexibly assemble vocal sounds develops in a similar, stepwise manner across species. Therefore, universal features of vocal learning go well beyond the capacity to imitate. PMID:25005823
Nonlinear dynamic mechanism of vocal tremor from voice analysis and model simulations
NASA Astrophysics Data System (ADS)
Zhang, Yu; Jiang, Jack J.
2008-09-01
Nonlinear dynamic analysis and model simulations are used to study the nonlinear dynamic characteristics of vocal folds with vocal tremor, which can typically be characterized by low-frequency modulation and aperiodicity. Tremor voices from patients with disorders such as paresis, Parkinson's disease, hyperfunction, and adductor spasmodic dysphonia show low-dimensional characteristics, differing from random noise. Correlation dimension analysis statistically distinguishes tremor voices from normal voices. Furthermore, a nonlinear tremor model is proposed to study the vibrations of the vocal folds with vocal tremor. Fractal dimensions and positive Lyapunov exponents demonstrate the evidence of chaos in the tremor model, where amplitude and frequency play important roles in governing vocal fold dynamics. Nonlinear dynamic voice analysis and vocal fold modeling may provide a useful set of tools for understanding the dynamic mechanism of vocal tremor in patients with laryngeal diseases.
Elie, Julie E.; Theunissen, Frédéric E.
2018-01-01
Although a universal code for the acoustic features of animal vocal communication calls may not exist, the thorough analysis of the distinctive acoustical features of vocalization categories is important not only to decipher the acoustical code for a specific species but also to understand the evolution of communication signals and the mechanisms used to produce and understand them. Here, we recorded more than 8,000 examples of almost all the vocalizations of the domesticated zebra finch, Taeniopygia guttata: vocalizations produced to establish contact, to form and maintain pair bonds, to sound an alarm, to communicate distress or to advertise hunger or aggressive intents. We characterized each vocalization type using complete representations that avoided any a priori assumptions on the acoustic code, as well as classical bioacoustics measures that could provide more intuitive interpretations. We then used these acoustical features to rigorously determine the potential information-bearing acoustical features for each vocalization type using both a novel regularized classifier and an unsupervised clustering algorithm. Vocalization categories are discriminated by the shape of their frequency spectrum and by their pitch saliency (noisy to tonal vocalizations) but not particularly by their fundamental frequency. Notably, the spectral shape of zebra finch vocalizations contains peaks or formants that vary systematically across categories and that would be generated by active control of both the vocal organ (source) and the upper vocal tract (filter). PMID:26581377
Elias-Costa, Agustin J; Montesinos, Rachel; Grant, Taran; Faivovich, Julián
2017-11-01
Anuran vocal sacs are elastic chambers that recycle exhaled air during vocalizations and are present in males of most species of frogs. Most knowledge of the diversity of vocal sacs relates to external morphology; detailed information on internal anatomy is available for few groups of frogs. Frogs of the family Hylodidae, which is endemic to the Atlantic Forest of Brazil and adjacent Argentina and Paraguay, have three patterns of vocal sac morphology-that is, single, subgular; paired, lateral; and absent. The submandibular musculature and structure of the vocal sac mucosa (the internal wall of the vocal sac) of exemplar species of this family and relatives were studied. In contrast to previous accounts, we found that all species of Crossodactylus and Hylodes possess paired, lateral vocal sacs, with the internal mucosa of each sac being separate from the contralateral one. Unlike all other frogs for which data are available, the mucosa of the vocal sacs in these genera is not supported externally by the mm. intermandibularis and interhyoideus. Rather, the vocal sac mucosa projects through the musculature and is free in the submandibular lymphatic sac. The presence of paired, lateral vocal sacs, the internal separation of the sac mucosae, and their projection through the m. interhyoideus are synapomorphies of the family. Furthermore, the specific configuration of the m. interhyoideus allows asymmetric inflation of paired vocal sacs, a feature only reported in species of these diurnal, stream-dwelling frogs. © 2017 Wiley Periodicals, Inc.
Oral breathing challenge in participants with vocal attrition.
Sivasankar, Mahalakshmi; Fisher, Kimberly V
2003-12-01
Vocal folds undergo osmotic challenge by mouth breathing during singing, exercising, and loud speaking. Just 15 min of obligatory oral breathing, to dry the vocal folds, increases phonation threshold pressure (Pth) and expiratory vocal effort in healthy speakers (M. Sivasankar & K. Fisher, 2002). We questioned whether oral breathing is more detrimental to phonation in healthy participants with a history of temporary vocal attrition. The effects of a 15-min oral or nasal breathing challenge on Pth and perceived expiratory vocal effort were compared for participants reporting symptoms of vocal attrition (N = 18, ages 19-38 years) and normal controls (N = 20, ages 19-33 years). Post-challenge-prechallenge differences in Pth (deltaPth) and effort (deltaEffort) revealed that oral breathing, but not nasal breathing, increased Pth (p < .001 ) and effort (p < .001) at low, comfortable, and high pitch. deltaPth was significantly greater in participants with vocal attrition than in normal controls (p < .001). Nasal breathing reduced Pth for all controls but not for all participants reporting vocal attrition. deltaPth was significantly and linearly correlated with deltaEffort (rvocal attrition = .81, p < .001; rcontrol = .84, p < .001). We speculate that the greater increases in Pth in participants reporting vocal attrition may result from delayed or inadequate compensatory response to superficial laryngeal dehydration. Obligatory oral breathing may place voice users at risk for exacerbating vocal attrition. That sol layer depletion by obligatory oral breathing increased Pth and vocal effort provides support for the role of superficial hydration in maintaining ease of phonation.
Vocal coordination and vocal imitation: a role for mirror neurons?
Newman, John D
2014-04-01
Some birds and mammals have vocal communication systems in which coordination between individuals is important. Examples would include duetting or antiphonal calling in some birds and mammals, rapid exchanges of the same vocalization, and vocal exchanges between paired individuals and other nearby pairs. Mirror neurons may play a role in such systems but become functional only after experience.
Ohta, Noriyuki; Mori, Takahiko
2007-11-01
Vocal cord paralysis is one of the frequently encountered complications after aortic surgery. However, reports of vocal cord paralysis after aortic surgery have been limited. In a retrospective cohort study of vocal cord paralysis after aortic surgery at a general hospital, we sought factors related to its development after aortic surgery to the descending thoracic aorta via left posterolateral thoracotomy. We reviewed data for a total of 69 patients who, between 1989 and 1995, underwent aortic surgery to the descending thoracic aorta. We assessed factors associated with the development of vocal cord paralysis and postoperative complications. Postoperative vocal cord paralysis appeared in 19 patients. Multiple logistic regression analysis revealed two risk factors for vocal cord paralysis: chronic dilatation of the aorta at the left subclavian artery (odds ratio = 8.67) and anastomosis proximal to the left subclavian artery (odds ratio = 17.7). The duration of mechanical ventilation was significantly prolonged for patients with vocal cord paralysis. Certain surgical factors associated with left subclavian artery increase the risk of vocal cord paralysis after surgery on the descending thoracic aorta. Vocal cord paralysis after aortic surgery did not increase aspiration pneumonia but was associated with pulmonary complications.
Tyack, Peter L
2008-08-01
The classic evidence for vocal production learning involves imitation of novel, often anthropogenic sounds. Among mammals, this has been reported for dolphins, elephants, harbor seals, and humans. A broader taxonomic distribution has been reported for vocal convergence, where the acoustic properties of calls from different individuals converge when they are housed together in captivity or form social bonds in the wild. Vocal convergence has been demonstrated for animals as diverse as songbirds, parakeets, hummingbirds, bats, elephants, cetaceans, and primates. For most species, call convergence is thought to reflect a group-distinctive identifier, with shared calls reflecting and strengthening social bonds. A ubiquitous function for vocal production learning that is starting to receive attention involves modifying signals to improve communication in a noisy channel. Pooling data on vocal imitation, vocal convergence, and compensation for noise suggests a wider taxonomic distribution of vocal production learning among mammals than has been generally appreciated. The wide taxonomic distribution of this evidence for vocal production learning suggests that perhaps more of the neural underpinnings for vocal production learning are in place in mammals than is usually recognized. (c) 2008 APA, all rights reserved
Assessing and treating vocal stereotypy in children with autism.
Ahearn, William H; Clark, Kathy M; MacDonald, Rebecca P F; Chung, Bo In
2007-01-01
Previous research implies that stereotypic behavior tends to be maintained by the sensory consequences produced by engaging in the response. Few investigations, however, have focused on vocal stereotypy. The current study examined the noncommunicative vocalizations of 4 children with an autism spectrum disorder. First, functional analyses were conducted in an attempt to identify the function of each child's behavior. For each of the participants, it was found that vocal stereotypy was likely not maintained by the social consequences. Following assessment, response interruption and redirection (RIRD) was implemented in an ABAB design to determine whether vocal stereotypy could be successfully redirected. RIRD involved a teacher issuing a series of vocal demands the child readily complied with during regular academic programming. Vocal demands were presented contingent on the occurrence of vocal stereotypy and were continuously presented until the child complied with three consecutively issued demands without emitting vocal stereotypy. For each child, RIRD produced levels of vocal stereotypy substantially lower than those observed in baseline. For 3 of the children, an increase in appropriate communication was also observed. The children's teachers were trained to implement RIRD. Brief follow-up probes and anecdotal information implied that the treatment had a positive impact in the natural environment.
Assessing and Treating Vocal Stereotypy in children with Autism
Ahearn, William H; Clark, Kathy M; MacDonald, Rebecca P.F; In Chung, Bo
2007-01-01
Previous research implies that stereotypic behavior tends to be maintained by the sensory consequences produced by engaging in the response. Few investigations, however, have focused on vocal stereotypy. The current study examined the noncommunicative vocalizations of 4 children with an autism spectrum disorder. First, functional analyses were conducted in an attempt to identify the function of each child's behavior. For each of the participants, it was found that vocal stereotypy was likely not maintained by the social consequences. Following assessment, response interruption and redirection (RIRD) was implemented in an ABAB design to determine whether vocal stereotypy could be successfully redirected. RIRD involved a teacher issuing a series of vocal demands the child readily complied with during regular academic programming. Vocal demands were presented contingent on the occurrence of vocal stereotypy and were continuously presented until the child complied with three consecutively issued demands without emitting vocal stereotypy. For each child, RIRD produced levels of vocal stereotypy substantially lower than those observed in baseline. For 3 of the children, an increase in appropriate communication was also observed. The children's teachers were trained to implement RIRD. Brief follow-up probes and anecdotal information implied that the treatment had a positive impact in the natural environment. PMID:17624067
Comprehensive Outcome Researches of Intralesional Steroid Injection on Benign Vocal Fold Lesions.
Wang, Chi-Te; Lai, Mei-Shu; Hsiao, Tzu-Yu
2015-09-01
This study investigated multidimensional treatment outcomes, including prognostic factors and side effects of vocal fold steroid injection (VFSI). We recruited 126 consecutive patients, including patients with 49 nodules, 47 polyps, and 30 mucus retention cysts. All the patients received VFSI under local anesthesia in the office settings. Treatment outcomes were evaluated 1 and 2 months after the procedure, including endoscopic evaluation, perceptual voice quality (GRB scores), acoustic analysis, and 10-item Voice Handicap Index (VHI-10). More than 80% of the patients reported subjective improvements after VFSI. Objective measurements revealed significant improvements from baseline in most of the outcome parameters (P<0.05). Higher occupational vocal demands and fibrotic vocal nodules were significantly associated with poorer clinical responses as measured by the VHI-10 and GRB scores, respectively. For vocal polyps, dysphonia for more than 12 months were significantly associated with higher postoperative VHI-10 scores, whereas patients with laryngopharyngeal reflux (LPR) showed significantly poor postoperative voice quality as measured by GRB scores. Side effects after VFSI included hematoma (27%), triamcinolone deposits (4%), and vocal atrophy (1%), which resolved spontaneously within 1-2 months. Presentation with vocal fold ectasias/varicosities and higher vocal demands were significantly correlated with postoperative vocal hematoma. This study demonstrated significant improvements after VFSI in vocal nodules, polyps, and cysts. Occupational vocal demand and subtypes of vocal nodules are closely related to the treatment outcomes after VFSI, whereas symptom duration and LPR were significant prognostic factors for VFSI treatment outcomes in vocal polyps. Side effects after receiving VFSI were mostly self-limited without sequel, whereas the incidence rates might be varied by the injection approach and the timing for postoperative follow-up. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Riede, Tobias; Li, Zhiheng; Tokuda, Isao T; Farmer, Colleen G
2015-04-01
Sauropsid vocalization is mediated by the syrinx in birds and the larynx in extant reptiles; but whereas avian vocal production has received much attention, the vocal mechanism of basal reptilians is poorly understood. The American alligator (Alligator mississippiensis) displays a large vocal repertoire during mating and in parent-offspring interactions. Although vocal outputs of these behaviors have received some attention, the underlying mechanism of sound production remains speculative. Here, we investigate the laryngeal anatomy of juvenile and adult animals by macroscopic and histological methods. Observations of the cartilaginous framework and associated muscles largely corroborate earlier findings, but one muscle, the cricoarytenoideus, exhibits a heretofore unknown extrinsic insertion that has important implications for effective regulation of vocal fold length and tension. Histological investigation of the larynx revealed a layered vocal fold morphology. The thick lamina propria consists of non-homogenous extracellular matrix containing collagen fibers that are tightly packed below the epithelium but loosely organized deep inside the vocal fold. We found few elastic fibers but comparatively high proportions of hyaluronan. Similar organizational complexity is also seen in mammalian vocal folds and the labia of the avian syrinx: convergent morphologies that suggest analogous mechanisms for sound production. In tensile tests, alligator vocal folds demonstrated a linear stress-strain behavior in the low strain region and nonlinear stress responses at strains larger than 15%, which is similar to mammalian vocal fold tissue. We have integrated morphological and physiological data in a two-mass vocal fold model, providing a systematic description of the possible acoustic space that could be available to an alligator larynx. Mapping actual call production onto possible acoustic space validates the model's predictions. © 2015. Published by The Company of Biologists Ltd.
Precise Motor Control Enables Rapid Flexibility in Vocal Behavior of Marmoset Monkeys.
Pomberger, Thomas; Risueno-Segovia, Cristina; Löschner, Julia; Hage, Steffen R
2018-03-05
Investigating the evolution of human speech is difficult and controversial because human speech surpasses nonhuman primate vocal communication in scope and flexibility [1-3]. Monkey vocalizations have been assumed to be largely innate, highly affective, and stereotyped for over 50 years [4, 5]. Recently, this perception has dramatically changed. Current studies have revealed distinct learning mechanisms during vocal development [6-8] and vocal flexibility, allowing monkeys to cognitively control when [9, 10], where [11], and what to vocalize [10, 12, 13]. However, specific call features (e.g., duration, frequency) remain surprisingly robust and stable in adult monkeys, resulting in rather stereotyped and discrete call patterns [14]. Additionally, monkeys seem to be unable to modulate their acoustic call structure under reinforced conditions beyond natural constraints [15, 16]. Behavioral experiments have shown that monkeys can stop sequences of calls immediately after acoustic perturbation but cannot interrupt ongoing vocalizations, suggesting that calls consist of single impartible pulses [17, 18]. Using acoustic perturbation triggered by the vocal behavior itself and quantitative measures of resulting vocal adjustments, we show that marmoset monkeys are capable of producing calls with durations beyond the natural boundaries of their repertoire by interrupting ongoing vocalizations rapidly after perturbation onset. Our results indicate that marmosets are capable of interrupting vocalizations only at periodic time points throughout calls, further supported by the occurrence of periodically segmented phees. These ideas overturn decades-old concepts on primate vocal pattern generation, indicating that vocalizations do not consist of one discrete call pattern but are built of many sequentially uttered units, like human speech. Copyright © 2018 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Horita, Haruhito; Kobayashi, Masahiko; Liu, Wan-chun; Oka, Kotaro; Jarvis, Erich D.; Wada, Kazuhiro
2012-01-01
Mechanisms for the evolution of convergent behavioral traits are largely unknown. Vocal learning is one such trait that evolved multiple times and is necessary in humans for the acquisition of spoken language. Among birds, vocal learning is evolved in songbirds, parrots, and hummingbirds. Each time similar forebrain song nuclei specialized for vocal learning and production have evolved. This finding led to the hypothesis that the behavioral and neuroanatomical convergences for vocal learning could be associated with molecular convergence. We previously found that the neural activity-induced gene dual specificity phosphatase 1 (dusp1) was up-regulated in non-vocal circuits, specifically in sensory-input neurons of the thalamus and telencephalon; however, dusp1 was not up-regulated in higher order sensory neurons or motor circuits. Here we show that song motor nuclei are an exception to this pattern. The song nuclei of species from all known vocal learning avian lineages showed motor-driven up-regulation of dusp1 expression induced by singing. There was no detectable motor-driven dusp1 expression throughout the rest of the forebrain after non-vocal motor performance. This pattern contrasts with expression of the commonly studied activity-induced gene egr1, which shows motor-driven expression in song nuclei induced by singing, but also motor-driven expression in adjacent brain regions after non-vocal motor behaviors. In the vocal non-learning avian species, we found no detectable vocalizing-driven dusp1 expression in the forebrain. These findings suggest that independent evolutions of neural systems for vocal learning were accompanied by selection for specialized motor-driven expression of the dusp1 gene in those circuits. This specialized expression of dusp1 could potentially lead to differential regulation of dusp1-modulated molecular cascades in vocal learning circuits. PMID:22876306
The Distribution and Severity of Tremor in Speech Structures of Persons with Vocal Tremor.
Hemmerich, Abby L; Finnegan, Eileen M; Hoffman, Henry T
2017-05-01
Vocal tremor may be associated with cyclic oscillations in the pulmonary, laryngeal, velopharyngeal, or oral regions. This study aimed to correlate the overall severity of vocal tremor with the distribution and severity of tremor in structures involved. Endoscopic and clinical examinations were completed on 20 adults with vocal tremor and two age-matched controls during sustained phonation. Two judges rated the severity of vocal tremor and the severity of tremor affecting each of 13 structures. Participants with mild vocal tremor typically presented with tremor in three laryngeal structures, moderate vocal tremor in five structures (laryngeal and another region), and severe vocal tremor in eight structures affecting all regions. The severity of tremor was lowest (mean = 1.2 out of 3) in persons with mild vocal tremor and greater in persons with moderate (mean = 1.5) and severe vocal tremor (mean = 1.4). Laryngeal structures were most frequently (95%) and severely (1.7 out of 3) affected, followed by velopharynx (40% occurrence, 1.3 severity), pulmonary (40% occurrence, 1.1 severity), and oral (40% occurrence, 1.0 severity) regions. Regression analyses indicated tremor severity of the supraglottic structures, and vertical laryngeal movement contributed most to vocal tremor severity during sustained phonation (r = 0.77, F = 16.17, P < 0.0001). A strong positive correlation (r = 0.72) was found between the Tremor Index and the severity of the vocal tremor during sustained phonation. It is useful to obtain a wide endoscopic view of the larynx to visualize tremor, which is rarely isolated to the true vocal folds alone. Published by Elsevier Inc.
Current Understanding and Future Directions for Vocal Fold Mechanobiology
Li, Nicole Y.K.; Heris, Hossein K.; Mongeau, Luc
2013-01-01
The vocal folds, which are located in the larynx, are the main organ of voice production for human communication. The vocal folds are under continuous biomechanical stress similar to other mechanically active organs, such as the heart, lungs, tendons and muscles. During speech and singing, the vocal folds oscillate at frequencies ranging from 20 Hz to 3 kHz with amplitudes of a few millimeters. The biomechanical stress associated with accumulated phonation is believed to alter vocal fold cell activity and tissue structure in many ways. Excessive phonatory stress can damage tissue structure and induce a cell-mediated inflammatory response, resulting in a pathological vocal fold lesion. On the other hand, phonatory stress is one major factor in the maturation of the vocal folds into a specialized tri-layer structure. One specific form of vocal fold oscillation, which involves low impact and large amplitude excursion, is prescribed therapeutically for patients with mild vocal fold injuries. Although biomechanical forces affect vocal fold physiology and pathology, there is little understanding of how mechanical forces regulate these processes at the cellular and molecular level. Research into vocal fold mechanobiology has burgeoned over the past several years. Vocal fold bioreactors are being developed in several laboratories to provide a biomimic environment that allows the systematic manipulation of physical and biological factors on the cells of interest in vitro. Computer models have been used to simulate the integrated response of cells and proteins as a function of phonation stress. The purpose of this paper is to review current research on the mechanobiology of the vocal folds as it relates to growth, pathogenesis and treatment as well as to propose specific research directions that will advance our understanding of this subject. PMID:24812638
Vocal cysts: clinical, endoscopic, and surgical aspects.
Martins, Regina Helena Garcia; Santana, Marcela Ferreira; Tavares, Elaine Lara Mendes
2011-01-01
Vocal cysts are benign laryngeal lesions, which affect children and adults. They can be classified as epidermic or mucous-retention cyst. The objective was to study the clinical, endoscopic, and surgical aspects of vocal cysts. We reviewed the medical charts of 72 patients with vocal cysts, considering age, gender, occupation, time of vocal symptoms, nasosinusal and gastroesophageal symptoms, vocal abuse, tabagism, alcoholism, associated lesions, treatment, and histological details. Of the 72 cases, 46 were adults (36 females and 10 male) and 26 were children (eight girls and 18 boys). As far as occupation is concerned, there was a higher incidence of students and teachers. All the patients had symptoms of chronic hoarseness. Nasosinusal (27.77%) and gastroesophageal (32%) symptoms were not relevant. Vocal abuse was reported by 45.83%, smoking by 18%, and alcoholism by 8.4% of the patients. Unilateral cysts were seen in 93% of the cases, 22 patients had associated lesions, such as bridge, sulcus vocalis, and microweb. Surgical treatment was performed in 46 cases. Histological analysis of the epidermic cysts revealed a cavity with caseous content, covered by stratified squamous epithelium, often keratinized. Mucous cysts presented mucous content, and the walls were coated by a cylindrical ciliated epithelium. Vocal cysts are benign vocal fold lesions that affect children and adults, being often associated with vocal overuse, which frequently affects people who use their voices professionally. Vocal symptoms are chronic in course, often times since childhood, and the treatment of choice is surgical removal. A careful examination of the vocal folds is necessary during surgery, because other laryngeal lesions may be associated with vocal cysts. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Low frequency dove coos vary across noise gradients in an urbanized environment.
Guo, Fengyi; Bonebrake, Timothy C; Dingle, Caroline
2016-08-01
Urbanization poses a challenge to bird communication due to signal masking by ambient noise and reflective surfaces that lead to signal degradation. Bird species (especially oscines) have been shown to alter their singing behaviour to increase signal efficiency in highly urbanized environments. However, few studies on the effects of noise on song structure have included birds with low frequency vocal signals which may be especially vulnerable to noise pollution due to significant frequency overlap of their signals with traffic noise. We compared the perch coos of spotted doves (Streptopelia chinensis), a species with very low frequency vocalizations, in different background noise levels across urban and peri-urban areas in Hong Kong. We documented a 10% upward shift in the minimum frequency of coos of spotted doves across the noise gradient (a relatively small but significant shift), and a reduced maximum frequency in urban habitats with a higher density of built up area. Hong Kong doves had significantly higher minimum and maximum frequencies than doves from throughout their range (from mostly rural sites). Our results indicate that urban species with extremely low sound frequencies such as doves can alter their vocalizations in response to variable urban acoustic environments. Copyright © 2016 Elsevier B.V. All rights reserved.
Estis, Julie M; Dean-Claytor, Ashli; Moore, Robert E; Rowell, Thomas L
2011-03-01
The effects of musical interference and noise on pitch-matching accuracy were examined. Vocal training was explored as a factor influencing pitch-matching accuracy, and the relationship between pitch matching and pitch discrimination was examined. Twenty trained singers (TS) and 20 untrained individuals (UT) vocally matched tones in six conditions (immediate, four types of chords, noise). Fundamental frequencies were calculated, compared with the frequency of the target tone, and converted to semitone difference scores. A pitch discrimination task was also completed. TS showed significantly better pitch matching than UT across all conditions. Individual performances for UT were highly variable. Therefore, untrained participants were divided into two groups: 10 untrained accurate and 10 untrained inaccurate. Comparison of TS with untrained accurate individuals revealed significant differences between groups and across conditions. Compared with immediate vocal matching of target tones, pitch-matching accuracy was significantly reduced, given musical chord and noise interference unless the target tone was presented in the musical chord. A direct relationship between pitch matching and pitch discrimination was revealed. Across pitch-matching conditions, TS were consistently more accurate than UT. Pitch-matching accuracy diminished when auditory interference consisted of chords that did not contain the target tone and noise. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Effects of noise levels and call types on the source levels of killer whale calls.
Holt, Marla M; Noren, Dawn P; Emmons, Candice K
2011-11-01
Accurate parameter estimates relevant to the vocal behavior of marine mammals are needed to assess potential effects of anthropogenic sound exposure including how masking noise reduces the active space of sounds used for communication. Information about how these animals modify their vocal behavior in response to noise exposure is also needed for such assessment. Prior studies have reported variations in the source levels of killer whale sounds, and a more recent study reported that killer whales compensate for vessel masking noise by increasing their call amplitude. The objectives of the current study were to investigate the source levels of a variety of call types in southern resident killer whales while also considering background noise level as a likely factor related to call source level variability. The source levels of 763 discrete calls along with corresponding background noise were measured over three summer field seasons in the waters surrounding the San Juan Islands, WA. Both noise level and call type were significant factors on call source levels (1-40 kHz band, range of 135.0-175.7 dB(rms) re 1 [micro sign]Pa at 1 m). These factors should be considered in models that predict how anthropogenic masking noise reduces vocal communication space in marine mammals.
Changes in the neural control of a complex motor sequence during learning
Otchy, Timothy M.; Goldberg, Jesse H.; Aronov, Dmitriy; Fee, Michale S.
2011-01-01
The acquisition of complex motor sequences often proceeds through trial-and-error learning, requiring the deliberate exploration of motor actions and the concomitant evaluation of the resulting performance. Songbirds learn their song in this manner, producing highly variable vocalizations as juveniles. As the song improves, vocal variability is gradually reduced until it is all but eliminated in adult birds. In the present study we examine how the motor program underlying such a complex motor behavior evolves during learning by recording from the robust nucleus of the arcopallium (RA), a motor cortex analog brain region. In young birds, neurons in RA exhibited highly variable firing patterns that throughout development became more precise, sparse, and bursty. We further explored how the developing motor program in RA is shaped by its two main inputs: LMAN, the output nucleus of a basal ganglia-forebrain circuit, and HVC, a premotor nucleus. Pharmacological inactivation of LMAN during singing made the song-aligned firing patterns of RA neurons adultlike in their stereotypy without dramatically affecting the spike statistics or the overall firing patterns. Removing the input from HVC, on the other hand, resulted in a complete loss of stereotypy of both the song and the underlying motor program. Thus our results show that a basal ganglia-forebrain circuit drives motor exploration required for trial-and-error learning by adding variability to the developing motor program. As learning proceeds and the motor circuits mature, the relative contribution of LMAN is reduced, allowing the premotor input from HVC to drive an increasingly stereotyped song. PMID:21543758
Vocal Imitations of Non-Vocal Sounds
Houix, Olivier; Voisin, Frédéric; Misdariis, Nicolas; Susini, Patrick
2016-01-01
Imitative behaviors are widespread in humans, in particular whenever two persons communicate and interact. Several tokens of spoken languages (onomatopoeias, ideophones, and phonesthemes) also display different degrees of iconicity between the sound of a word and what it refers to. Thus, it probably comes at no surprise that human speakers use a lot of imitative vocalizations and gestures when they communicate about sounds, as sounds are notably difficult to describe. What is more surprising is that vocal imitations of non-vocal everyday sounds (e.g. the sound of a car passing by) are in practice very effective: listeners identify sounds better with vocal imitations than with verbal descriptions, despite the fact that vocal imitations are inaccurate reproductions of a sound created by a particular mechanical system (e.g. a car driving by) through a different system (the voice apparatus). The present study investigated the semantic representations evoked by vocal imitations of sounds by experimentally quantifying how well listeners could match sounds to category labels. The experiment used three different types of sounds: recordings of easily identifiable sounds (sounds of human actions and manufactured products), human vocal imitations, and computational “auditory sketches” (created by algorithmic computations). The results show that performance with the best vocal imitations was similar to the best auditory sketches for most categories of sounds, and even to the referent sounds themselves in some cases. More detailed analyses showed that the acoustic distance between a vocal imitation and a referent sound is not sufficient to account for such performance. Analyses suggested that instead of trying to reproduce the referent sound as accurately as vocally possible, vocal imitations focus on a few important features, which depend on each particular sound category. These results offer perspectives for understanding how human listeners store and access long-term sound representations, and sets the stage for the development of human-computer interfaces based on vocalizations. PMID:27992480
Arriaga, Gustavo; Zhou, Eric P.; Jarvis, Erich D.
2012-01-01
Humans and song-learning birds communicate acoustically using learned vocalizations. The characteristic features of this social communication behavior include vocal control by forebrain motor areas, a direct cortical projection to brainstem vocal motor neurons, and dependence on auditory feedback to develop and maintain learned vocalizations. These features have so far not been found in closely related primate and avian species that do not learn vocalizations. Male mice produce courtship ultrasonic vocalizations with acoustic features similar to songs of song-learning birds. However, it is assumed that mice lack a forebrain system for vocal modification and that their ultrasonic vocalizations are innate. Here we investigated the mouse song system and discovered that it includes a motor cortex region active during singing, that projects directly to brainstem vocal motor neurons and is necessary for keeping song more stereotyped and on pitch. We also discovered that male mice depend on auditory feedback to maintain some ultrasonic song features, and that sub-strains with differences in their songs can match each other's pitch when cross-housed under competitive social conditions. We conclude that male mice have some limited vocal modification abilities with at least some neuroanatomical features thought to be unique to humans and song-learning birds. To explain our findings, we propose a continuum hypothesis of vocal learning. PMID:23071596
Dopaminergic Contributions to Vocal Learning
Hoffmann, Lukas A.; Saravanan, Varun; Wood, Alynda N.; He, Li
2016-01-01
Although the brain relies on auditory information to calibrate vocal behavior, the neural substrates of vocal learning remain unclear. Here we demonstrate that lesions of the dopaminergic inputs to a basal ganglia nucleus in a songbird species (Bengalese finches, Lonchura striata var. domestica) greatly reduced the magnitude of vocal learning driven by disruptive auditory feedback in a negative reinforcement task. These lesions produced no measureable effects on the quality of vocal performance or the amount of song produced. Our results suggest that dopaminergic inputs to the basal ganglia selectively mediate reinforcement-driven vocal plasticity. In contrast, dopaminergic lesions produced no measurable effects on the birds' ability to restore song acoustics to baseline following the cessation of reinforcement training, suggesting that different forms of vocal plasticity may use different neural mechanisms. SIGNIFICANCE STATEMENT During skill learning, the brain relies on sensory feedback to improve motor performance. However, the neural basis of sensorimotor learning is poorly understood. Here, we investigate the role of the neurotransmitter dopamine in regulating vocal learning in the Bengalese finch, a songbird with an extremely precise singing behavior that can nevertheless be reshaped dramatically by auditory feedback. Our findings show that reduction of dopamine inputs to a region of the songbird basal ganglia greatly impairs vocal learning but has no detectable effect on vocal performance. These results suggest a specific role for dopamine in regulating vocal plasticity. PMID:26888928
Conversational Entrainment of Vocal Fry in Young Adult Female American English Speakers.
Borrie, Stephanie A; Delfino, Christine R
2017-07-01
Conversational entrainment, the natural tendency for people to modify their behaviors to more closely match their communication partner, is examined as one possible mechanism modulating the prevalence of vocal fry in the speech of young American women engaged in spoken dialogue. Twenty young adult female American English speakers engaged in two spoken dialogue tasks-one with a young adult female American English conversational partner who exhibited substantial vocal fry and one with a young adult female American English conversational partner who exhibited quantifiably less vocal fry. Dialogues were analyzed for proportion of vocal fry, by speaker, and two measures of communicative success (efficiency and enjoyment). Participants employed significantly more vocal fry when conversing with the partner who exhibited substantial vocal fry than when conversing with the partner who exhibited quantifiably less vocal fry. Further, greater similarity between communication partners in their use of vocal fry tracked with higher scores of communicative efficiency and communicative enjoyment. Conversational entrainment offers a mechanistic framework that may be used to explain, to some degree, the frequency with which vocal fry is employed by young American women engaged in spoken dialogue. Further, young American women who modulated their vocal patterns during dialogue to match those of their conversational partner gained more efficiency and enjoyment from their interactions, demonstrating the cognitive and social benefits of entrainment. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Mouse vocal communication system: are ultrasounds learned or innate?
Arriaga, Gustavo; Jarvis, Erich D.
2013-01-01
Mouse ultrasonic vocalizations (USVs) are often used as behavioral readouts of internal states, to measure effects of social and pharmacological manipulations, and for behavioral phenotyping of mouse models for neuropsychiatric and neurodegenerative disorders. However, little is known about the neurobiological mechanisms of rodent USV production. Here we discuss the available data to assess whether male mouse song behavior and the supporting brain circuits resemble those of known vocal non-learning or vocal learning species. Recent neurobiology studies have demonstrated that the mouse USV brain system includes motor cortex and striatal regions, and that the vocal motor cortex sends a direct sparse projection to the brainstem vocal motor nucleus ambiguous, a projection thought be unique to humans among mammals. Recent behavioral studies have reported opposing conclusions on mouse vocal plasticity, including vocal ontogeny changes in USVs over early development that might not be explained by innate maturation processes, evidence for and against a role for auditory feedback in developing and maintaining normal mouse USVs, and evidence for and against limited vocal imitation of song pitch. To reconcile these findings, we suggest that the trait of vocal learning may not be dichotomous but encompass a broad set of behavioral and neural traits we call the continuum hypothesis, and that mice possess some of the traits associated with a capacity for limited vocal learning. PMID:23295209
Assessing Vocal Development in Infants and Toddlers
Nathani, Suneeti; Ertmer, David J.
2012-01-01
The purpose of this study was to examine changes in prelinguistic vocal productions during the first 20 months of life. Vocalizations were classified into 23 mutually exclusive and exhaustive types, and grouped into five ascending levels using the Stark Assessment of Early Vocal Development-Revised (SAEVD-R). Data from 30 typically developing infants, aged 0–20 months, show that older infants attained higher developmental levels on the SAEVD-R than younger infants. Infants 0–2, 3–5, and 6–8 months of age primarily produced vocalizations from Levels 1 (Reflexive), 2 (Control of Phonation), and 3 (Expansion). Infants 9–20 months of age also produced vocalizations from Level 4 (Basic Canonical Syllables). Only infants from 16–20 months of age produced Level 5 (Advanced Forms) vocalizations in significant quantities. The outcomes indicate that the SAEVD-R is a valuable instrument for evaluating prelinguistic vocal development. PMID:16728333
Phonetogram changes for trained singers over a nine-month period of vocal training.
LeBorgne, Wendy DeLeo; Weinrich, Barbara D
2002-03-01
Professional vocalists encounter demands requiring voluntary control of phonation, while utilizing a considerable range of frequency and intensity. These quantifiable acoustic events can be measured and represented in a phonetogram. Previous research has compared the phonetograms of trained and untrained voices and found significant differences between these groups. This study was designed to assess the effects of vocal training for singers over a period of nine months. Phonetogram contour changes were examined, with the primary focus on expansion of frequency range and/or intensity control. Twenty-one first-year, master's level, vocal music students, who were engaged in an intensive vocal performance curriculum, participated in this study. Following nine months of vocal training, significant differences were revealed in the subjects' mean frequency range and minimum vocal intensity across frequency levels. There was no significant difference for the mean maximum vocal intensity across frequency levels following vocal training.
Identification of a motor to auditory pathway important for vocal learning
Roberts, Todd F.; Hisey, Erin; Tanaka, Masashi; Kearney, Matthew; Chattree, Gaurav; Yang, Cindy F.; Shah, Nirao M.; Mooney, Richard
2017-01-01
Summary Learning to vocalize depends on the ability to adaptively modify the temporal and spectral features of vocal elements. Neurons that convey motor-related signals to the auditory system are theorized to facilitate vocal learning, but the identity and function of such neurons remain unknown. Here we identify a previously unknown neuron type in the songbird brain that transmits vocal motor signals to the auditory cortex. Genetically ablating these neurons in juveniles disrupted their ability to imitate features of an adult tutor’s song. Ablating these neurons in adults had little effect on previously learned songs, but interfered with their ability to adaptively modify the duration of vocal elements and largely prevented the degradation of song’s temporal features normally caused by deafening. These findings identify a motor to auditory circuit essential to vocal imitation and to the adaptive modification of vocal timing. PMID:28504672
ERIC Educational Resources Information Center
Hackworth, Rhonda S.
2007-01-01
This study examined the effects of vocal hygiene and behavior modification instruction on self-reported behaviors of music teachers. Subjects (N = 76) reported daily behaviors for eight weeks: water consumption, warm-up, talking over music/noise, vocal rest, nonverbal commands, and vocal problems. Subjects were in experimental group 1 or 2, or the…
Choi, Jeong-Seok; Kim, Nahn Ju; Klemuk, Sarah; Jang, Yun Ho; Park, In Suh; Ahn, Kyung Hyun; Lim, Jae-Yol; Kim, Young-Mo
2012-09-01
To compare the rheological characteristics of structurally different hyaluronic acid (HA)-based biomaterials that are presently used for phonosurgery and to investigate their influence on the viscoelastic properties of vocal folds after implantation in an in vivo rabbit model. In vitro and in vivo rheometric investigation. Experimental laboratory, Inha and Seoul National Universities. Viscoelastic shear properties of 3 HA-based biomaterials (Rofilan, Restylane, and Reviderm) were measured with a strain-controlled rheometer. These biomaterials were injected into the deep layers of rabbit vocal folds, and viscoelastic moduli of the injected vocal folds were determined 2 months after the injection. The vocal fold specimens were observed using a light microscope and a transmission electron microscope. All HA-based biomaterials showed similar levels of shear viscosity, which were slightly higher than that of human vocal folds reported in previous studies. Compared with noninjected control vocal folds, there were no significant differences in the magnitudes of both elastic shear modulus (G') and viscous modulus (G") of injected vocal folds among all of the materials. Light microscopic images showed that all materials were observed in the deep layers of vocal folds and electron scanning images revealed that injected HA particles were homogeneously distributed in regions of collagenous fibers. HA-based biomaterials could preserve the viscoelastic properties of the vocal folds, when they were injected into vocal folds in an in vivo rabbit model. However, further studies on the influence of the biomaterials on the viscoelasticity of human vocal folds in ECM surroundings are still needed.
Voice Use Among Music Theory Teachers: A Voice Dosimetry and Self-Assessment Study.
Schiller, Isabel S; Morsomme, Dominique; Remacle, Angélique
2017-07-25
This study aimed (1) to investigate music theory teachers' professional and extra-professional vocal loading and background noise exposure, (2) to determine the correlation between vocal loading and background noise, and (3) to determine the correlation between vocal loading and self-evaluation data. Using voice dosimetry, 13 music theory teachers were monitored for one workweek. The parameters analyzed were voice sound pressure level (SPL), fundamental frequency (F0), phonation time, vocal loading index (VLI), and noise SPL. Spearman correlation was used to correlate vocal loading parameters (voice SPL, F0, and phonation time) and noise SPL. Each day, the subjects self-assessed their voice using visual analog scales. VLI and self-evaluation data were correlated using Spearman correlation. Vocal loading parameters and noise SPL were significantly higher in the professional than in the extra-professional environment. Voice SPL, phonation time, and female subjects' F0 correlated positively with noise SPL. VLI correlated with self-assessed voice quality, vocal fatigue, and amount of singing and speaking voice produced. Teaching music theory is a profession with high vocal demands. More background noise is associated with increased vocal loading and may indirectly increase the risk for voice disorders. Correlations between VLI and self-assessments suggest that these teachers are well aware of their vocal demands and feel their effect on voice quality and vocal fatigue. Visual analog scales seem to represent a useful tool for subjective vocal loading assessment and associated symptoms in these professional voice users. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
The Interaction of Surface Hydration and Vocal Loading on Voice Measures.
Fujiki, Robert Brinton; Chapleau, Abigail; Sundarrajan, Anusha; McKenna, Victoria; Sivasankar, M Preeti
2017-03-01
Vocal loading tasks provide insight regarding the mechanisms underlying healthy laryngeal function. Determining the manner in which the larynx can most efficiently be loaded is a complex task. The goal of this study was to determine if vocal loading could be achieved in 30 minutes by altering phonatory mode. Owing to the fact that surface hydration facilitates efficient vocal fold oscillation, the effects of environmental humidity on vocal loading were also examined. This study also investigated whether the detrimental effects of vocal loading could be attenuated by increasing environmental humidity. Sixteen vocally healthy adults (8 men, 8 women) completed a 30-minute vocal loading task in low and moderate humidity. The order of humidities was counterbalanced across subjects. The vocal loading task consisted of reading with elevated pitch and pressed vocal quality and low pitch and pressed and/or raspy vocal quality in the presence of 65 dB ambient, multi-talker babble noise. Significant effects were observed for (1) cepstral peak prominence on soft sustained phonation at 10th and 80th pitches, (2) perceived phonatory effort, and (3) perceived tiredness ratings. No loading effects were observed for cepstral peak prominence on the rainbow passage, although fundamental frequency on the rainbow passage increased post loading. No main effect was observed for humidity. Following a 30-minute vocal loading task involving altering laryngeal vibratory mode in combination with increased volume. Also, moderate environmental humidity did not significantly attenuate the negative effects of loading. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Convergent Differential Regulation of Parvalbumin in the Brains of Vocal Learners
Hara, Erina; Rivas, Miriam V.; Ward, James M.; Okanoya, Kazuo; Jarvis, Erich D.
2012-01-01
Spoken language and learned song are complex communication behaviors found in only a few species, including humans and three groups of distantly related birds – songbirds, parrots, and hummingbirds. Despite their large phylogenetic distances, these vocal learners show convergent behaviors and associated brain pathways for vocal communication. However, it is not clear whether this behavioral and anatomical convergence is associated with molecular convergence. Here we used oligo microarrays to screen for genes differentially regulated in brain nuclei necessary for producing learned vocalizations relative to adjacent brain areas that control other behaviors in avian vocal learners versus vocal non-learners. A top candidate gene in our screen was a calcium-binding protein, parvalbumin (PV). In situ hybridization verification revealed that PV was expressed significantly higher throughout the song motor pathway, including brainstem vocal motor neurons relative to the surrounding brain regions of all distantly related avian vocal learners. This differential expression was specific to PV and vocal learners, as it was not found in avian vocal non-learners nor for control genes in learners and non-learners. Similar to the vocal learning birds, higher PV up-regulation was found in the brainstem tongue motor neurons used for speech production in humans relative to a non-human primate, macaques. These results suggest repeated convergent evolution of differential PV up-regulation in the brains of vocal learners separated by more than 65–300 million years from a common ancestor and that the specialized behaviors of learned song and speech may require extra calcium buffering and signaling. PMID:22238614
Gender differences affecting vocal health of women in vocally demanding careers
Hunter, Eric J.; Smith, Marshall E.; Tanner, Kristine
2012-01-01
Studies suggest that occupational voice users have a greater incidence of vocal issues than the general population. Women have been found to experience vocal health problems more frequently than men, regardless of their occupation. Traditionally, it has been assumed that differences in the laryngeal system are the cause of this disproportion. Nevertheless, it is valuable to identify other potential gender distinctions which may make women more vulnerable to voice disorders. A search of the literature was conducted for gender-specific characteristics which might impact the vocal health of women. This search can be used by healthcare practitioners to help female patients avoid serious vocal health injuries, as well as to better treat women who already suffer from such vocal health issues. PMID:21722077
Evolution of Courtship Songs in Xenopus : Vocal Pattern Generation and Sound Production.
Leininger, Elizabeth C; Kelley, Darcy B
2015-01-01
The extant species of African clawed frogs (Xenopus and Silurana) provide an opportunity to link the evolution of vocal characters to changes in the responsible cellular and molecular mechanisms. In this review, we integrate several robust lines of research: evolutionary trajectories of Xenopus vocalizations, cellular and circuit-level mechanisms of vocalization in selected Xenopus model species, and Xenopus evolutionary history and speciation mechanisms. Integrating recent findings allows us to generate and test specific hypotheses about the evolution of Xenopus vocal circuits. We propose that reduced vocal sex differences in some Xenopus species result from species-specific losses of sexually differentiated neural and neuromuscular features. Modification of sex-hormone-regulated developmental mechanisms is a strong candidate mechanism for reduced vocal sex differences.
Relationships between CSID and vocal fold vibratory function
NASA Astrophysics Data System (ADS)
Cooke, Melissa L.
High correlations have been reported between the acoustic-based Cepstral/Spectral Index of Dysphonia (CSID) and perceptual judgments of dysphonia. This study explores whether CSID provides additional insight and explains more of the variance in HSV-based properties of vocal fold vibratory function than has been reported for other acoustic measures. Using the Analysis of Dysphonia in Speech and Voice (ADSV) program, CSID and its component variables were correlated with HSV-based measures of glottal cycle aperiodicity and glottal area for 20 subjects who underwent phonomicrosurgery. Results indicate CSID is only marginally correlated with glottal cycle aperiodicity in pre- and post-surgical conditions and does not correlate as highly as the cepstral peak prominence alone. Additionally, results reveal higher correlations when examining within-subject change from pre-surgical to post-surgical assessments rather than correlating measures across subjects. Future directions are discussed that aim at improving our understanding of the relationships between acoustic parameters and underlying phonatory function.
Abnormal laughter-like vocalisations replacing speech in primary progressive aphasia
Rohrer, Jonathan D.; Warren, Jason D.; Rossor, Martin N.
2009-01-01
We describe ten patients with a clinical diagnosis of primary progressive aphasia (PPA) (pathologically confirmed in three cases) who developed abnormal laughter-like vocalisations in the context of progressive speech output impairment leading to mutism. Failure of speech output was accompanied by increasing frequency of the abnormal vocalisations until ultimately they constituted the patient's only extended utterance. The laughter-like vocalisations did not show contextual sensitivity but occurred as an automatic vocal output that replaced speech. Acoustic analysis of the vocalisations in two patients revealed abnormal motor features including variable note duration and inter-note interval, loss of temporal symmetry of laugh notes and loss of the normal decrescendo. Abnormal laughter-like vocalisations may be a hallmark of a subgroup in the PPA spectrum with impaired control and production of nonverbal vocal behaviour due to disruption of fronto-temporal networks mediating vocalisation. PMID:19435636
Abnormal laughter-like vocalisations replacing speech in primary progressive aphasia.
Rohrer, Jonathan D; Warren, Jason D; Rossor, Martin N
2009-09-15
We describe ten patients with a clinical diagnosis of primary progressive aphasia (PPA) (pathologically confirmed in three cases) who developed abnormal laughter-like vocalisations in the context of progressive speech output impairment leading to mutism. Failure of speech output was accompanied by increasing frequency of the abnormal vocalisations until ultimately they constituted the patient's only extended utterance. The laughter-like vocalisations did not show contextual sensitivity but occurred as an automatic vocal output that replaced speech. Acoustic analysis of the vocalisations in two patients revealed abnormal motor features including variable note duration and inter-note interval, loss of temporal symmetry of laugh notes and loss of the normal decrescendo. Abnormal laughter-like vocalisations may be a hallmark of a subgroup in the PPA spectrum with impaired control and production of nonverbal vocal behaviour due to disruption of fronto-temporal networks mediating vocalisation.
Cultural relativity in perceiving emotion from vocalizations.
Gendron, Maria; Roberson, Debi; van der Vyver, Jacoba Marieta; Barrett, Lisa Feldman
2014-04-01
A central question in the study of human behavior is whether certain emotions, such as anger, fear, and sadness, are recognized in nonverbal cues across cultures. We predicted and found that in a concept-free experimental task, participants from an isolated cultural context (the Himba ethnic group from northwestern Namibia) did not freely label Western vocalizations with expected emotion terms. Responses indicate that Himba participants perceived more basic affective properties of valence (positivity or negativity) and to some extent arousal (high or low activation). In a second, concept-embedded task, we manipulated whether the target and foil on a given trial matched in both valence and arousal, neither valence nor arousal, valence only, or arousal only. Himba participants achieved above-chance accuracy only when foils differed from targets in valence only. Our results indicate that the voice can reliably convey affective meaning across cultures, but that perceptions of emotion from the voice are culturally variable.
Cultural relativity in perceiving emotion from vocalizations
Gendron, Maria; Roberson, Debi; van der Vyver, Jacoba Marietta; Barrett, Lisa Feldman
2014-01-01
A central question in the study of human behavior is whether or not certain emotions, such as anger, fear and sadness, are recognized across cultures in non-verbal cues. We predicted and found that in a concept-free experimental task, participants from an isolated cultural context (the Himba ethnic group from Northwest Namibia) do not freely label Western vocalizations with expected emotion terms. Responses indicated Himba participants perceived more basic affective properties of valence (positivity or negativity) and to some extent arousal (high or low activation). In a second concept-embedded task, we manipulated whether a given trial could be solved using only affective content or discrete emotion content based on the foil choice. Above chance accuracy in Himba participants occurred only when foils differed from targets in valence, indicating that the voice can reliably convey affective meaning across cultures, but that perceptions of emotion from the voice are culturally variable. PMID:24501109
Speech and Voice Response to a Levodopa Challenge in Late-Stage Parkinson's Disease.
Fabbri, Margherita; Guimarães, Isabel; Cardoso, Rita; Coelho, Miguel; Guedes, Leonor Correia; Rosa, Mario M; Godinho, Catarina; Abreu, Daisy; Gonçalves, Nilza; Antonini, Angelo; Ferreira, Joaquim J
2017-01-01
Parkinson's disease (PD) patients are affected by hypokinetic dysarthria, characterized by hypophonia and dysprosody, which worsens with disease progression. Levodopa's (l-dopa) effect on quality of speech is inconclusive; no data are currently available for late-stage PD (LSPD). To assess the modifications of speech and voice in LSPD following an acute l-dopa challenge. LSPD patients [Schwab and England score <50/Hoehn and Yahr stage >3 (MED ON)] performed several vocal tasks before and after an acute l-dopa challenge. The following was assessed: respiratory support for speech, voice quality, stability and variability, speech rate, and motor performance (MDS-UPDRS-III). All voice samples were recorded and analyzed by a speech and language therapist blinded to patients' therapeutic condition using Praat 5.1 software. 24/27 (14 men) LSPD patients succeeded in performing voice tasks. Median age and disease duration of patients were 79 [IQR: 71.5-81.7] and 14.5 [IQR: 11-15.7] years, respectively. In MED OFF, respiratory breath support and pitch break time of LSPD patients were worse than the normative values of non-parkinsonian. A correlation was found between disease duration and voice quality ( R = 0.51; p = 0.013) and speech rate ( R = -0.55; p = 0.008). l-Dopa significantly improved MDS-UPDRS-III score (20%), with no effect on speech as assessed by clinical rating scales and automated analysis. Speech is severely affected in LSPD. Although l-dopa had some effect on motor performance, including axial signs, speech and voice did not improve. The applicability and efficacy of non-pharmacological treatment for speech impairment should be considered for speech disorder management in PD.
Novakovic, Daniel; D'Elia, Joanna; Branski, Ryan C; Blitzer, Andrew
2014-06-01
Vocal fold hematoma is traditionally managed with a period of voice rest, in the order of weeks, to allow natural resolution. This study is designed to examine the efficacy and safety of a number of hemoglobin-avid (vascular) lasers when used in the setting of acute vocal fold hematoma. Venous blood drawn from 4 white rabbits was used to create an array of subepithelial hematomas in the buccal cavities of each animal. Laser energy from I of 3 different lasers (532-nm pulsed potassium titanyl phosphate [KTP], 532-nm diode KTP, and 940-nm diode laser) was applied to each of the test hematomas at varying energy levels. Hematoma sites were photographed at days 0, 1, 5, 7, 9, and 12. Two animals were sacrificed on day 7 and the remainder on day 12. Histological evaluation of collateral tissue damage and residual hematoma was performed on biopsy specimens. Macroscopic and microscopic ulceration at laser-treated sites was mostly resolved by day 7. Inflammatory cell infiltrate was present in laser-treated and hematoma-only sites. Laser-treated samples showed alterations in vascularity. Hemoglobin-avid lasers may be beneficial in accelerating subepithelial hematoma resolution with a favorable tissue damage profile.
Lee, Shao-Hsuan; Hsiao, Tzu-Yu; Lee, Guo-She
2015-06-01
Sustained vocalizations of vowels [a], [i], and syllable [mə] were collected in twenty normal-hearing individuals. On vocalizations, five conditions of different audio-vocal feedback were introduced separately to the speakers including no masking, wearing supra-aural headphones only, speech-noise masking, high-pass noise masking, and broad-band-noise masking. Power spectral analysis of vocal fundamental frequency (F0) was used to evaluate the modulations of F0 and linear-predictive-coding was used to acquire first two formants. The results showed that while the formant frequencies were not significantly shifted, low-frequency modulations (<3 Hz) of F0 significantly increased with reduced audio-vocal feedback across speech sounds and were significantly correlated with auditory awareness of speakers' own voices. For sustained speech production, the motor speech controls on F0 may depend on a feedback mechanism while articulation should rely more on a feedforward mechanism. Power spectral analysis of F0 might be applied to evaluate audio-vocal control for various hearing and neurological disorders in the future. Copyright © 2015 Elsevier B.V. All rights reserved.
Animal models of speech and vocal communication deficits associated with psychiatric disorders
Konopka, Genevieve; Roberts, Todd F.
2015-01-01
Disruptions in speech, language and vocal communication are hallmarks of several neuropsychiatric disorders, most notably autism spectrum disorders. Historically, the use of animal models to dissect molecular pathways and connect them to behavioral endophenotypes in cognitive disorders has proven to be an effective approach for developing and testing disease-relevant therapeutics. The unique aspects of human language when compared to vocal behaviors in other animals make such an approach potentially more challenging. However, the study of vocal learning in species with analogous brain circuits to humans may provide entry points for understanding this human-specific phenotype and diseases. Here, we review animal models of vocal learning and vocal communication, and specifically link phenotypes of psychiatric disorders to relevant model systems. Evolutionary constraints in the organization of neural circuits and synaptic plasticity result in similarities in the brain mechanisms for vocal learning and vocal communication. Comparative approaches and careful consideration of the behavioral limitations among different animal models can provide critical avenues for dissecting the molecular pathways underlying cognitive disorders that disrupt speech, language and vocal communication. PMID:26232298
Intelligibility of emotional speech in younger and older adults.
Dupuis, Kate; Pichora-Fuller, M Kathleen
2014-01-01
Little is known about the influence of vocal emotions on speech understanding. Word recognition accuracy for stimuli spoken to portray seven emotions (anger, disgust, fear, sadness, neutral, happiness, and pleasant surprise) was tested in younger and older listeners. Emotions were presented in either mixed (heterogeneous emotions mixed in a list) or blocked (homogeneous emotion blocked in a list) conditions. Three main hypotheses were tested. First, vocal emotion affects word recognition accuracy; specifically, portrayals of fear enhance word recognition accuracy because listeners orient to threatening information and/or distinctive acoustical cues such as high pitch mean and variation. Second, older listeners recognize words less accurately than younger listeners, but the effects of different emotions on intelligibility are similar across age groups. Third, blocking emotions in list results in better word recognition accuracy, especially for older listeners, and reduces the effect of emotion on intelligibility because as listeners develop expectations about vocal emotion, the allocation of processing resources can shift from emotional to lexical processing. Emotion was the within-subjects variable: all participants heard speech stimuli consisting of a carrier phrase followed by a target word spoken by either a younger or an older talker, with an equal number of stimuli portraying each of seven vocal emotions. The speech was presented in multi-talker babble at signal to noise ratios adjusted for each talker and each listener age group. Listener age (younger, older), condition (mixed, blocked), and talker (younger, older) were the main between-subjects variables. Fifty-six students (Mage= 18.3 years) were recruited from an undergraduate psychology course; 56 older adults (Mage= 72.3 years) were recruited from a volunteer pool. All participants had clinically normal pure-tone audiometric thresholds at frequencies ≤3000 Hz. There were significant main effects of emotion, listener age group, and condition on the accuracy of word recognition in noise. Stimuli spoken in a fearful voice were the most intelligible, while those spoken in a sad voice were the least intelligible. Overall, word recognition accuracy was poorer for older than younger adults, but there was no main effect of talker, and the pattern of the effects of different emotions on intelligibility did not differ significantly across age groups. Acoustical analyses helped elucidate the effect of emotion and some intertalker differences. Finally, all participants performed better when emotions were blocked. For both groups, performance improved over repeated presentations of each emotion in both blocked and mixed conditions. These results are the first to demonstrate a relationship between vocal emotion and word recognition accuracy in noise for younger and older listeners. In particular, the enhancement of intelligibility by emotion is greatest for words spoken to portray fear and presented heterogeneously with other emotions. Fear may have a specialized role in orienting attention to words heard in noise. This finding may be an auditory counterpart to the enhanced detection of threat information in visual displays. The effect of vocal emotion on word recognition accuracy is preserved in older listeners with good audiograms and both age groups benefit from blocking and the repetition of emotions.
An Investigation of Extinction-Induced Vocalizations
ERIC Educational Resources Information Center
Valentino, Amber L.; Shillingsburg, M. Alice; Call, Nathan A.; Burton, Britney; Bowen, Crystal N.
2011-01-01
Children with autism have significant communication delays. Although some children develop vocalizations through shaping and differential reinforcement, others rarely exhibit vocalizations, and alternative methods are targeted in intervention. However, vocal language often remains a goal for caregivers and clinicians. Thus, strategies to increase…
ERIC Educational Resources Information Center
Meerschman, Iris; Van Lierde, Kristiane; Peeters, Karen; Meersman, Eline; Claeys, Sofie; D'haeseleer, Evelien
2017-01-01
Purpose: The purpose of this study was to determine the short-term effect of 2 semi-occluded vocal tract training programs, "resonant voice training using nasal consonants" versus "straw phonation," on the vocal quality of vocally healthy future occupational voice users. Method: A multigroup pretest--posttest randomized control…
Clinical practice: vocal nodules in dysphonic children.
Martins, Regina Helena Garcia; Branco, Anete; Tavares, Elaine Lara Mendes; Gramuglia, Andrea Cristina Jóia
2013-09-01
Common among children, vocal symptoms are a cause of concern for parents who seek elucidation of their diagnosis and treatment. Vocal nodules are the major cause of dysphonias in children and are related to vocal abuse. We conducted a literature review considering clinical, physiopathological, epidemiological, and histological aspects of vocal nodules, as well as diagnostic methods, highlighting the main studies addressing this issue. The controversial points of treatments were also discussed.
Mechanical properties of the vocal fold. Stress-strain studies.
Haji, T; Mori, K; Omori, K; Isshiki, N
1992-01-01
The viscoelasticity of the vocal and ventricular folds was experimentally assessed by analyzing the stress-strain relationships obtained using a newly developed measuring system. The degree of stiffness of the mid-membranous portion of the vocal fold was less than that near the anterior commissure or the vocal process. The ventricular fold was much less stiff and significantly more viscous than the vocal fold. At the membranous portion of the vocal fold, the degree of stiffness was less and that of viscosity greater at 2 mm above and below the free margin than at the free margin itself.
Fuentes-López, Eduardo; Fuente, Adrian; Contreras, Karem V
2017-12-18
The aim of this study is to determine possible associations between vocal hygiene habits and self-reported vocal symptoms in telemarketers. A cross-sectional study that included 79 operators from call centres in Chile was carried out. Their vocal hygiene habits and self-reported symptoms were investigated using a validated and reliable questionnaire created for the purposes of this study. Forty-five percent of telemarketers reported having one or more vocal symptoms. Among them, 16.46% reported that their voices tense up when talking and 10.13% needed to clear their throat to make their voices clearer. Five percent mentioned that they always talk without taking a break and 40.51% reported using their voices in noisy environments. The number of working hours per day and inadequate vocal hygiene habits were associated with the presence of self-reported symptoms. Additionally, an interaction between the use of the voice in noisy environments and not taking breaks during the day was observed. Finally, the frequency of inadequate vocal hygiene habits was associated with the number of symptoms reported. Using the voice in noisy environments and talking without taking breaks were both associated with the presence of specific vocal symptoms. This study provides some evidence about the interaction between these two inadequate vocal hygiene habits that potentiates vocal symptoms.
Julias, Margaret; Riede, Tobias; Cook, Douglas
2014-01-01
Objectives Collagen fiber content and orientation affect the viscoelastic properties of the vocal folds, determining oscillation characteristics during speech and other vocalization. The investigation and reconstruction of the collagen network in vocal folds remains a challenge, because the collagen network requires at least micron-scale resolution. In this study, we used polarized light microscopy to investigate the distribution and alignment of collagen fibers within the vocal folds. Methods Data were collected in sections of human and rhesus monkey (Macaca mulatta) vocal folds cut at 3 different angles and stained with picrosirius red. Results Statistically significant differences were found between different section angles, implying that more than one section angle is required to capture the network’s complexity. In the human vocal folds, the collagen fiber distribution continuously varied across the lamina propria (medial to lateral). Distinct differences in birefringence distribution were observed between the species. For the human vocal folds, high birefringence was observed near the thyroarytenoid muscle and near the epithelium. However, in the rhesus monkey vocal folds, high birefringence was observed near the epithelium, and lower birefringence was seen near the thyroarytenoid muscle. Conclusions The differences between the collagen networks in human and rhesus monkey vocal folds provide a morphological basis for differences in viscoelastic properties between species. PMID:23534129
Imaizumi, Mitsuyoshi; Li-Jessen, Nicole Y K; Sato, Yuka; Yang, David T; Thibeault, Susan L
2017-04-01
One prospective treatment option for vocal fold scarring is regeneration with an engineered scaffold containing induced pluripotent stem cells (iPS). In the present study, we investigated the feasibility of utilizing an injectable hyaluronic acid (HA) scaffold encapsulated with human-iPS cell (hiPS) for regeneration of vocal folds. Thirty athymic nude rats underwent unilateral vocal fold injury. Contralateral vocal folds served as uninjured controls. Hyaluronic acid hydrogel scaffold, HA hydrogel scaffold containing hiPS, and HA hydrogel scaffold containing hiPS with epidermal growth factor (EGF) were injected in both vocal folds immediately after surgery. One and 2 weeks after injection, larynges were excised for histology, immunohistochemistry, and fluorescence in situ hybridization (FISH). Presence of HA hydrogel was confirmed in vocal folds 1 and 2 weeks post injection. The FISH analysis confirmed the presence and viability of hiPS in the injected vocal folds. Histological results demonstrated that vocal folds injected with HA hydrogel scaffold containing EGF demonstrated less fibrosis than those with HA hydrogel only. Human-iPS survived in injured rat vocal folds. The HA hydrogel with hiPS and EGF ameliorated the fibrotic response. Additional work is necessary to optimize hiPS differentiation and further confirm the safety of hiPS for clinical applications.
Makiyama, Kiyoshi; Yoshihashi, Hidetaka; Mogitate, Manabu; Kida, Akinori
2005-04-01
To determine the role of the adjustment of expiratory effort in the control of vocal intensity. An intensity-loading test was performed by using the airway interruption method. Three groups of subjects were used: a control group thought to resemble normal vocal fold closure, a group of patients with Reinke's edema thought to represent increased mass at the level of the vocal folds, and a group with vocal fold paralysis that was thought to represent a group with lack of adequate vocal fold closure. In the control group, expiratory lung pressure and airway resistance slightly increased. In the patients with Reinke's edema, expiratory lung pressure, and airway resistance significantly increased. In this group, the voice intensity was controlled by laryngeal adjustment, but a greater expiratory effort was needed because of a greater increase in glottal resistance. In the patients with vocal cord paralysis, airway resistance did not increase even with a high-intensity voice. Vocal intensity was controlled by expiratory effort. If there is sufficient ability for laryngeal adjustment, vocal intensity is controlled primarily by laryngeal adjustment and by expiratory adjustment in response to increased glottal resistance. However, vocal intensity is controlled by expiratory effort when laryngeal adjustment ability is poor.
Mothers' tone of voice depends on the nature of infants' transgressions.
Dahl, Audun; Sherlock, Briana R; Campos, Joseph J; Theunissen, Frédéric E
2014-08-01
Emotional vocal signals are important ways of communicating norms to young infants. The second year is a period of increase in various forms of child transgressions, but also a period when infants have limited linguistic abilities. Two studies investigated the hypothesis that mothers respond with different vocal emotional tones to 3 types of child transgressions: moral (harming others), prudential (harming oneself), and pragmatic (creating inconvenience, e.g., by spilling) transgressions. We used a combination of naturalistic observation (Study 1) and experimental manipulation (Study 2) to record, code, and analyze maternal vocal responses to child transgressions. Both studies showed that mothers were more likely to use intense, angry vocalizations in response to moral transgressions, fearful vocalizations in response to prudential transgressions, comforting vocalizations in response to pragmatic and prudential transgressions, and (in Study 2) playful vocalizations in response to pragmatic transgressions. Study 1 showed that this differential use of vocal tone is used systematically in everyday life. Study 2 allowed us to standardize the context of the maternal intervention and perform additional acoustical analyses. A combination of principal component analysis and linear discriminant analysis applied to pitch and intensity data provided quantitative measures of the differences in vocal responses. These differentiated vocal responses are likely contributors to children's acquisition of norms from early in life.
Discriminating Simulated Vocal Tremor Source Using Amplitude Modulation Spectra
Carbonell, Kathy M.; Lester, Rosemary A.; Story, Brad H.; Lotto, Andrew J.
2014-01-01
Objectives/Hypothesis Sources of vocal tremor are difficult to categorize perceptually and acoustically. This paper describes a preliminary attempt to discriminate vocal tremor sources through the use of spectral measures of the amplitude envelope. The hypothesis is that different vocal tremor sources are associated with distinct patterns of acoustic amplitude modulations. Study Design Statistical categorization methods (discriminant function analysis) were used to discriminate signals from simulated vocal tremor with different sources using only acoustic measures derived from the amplitude envelopes. Methods Simulations of vocal tremor were created by modulating parameters of a vocal fold model corresponding to oscillations of respiratory driving pressure (respiratory tremor), degree of vocal fold adduction (adductory tremor) and fundamental frequency of vocal fold vibration (F0 tremor). The acoustic measures were based on spectral analyses of the amplitude envelope computed across the entire signal and within select frequency bands. Results The signals could be categorized (with accuracy well above chance) in terms of the simulated tremor source using only measures of the amplitude envelope spectrum even when multiple sources of tremor were included. Conclusions These results supply initial support for an amplitude-envelope based approach to identify the source of vocal tremor and provide further evidence for the rich information about talker characteristics present in the temporal structure of the amplitude envelope. PMID:25532813
The Effect of Classroom Capacity on Vocal Fatigue as Quantified by the Vocal Fatigue Index.
Banks, Russell E; Bottalico, Pasquale; Hunter, Eric J
2017-01-01
Previous research has concluded that teachers are at a higher-than-normal risk for voice issues that can cause occupational limitations. While some risk factors have been identified, there are still many unknowns. A survey was distributed electronically with 506 female teacher respondents. The survey included questions to quantify three aspects of vocal fatigue as captured by the Vocal Fatigue Index (VFI): (1) general tiredness of voice (performance), (2) physical discomfort associated with voicing (pain), and (3) improvement of symptoms with rest (recovery). The effect of classroom capacity on US teachers' self-reported experience of vocal fatigue was analyzed. The results indicated that a classroom's capacity significantly affected teachers' reported amounts of vocal fatigue, while a teacher's age also appeared to significantly affect the reported amount of vocal fatigue. A quadratic rather than linear effect was seen, with the largest age effect occurring at around 40-45 years in all three factors of the VFI. Further factors which may affect vocal fatigue must be explored in future research. By understanding what increases the risk for vocal fatigue, educators and school administrators can take precautions to mitigate the occupational risk of short- and long-term vocal health issues in school teachers. © 2017 S. Karger AG, Basel.
Gómez, Pablo; Schützenberger, Anne; Kniesburges, Stefan; Bohr, Christopher; Döllinger, Michael
2018-06-01
This study presents a framework for a direct comparison of experimental vocal fold dynamics data to a numerical two-mass-model (2MM) by solving the corresponding inverse problem of which parameters lead to similar model behavior. The introduced 2MM features improvements such as a variable stiffness and a modified collision force. A set of physiologically sensible degrees of freedom is presented, and three optimization algorithms are compared on synthetic vocal fold trajectories. Finally, a total of 288 high-speed video recordings of six excised porcine larynges were optimized to validate the proposed framework. Particular focus lay on the subglottal pressure, as the experimental subglottal pressure is directly comparable to the model subglottal pressure. Fundamental frequency, amplitude and objective function values were also investigated. The employed 2MM is able to replicate the behavior of the porcine vocal folds very well. The model trajectories' fundamental frequency matches the one of the experimental trajectories in [Formula: see text] of the recordings. The relative error of the model trajectory amplitudes is on average [Formula: see text]. The experiments feature a mean subglottal pressure of 10.16 (SD [Formula: see text]) [Formula: see text]; in the model, it was on average 7.61 (SD [Formula: see text]) [Formula: see text]. A tendency of the model to underestimate the subglottal pressure is found, but the model is capable of inferring trends in the subglottal pressure. The average absolute error between the subglottal pressure in the model and the experiment is 2.90 (SD [Formula: see text]) [Formula: see text] or [Formula: see text]. A detailed analysis of the factors affecting the accuracy in matching the subglottal pressure is presented.
Volodin, Ilya A; Matrosova, Vera A; Frey, Roland; Kozhevnikova, Julia D; Isaeva, Inna L; Volodina, Elena V
2018-06-11
Non-hibernating pikas collect winter food reserves and store them in hay piles. Individualization of alarm calls might allow discrimination between colony members and conspecifics trying to steal food items from a colony pile. We investigated vocal posture, vocal tract length, and individual acoustic variation of alarm calls, emitted by wild-living Altai pikas Ochotona alpina toward a researcher. Recording started when a pika started calling and lasted as long as possible. The alarm call series of 442 individual callers from different colonies consisted of discrete short (0.073-0.157 s), high-frequency (7.31-15.46 kHz), and frequency-modulated calls separated by irregular intervals. Analysis of 442 discrete calls, the second of each series, revealed that 44.34% calls lacked nonlinear phenomena, in 7.02% nonlinear phenomena covered less than half of call duration, and in 48.64% nonlinear phenomena covered more than half of call duration. Peak frequencies varied among individuals but always fitted one of three maxima corresponding to the vocal tract resonance frequencies (formants) calculated for an estimated 45-mm oral vocal tract. Discriminant analysis using variables of 8 calls per series of 36 different callers, each from a different colony, correctly assigned over 90% of the calls to individuals. Consequently, Altai pika alarm calls are individualistic and nonlinear phenomena might further increase this acoustic individualization. Additionally, video analysis revealed a call-synchronous, very fast (0.13-0.23 s) folding, depression, and subsequent re-expansion of the pinna confirming an earlier report of this behavior that apparently contributes to protecting the hearing apparatus from damage by the self-generated high-intensity alarm calls.
Kojima, Tsuyoshi; Valenzuela, Carla V.; Novaleski, Carolyn K.; Van Deusen, Mark; Mitchell, Joshua R.; Garrett, C. Gaelyn; Sivasankar, M. Preeti; Rousseau, Bernard
2014-01-01
Objective To investigate the effects of increasing time and magnitude doses of vibration exposure on transcription of the vocal fold's junctional proteins, structural alterations, and functional tissue outcomes. Study Design Animal study. Methods 100 New Zealand White breeder rabbits were studied. Dependent variables were measured in response to increasing time doses (30, 60, or 120 minutes) and magnitude doses (control, modal intensity, and raised intensity) of vibration exposure. Messenger RNA expression of occludin, zonula occluden-1 (ZO-1), E-cadherin, β-catenin, interleukin 1β (IL-1β), cyclooxygenase-2 (COX-2), transforming growth factor β-1 (TGFβ1), and fibronectin were measured. Tissue structural alterations were assessed using transmission electron microscopy (TEM). Transepithelial resistance was used to measure functional tissue outcomes. Results Occludin gene expression was downregulated in vocal folds exposed to 120 minute time doses of raised intensity phonation, relative to control, and modal intensity phonation. ZO-1 gene expression was upregulated following a 120 minute time dose of modal intensity phonation, compared to control, and downregulated after a 120 minute time dose of raised intensity phonation, compared to modal intensity phonation. E-cadherin gene expression was downregulated after a120 minute time dose of raised intensity phonation, compared to control and modal intensity phonation. TEM revealed extensive desquamation of the stratified squamous epithelial cells with increasing time and magnitude doses of vibration exposure. A general observation of lower transepithelial resistance measures was made in tissues exposed to raised intensity phonation, compared to all other groups. Conclusions This study provides evidence of vocal fold tissue responses to varying time and magnitude doses of vibration exposure. Level of Evidence N/A PMID:25073715
Pulse register phonation in Diana monkey alarm calls
NASA Astrophysics Data System (ADS)
Riede, Tobias; Zuberbühler, Klaus
2003-05-01
The adult male Diana monkeys (Cercopithecus diana) produce predator-specific alarm calls in response to two of their predators, the crowned eagles and the leopards. The acoustic structure of these alarm calls is remarkable for a number of theoretical and empirical reasons. First, although pulsed phonation has been described in a variety of mammalian vocalizations, very little is known about the underlying production mechanism. Second, Diana monkey alarm calls are based almost exclusively on this vocal production mechanism to an extent that has never been documented in mammalian vocal behavior. Finally, the Diana monkeys' pulsed phonation strongly resembles the pulse register in human speech, where fundamental frequency is mainly controlled by subglottal pressure. Here, we report the results of a detailed acoustic analysis to investigate the production mechanism of Diana monkey alarm calls. Within calls, we found a positive correlation between the fundamental frequency and the pulse amplitude, suggesting that both humans and monkeys control fundamental frequency by subglottal pressure. While in humans pulsed phonation is usually considered pathological or artificial, male Diana monkeys rely exclusively on pulsed phonation, suggesting a functional adaptation. Moreover, we were unable to document any nonlinear phenomena, despite the fact that they occur frequently in the vocal repertoire of humans and nonhumans, further suggesting that the very robust Diana monkey pulse production mechanism has evolved for a particular functional purpose. We discuss the implications of these findings for the structural evolution of Diana monkey alarm calls and suggest that the restricted variability in fundamental frequency and robustness of the source signal gave rise to the formant patterns observed in Diana monkey alarm calls, used to convey predator information.
Tanner, Kristine; Fujiki, Robert B; Dromey, Christopher; Merrill, Ray M; Robb, Whitney; Kendall, Katherine A; Hopkin, J Arden; Channell, Ron W; Sivasankar, M Preeti
2016-11-01
This study examined the effects of a laryngeal desiccation challenge and nebulized isotonic saline on voice production in young, healthy male singers and nonsingers. This is a prospective, double-blind, within-subjects experimental design. Participants included 10 male university-trained singers and 10 age-matched nonsingers (mean age, 21.8 years; range, 18-26 years) who underwent a 30-minute oral breathing laryngeal desiccation challenge using medical grade dry air (<1% relative humidity) on two occasions in consecutive weeks. After the challenge, participants received either 3 mL or 9 mL of nebulized isotonic saline (0.9% Na + Cl - ); order of administration was counterbalanced. Phonation threshold pressure (PTP), the cepstral spectral index of dysphonia (CSID) for sustained vowels and connected speech, and self-perceived vocal effort, mouth dryness, and throat dryness were measured at each recording (baseline, after challenge, and at 5, 35, and 65 minutes after treatment). Self-perceived effort and dryness measures increased (worsened) after desiccation challenge and decreased (improved) after nebulized treatment (P < 0.05). No consistent changes were observed for PTP or CSID over time. Overall, singers demonstrated significantly lower vocal effort and CSID as compared with nonsingers. Young, vocally healthy men may not experience physiologic changes in voice production associated with laryngeal desiccation and nebulized saline treatments; however, self-reported increases in vocal effort which are associated with dryness symptoms might improve with nebulized treatments. Future hydration research should consider age and sex variables. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Collagen Content Limits Optical Coherence Tomography Image Depth in Porcine Vocal Fold Tissue.
Garcia, Jordan A; Benboujja, Fouzi; Beaudette, Kathy; Rogers, Derek; Maurer, Rie; Boudoux, Caroline; Hartnick, Christopher J
2016-11-01
Vocal fold scarring, a condition defined by increased collagen content, is challenging to treat without a method of noninvasively assessing vocal fold structure in vivo. The goal of this study was to observe the effects of vocal fold collagen content on optical coherence tomography imaging to develop a quantifiable marker of disease. Excised specimen study. Massachusetts Eye and Ear Infirmary. Porcine vocal folds were injected with collagenase to remove collagen from the lamina propria. Optical coherence tomography imaging was performed preinjection and at 0, 45, 90, and 180 minutes postinjection. Mean pixel intensity (or image brightness) was extracted from images of collagenase- and control-treated hemilarynges. Texture analysis of the lamina propria at each injection site was performed to extract image contrast. Two-factor repeated measure analysis of variance and t tests were used to determine statistical significance. Picrosirius red staining was performed to confirm collagenase activity. Mean pixel intensity was higher at injection sites of collagenase-treated vocal folds than control vocal folds (P < .0001). Fold change in image contrast was significantly increased in collagenase-treated vocal folds than control vocal folds (P = .002). Picrosirius red staining in control specimens revealed collagen fibrils most prominent in the subepithelium and above the thyroarytenoid muscle. Specimens treated with collagenase exhibited a loss of these structures. Collagen removal from vocal fold tissue increases image brightness of underlying structures. This inverse relationship may be useful in treating vocal fold scarring in patients. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.
Vocal Hyperfunction in Parents of Children With Attention Deficit Hyperactivity Disorder.
Teresa, Garcia-Real; Díaz-Román, Tomás M
2016-05-01
The objective of this study was to evaluate the presence of habits and symptoms of vocal hyperfunction in the parents of children with attention deficit hyperactivity disorder (ADHD). Parents of 24 children with ADHD and 30 children of a control group completed a specific questionnaire to detect the hyperfunctional use of the voice (excessive talking, excessive loudness, talking too fast, and shouting), hoarseness, vocal fatigue, mental and physical fatigue, and the degree of parental concern for the vocal health of their child. Parents of children with ADHD spoke more often, faster, and stronger than the parents of the control group; in addition, they also used a louder volume than they usually used when they spoke to their children. The parents manifested more vocal, mental, and physical fatigue than the parents of the control group. There was a significant correlation between the "concern" for the vocal health of their children with respect to vocal symptoms of the children, the habits of vocal hyperfunctioning, and the symptoms suffered by the parents. These results suggest that the parents of children with ADHD change their vocal attitude when communicating with their children. Most likely, the increased concern of parents with ADHD children and their respective level of stress lead to hyperfunctional vocal usage. This subsequently leads to symptoms of vocal, physical, and mental fatigue at the end of the day. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Is laughter a better vocal change detector than a growl?
Pinheiro, Ana P; Barros, Carla; Vasconcelos, Margarida; Obermeier, Christian; Kotz, Sonja A
2017-07-01
The capacity to predict what should happen next and to minimize any discrepancy between an expected and an actual sensory input (prediction error) is a central aspect of perception. Particularly in vocal communication, the effective prediction of an auditory input that informs the listener about the emotionality of a speaker is critical. What is currently unknown is how the perceived valence of an emotional vocalization affects the capacity to predict and detect a change in the auditory input. This question was probed in a combined event-related potential (ERP) and time-frequency analysis approach. Specifically, we examined the brain response to standards (Repetition Positivity) and to deviants (Mismatch Negativity - MMN), as well as the anticipatory response to the vocal sounds (pre-stimulus beta oscillatory power). Short neutral, happy (laughter), and angry (growls) vocalizations were presented both as standard and deviant stimuli in a passive oddball listening task while participants watched a silent movie and were instructed to ignore the vocalizations. MMN amplitude was increased for happy compared to neutral and angry vocalizations. The Repetition Positivity was enhanced for happy standard vocalizations. Induced pre-stimulus upper beta power was increased for happy vocalizations, and predicted the modulation of the standard Repetition Positivity. These findings indicate enhanced sensory prediction for positive vocalizations such as laughter. Together, the results suggest that positive vocalizations are more effective predictors in social communication than angry and neutral ones, possibly due to their high social significance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Lin, Ya; Yamashita, Masaru; Zhang, Jingxian; Ling, Changying; Welham, Nathan V
2009-10-01
Disruption of the vocal fold extracellular matrix (ECM) can induce a profound and refractory dysphonia. Pulsed dye laser (PDL) irradiation has shown early promise as a treatment modality for disordered ECM in patients with chronic vocal fold scar; however, there are limited data addressing the mechanism by which this laser energy might induce cellular and extracellular changes in vocal fold tissues. In this study, we examined the inflammatory and ECM modulating effects of PDL irradiation on normal vocal fold tissues and cultured vocal fold fibroblasts (VFFs). We evaluated the effects of 585 nm PDL irradiation on inflammatory cytokine and collagen/collagenase gene transcription in normal rat vocal folds in vivo (3-168 hours following delivery of approximately 39.46 J/cm(2) fluence) and VFFs in vitro (3-72 hours following delivery of 4.82 or 9.64 J/cm(2) fluence). We also examined morphological vocal fold tissue changes 3 hours, 1 week, and 1 month post-irradiation. PDL irradiation altered inflammatory cytokine and procollagen/collagenase expression at the transcript level, both in vitro and in vivo. Additionally, PDL irradiation induced an inflammatory repair process in vivo that was completed by 1 month with preservation of normal tissue morphology. PDL irradiation can modulate ECM turnover in phenotypically normal vocal folds. Additional work is required to determine if these findings extend to disordered ECM, such as is seen in vocal fold scar. Lasers Surg. Med. 41:585-594, 2009. (c) 2009 Wiley-Liss, Inc.
Nocturnal "humming" vocalizations: adding a piece to the puzzle of giraffe vocal communication.
Baotic, Anton; Sicks, Florian; Stoeger, Angela S
2015-09-09
Recent research reveals that giraffes (Giraffa camelopardalis sp.) exhibit a socially structured, fission-fusion system. In other species possessing this kind of society, information exchange is important and vocal communication is usually well developed. But is this true for giraffes? Giraffes are known to produce sounds, but there is no evidence that they use vocalizations for communication. Reports on giraffe vocalizations are mainly anecdotal and the missing acoustic descriptions make it difficult to establish a call nomenclature. Despite inconclusive evidence to date, it is widely assumed that giraffes produce infrasonic vocalizations similar to elephants. In order to initiate a more detailed investigation of the vocal communication in giraffes, we collected data of captive individuals during day and night. We particularly focussed on detecting tonal, infrasonic or sustained vocalizations. We collected over 947 h of audio material in three European zoos and quantified the spectral and temporal components of acoustic signals to obtain an accurate set of acoustic parameters. Besides the known burst, snorts and grunts, we detected harmonic, sustained and frequency-modulated "humming" vocalizations during night recordings. None of the recorded vocalizations were within the infrasonic range. These results show that giraffes do produce vocalizations, which, based on their acoustic structure, might have the potential to function as communicative signals to convey information about the physical and motivational attributes of the caller. The data further reveal that the assumption of infrasonic communication in giraffes needs to be considered with caution and requires further investigations in future studies.
Vocal activity of lesser galagos (Galago spp.) at zoos.
Schneiderová, Irena; Zouhar, Jan; Štefanská, Lucie; Bolfíková, Barbora Černá; Lhota, Stanislav; Brandl, Pavel
2016-01-01
Almost nothing is known about the natural vocal behavior of lesser galagos living in zoos. This is perhaps because they are usually kept in nocturnal exhibits separated from the visitors by a transparent and acoustically insulating glass barrier. The aim of the present study was therefore to fill this gap in knowledge of the vocal behavior of lesser galagos from zoos. This knowledge might be beneficial because the vocalizations of these small primates can be used for species determination. We performed a 10-day-long acoustic monitoring of vocal activity in each of seven various groups of Galago senegalensis and G. moholi living at four zoos. We quantitatively evaluated the occurrence of four loud vocalization types present in both species, including the most species-specific advertisement call. We found that qualitative as well as quantitative differences exist in the vocal behavior of the studied groups. We confirmed that the observed vocalization types can be collected from lesser galagos living at zoos, and the success can be increased by selecting larger and more diverse groups. We found two distinct patterns of diel vocal activity in the most vocally active groups. G. senegalensis groups were most vocally active at the beginning and at the end of their activity period, whereas one G. moholi group showed an opposite pattern. The latter is surprising, as it is generally accepted that lesser galagos emit advertisement calls especially at dawn and dusk, i.e., at the beginning and at the end of their diel activity. © 2016 Wiley Periodicals, Inc.
[Causes of vocal cord dyscinesia and its original factors after endotracheal intubation].
Sun, Anke; Zhang, Tiezheng; Liu, Wenyuan; Tang, Weiwei; Guo, Xiaohong
2012-03-01
To research the causes of postintubation vocal cord dyskinesia and its contributing factors. The causes of vocal cord dyskinesia were confirmed by laryngoscope, three-dimensional spiral CT, stroboscope, and the analysis of therapy. The factors relevant to the causes of vocal cord dyskinesia were analysed based on the following elements: (1) the anatomic or pathological condition of patients or the technical skills of anesthetists. (2) emaciated or obese body and neck. (3) the age of patients. (4) the duration of endotracheal tube retention. (5) the types of operations. (6) anesthesia procedure. Among 135 patients, 128 cases (94.81%) manifested arytenoid dislocation, 7 cases (5.19%) vocal cord paralysis. The study showed that the vocal cord dyskinesia associated with anatomic or pathological condition of patients and technical skills of anesthetists (with intubation difficulty) accounted for 76.30%. The patients with relative emaciated body or neck accounted for 90.62% in cases without intubation difficulty. Age had no significant analytical relationship with vocal cord dyskinesia. Prolonged intubation (endotracheal tube retention over 12 hours) was accounted for only 17.64%. The incidence of vocal cord dyskinesia was nearly 0.5% in patients underwent cardio-thoracic surgery, accounting for 59.26% of all the patients. There are two major causes of vocal cord dyskinesia: arytenoid dislocation and vocal cord paralysis, and the rate of vocal cord dyskinesia could be reduced by the improvement of technical skill of anesthetists and/or sufficient attention to the intubation condition of patients.
Experimental analysis of the characteristics of artificial vocal folds.
Misun, Vojtech; Svancara, Pavel; Vasek, Martin
2011-05-01
Specialized literature presents a number of models describing the function of the vocal folds. In most of those models, an emphasis is placed on the air flowing through the glottis and, further, on the effect of the parameters of the air alone (its mass, speed, and so forth). The article focuses on the constructional definition of artificial vocal folds and their experimental analysis. The analysis is conducted for voiced source voice phonation and for the changing mean value of the subglottal pressure. The article further deals with the analysis of the pressure of the airflow through the vocal folds, which is cut (separated) into individual pulses by the vibrating vocal folds. The analysis results show that air pulse characteristics are relevant to voice generation, as they are produced by the flowing air and vibrating vocal folds. A number of artificial vocal folds have been constructed to date, and the aforementioned view of their phonation is confirmed by their analysis. The experiments have confirmed that man is able to consciously affect only two parameters of the source voice, that is, its fundamental frequency and voice intensity. The main forces acting on the vocal folds during phonation are as follows: subglottal air pressure and elastic and inertia forces of the vocal folds' structure. The correctness of the function of the artificial vocal folds is documented by the experimental verification of the spectra of several types of artificial vocal folds. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Alternative models developed for estimating acute systemic toxicity are generally evaluated using in vivo LD50 values. However, in vivo acute systemic toxicity studies can produce variable results, even when conducted according to accepted test guidelines. This variability can ma...
Alternative models developed for estimating acute systemic toxicity are generally evaluated using in vivo LD50 values. However, in vivo acute systemic toxicity studies can produce variable results, even when conducted according to accepted test guidelines. This variability can ma...
Computational model for vocal tract dynamics in a suboscine bird.
Assaneo, M F; Trevisan, M A
2010-09-01
In a recent work, active use of the vocal tract has been reported for singing oscines. The reconfiguration of the vocal tract during song serves to match its resonances to the syringeal fundamental frequency, demonstrating a precise coordination of the two main pieces of the avian vocal system for songbirds characterized by tonal songs. In this work we investigated the Great Kiskadee (Pitangus sulfuratus), a suboscine bird whose calls display a rich harmonic content. Using a recently developed mathematical model for the syrinx and a mobile vocal tract, we set up a computational model that provides a plausible reconstruction of the vocal tract movement using a few spectral features taken from the utterances. Moreover, synthetic calls were generated using the articulated vocal tract that accounts for all the acoustical features observed experimentally.
Further evaluation of response interruption and redirection as treatment for stereotypy.
Ahrens, Erin N; Lerman, Dorothea C; Kodak, Tiffany; Worsdell, April S; Keegan, Courtney
2011-01-01
The effects of 2 forms of response interruption and redirection (RIRD)-motor RIRD and vocal RIRD-were examined with 4 boys with autism to evaluate further the effects of this intervention and its potential underlying mechanisms. In Experiment 1, the effects of motor RIRD and vocal RIRD on vocal stereotypy and appropriate vocalizations were compared for 2 participants. In Experiment 2, the effects of both RIRD procedures on both vocal and motor stereotypy and appropriate vocalizations were compared with 2 additional participants. Results suggested that RIRD was effective regardless of the procedural variation or topography of stereotypy and that vocal RIRD functioned as a punisher. This mechanism was further explored with 1 participant by manipulating the schedule of RIRD in Experiment 3. Results were consistent with the punishment interpretation.
FURTHER EVALUATION OF RESPONSE INTERRUPTION AND REDIRECTION AS TREATMENT FOR STEREOTYPY
Ahrens, Erin N; Lerman, Dorothea C; Kodak, Tiffany; Worsdell, April S; Keegan, Courtney
2011-01-01
The effects of 2 forms of response interruption and redirection (RIRD)—motor RIRD and vocal RIRD—were examined with 4 boys with autism to evaluate further the effects of this intervention and its potential underlying mechanisms. In Experiment 1, the effects of motor RIRD and vocal RIRD on vocal stereotypy and appropriate vocalizations were compared for 2 participants. In Experiment 2, the effects of both RIRD procedures on both vocal and motor stereotypy and appropriate vocalizations were compared with 2 additional participants. Results suggested that RIRD was effective regardless of the procedural variation or topography of stereotypy and that vocal RIRD functioned as a punisher. This mechanism was further explored with 1 participant by manipulating the schedule of RIRD in Experiment 3. Results were consistent with the punishment interpretation. PMID:21541130
Influence of climate variability on acute myocardial infarction mortality in Havana, 2001-2012.
Rivero, Alina; Bolufé, Javier; Ortiz, Paulo L; Rodríguez, Yunisleydi; Reyes, María C
2015-04-01
Death from acute myocardial infarction is due to many factors; influences on risk to the individual include habits, lifestyle and behavior, as well as weather, climate and other environmental components. Changing climate patterns make it especially important to understand how climatic variability may influence acute myocardial infarction mortality. Describe the relationship between climate variability and acute myocardial infarction mortality during the period 2001-2012 in Havana. An ecological time-series study was conducted. The universe comprised 23,744 deaths from acute myocardial infarction (ICD-10: I21-I22) in Havana residents from 2001 to 2012. Climate variability and seasonal anomalies were described using the Bultó-1 bioclimatic index (comprising variables of temperature, humidity, precipitation, and atmospheric pressure), along with series analysis to determine different seasonal-to-interannual climate variation signals. The role played by climate variables in acute myocardial infarction mortality was determined using factor analysis. The Mann-Kendall and Pettitt statistical tests were used for trend analysis with a significance level of 5%. The strong association between climate variability conditions described using the Bultó-1 bioclimatic index and acute myocardial infarctions accounts for the marked seasonal pattern in AMI mortality. The highest mortality rate occurred during the dry season, i.e., the winter months in Cuba (November-April), with peak numbers in January, December and March. The lowest mortality coincided with the rainy season, i.e., the summer months (May-October). A downward trend in total number of deaths can be seen starting with the change point in April 2009. Climate variability is inversely associated with an increase in acute myocardial infarction mortality as is shown by the Bultó-1 index. This inverse relationship accounts for acute myocardial infarction mortality's seasonal pattern.
Vocal cord paralysis after aortic arch surgery: predictors and clinical outcome.
Ohta, Noriyuki; Kuratani, Toru; Hagihira, Satoshi; Kazumi, Ken-Ichiro; Kaneko, Mitsunori; Mori, Takahiko
2006-04-01
This study is retrospective cohort study of data on vocal cord paralysis after aortic arch surgery collected during 14 years at a general hospital. We investigated factors in the development of vocal cord paralysis after aortic arch surgery and the effect of vocal cord paralysis on clinical course and outcome. We reviewed data for 182 patients who underwent aortic arch surgery for aortic arch aneurysm and aortic dissection between 1989 and 2003, of whom 58 patients had proximal aortic repair, 62 had distal arch repair, and 62 had total arch repair. We assessed factors associated with the development of vocal cord paralysis and examined in detail the clinical outcome of patients with vocal cord paralysis. Postoperative vocal cord paralysis occurred in 40 patients. Multiple logistic regression analysis revealed the following risk factors with odds ratios (OR) for vocal cord paralysis: extension of procedures into distal arch (OR, 17.0), chronic dilatation of the aorta at the left subclavian artery (OR, 9.14), and total arch repair (OR, 4.24). Adoption of open-style stent-grafts reduced the incidence of vocal cord paralysis (OR, 0.031). The postoperative occurrence of vocal cord paralysis itself emerges as an independent predictor of pulmonary complications (OR, 4.12) and leads to a longer duration of hospital stay. The risk of vocal cord paralysis after aortic arch surgery depends on surgical factors, such as aneurysmal involvement of the distal arch, or the application of newer, less invasive surgical procedures. Vocal cord paralysis after aortic arch surgery itself, under aggressive postoperative respiratory management, did not increase aspiration pneumonia but was associated with postoperative complications leading to higher hospital mortality and prolonged hospitalization.
Automated Assessment of Child Vocalization Development Using LENA
ERIC Educational Resources Information Center
Richards, Jeffrey A.; Xu, Dongxin; Gilkerson, Jill; Yapanel, Umit; Gray, Sharmistha; Paul, Terrance
2017-01-01
Purpose: To produce a novel, efficient measure of children's expressive vocal development on the basis of automatic vocalization assessment (AVA), child vocalizations were automatically identified and extracted from audio recordings using Language Environment Analysis (LENA) System technology. Method: Assessment was based on full-day audio…
Vocal education for the professional voice user and singer.
Murry, T; Rosen, C A
2000-10-01
Providing education on voice-related anatomy, physiology, and vocal hygiene information is the responsibility of every voice care professional. This article discusses the importance of a vocal education program for singers and professional voice users. An outline of a vocal education lecture is provided.
Contextual influences on children's use of vocal affect cues during referential interpretation.
Berman, Jared M J; Graham, Susan A; Chambers, Craig G
2013-01-01
In three experiments, we investigated 5-year-olds' sensitivity to speaker vocal affect during referential interpretation in cases where the indeterminacy is or is not resolved by speech information. In Experiment 1, analyses of eye gaze patterns and pointing behaviours indicated that 5-year-olds used vocal affect cues at the point where an ambiguous description was encountered. In Experiments 2 and 3, we used unambiguous situations to investigate how the referential context influences the ability to use affect cues earlier in the utterance. Here, we found a differential use of speaker vocal affect whereby 5-year-olds' referential hypotheses were influenced by negative vocal affect cues in advance of the noun, but not by positive affect cues. Together, our findings reveal how 5-year-olds use a speaker's vocal affect to identify potential referents in different contextual situations and also suggest that children may be more attuned to negative vocal affect than positive vocal affect, particularly early in an utterance.
Automated Assessment of Child Vocalization Development Using LENA.
Richards, Jeffrey A; Xu, Dongxin; Gilkerson, Jill; Yapanel, Umit; Gray, Sharmistha; Paul, Terrance
2017-07-12
To produce a novel, efficient measure of children's expressive vocal development on the basis of automatic vocalization assessment (AVA), child vocalizations were automatically identified and extracted from audio recordings using Language Environment Analysis (LENA) System technology. Assessment was based on full-day audio recordings collected in a child's unrestricted, natural language environment. AVA estimates were derived using automatic speech recognition modeling techniques to categorize and quantify the sounds in child vocalizations (e.g., protophones and phonemes). These were expressed as phone and biphone frequencies, reduced to principal components, and inputted to age-based multiple linear regression models to predict independently collected criterion-expressive language scores. From these models, we generated vocal development AVA estimates as age-standardized scores and development age estimates. AVA estimates demonstrated strong statistical reliability and validity when compared with standard criterion expressive language assessments. Automated analysis of child vocalizations extracted from full-day recordings in natural settings offers a novel and efficient means to assess children's expressive vocal development. More research remains to identify specific mechanisms of operation.
Zhang, Zhaoyan
2015-01-01
Maintaining a small glottal opening across a large range of voice conditions is critical to normal voice production. This study investigated the effectiveness of vocal fold approximation and stiffening in regulating glottal opening and airflow during phonation, using a three-dimensional numerical model of phonation. The results showed that with increasing subglottal pressure the vocal folds were gradually pushed open, leading to increased mean glottal opening and flow rate. A small glottal opening and a mean glottal flow rate typical of human phonation can be maintained against increasing subglottal pressure by proportionally increasing the degree of vocal fold approximation for low to medium subglottal pressures and vocal fold stiffening at high subglottal pressures. Although sound intensity was primarily determined by the subglottal pressure, the results suggest that, to maintain small glottal opening as the sound intensity increases, one has to simultaneously tighten vocal fold approximation and/or stiffen the vocal folds, resulting in increased glottal resistance, vocal efficiency, and fundamental frequency. PMID:25698022
Contingent imitation increases verbal interaction in children with autism spectrum disorders.
Ishizuka, Yuka; Yamamoto, Jun-Ichi
2016-11-01
Several studies have suggested that contingent adult imitation increase nonverbal communication, such as attention and proximity to adults, in children with autism spectrum disorders. However, few studies have shown the effect of contingent imitation on verbal communication. This study examined whether children with autism were able to promote verbal interaction such as vocal imitation, vocalization, and vocal turn-taking via contingent imitation. We used an alternating treatment design composed of the conditions of contingent imitation and control for six children with autism (aged 33-63 months). For contingent imitation condition, adults imitated children's vocalization immediately. For control condition, adults did not imitate but gave a vocal response immediately. Results showed that in contingent imitation condition, all children increased the number of vocal imitations and vocal turn-takings compared with control condition. The number of vocalizations increased in both condition for all children. Overall, it is suggested that all children promote verbal interaction via contingent imitation. © The Author(s) 2016.
Influence of the ventricular folds on a voice source with specified vocal fold motion1
McGowan, Richard S.; Howe, Michael S.
2010-01-01
The unsteady drag on the vocal folds is the major source of sound during voiced speech. The drag force is caused by vortex shedding from the vocal folds. The influence of the ventricular folds (i.e., the “false” vocal folds that protrude into the vocal tract a short distance downstream of the glottis) on the drag and the voice source are examined in this paper by means of a theoretical model involving vortex sheets in a two-dimensional geometry. The effect of the ventricular folds on the output acoustic pressure is found to be small when the movement of the vocal folds is prescribed. It is argued that the effect remains small when fluid-structure interactions account for vocal fold movement. These conclusions can be justified mathematically when the characteristic time scale for change in the velocity of the glottal jet is large compared to the time it takes for a vortex disturbance to be convected through the vocal fold and ventricular fold region. PMID:20329852
Pasch, Bret; Abbasi, Mustafa Z; Wilson, Macey; Zhao, Daniel; Searle, Jeremy B; Webster, Michael S; Rice, Aaron N
2016-04-01
Nutritional stress can have lasting impacts on the development of traits involved in vocal production. Cross-fostering experiments are often used to examine the propensity for vocal learning in a variety of taxa, but few studies assess the influence of malnourishment that can occur as a byproduct of this technique. In this study, we reciprocally cross-fostered sister taxa of voluble grasshopper mice (genus Onychomys) to explore their propensity for vocal learning. Vocalizations of Onychomys leucogaster did not differ between control and cross-fostered animals, but cross-fostered Onychomys arenicola produced vocalizations that were higher in frequency in a direction away from tutors. These same animals exhibited a transient reduction in body mass early in development, indicative of malnutrition. Our findings simultaneously refute vocal learning and support the developmental stress hypothesis to highlight the importance of early ontogeny on the production of vocalizations later in life. Copyright © 2016 Elsevier Inc. All rights reserved.
Further evaluation of methods to identify matched stimulation.
Rapp, John T
2007-01-01
The effects of preferred stimulation on the vocal stereotypy of 2 individuals were evaluated in two experiments. The results of Experiment 1 showed that (a) the vocal stereotypy of both participants persisted in the absence of social consequences, (b) 1 participant manipulated toys that did and did not produce auditory stimulation, but only sound-producing toys decreased his vocal stereotypy, and (c) only noncontingent music decreased vocal stereotypy for the other participant, but sterotypy paradoxically increased when toys were presented with music. Using a three-component multiple schedule, the results of Experiment 2 showed that the vocal stereotypy of both participants remained below preintervention levels following the removal of auditory stimulation and that 1 participant's vocal stereotypy increased following the removal of contingent reprimands. These patterns suggest that auditory stimulation functioned as an abolishing operation for vocal stereotypy and reprimands functioned as an establishing operation for vocal stereotypy. Together, the two experiments provide a method for identifying alternative stimulation that may substitute for automatically reinforced behavior.
Surgery of adult bilateral vocal fold paralysis in adduction: history and trends.
Sapundzhiev, Nikolay; Lichtenberger, György; Eckel, Hans Edmund; Friedrich, Gerhard; Zenev, Ivan; Toohill, Robert J; Werner, Jochen Alfred
2008-12-01
Bilateral vocal fold paralysis (BVFP) in adduction is characterised by inspiratory dyspnea, due to the paramedian position of the vocal folds with narrowing of the airway at the glottic level. The condition is often life threatening and therefore requires surgical intervention to prevent acute asphyxiation or pulmonary consequences of chronic airway obstruction. Aside from corticosteroid administration and intubation, which are only temporary measures, the standard approach for improving respiration is to perform a tracheotomy. Over the past century, a vast majority of surgical interventions have been developed and applied to restore the patency of the airway and achieve decannulation. Surgeons can generally choose for every individual patient from various well-established treatment options, which have a predictable outcome. An overview of the surgical techniques for laryngeal airway enlargement in BVFP is presented. Included are operative techniques, which have found application in clinical practice, and only to a small extent in purely anatomic or animal studies. The focus is on two major groups of interventions--for temporary and for definitive glottic enlargement. The major types of interventions include the following: (1) resection of anatomical structures; (2) retailoring and displacing the existing structures, with minimal tissue removal; (3) displacing existing structures, without tissue resection; (4) restoration or substitution of the missing innervation of the laryngeal musculature. The single interventions of these four major types have always followed the development of the medical equipment and anaesthesia. At the beginning of the twentieth century, when medicine was unable to counteract surgical infection, endoscopic or extramucosal surgical techniques were dominant. In the 1950s, the microscopic endoscopic laryngeal surgery boomed. At the end of the twentieth century many of the classical endoscopic operations were performed either with the help of surgical lasers alone, or in combination with other interventions.
Acute Worsening of Tics on Varenicline.
Mittal, Shivam Om; Klassen, Bryan T; Hassan, Anhar; Bower, James H; Coon, Elizabeth A
The aim of this study was to report worsening of Tourette syndrome (TS) in 2 patients treated with varenicline. Abnormal dopaminergic signaling is likely involved in the pathophysiology of TS. Varenicline is a partial α4β2 nicotinic acetylcholine agonist that enhances dopamine release. Therefore, the use of varenicline may influence tics in patients with TS. We analyzed and described 2 case studies on patients with significant worsening of tics after treatment with varenicline. Patient 1 had motor tics in childhood, which completely resolved by the age of 20 years. At the age of 25 years, he started varenicline and stopped smoking. Within 2 weeks, he developed motor followed by vocal tics that persisted despite stopping varenicline and restarting smoking. The tics were complex, medically refractory, and caused severe disability at work and school (Yale Global Tic Severity Scale score, 86). Patient 2 developed motor and vocal tics in adolescence that persisted into her 20s and caused significant disability in association with psychiatric comorbidities. At the age of 31 years, she started varenicline to quit smoking, which led to a marked increase in tic frequency and severity. Varenicline was discontinued after 3 weeks with improvement to baseline tic severity (Yale Global Tic Severity Scale score, 94). Ultimately, both patients successfully underwent deep brain stimulation to bilateral centromedian/parafascicular complex thalamic nuclei for medically refractory TS. We report 2 patients with motor and/or vocal tics that had severe worsening of tics after varenicline use. This may be due to varenicline-induced increased striatal dopamine in conjunction with nicotine cessation, influencing dopamine receptor sensitivity in TS. Providers should be cautious in prescribing varenicline to patients with TS.
Voice disorders in residual paracoccidioidomycosis in upper airways and digestive tract.
da Costa, Ananda Dutra; Vargas, Amanda Pereira; Lucena, Marcia Mendonça; Ruas, Ana Cristina Nunes; Braga, Fernanda da Silva Santos; Bom-Braga, Mateus Pereira; Bom-Braga, Frederico Pereira; do Valle, Antonio Carlos Francesconi; Igreja, Ricardo Pereira; Valete-Rosalino, Cláudia Maria
Paracoccidioidomycosis (PCM) is a systemic mycosis of acute and chronic evolution, caused by species belonging to the genus Paracoccidioides. It is considered the most prevalent systemic endemic mycosis in Latin America, with cases in the tropical and subtropical regions. Residual PCM refers to the fibrotic scar sequelae resulting from the disease treatment which, when associated with collagen accumulation, leads to functional and anatomic alterations in the organs. The aim of this study was to evaluate the vocal function of patients with residual PCM in upper airways and digestive tract. We performed a cross-sectional study in 2010 in a cohort of 21 patients with residual PCM in upper airways and digestive tract. The average age was 49.48±9.1 years, and only two (9.5%) patients were female. The study was performed in the 1-113 month-period (median 27) after the end of drug treatment. Five (23.8%) patients had alterations in the larynx as a sequela of the disease. However, all patients had vocal changes in vocal auditory perceptual analysis by GRBASI scale. The computerized acoustic analysis using the software Vox Metria, showed that 11 patients (52.4%) presented alterations in jitter, 15 (71.4%) in shimmer, 8 (38.1%) in F0, 4 (19%) in glottal to noise excitation (GNE), 7 (33.3%) in the presence of noise and 12 (57.1%) in the presence of vibratory irregularity. The great frequency of alterations in residual PCM suggests that the patients in such phase could benefit from a multidisciplinary treatment, offering them integral monitoring of the disease, including speech rehabilitation after the PCM is healed. Copyright © 2017 Asociación Española de Micología. Publicado por Elsevier España, S.L.U. All rights reserved.
Shear properties of vocal fold mucosal tissues and their effect on vocal fold oscillation
NASA Astrophysics Data System (ADS)
Chan, Roger Wai Kai
Viscoelastic shear properties of vocal fold mucosal tissues and phonosurgical biomaterials were measured with a parallel-plate rotational rheometer. Elastic, viscous and damping properties were quantified as a function of frequency (0.01 Hz to 15 Hz) for human vocal fold mucosal tissues (N = 15), implantable biomaterials commonly used in the treatment of vocal fold paralysis (Teflon, gelatin, and collagen) (the non-mucosal group), and biomaterials currently or potentially useful in the treatment of vocal fold mucosal defects (adipose tissue or fat, hyaluronic acid, and fibronectin) (the mucosal group). It was found that intersubject differences as large as an order of magnitude were often observed for the shear properties of vocal fold mucosal tissues, part of which may be age- and gender-related. Shear properties of the non-mucosal group biomaterials were often much higher than those of the mucosal group biomaterials, which were relatively close to the shear properties of mucosal tissues. Viscoelastic and rheological modeling showed that shear properties of human vocal fold mucosa may be described by a quasi-linear viscoelastic theory and a statistical network theory, based upon which extrapolations to audio frequencies were possible. A theory of small-amplitude vocal fold oscillation was revisited to describe the effects of tissue shear properties on vocal fold oscillation and phonation threshold pressure, a measure of the 'ease' of phonation and an objective indication of vocal function. It was found that phonation threshold pressure is directly related to the viscous shear modulus or the 'effective damping modulus', a concept proposed to quantify the effective amount of damping in vocal fold oscillation. The mucosal group biomaterials were incorporated into the artificial vocal fold mucosa of a physical model in order to empirically assess their effects on phonation threshold pressure. Results showed that higher threshold pressures were consistently observed for higher concentrations of hyaluronic acid and for hyaluronic acid mixed with fibronectin, in correlation with their differences in viscous shear modulus and effective damping modulus. Implications for phonosurgery were discussed in terms of the choice of optimal biomaterials for the surgical management of vocal fold mucosal defects and lamina propria deficiencies.
Neural Correlates of Vocal Production and Motor Control in Human Heschl's Gyrus
Oya, Hiroyuki; Nourski, Kirill V.; Kawasaki, Hiroto; Larson, Charles R.; Brugge, John F.; Howard, Matthew A.; Greenlee, Jeremy D.W.
2016-01-01
The present study investigated how pitch frequency, a perceptually relevant aspect of periodicity in natural human vocalizations, is encoded in Heschl's gyrus (HG), and how this information may be used to influence vocal pitch motor control. We recorded local field potentials from multicontact depth electrodes implanted in HG of 14 neurosurgical epilepsy patients as they vocalized vowel sounds and received brief (200 ms) pitch perturbations at 100 Cents in their auditory feedback. Event-related band power responses to vocalizations showed sustained frequency following responses that tracked voice fundamental frequency (F0) and were significantly enhanced in posteromedial HG during speaking compared with when subjects listened to the playback of their own voice. In addition to frequency following responses, a transient response component within the high gamma frequency band (75–150 Hz) was identified. When this response followed the onset of vocalization, the magnitude of the response was the same for the speaking and playback conditions. In contrast, when this response followed a pitch shift, its magnitude was significantly enhanced during speaking compared with playback. We also observed that, in anterolateral HG, the power of high gamma responses to pitch shifts correlated with the magnitude of compensatory vocal responses. These findings demonstrate a functional parcellation of HG with neural activity that encodes pitch in natural human voice, distinguishes between self-generated and passively heard vocalizations, detects discrepancies between the intended and heard vocalization, and contains information about the resulting behavioral vocal compensations in response to auditory feedback pitch perturbations. SIGNIFICANCE STATEMENT The present study is a significant contribution to our understanding of sensor-motor mechanisms of vocal production and motor control. The findings demonstrate distinct functional parcellation of core and noncore areas within human auditory cortex on Heschl's gyrus that process natural human vocalizations and pitch perturbations in the auditory feedback. In addition, our data provide evidence for distinct roles of high gamma neural oscillations and frequency following responses for processing periodicity in human vocalizations during vocal production and motor control. PMID:26888939
Differential short-term memorisation for vocal and instrumental rhythms.
Klyn, Niall A M; Will, Udo; Cheong, Yong-Jeon; Allen, Erin T
2016-07-01
This study explores differential processing of vocal and instrumental rhythms in short-term memory with three decision (same/different judgments) and one reproduction experiment. In the first experiment, memory performance declined for delayed versus immediate recall, with accuracy for the two rhythms being affected differently: Musicians performed better than non-musicians on clapstick but not on vocal rhythms, and musicians were better on vocal rhythms in the same than in the different condition. Results for the second experiment showed that concurrent sub-vocal articulation and finger-tapping differentially affected the two rhythms and same/different decisions, but produced no evidence for articulatory loop involvement in delayed decision tasks. In a third experiment, which tested rhythm reproduction, concurrent sub-vocal articulation decreased memory performance, with a stronger deleterious effect on the reproduction of vocal than of clapstick rhythms. This suggests that the articulatory loop may only be involved in delayed reproduction not in decision tasks. The fourth experiment tested whether differences between filled and empty rhythms (continuous vs. discontinuous sounds) can explain the different memorisation of vocal and clapstick rhythms. Though significant differences were found for empty and filled instrumental rhythms, the differences between vocal and clapstick can only be explained by considering additional voice specific features.
Vocal Features of Song and Speech: Insights from Schoenberg's Pierrot Lunaire.
Merrill, Julia; Larrouy-Maestri, Pauline
2017-01-01
Similarities and differences between speech and song are often examined. However, the perceptual definition of these two types of vocalization is challenging. Indeed, the prototypical characteristics of speech or song support top-down processes, which influence listeners' perception of acoustic information. In order to examine vocal features associated with speaking and singing, we propose an innovative approach designed to facilitate bottom-up mechanisms in perceiving vocalizations by using material situated between speech and song: Speechsong. 25 participants were asked to evaluate 20 performances of a speechsong composition by Arnold Schoenberg, "Pierrot lunaire" op. 21 from 1912, evaluating 20 features of vocal-articulatory expression. Raters provided reliable judgments concerning the vocal features used by the performers and did not show strong appeal or specific expectations in reference to Schoenberg's piece. By examining the relationship between the vocal features and the impression of song or speech, the results confirm the importance of pitch (height, contour, range), but also point to the relevance of register, timbre, tension and faucal distance. Besides highlighting vocal features associated with speech and song, this study supports the relevance of the present approach of focusing on a theoretical middle category in order to better understand vocal expression in song and speech.
Effects of speech style, room acoustics, and vocal fatigue on vocal effort
Bottalico, Pasquale; Graetzer, Simone; Hunter, Eric J.
2016-01-01
Vocal effort is a physiological measure that accounts for changes in voice production as vocal loading increases. It has been quantified in terms of sound pressure level (SPL). This study investigates how vocal effort is affected by speaking style, room acoustics, and short-term vocal fatigue. Twenty subjects were recorded while reading a text at normal and loud volumes in anechoic, semi-reverberant, and reverberant rooms in the presence of classroom babble noise. The acoustics in each environment were modified by creating a strong first reflection in the talker position. After each task, the subjects answered questions addressing their perception of the vocal effort, comfort, control, and clarity of their own voice. Variation in SPL for each subject was measured per task. It was found that SPL and self-reported effort increased in the loud style and decreased when the reflective panels were present and when reverberation time increased. Self-reported comfort and control decreased in the loud style, while self-reported clarity increased when panels were present. The lowest magnitude of vocal fatigue was experienced in the semi-reverberant room. The results indicate that early reflections may be used to reduce vocal effort without modifying reverberation time. PMID:27250179
Integrating perspectives on vocal performance and consistency
Sakata, Jon T.; Vehrencamp, Sandra L.
2012-01-01
SUMMARY Recent experiments in divergent fields of birdsong have revealed that vocal performance is important for reproductive success and under active control by distinct neural circuits. Vocal consistency, the degree to which the spectral properties (e.g. dominant or fundamental frequency) of song elements are produced consistently from rendition to rendition, has been highlighted as a biologically important aspect of vocal performance. Here, we synthesize functional, developmental and mechanistic (neurophysiological) perspectives to generate an integrated understanding of this facet of vocal performance. Behavioral studies in the field and laboratory have found that vocal consistency is affected by social context, season and development, and, moreover, positively correlated with reproductive success. Mechanistic investigations have revealed a contribution of forebrain and basal ganglia circuits and sex steroid hormones to the control of vocal consistency. Across behavioral, developmental and mechanistic studies, a convergent theme regarding the importance of vocal practice in juvenile and adult songbirds emerges, providing a basis for linking these levels of analysis. By understanding vocal consistency at these levels, we gain an appreciation for the various dimensions of song control and plasticity and argue that genes regulating the function of basal ganglia circuits and sex steroid hormones could be sculpted by sexual selection. PMID:22189763