vocalization aid system: Topics by Science.gov

Sample records for vocalization aid system

Towards a computer-aided diagnosis system for vocal cord diseases.

PubMed

Verikas, A; Gelzinis, A; Bacauskiene, M; Uloza, V

2006-01-01

The objective of this work is to investigate a possibility of creating a computer-aided decision support system for an automated analysis of vocal cord images aiming to categorize diseases of vocal cords. The problem is treated as a pattern recognition task. To obtain a concise and informative representation of a vocal cord image, colour, texture, and geometrical features are used. The representation is further analyzed by a pattern classifier categorizing the image into healthy, diffuse, and nodular classes. The approach developed was tested on 785 vocal cord images collected at the Department of Otolaryngology, Kaunas University of Medicine, Lithuania. A correct classification rate of over 87% was obtained when categorizing a set of unseen images into the aforementioned three classes. Bearing in mind the high similarity of the decision classes, the results obtained are rather encouraging and the developed tools could be very helpful for assuring objective analysis of the images of laryngeal diseases.
Comprehensive and Development of Communication Aids as Aids to the Education of the Non-Vocal Severely Handicapped. Final Report. September 1, 1974 to March 1, 1976.

ERIC Educational Resources Information Center

Wisconsin Univ., Madison. Trace Center.

The report details the results of a research and development program on communication aids for the education of the non-vocal severely handicapped. Much of the information in the report was prepared so that it could stand alone, and is in the form of individual papers, reports, and a sourcebook. The documents are divided into three sections: (1)…
The vocal repertoire of Tibetan macaques (Macaca thibetana): A quantitative classification.

PubMed

Bernstein, Sofia K; Sheeran, Lori K; Wagner, R Steven; Li, Jin-Hua; Koda, Hiroki

2016-09-01

Vocal repertoires are basic and essential components for describing vocal communication in animals. Studying the entire suite of vocal signals aids investigations on the variation of acoustic structure across social contexts, comparisons on the complexity of communication systems across taxa, and in exploration of the evolutionary origins of species-specific vocalizations. Here, we describe the vocal repertoire of the largest species in the macaque genus, Macaca thibetana. We extracted thirty acoustic parameters from call recordings. Post hoc validation through quantitative analyses of the a priori repertoire classified eleven call types: coo, squawk, squeal, noisy scream, growl, bark, compound squeak, leap coo, weeping, modulated tonal scream, and pant. In comparison to the rest of the genus, Tibetan macaques uttered a wider array of vocalizations in the context of copulations. Previous reports did not include modulated tonal screams and pants during harassment of copulatory dyads. Furthermore, in comparison to the rest of the genus, Tibetan macaque females emit acoustically distinct copulation calls. The vocal repertoire of Tibetan macaques contributes to the literature on the emergence of species-specific calls in the genus Macaca with potential insights from social, reproductive, and ecological comparisons across species. Am. J. Primatol. 78:937-949, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Say It to Play It.

ERIC Educational Resources Information Center

Jarvis, William C.

1980-01-01

Author discusses the importance of vocalization in the development of basic musicianship. He cites studies demonstrating that vocal teaching strategies, such as singing tonal patterns, aids music reading, memory, and instrumental performance. (SJL)
21 CFR 874.1325 - Electroglottograph.

Code of Federal Regulations, 2014 CFR

2014-04-01

... the electrical impedance of the larynx to aid in assessing the degree of closure of the vocal cords, confirm larygeal diagnosis, aid behavioral treatment of voice disorders, and aid research concerning the...
21 CFR 874.1325 - Electroglottograph.

Code of Federal Regulations, 2012 CFR

2012-04-01

... the electrical impedance of the larynx to aid in assessing the degree of closure of the vocal cords, confirm larygeal diagnosis, aid behavioral treatment of voice disorders, and aid research concerning the...
21 CFR 874.1325 - Electroglottograph.

Code of Federal Regulations, 2011 CFR

2011-04-01

... the electrical impedance of the larynx to aid in assessing the degree of closure of the vocal cords, confirm larygeal diagnosis, aid behavioral treatment of voice disorders, and aid research concerning the...
21 CFR 874.1325 - Electroglottograph.

Code of Federal Regulations, 2013 CFR

2013-04-01

... the electrical impedance of the larynx to aid in assessing the degree of closure of the vocal cords, confirm larygeal diagnosis, aid behavioral treatment of voice disorders, and aid research concerning the...
21 CFR 874.1325 - Electroglottograph.

Code of Federal Regulations, 2010 CFR

2010-04-01

... the electrical impedance of the larynx to aid in assessing the degree of closure of the vocal cords, confirm larygeal diagnosis, aid behavioral treatment of voice disorders, and aid research concerning the...
Universal mechanisms of sound production and control in birds and mammals

PubMed Central

Elemans, C.P.H; Rasmussen, J.H.; Herbst, C.T.; Düring, D.N.; Zollinger, S.A.; Brumm, H.; Srivastava, K.; Svane, N.; Ding, M.; Larsen, O.N.; Sober, S.J.; Švec, J.G.

2015-01-01

As animals vocalize, their vocal organ transforms motor commands into vocalizations for social communication. In birds, the physical mechanisms by which vocalizations are produced and controlled remain unresolved because of the extreme difficulty in obtaining in vivo measurements. Here, we introduce an ex vivo preparation of the avian vocal organ that allows simultaneous high-speed imaging, muscle stimulation and kinematic and acoustic analyses to reveal the mechanisms of vocal production in birds across a wide range of taxa. Remarkably, we show that all species tested employ the myoelastic-aerodynamic (MEAD) mechanism, the same mechanism used to produce human speech. Furthermore, we show substantial redundancy in the control of key vocal parameters ex vivo, suggesting that in vivo vocalizations may also not be specified by unique motor commands. We propose that such motor redundancy can aid vocal learning and is common to MEAD sound production across birds and mammals, including humans. PMID:26612008
Universal mechanisms of sound production and control in birds and mammals.

PubMed

Elemans, C P H; Rasmussen, J H; Herbst, C T; Düring, D N; Zollinger, S A; Brumm, H; Srivastava, K; Svane, N; Ding, M; Larsen, O N; Sober, S J; Švec, J G

2015-11-27

As animals vocalize, their vocal organ transforms motor commands into vocalizations for social communication. In birds, the physical mechanisms by which vocalizations are produced and controlled remain unresolved because of the extreme difficulty in obtaining in vivo measurements. Here, we introduce an ex vivo preparation of the avian vocal organ that allows simultaneous high-speed imaging, muscle stimulation and kinematic and acoustic analyses to reveal the mechanisms of vocal production in birds across a wide range of taxa. Remarkably, we show that all species tested employ the myoelastic-aerodynamic (MEAD) mechanism, the same mechanism used to produce human speech. Furthermore, we show substantial redundancy in the control of key vocal parameters ex vivo, suggesting that in vivo vocalizations may also not be specified by unique motor commands. We propose that such motor redundancy can aid vocal learning and is common to MEAD sound production across birds and mammals, including humans.
A Kinect-Based Sign Language Hand Gesture Recognition System for Hearing- and Speech-Impaired: A Pilot Study of Pakistani Sign Language.

PubMed

Halim, Zahid; Abbas, Ghulam

2015-01-01

Sign language provides hearing and speech impaired individuals with an interface to communicate with other members of the society. Unfortunately, sign language is not understood by most of the common people. For this, a gadget based on image processing and pattern recognition can provide with a vital aid for detecting and translating sign language into a vocal language. This work presents a system for detecting and understanding the sign language gestures by a custom built software tool and later translating the gesture into a vocal language. For the purpose of recognizing a particular gesture, the system employs a Dynamic Time Warping (DTW) algorithm and an off-the-shelf software tool is employed for vocal language generation. Microsoft(®) Kinect is the primary tool used to capture video stream of a user. The proposed method is capable of successfully detecting gestures stored in the dictionary with an accuracy of 91%. The proposed system has the ability to define and add custom made gestures. Based on an experiment in which 10 individuals with impairments used the system to communicate with 5 people with no disability, 87% agreed that the system was useful.
Identifying stimuli that alter immediate and subsequent levels of vocal stereotypy: a further analysis of functionally matched stimulation.

PubMed

Lanovaz, Marc J; Fletcher, Sarah E; Rapp, John T

2009-09-01

We used a three-component multiple-schedule with a brief reversal design to evaluate the effects of structurally unmatched and matched stimuli on immediate and subsequent vocal stereotypy that was displayed by three children with autism spectrum disorders. For 2 of the 3 participants, access to matched stimuli, unmatched stimuli, and music decreased immediate levels of vocal stereotypy; however, with the exception of matched stimuli for one participant, none of the stimuli produced a clear abolishing operation for subsequent vocal stereotypy. That is, vocal stereotypy typically increased to baseline levels shortly after alternative stimulation was removed. Detection of motivating operations for each participant's vocal stereotypy was aided by the analysis of component distributions. The results are discussed in terms of immediate and subsequent effects of preferred stimuli on automatically reinforced problem behavior.
Compairing Picture Exchange and Voice Output Communication Aids in Young Children with Autism

ERIC Educational Resources Information Center

Lorah, Elizabeth R.

2012-01-01

The Center for Disease Control estimates that one in 88 births result in a diagnosis of autism (CDC, 2012). Of those individuals diagnosed with autism approximately 25-61% fail to develop vocal output capabilities (Weitxz, Dexter, & Moore, 1997). The use of Augmentative and Alternative Communication (AAC) systems, such as Picture Exchange (PE)…
Superpixel-based segmentation of glottal area from videolaryngoscopy images

NASA Astrophysics Data System (ADS)

Turkmen, H. Irem; Albayrak, Abdulkadir; Karsligil, M. Elif; Kocak, Ismail

2017-11-01

Segmentation of the glottal area with high accuracy is one of the major challenges for the development of systems for computer-aided diagnosis of vocal-fold disorders. We propose a hybrid model combining conventional methods with a superpixel-based segmentation approach. We first employed a superpixel algorithm to reveal the glottal area by eliminating the local variances of pixels caused by bleedings, blood vessels, and light reflections from mucosa. Then, the glottal area was detected by exploiting a seeded region-growing algorithm in a fully automatic manner. The experiments were conducted on videolaryngoscopy images obtained from both patients having pathologic vocal folds as well as healthy subjects. Finally, the proposed hybrid approach was compared with conventional region-growing and active-contour model-based glottal area segmentation algorithms. The performance of the proposed method was evaluated in terms of segmentation accuracy and elapsed time. The F-measure, true negative rate, and dice coefficients of the hybrid method were calculated as 82%, 93%, and 82%, respectively, which are superior to the state-of-art glottal-area segmentation methods. The proposed hybrid model achieved high success rates and robustness, making it suitable for developing a computer-aided diagnosis system that can be used in clinical routines.
Weight-Bearing MR Imaging as an Option in the Study of Gravitational Effects on the Vocal Tract of Untrained Subjects in Singing Phonation

PubMed Central

Traser, Louisa; Burdumy, Michael; Richter, Bernhard; Vicari, Marco; Echternach, Matthias

2014-01-01

Magnetic Resonance Imaging (MRI) of subjects in a supine position can be used to evaluate the configuration of the vocal tract during phonation. However, studies of speech phonation have shown that gravity can affect vocal tract shape and bias measurements. This is one of the reasons that MRI studies of singing phonation have used professionally trained singers as subjects, because they are generally considered to be less affected by the supine body position and environmental distractions. A study of untrained singers might not only contribute to the understanding of intuitive singing function and aid the evaluation of potential hazards for vocal health, but also provide insights into the effect of the supine position on singers in general. In the present study, an open configuration 0.25 T MRI system with a rotatable examination bed was used to study the effect of body position in 20 vocally untrained subjects. The subjects were asked to sing sustained tones in both supine and upright body positions on different pitches and in different register conditions. Morphometric measurements were taken from the acquired images of a sagittal slice depicting the vocal tract. The analysis concerning the vocal tract configuration in the two body positions revealed differences in 5 out of 10 measured articulatory parameters. In the upright position the jaw was less protruded, the uvula was elongated, the larynx more tilted and the tongue was positioned more to the front of the mouth than in the supine position. The findings presented are in agreement with several studies on gravitational effects in speech phonation, but contrast with the results of a previous study on professional singers of our group where only minor differences between upright and supine body posture were observed. The present study demonstrates that imaging of the vocal tract using weight-bearing MR imaging is a feasible tool for the study of sustained phonation in singing for vocally untrained subjects. PMID:25379885
Weight-bearing MR imaging as an option in the study of gravitational effects on the vocal tract of untrained subjects in singing phonation.

PubMed

Traser, Louisa; Burdumy, Michael; Richter, Bernhard; Vicari, Marco; Echternach, Matthias

2014-01-01

Magnetic Resonance Imaging (MRI) of subjects in a supine position can be used to evaluate the configuration of the vocal tract during phonation. However, studies of speech phonation have shown that gravity can affect vocal tract shape and bias measurements. This is one of the reasons that MRI studies of singing phonation have used professionally trained singers as subjects, because they are generally considered to be less affected by the supine body position and environmental distractions. A study of untrained singers might not only contribute to the understanding of intuitive singing function and aid the evaluation of potential hazards for vocal health, but also provide insights into the effect of the supine position on singers in general. In the present study, an open configuration 0.25 T MRI system with a rotatable examination bed was used to study the effect of body position in 20 vocally untrained subjects. The subjects were asked to sing sustained tones in both supine and upright body positions on different pitches and in different register conditions. Morphometric measurements were taken from the acquired images of a sagittal slice depicting the vocal tract. The analysis concerning the vocal tract configuration in the two body positions revealed differences in 5 out of 10 measured articulatory parameters. In the upright position the jaw was less protruded, the uvula was elongated, the larynx more tilted and the tongue was positioned more to the front of the mouth than in the supine position. The findings presented are in agreement with several studies on gravitational effects in speech phonation, but contrast with the results of a previous study on professional singers of our group where only minor differences between upright and supine body posture were observed. The present study demonstrates that imaging of the vocal tract using weight-bearing MR imaging is a feasible tool for the study of sustained phonation in singing for vocally untrained subjects.
Training Aids for Basic Combat Skills: A Procedure for Training-Aid Development

DTIC Science & Technology

2011-02-01

aids is a constant in training and education. Researchers in fields as varied as disability education, business, firefighting, vocal performance...the aids should (a) address tasks with which many Soldiers have difficulty mastering, (b) address tasks that are critical to basic combat training...candidates because of other practical considerations such as low cost, potential impact to critical IET tasks, etc
Computer-aided technique for automatic determination of the relationship between transglottal pressure change and voice fundamental frequency.

PubMed

Deguchi, Shinji; Kawashima, Kazutaka; Washio, Seiichi

2008-12-01

The effect of artificially altered transglottal pressures on the voice fundamental frequency (F0) is known to be associated with vocal fold stiffness. Its measurement, though useful as a potential diagnostic tool for noncontact assessment of vocal fold stiffness, often requires manual and painstaking determination of an unstable F0 of voice. Here, we provide a computer-aided technique that enables one to carry out the determination easily and accurately. Human subjects vocalized in accordance with a series of reference sounds from a speaker controlled by a computer. Transglottal pressures were altered by means of a valve embedded in a mouthpiece. Time-varying vocal F0 was extracted, without manual procedures, from a specific range of the voice spectrum determined on the basis of the controlled reference sounds. The validity of the proposed technique was assessed for 11 healthy subjects. Fluctuating voice F0 was tracked automatically during experiments, providing the relationship between transglottal pressure change and F0 on the computer. The proposed technique overcomes the difficulty in automatic determination of the voice F0, which tends to be transient both in normal voice and in some types of pathological voice.
Gender and vocal production mode discrimination using the high frequencies for speech and singing

PubMed Central

Monson, Brian B.; Lotto, Andrew J.; Story, Brad H.

2014-01-01

Humans routinely produce acoustical energy at frequencies above 6 kHz during vocalization, but this frequency range is often not represented in communication devices and speech perception research. Recent advancements toward high-definition (HD) voice and extended bandwidth hearing aids have increased the interest in the high frequencies. The potential perceptual information provided by high-frequency energy (HFE) is not well characterized. We found that humans can accomplish tasks of gender discrimination and vocal production mode discrimination (speech vs. singing) when presented with acoustic stimuli containing only HFE at both amplified and normal levels. Performance in these tasks was robust in the presence of low-frequency masking noise. No substantial learning effect was observed. Listeners also were able to identify the sung and spoken text (excerpts from “The Star-Spangled Banner”) with very few exposures. These results add to the increasing evidence that the high frequencies provide at least redundant information about the vocal signal, suggesting that its representation in communication devices (e.g., cell phones, hearing aids, and cochlear implants) and speech/voice synthesizers could improve these devices and benefit normal-hearing and hearing-impaired listeners. PMID:25400613

Intraoperative laryngeal electromyography in children with vocal fold immobility: results of a multicenter longitudinal study.

PubMed

Maturo, Stephen C; Braun, Nicole; Brown, David J; Chong, Peter Siao Tick; Kerschner, Joseph E; Hartnick, Christopher J

2011-12-01

To determine whether laryngeal electromyography (LEMG) can predict recurrent laryngeal nerve function return in children and whether LEMG can aid in the management of vocal fold immobility (VFI). Prospective case series. Tertiary pediatric aerodigestive centers. Twenty-five children aged 14 days to 7 years at the time of first LEMG (mean age, 21.4 months) with VFI who underwent flexible fiberoptic laryngeal examination, intraoperative LEMG of the thyroarytenoid muscles, and 12-month follow-up. To compare results of LEMG with flexible fiberoptic laryngeal examination in children with vocal fold paresis and to determine if LEMG can predict vocal fold return. In children who had a patent ductus arteriosus ligation, the LEMG data suggest that if there is no activity 6 months after injury, then the nerve is unlikely to regain function. In 3 of 3 children with central causes of VFI, normal LEMG findings predicted return of nerve function 2 to 7 months before vocal fold movement on fiberoptic examination. Finally, in 3 of 3 children with idiopathic VFI, LEMG predicted return within 2 to 14 months of vocal folds with normal findings. Intraoperative LEMG is a safe, easy-to-use method for determining the likelihood of recurrent laryngeal nerve function return in children who have undergone patent ductus arteriosus ligation, in children with centrally correctable lesions, and in children with idiopathic VFI. More work is needed in the area of pediatric LEMG, but it is possible that LEMG data can be used to aid in management strategies and provide families with more information to make better informed decisions regarding their child's care.
Monitoring of piglets' open field activity and choice behaviour during the replay of maternal vocalization: a comparison between Observer and PID technique.

PubMed

Puppe, B; Schön, P C; Wendland, K

1999-07-01

The paper presents a new system for the automatic monitoring of open field activity and choice behaviour of medium-sized animals. Passive infrared motion detectors (PID) were linked on-line via a digital I/O interface to a personal computer provided with self-developed analysis software based on LabVIEW (PID technique). The set up was used for testing 18 one-week-old piglets (Sus scrofa) for their approach to their mother's nursing vocalization replayed through loudspeakers. The results were validated by comparison with a conventional Observer technique, a computer-aided direct observation. In most of the cases, no differences were seen between the Observer and PID technique regarding the percentage of stay in previously defined open field segments, the locomotor open field activity, and the choice behaviour. The results revealed that piglets are clearly attracted by their mother's nursing vocalization. The monitoring system presented in this study is thus suitable for detailed behavioural investigations of individual acoustic recognition. In general, the PID technique is a useful tool for research into the behaviour of individual animals in a restricted open field which does not rely on subjective analysis by a human observer.
A Verbal Guidance System for Severe Disabled People

NASA Astrophysics Data System (ADS)

Redjati, Abdelghani; Bousbia-Salah, Mounir

2008-06-01

The recent development in rehabilitation technology allows to significantly broaden the range of possible applications that support handicapped people in their daily lives. This paper presents a moral and physical support for the disabled. It consists in the development of a verbal guidance system based on a speech recognition development kit `VD364'. This aid is intended to control a wheelchair and a manipulator arm for people with severe disabilities and who can speak. The study and design, conducted in the framework of this contribution have enabled an adaptation for a possible application and maximum exploitation of words that can be generated by a vocal module. The problem addressed is to allow a manipulator arm to compensate mechanically arm movements to give the handicapped satisfaction of his needs (for instance, drinking a glass of water). The objective is then to put forward a vocal command system that allows the arm to move in a well determined area to accomplish tasks that must be given by the user in addition to the displacement of the wheelchair.
Objective vocal quality in children using cochlear implants: a multiparameter approach.

PubMed

Baudonck, Nele; D'haeseleer, Evelien; Dhooge, Ingeborg; Van Lierde, Kristiane

2011-11-01

The purpose of this study was to determine the objective vocal quality in 36 prelingually deaf children using cochlear implant (CI) with a mean age of 9 years. An additional purpose was to compare the objective vocal quality of these 36 CI users with 25 age-matched children with prelingual severe hearing loss using conventional hearing aids (HAs) and 25 normal hearing (NH) children. The design for this cross-sectional study was a multigroup posttest-only design. The objective vocal quality was measured by means of the dysphonia severity index (DSI). Moreover, perceptual voice assessment using the GRBASI scale was performed. CI children have a vocal quality by means of the DSI of +1.8, corresponding with a DSI% of 68%, indicating a borderline vocal quality situated 2% above the limit of normality. The voice was perceptually characterized by the presence of a very slight grade of hoarseness, roughness, strained phonation, and higher pitch and intensity levels. No significant objective vocal quality differences were measured between the voices of the CI children, HA users, and NH children. According to the results, one aspect of the vocal approach in children with CI and using HAs must be focused on the improvement of the strained vocal characteristic and the use of a lower pitch and intensity level. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Iconicity can ground the creation of vocal symbols.

PubMed

Perlman, Marcus; Dale, Rick; Lupyan, Gary

2015-08-01

Studies of gestural communication systems find that they originate from spontaneously created iconic gestures. Yet, we know little about how people create vocal communication systems, and many have suggested that vocalizations do not afford iconicity beyond trivial instances of onomatopoeia. It is unknown whether people can generate vocal communication systems through a process of iconic creation similar to gestural systems. Here, we examine the creation and development of a rudimentary vocal symbol system in a laboratory setting. Pairs of participants generated novel vocalizations for 18 different meanings in an iterative 'vocal' charades communication game. The communicators quickly converged on stable vocalizations, and naive listeners could correctly infer their meanings in subsequent playback experiments. People's ability to guess the meanings of these novel vocalizations was predicted by how close the vocalization was to an iconic 'meaning template' we derived from the production data. These results strongly suggest that the meaningfulness of these vocalizations derived from iconicity. Our findings illuminate a mechanism by which iconicity can ground the creation of vocal symbols, analogous to the function of iconicity in gestural communication systems.
Actor vocal training for the habilitation of speech in adolescent users of cochlear implants.

PubMed

Holt, Colleen M; Dowell, Richard C

2011-01-01

This study examined changes to speech production in adolescents with hearing impairment following a period of actor vocal training. In addition to vocal parameters, the study also investigated changes to psychosocial factors such as confidence, self-esteem, and anxiety. The group were adolescent users of cochlear implants (mean age at commencement of training 15.9 years), with approximately half of the group wearing a hearing aid in the contralateral ear. The mean age of implantation of the group was 7.6 years and the participants displayed a range of speech production abilities. Evaluation of posttraining outcomes was performed via a combination of perceptual and acoustic analyses. Significant posttraining changes to vocal parameters included increased pitch range and variability and decreased speaking rate. From a psychosocial perspective, posttraining stress levels were significantly lowered. This study suggested that actor vocal training may benefit young people with hearing impairment, both in the way in which they use their voices and in the way in which they view themselves.
Iconicity can ground the creation of vocal symbols

PubMed Central

Perlman, Marcus; Dale, Rick; Lupyan, Gary

2015-01-01

Studies of gestural communication systems find that they originate from spontaneously created iconic gestures. Yet, we know little about how people create vocal communication systems, and many have suggested that vocalizations do not afford iconicity beyond trivial instances of onomatopoeia. It is unknown whether people can generate vocal communication systems through a process of iconic creation similar to gestural systems. Here, we examine the creation and development of a rudimentary vocal symbol system in a laboratory setting. Pairs of participants generated novel vocalizations for 18 different meanings in an iterative ‘vocal’ charades communication game. The communicators quickly converged on stable vocalizations, and naive listeners could correctly infer their meanings in subsequent playback experiments. People's ability to guess the meanings of these novel vocalizations was predicted by how close the vocalization was to an iconic ‘meaning template’ we derived from the production data. These results strongly suggest that the meaningfulness of these vocalizations derived from iconicity. Our findings illuminate a mechanism by which iconicity can ground the creation of vocal symbols, analogous to the function of iconicity in gestural communication systems. PMID:26361547
[The effect of laryngoscopic surgery combined with nasal endoscopic system for the treatment of vocal cords benign lesions].

PubMed

Wang, Weian; Lu, Rong

2013-06-01

To investigate the effect of laryngoscopic surgery combined with nasal endoscopic system for the treatment of vocal cords benign lesions. Fifty-two patients admitted to our department with vocal cords benign lesions (including vocal polyps, vocal nodules, vocal cord cyst) underwent laryngoscopic surgery combined with nasal endoscopic system. All patients were treated successfully once and for all without any significant postoperative complication. The laryngoscopic surgery combined with nasal endoscopic system is a safe, minimally invasive and simple method for the treatment of benign lesions of vocal cords.
Vocal learning in elephants: neural bases and adaptive context

PubMed Central

Stoeger, Angela S; Manger, Paul

2014-01-01

In the last decade clear evidence has accumulated that elephants are capable of vocal production learning. Examples of vocal imitation are documented in African (Loxodonta africana) and Asian (Elephas maximus) elephants, but little is known about the function of vocal learning within the natural communication systems of either species. We are also just starting to identify the neural basis of elephant vocalizations. The African elephant diencephalon and brainstem possess specializations related to aspects of neural information processing in the motor system (affecting the timing and learning of trunk movements) and the auditory and vocalization system. Comparative interdisciplinary (from behavioral to neuroanatomical) studies are strongly warranted to increase our understanding of both vocal learning and vocal behavior in elephants. PMID:25062469
Audio-vocal interaction in single neurons of the monkey ventrolateral prefrontal cortex.

PubMed

Hage, Steffen R; Nieder, Andreas

2015-05-06

Complex audio-vocal integration systems depend on a strong interconnection between the auditory and the vocal motor system. To gain cognitive control over audio-vocal interaction during vocal motor control, the PFC needs to be involved. Neurons in the ventrolateral PFC (VLPFC) have been shown to separately encode the sensory perceptions and motor production of vocalizations. It is unknown, however, whether single neurons in the PFC reflect audio-vocal interactions. We therefore recorded single-unit activity in the VLPFC of rhesus monkeys (Macaca mulatta) while they produced vocalizations on command or passively listened to monkey calls. We found that 12% of randomly selected neurons in VLPFC modulated their discharge rate in response to acoustic stimulation with species-specific calls. Almost three-fourths of these auditory neurons showed an additional modulation of their discharge rates either before and/or during the monkeys' motor production of vocalization. Based on these audio-vocal interactions, the VLPFC might be well positioned to combine higher order auditory processing with cognitive control of the vocal motor output. Such audio-vocal integration processes in the VLPFC might constitute a precursor for the evolution of complex learned audio-vocal integration systems, ultimately giving rise to human speech. Copyright © 2015 the authors 0270-6474/15/357030-11$15.00/0.
Vocal coordination and vocal imitation: a role for mirror neurons?

PubMed

Newman, John D

2014-04-01

Some birds and mammals have vocal communication systems in which coordination between individuals is important. Examples would include duetting or antiphonal calling in some birds and mammals, rapid exchanges of the same vocalization, and vocal exchanges between paired individuals and other nearby pairs. Mirror neurons may play a role in such systems but become functional only after experience.
Female mice ultrasonically interact with males during courtship displays

PubMed Central

Neunuebel, Joshua P; Taylor, Adam L; Arthur, Ben J; Egnor, SE Roian

2015-01-01

During courtship males attract females with elaborate behaviors. In mice, these displays include ultrasonic vocalizations. Ultrasonic courtship vocalizations were previously attributed to the courting male, despite evidence that both sexes produce virtually indistinguishable vocalizations. Because of this similarity, and the difficulty of assigning vocalizations to individuals, the vocal contribution of each individual during courtship is unknown. To address this question, we developed a microphone array system to localize vocalizations from socially interacting, individual adult mice. With this system, we show that female mice vocally interact with males during courtship. Males and females jointly increased their vocalization rates during chases. Furthermore, a female's participation in these vocal interactions may function as a signal that indicates a state of increased receptivity. Our results reveal a novel form of vocal communication during mouse courtship, and lay the groundwork for a mechanistic dissection of communication during social behavior. DOI: http://dx.doi.org/10.7554/eLife.06203.001 PMID:26020291
Exploring the anatomical encoding of voice with a mathematical model of the vocal system.

PubMed

Assaneo, M Florencia; Sitt, Jacobo; Varoquaux, Gael; Sigman, Mariano; Cohen, Laurent; Trevisan, Marcos A

2016-11-01

The faculty of language depends on the interplay between the production and perception of speech sounds. A relevant open question is whether the dimensions that organize voice perception in the brain are acoustical or depend on properties of the vocal system that produced it. One of the main empirical difficulties in answering this question is to generate sounds that vary along a continuum according to the anatomical properties the vocal apparatus that produced them. Here we use a mathematical model that offers the unique possibility of synthesizing vocal sounds by controlling a small set of anatomically based parameters. In a first stage the quality of the synthetic voice was evaluated. Using specific time traces for sub-glottal pressure and tension of the vocal folds, the synthetic voices generated perceptual responses, which are indistinguishable from those of real speech. The synthesizer was then used to investigate how the auditory cortex responds to the perception of voice depending on the anatomy of the vocal apparatus. Our fMRI results show that sounds are perceived as human vocalizations when produced by a vocal system that follows a simple relationship between the size of the vocal folds and the vocal tract. We found that these anatomical parameters encode the perceptual vocal identity (male, female, child) and show that the brain areas that respond to human speech also encode vocal identity. On the basis of these results, we propose that this low-dimensional model of the vocal system is capable of generating realistic voices and represents a novel tool to explore the voice perception with a precise control of the anatomical variables that generate speech. Furthermore, the model provides an explanation of how auditory cortices encode voices in terms of the anatomical parameters of the vocal system. Copyright © 2016 Elsevier Inc. All rights reserved.
Early experience shapes vocal neural coding and perception in songbirds

PubMed Central

Woolley, Sarah M. N.

2012-01-01

Songbirds, like humans, are highly accomplished vocal learners. The many parallels between speech and birdsong and conserved features of mammalian and avian auditory systems have led to the emergence of the songbird as a model system for studying the perceptual mechanisms of vocal communication. Laboratory research on songbirds allows the careful control of early life experience and high-resolution analysis of brain function during vocal learning, production and perception. Here, I review what songbird studies have revealed about the role of early experience in the development of vocal behavior, auditory perception and the processing of learned vocalizations by auditory neurons. The findings of these studies suggest general principles for how exposure to vocalizations during development and into adulthood influences the perception of learned vocal signals. PMID:22711657
Vocal development in a Waddington landscape

PubMed Central

Teramoto, Yayoi; Takahashi, Daniel Y; Holmes, Philip; Ghazanfar, Asif A

2017-01-01

Vocal development is the adaptive coordination of the vocal apparatus, muscles, the nervous system, and social interaction. Here, we use a quantitative framework based on optimal control theory and Waddington’s landscape metaphor to provide an integrated view of this process. With a biomechanical model of the marmoset monkey vocal apparatus and behavioral developmental data, we show that only the combination of the developing vocal tract, vocal apparatus muscles and nervous system can fully account for the patterns of vocal development. Together, these elements influence the shape of the monkeys’ vocal developmental landscape, tilting, rotating or shifting it in different ways. We can thus use this framework to make quantitative predictions regarding how interfering factors or experimental perturbations can change the landscape within a species, or to explain comparative differences in vocal development across species DOI: http://dx.doi.org/10.7554/eLife.20782.001 PMID:28092262
Multilevel Analysis in Analyzing Speech Data

ERIC Educational Resources Information Center

Guddattu, Vasudeva; Krishna, Y.

2011-01-01

The speech produced by human vocal tract is a complex acoustic signal, with diverse applications in phonetics, speech synthesis, automatic speech recognition, speaker identification, communication aids, speech pathology, speech perception, machine translation, hearing research, rehabilitation and assessment of communication disorders and many…
Vocal Parameters and Self-Perception in Individuals With Adductor Spasmodic Dysphonia.

PubMed

Rojas, Gleidy Vannesa E; Ricz, Hilton; Tumas, Vitor; Rodrigues, Guilherme R; Toscano, Patrícia; Aguiar-Ricz, Lílian

2017-05-01

The study aimed to compare and correlate perceptual-auditory analysis of vocal parameters and self-perception in individuals with adductor spasmodic dysphonia before and after the application of botulinum toxin. This is a prospective cohort study. Sixteen individuals with a diagnosis of adductor spasmodic dysphonia were submitted to the application of botulinum toxin in the thyroarytenoid muscle, to the recording of a voice signal, and to the Voice Handicap Index (VHI) questionnaire before the application and at two time points after application. Two judges performed a perceptual-auditory analysis of eight vocal parameters with the aid of the Praat software for the visualization of narrow band spectrography, pitch, and intensity contour. Comparison of the vocal parameters before toxin application and on the first return revealed a reduction of oscillation intensity (P = 0.002), voice breaks (P = 0.002), and vocal tremor (P = 0.002). The same parameters increased on the second return. The degree of severity, strained-strangled voice, roughness, breathiness, and asthenia was unchanged. The total score and the emotional domain score of the VHI were reduced on the first return. There was a moderate correlation between the degree of voice severity and the total VHI score before application and on the second return, and a weak correlation on the first return. Perceptual-auditory analysis and self-perception proved to be efficient in the recognition of vocal changes and of the vocal impact on individuals with adductor spasmodic dysphonia under treatment with botulinum toxin, permitting the quantitation of changes along time. Copyright © 2017. Published by Elsevier Inc.
Peripheral Mechanisms for Vocal Production in Birds--Differences and Similarities to Human Speech and Singing

ERIC Educational Resources Information Center

Riede, Tobias; Goller, Franz

2010-01-01

Song production in songbirds is a model system for studying learned vocal behavior. As in humans, bird phonation involves three main motor systems (respiration, vocal organ and vocal tract). The avian respiratory mechanism uses pressure regulation in air sacs to ventilate a rigid lung. In songbirds sound is generated with two independently…
Neuroendocrine control of seasonal plasticity in the auditory and vocal systems of fish

PubMed Central

Forlano, Paul M.; Sisneros, Joseph A.; Rohmann, Kevin N.; Bass, Andrew H.

2014-01-01

Seasonal changes in reproductive-related vocal behavior are widespread among fishes. This review highlights recent studies of the vocal plainfin midshipman fish, Porichthys notatus, a neuroethological model system used for the past two decades to explore neural and endocrine mechanisms of vocal-acoustic social behaviors shared with tetrapods. Integrative approaches combining behavior, neurophysiology, neuropharmacology, neuroanatomy, and gene expression methodologies have taken advantage of simple, stereotyped and easily quantifiable behaviors controlled by discrete neural networks in this model system to enable discoveries such as the first demonstration of adaptive seasonal plasticity in the auditory periphery of a vertebrate as well as rapid steroid and neuropeptide effects on vocal physiology and behavior. This simple model system has now revealed cellular and molecular mechanisms underlying seasonal and steroid-driven auditory and vocal plasticity in the vertebrate brain. PMID:25168757
Breathing and Vocal Control: The Respiratory System as both a Driver and Target of Telencephalic Vocal Motor Circuits in Songbirds

PubMed Central

Schmidt, Marc F.; McLean, Judith; Goller, Franz

2011-01-01

The production of vocalizations is intimately linked to the respiratory system. Despite our understanding of neural circuits that generate normal respiratory patterns, very little is understood regarding how these ponto-medullary circuits become engaged during vocal production. Songbirds offer a potentially powerful model system for addressing this relationship. Songs dramatically alter the respiratory pattern in ways that are often highly predictable and songbirds have a specialized telencephalic vocal motor circuit that provides massive innervation to a brainstem respiratory network that shares many similarities with its mammalian counterpart. In this review, we highlight interactions between the song motor circuit and the respiratory system, describing how both systems likely interact to produce the complex respiratory patterns that are observed during vocalization. We also discuss how the respiratory system, through its bilateral bottom-up projections to thalamus, might play a key role in sending precisely timed signals that synchronize premotor activity in both hemispheres. PMID:21984733

Of Mice, Birds, and Men: The Mouse Ultrasonic Song System Has Some Features Similar to Humans and Song-Learning Birds

PubMed Central

Arriaga, Gustavo; Zhou, Eric P.; Jarvis, Erich D.

2012-01-01

Humans and song-learning birds communicate acoustically using learned vocalizations. The characteristic features of this social communication behavior include vocal control by forebrain motor areas, a direct cortical projection to brainstem vocal motor neurons, and dependence on auditory feedback to develop and maintain learned vocalizations. These features have so far not been found in closely related primate and avian species that do not learn vocalizations. Male mice produce courtship ultrasonic vocalizations with acoustic features similar to songs of song-learning birds. However, it is assumed that mice lack a forebrain system for vocal modification and that their ultrasonic vocalizations are innate. Here we investigated the mouse song system and discovered that it includes a motor cortex region active during singing, that projects directly to brainstem vocal motor neurons and is necessary for keeping song more stereotyped and on pitch. We also discovered that male mice depend on auditory feedback to maintain some ultrasonic song features, and that sub-strains with differences in their songs can match each other's pitch when cross-housed under competitive social conditions. We conclude that male mice have some limited vocal modification abilities with at least some neuroanatomical features thought to be unique to humans and song-learning birds. To explain our findings, we propose a continuum hypothesis of vocal learning. PMID:23071596
A Computational Study of Vocal Fold Dehydration During Phonation.

PubMed

Wu, Liang; Zhang, Zhaoyan

2017-12-01

While vocal fold dehydration is often considered an important factor contributing to vocal fatigue, it still remains unclear whether vocal fold vibration alone is able to induce severe dehydration that has a noticeable effect on phonation and perceived vocal effort. A three-dimensional model was developed to investigate vocal fold systemic dehydration and surface dehydration during phonation. Based on the linear poroelastic theory, the model considered water resupply from blood vessels through the lateral boundary, water movement within the vocal folds, water exchange between the vocal folds and the surface liquid layer through the epithelium, and surface fluid accumulation and discharge to the glottal airway. Parametric studies were conducted to investigate water loss within the vocal folds and from the surface after a 5-min sustained phonation under different permeability and vibration conditions. The results showed that the dehydration generally increased with increasing vibration amplitude, increasing epithelial permeability, and reduced water resupply. With adequate water resupply, a large-amplitude vibration can induce an overall systemic dehydration as high as 3%. The distribution of water loss within the vocal folds was non-uniform, and a local dehydration higher than 5% was observed even under conditions of a low overall systemic dehydration (<1%). Such high level of water loss may severely affect tissue properties, muscular functions, and phonations characteristics. In contrast, water loss of the surface liquid layer was generally an order of magnitude higher than water loss inside the vocal folds, indicating that the surface dehydration level is likely not a good indicator of the systemic dehydration.
A theoretical study of F0-F1 interaction with application to resonant speaking and singing voice.

PubMed

Titze, Ingo R

2004-09-01

An interactive source-filter system, consisting of a three-mass body-cover model of the vocal folds and a wave reflection model of the vocal tract, was used to test the dependence of vocal fold vibration on the vocal tract. The degree of interaction is governed by the epilarynx tube, which raises the vocal tract impedance to match the impedance of the glottis. The key component of the impedance is inertive reactance. Whenever there is inertive reactance, the vocal tract assists the vocal folds in vibration. The amplitude of vibration and the glottal flow can more than double, and the oral radiated power can increase up to 10 dB. As F0 approaches F1, the first formant frequency, the interactive source-filter system loses its advantage (because inertive reactance changes to compliant reactance) and the noninteractive system produces greater vocal output. Thus, from a voice training and control standpoint, there may be reasons to operate the system in either interactive and noninteractive modes. The harmonics 2F0 and 3F0 can also benefit from being positioned slightly below F1.
Electrophysiologic monitoring characteristics of the recurrent laryngeal nerve preoperatively paralyzed or invaded with malignancy.

PubMed

Kamani, Dipti; Darr, E Ashlie; Randolph, Gregory W

2013-11-01

To elucidate electrophysiologic responses of the recurrent laryngeal nerves that were preoperatively paralyzed or invaded by malignancy and to use this information as an added functional parameter for intraoperative management of recurrent laryngeal nerves with malignant invasion. Case series with chart review. Academic, tertiary care center. All consecutive neck surgeries with nerve monitoring performed by senior author (GWR) between December 1995 and January 2007 were reviewed after obtaining Institutional Review Board approval from Massachusetts Eye and Ear Infirmary Human Subjects Committee and the Partners Human Research Committee. Electrophysiologic parameters in all cases with preoperative vocal cord paralysis/paresis, and the recurrent laryngeal nerve invasion by cancer, were studied. Of the 1138 surgeries performed, 25 patients (2.1%) had preoperative vocal cord dysfunction. In patients with preoperative vocal cord dysfunction, recognizable recurrent laryngeal nerve electrophysiologic activity was preserved in over 50% of cases. Malignant invasion of the recurrent laryngeal nerve was found in 22 patients (1.9%). Neural invasion of the recurrent laryngeal nerve was associated with preoperative vocal cord paralysis in only 50% of these patients. In nerves invaded by malignancy, 60% maintained recognizable electrophysiologic activity, which was more commonly present and robust when vocal cord function was preserved. Knowledge of electrophysiologic intraoperative neural monitoring provides additional functional information and, along with preoperative vocal cord function information, aids in constructing decision algorithms regarding intraoperative management of the recurrent laryngeal nerve, in prognosticating postoperative outcomes, and in patient counseling regarding postoperative expectations.
Fluid-Structure Interactions as Flow Propagates Tangentially Over a Flexible Plate with Application to Voiced Speech Production

NASA Astrophysics Data System (ADS)

Westervelt, Andrea; Erath, Byron

2013-11-01

Voiced speech is produced by fluid-structure interactions that drive vocal fold motion. Viscous flow features influence the pressure in the gap between the vocal folds (i.e. glottis), thereby altering vocal fold dynamics and the sound that is produced. During the closing phases of the phonatory cycle, vortices form as a result of flow separation as air passes through the divergent glottis. It is hypothesized that the reduced pressure within a vortex core will alter the pressure distribution along the vocal fold surface, thereby aiding in vocal fold closure. The objective of this study is to determine the impact of intraglottal vortices on the fluid-structure interactions of voiced speech by investigating how the dynamics of a flexible plate are influenced by a vortex ring passing tangentially over it. A flexible plate, which models the medial vocal fold surface, is placed in a water-filled tank and positioned parallel to the exit of a vortex generator. The physical parameters of plate stiffness and vortex circulation are scaled with physiological values. As vortices propagate over the plate, particle image velocimetry measurements are captured to analyze the energy exchange between the fluid and flexible plate. The investigations are performed over a range of vortex formation numbers, and lateral displacements of the plate from the centerline of the vortex trajectory. Observations show plate oscillations with displacements directly correlated with the vortex core location.
Vocalization Subsystem Responses to a Temporarily Induced Unilateral Vocal Fold Paralysis

ERIC Educational Resources Information Center

Croake, Daniel J.; Andreatta, Richard D.; Stemple, Joseph C.

2018-01-01

Purpose: The purpose of this study is to quantify the interactions of the 3 vocalization subsystems of respiration, phonation, and resonance before, during, and after a perturbation to the larynx (temporarily induced unilateral vocal fold paralysis) in 10 vocally healthy participants. Using dynamic systems theory as a guide, we hypothesized that…
[Temperament of children with vocal fold nodules].

PubMed

Wei, Youhua; Wang, Zhinan; Xu, Zhongqiang; Chen, Ping; Hao, Lili

2009-11-01

To examine the temperament of children with vocal fold nodules. To compare the temperament dimension and temperamental types of 42 children with vocal fold nodules with 46 vocally normal children, using Chinese children's Temperament Problem Screening system (CCTPSs). The children with vocal fold nodules differed significantly from the comparison group in their temperament dimension's adaptability, intensity of reaction, mood value, persistency and temperamental types. There are more difficult and slow-to-warm-up children in patients with vocal fold nodules than vocally normal children.
[Persistent Bilateral Vocal Cord Paralysis after General Anesthesia in a Patient with Multiple System Atrophy: A Case Report].

PubMed

Konishi, Hanako; Mizota, Toshiyuki; Fukuda, Kazuhiko

2015-06-01

We report a case of persistent bilateral vocal cord paralysis which developed after spine surgery under general anesthesia in a patient with multiple system atrophy. A 64-year-old woman was scheduled to receive spinal fusion surgery for kyphoscoliosis. She did not have apparent symptoms of vocal cord paralysis such as hoarseness before surgery. The surgery was performed smoothly under general anesthesia with endotracheal intubation. However, immediately after extubation, the patient developed severe upper airway obstruction and was re-intubated. Fiberoptic laryngoscopy revealed bilateral vocal cord abductor paralysis. Vocal cord paralysis did not improve and she received tracheotomy on the 12th day after surgery. She also showed symptoms of autonomic nervous system dysfunction and cerebellar ataxia, and was diagnosed as multiple system atrophy on postoperative day 64. We discuss differential diagnosis of persistent vocal cord paralysis after general anesthesia, and anesthetic management of a patient with multiple system atrophy.
Interactive Voice Technology: Variations in the Vocal Utterances of Speakers Performing a Stress-Inducing Task,

DTIC Science & Technology

1983-08-16

34. " .. ,,,,.-j.Aid-is.. ;,,i . -i.t . "’" ’, V ,1 5- 4. 3- kHz 2-’ r 1 r s ’.:’ BOGEY 5D 0 S BOGEY 12D Figure 10. Spectrograms of two versions of the word...MF5852801B 0001 Reviewed by Approved and Released by Ashton Graybiel, M.D. Captain W. M. Houk , MC, USN Chief Scientific Advisor Commanding Officer 16 August...incorporating knowledge about these changes into speech recognition systems. i A J- I. . S , .4, ... ..’-° -- -iii l - - .- - i- . .. " •- - i ,f , i
Using image processing technology and mathematical algorithm in the automatic selection of vocal cord opening and closing images from the larynx endoscopy video.

PubMed

Kuo, Chung-Feng Jeffrey; Chu, Yueng-Hsiang; Wang, Po-Chun; Lai, Chun-Yu; Chu, Wen-Lin; Leu, Yi-Shing; Wang, Hsing-Won

2013-12-01

The human larynx is an important organ for voice production and respiratory mechanisms. The vocal cord is approximated for voice production and open for breathing. The videolaryngoscope is widely used for vocal cord examination. At present, physicians usually diagnose vocal cord diseases by manually selecting the image of the vocal cord opening to the largest extent (abduction), thus maximally exposing the vocal cord lesion. On the other hand, the severity of diseases such as vocal palsy, atrophic vocal cord is largely dependent on the vocal cord closing to the smallest extent (adduction). Therefore, diseases can be assessed by the image of the vocal cord opening to the largest extent, and the seriousness of breathy voice is closely correlated to the gap between vocal cords when closing to the smallest extent. The aim of the study was to design an automatic vocal cord image selection system to improve the conventional selection process by physicians and enhance diagnosis efficiency. Also, due to the unwanted fuzzy images resulting from examination process caused by human factors as well as the non-vocal cord images, texture analysis is added in this study to measure image entropy to establish a screening and elimination system to effectively enhance the accuracy of selecting the image of the vocal cord closing to the smallest extent. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Core and Shell Song Systems Unique to the Parrot Brain

PubMed Central

Chakraborty, Mukta; Walløe, Solveig; Nedergaard, Signe; Fridel, Emma E.; Dabelsteen, Torben; Pakkenberg, Bente; Bertelsen, Mads F.; Dorrestein, Gerry M.; Brauth, Steven E.; Durand, Sarah E.; Jarvis, Erich D.

2015-01-01

The ability to imitate complex sounds is rare, and among birds has been found only in parrots, songbirds, and hummingbirds. Parrots exhibit the most advanced vocal mimicry among non-human animals. A few studies have noted differences in connectivity, brain position and shape in the vocal learning systems of parrots relative to songbirds and hummingbirds. However, only one parrot species, the budgerigar, has been examined and no differences in the presence of song system structures were found with other avian vocal learners. Motivated by questions of whether there are important differences in the vocal systems of parrots relative to other vocal learners, we used specialized constitutive gene expression, singing-driven gene expression, and neural connectivity tracing experiments to further characterize the song system of budgerigars and/or other parrots. We found that the parrot brain uniquely contains a song system within a song system. The parrot “core” song system is similar to the song systems of songbirds and hummingbirds, whereas the “shell” song system is unique to parrots. The core with only rudimentary shell regions were found in the New Zealand kea, representing one of the only living species at a basal divergence with all other parrots, implying that parrots evolved vocal learning systems at least 29 million years ago. Relative size differences in the core and shell regions occur among species, which we suggest could be related to species differences in vocal and cognitive abilities. PMID:26107173
A robotic voice simulator and the interactive training for hearing-impaired people.

PubMed

Sawada, Hideyuki; Kitani, Mitsuki; Hayashi, Yasumori

2008-01-01

A talking and singing robot which adaptively learns the vocalization skill by means of an auditory feedback learning algorithm is being developed. The robot consists of motor-controlled vocal organs such as vocal cords, a vocal tract and a nasal cavity to generate a natural voice imitating a human vocalization. In this study, the robot is applied to the training system of speech articulation for the hearing-impaired, because the robot is able to reproduce their vocalization and to teach them how it is to be improved to generate clear speech. The paper briefly introduces the mechanical construction of the robot and how it autonomously acquires the vocalization skill in the auditory feedback learning by listening to human speech. Then the training system is described, together with the evaluation of the speech training by auditory impaired people.
A Brain for Speech. Evolutionary Continuity in Primate and Human Auditory-Vocal Processing

PubMed Central

Aboitiz, Francisco

2018-01-01

In this review article, I propose a continuous evolution from the auditory-vocal apparatus and its mechanisms of neural control in non-human primates, to the peripheral organs and the neural control of human speech. Although there is an overall conservatism both in peripheral systems and in central neural circuits, a few changes were critical for the expansion of vocal plasticity and the elaboration of proto-speech in early humans. Two of the most relevant changes were the acquisition of direct cortical control of the vocal fold musculature and the consolidation of an auditory-vocal articulatory circuit, encompassing auditory areas in the temporoparietal junction and prefrontal and motor areas in the frontal cortex. This articulatory loop, also referred to as the phonological loop, enhanced vocal working memory capacity, enabling early humans to learn increasingly complex utterances. The auditory-vocal circuit became progressively coupled to multimodal systems conveying information about objects and events, which gradually led to the acquisition of modern speech. Gestural communication accompanies the development of vocal communication since very early in human evolution, and although both systems co-evolved tightly in the beginning, at some point speech became the main channel of communication. PMID:29636657
Cooperative vocal control in marmoset monkeys via vocal feedback

PubMed Central

Choi, Jung Yoon; Takahashi, Daniel Y.

2015-01-01

Humans adjust speech amplitude as a function of distance from a listener; we do so in a manner that would compensate for such distance. This ability is presumed to be the product of high-level sociocognitive skills. Nonhuman primates are thought to lack such socially related flexibility in vocal production. Using predictions from a simple arousal-based model whereby vocal feedback from a conspecific modulates the drive to produce a vocalization, we tested whether another primate exhibits this type of cooperative vocal control. We conducted a playback experiment with marmoset monkeys and simulated “far-away” and “nearby” conspecifics using contact calls that differed in sound intensity. We found that marmoset monkeys increased the amplitude of their contact calls and produced such calls with shorter response latencies toward more distant conspecifics. The same was not true in response to changing levels of background noise. To account for how simulated conspecific distance can change both the amplitude and timing of vocal responses, we developed a model that incorporates dynamic interactions between the auditory system and limbic “drive” systems. Overall, our data show that, like humans, marmoset monkeys cooperatively control the acoustics of their vocalizations according to changes in listener distance, increasing the likelihood that a conspecific will hear their call. However, we propose that such cooperative vocal control is a system property that does not necessitate any particularly advanced sociocognitive skill. At least in marmosets, this vocal control can be parsimoniously explained by the regulation of arousal states across two interacting individuals via vocal feedback. PMID:25925323
Automated Assessment of Child Vocalization Development Using LENA

ERIC Educational Resources Information Center

Richards, Jeffrey A.; Xu, Dongxin; Gilkerson, Jill; Yapanel, Umit; Gray, Sharmistha; Paul, Terrance

2017-01-01

Purpose: To produce a novel, efficient measure of children's expressive vocal development on the basis of automatic vocalization assessment (AVA), child vocalizations were automatically identified and extracted from audio recordings using Language Environment Analysis (LENA) System technology. Method: Assessment was based on full-day audio…
Proton density-weighted laryngeal magnetic resonance imaging in systemically dehydrated rats.

PubMed

Oleson, Steven; Lu, Kun-Han; Liu, Zhongming; Durkes, Abigail C; Sivasankar, M Preeti

2018-06-01

Dehydrated vocal folds are inefficient sound generators. Although systemic dehydration of the body is believed to induce vocal fold dehydration, this causative relationship has not been demonstrated in vivo. Here we investigate the feasibility of using in vivo proton density (PD)-weighted magnetic resonance imaging (MRI) to demonstrate hydration changes in vocal fold tissue following systemic dehydration in rats. Animal study. Sprague-Dawley rats (n = 10) were imaged at baseline and following a 10% reduction in body weight secondary to withholding water. In vivo, high-field (7 T), PD-weighted MRI was used to successfully resolve vocal fold and salivary gland tissue structures. Normalized signal intensities within the vocal fold decreased postdehydration by an average of 11.38% ± 3.95% (mean ± standard error of the mean [SEM], P = .0098) as compared to predehydration levels. The salivary glands experienced a similar decrease in normalized signal intensity by an average of 10.74% ± 4.14% (mean ± SEM, P = .0195) following dehydration. The correlation coefficient (percent change from dehydration) between vocal folds and salivary glands was 0.7145 (P = .0202). Ten percent systemic dehydration induced vocal fold dehydration as assessed by PD-weighted MRI. Changes in the hydration state of vocal fold tissue were highly correlated with that of the salivary glands in dehydrated rats in vivo. These preliminary findings demonstrate the feasibility of using PD-weighted MRI to quantify hydration states of the vocal folds and lay the foundation for further studies that explore more routine and realistic magnitudes of systemic dehydration and rehydration. NA. Laryngoscope, 128:E222-E227, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
Effects of Systemic Hydration on Vocal Acoustics of 18- to 35-Year-Old Females

ERIC Educational Resources Information Center

Franca, Maria Claudia; Simpson, Kenneth O.

2012-01-01

The influence of body hydration and vocal acoustics was investigated in this study. Effects of two levels of hydration on objective measures of vocal acoustics were explored. In an attempt to reduce variability in the degree of systemic hydration and to induce a state of systemic dehydration, participants were instructed to refrain from ingestion…
Mouse vocal communication system: are ultrasounds learned or innate?

PubMed Central

Arriaga, Gustavo; Jarvis, Erich D.

2013-01-01

Mouse ultrasonic vocalizations (USVs) are often used as behavioral readouts of internal states, to measure effects of social and pharmacological manipulations, and for behavioral phenotyping of mouse models for neuropsychiatric and neurodegenerative disorders. However, little is known about the neurobiological mechanisms of rodent USV production. Here we discuss the available data to assess whether male mouse song behavior and the supporting brain circuits resemble those of known vocal non-learning or vocal learning species. Recent neurobiology studies have demonstrated that the mouse USV brain system includes motor cortex and striatal regions, and that the vocal motor cortex sends a direct sparse projection to the brainstem vocal motor nucleus ambiguous, a projection thought be unique to humans among mammals. Recent behavioral studies have reported opposing conclusions on mouse vocal plasticity, including vocal ontogeny changes in USVs over early development that might not be explained by innate maturation processes, evidence for and against a role for auditory feedback in developing and maintaining normal mouse USVs, and evidence for and against limited vocal imitation of song pitch. To reconcile these findings, we suggest that the trait of vocal learning may not be dichotomous but encompass a broad set of behavioral and neural traits we call the continuum hypothesis, and that mice possess some of the traits associated with a capacity for limited vocal learning. PMID:23295209
Determination of West Indian manatee vocalization levels and rate

NASA Astrophysics Data System (ADS)

Phillips, Richard; Niezrecki, Christopher; Beusse, Diedrich

2004-05-01

The West Indian manatee (Trichechus manatus latirostris) has become endangered partly because of a growing number of collisions with boats. A system to warn boaters of the presence of manatees, based upon the vocalizations of manatees, could potentially reduce these boat collisions. The feasibility of this warning system would depend mainly upon two factors: the rate at which manatees vocalize and the distance in which the manatees can be detected. The research presented in this paper verifies that the average vocalization rate of the West Indian manatee is approximately one to two times per 5-min period. Several different manatee vocalization recordings were broadcast to the manatees and their response was observed. It was found that during the broadcast periods, the vocalization rates for the manatees increased substantially when compared with the average vocalization rates during nonbroadcast periods. An array of four hydrophones was used while recording the manatees. This allowed for position estimation techniques to be used to determine the location of the vocalizing manatee. Knowing the position of the manatee, the source level was determined and it was found that the mean source level of the manatee vocalizations is approximately 112 dB (re:1 Pa) @ 1 m.
Determination of West Indian manatee vocalization levels and rate

NASA Astrophysics Data System (ADS)

Phillips, Richard; Niezrecki, Christopher; Beusse, Diedrich O.

2004-01-01

The West Indian manatee (Trichechus manatus latirostris) has become endangered partly because of a growing number of collisions with boats. A system to warn boaters of the presence of manatees, based upon the vocalizations of manatees, could potentially reduce these boat collisions. The feasibility of this warning system would depend mainly upon two factors: the rate at which manatees vocalize and the distance in which the manatees can be detected. The research presented in this paper verifies that the average vocalization rate of the West Indian manatee is approximately one to two times per 5-min period. Several different manatee vocalization recordings were broadcast to the manatees and their response was observed. It was found that during the broadcast periods, the vocalization rates for the manatees increased substantially when compared with the average vocalization rates during nonbroadcast periods. An array of four hydrophones was used while recording the manatees. This allowed for position estimation techniques to be used to determine the location of the vocalizing manatee. Knowing the position of the manatee, the source level was determined and it was found that the mean source level of the manatee vocalizations is approximately 112 dB (re 1 μPa) @ 1 m.

Acoustic properties of vocal singing in prelingually-deafened children with cochlear implants or hearing aids.

PubMed

Mao, Yitao; Zhang, Mengchao; Nutter, Heather; Zhang, Yijing; Zhou, Qixin; Liu, Qiaoyun; Wu, Weijing; Xie, Dinghua; Xu, Li

2013-11-01

The purpose of the present study was to investigate vocal singing performance of hearing-impaired children with cochlear implants (CI) and hearing aids (HA) as well as to evaluate the relationship between demographic factors of those hearing-impaired children and their singing ability. Thirty-seven prelingually-deafened children with CIs and 31 prelingually-deafened children with HAs, and 37 normal-hearing (NH) children participated in the study. The fundamental frequencies (F0) of each note in the recorded songs were extracted and the duration of each sung note was measured. Five metrics were used to evaluate the pitch-related and rhythm-based aspects of singing accuracy. Children with CIs and HAs showed significantly poorer performance in either the pitch-based assessments or the rhythm-based measure than the NH children. No significant differences were seen between the CI and HA groups in all of these measures except for the mean deviation of the pitch intervals. For both hearing-impaired groups, length of device use was significantly correlated with singing accuracy. There is a marked deficit in vocal singing ability either in pitch or rhythm accuracy in a majority of prelingually-deafened children who have received CIs or fitted with HAs. Although an increased length of device use might facilitate singing performance to some extent, the chance for the hearing-impaired children fitted with either HAs or CIs to reach high proficiency in singing is quite slim. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Sound duration as a perceptual cue influencing vocal behavior of male bullfrogs

NASA Astrophysics Data System (ADS)

Simmons, Andrea M.

2002-05-01

Female frogs of several species use the temporal cue of sound duration to aid in mate choice. Little is known, however, about the sensitivity of male frogs to this cue. Male bullfrogs emit a complex advertisement call to attract females for mating, and to announce territory occupation to other males. In two experiments, the sensitivity of vocalizing male bullfrogs to field playbacks of advertisement calls differing in duration was examined. The number and latency of evoked vocal responses to the stimuli was used as a measure of perception. Males responded with fewer calls, at longer latencies, to stimuli shorter in duration than the standard signal (with a duration at the mean value for the species). Males preferred stimuli longer in duration than the standard signal, responding with more calls at shorter latencies. They did not, however, significantly lengthen their own calls in response to playbacks of long duration signals. This preference for ``supernormal'' stimuli may be an important factor mediating the evolution of communication signals. [Work supported by NIH.
A wavelet-based approach for a continuous analysis of phonovibrograms.

PubMed

Unger, Jakob; Meyer, Tobias; Doellinger, Michael; Hecker, Dietmar J; Schick, Bernhard; Lohscheller, Joerg

2012-01-01

Recently, endoscopic high-speed laryngoscopy has been established for commercial use and constitutes a state-of-the-art technique to examine vocal fold dynamics. Despite overcoming many limitations of commonly applied stroboscopy it has not gained widespread clinical application, yet. A major drawback is a missing methodology of extracting valuable features to support visual assessment or computer-aided diagnosis. In this paper a compact and descriptive feature set is presented. The feature extraction routines are based on two-dimensional color graphs called phonovibrograms (PVG). These graphs contain the full spatio-temporal pattern of vocal fold dynamics and are therefore suited to derive features that comprehensively describe the vibration pattern of vocal folds. Within our approach, clinically relevant features such as glottal closure type, symmetry and periodicity are quantified in a set of 10 descriptive features. The suitability for classification tasks is shown using a clinical data set comprising 50 healthy and 50 paralytic subjects. A classification accuracy of 93.2% has been achieved.
Limiting parental feedback disrupts vocal development in marmoset monkeys

PubMed Central

Gultekin, Yasemin B.; Hage, Steffen R.

2017-01-01

Vocalizations of human infants undergo dramatic changes across the first year by becoming increasingly mature and speech-like. Human vocal development is partially dependent on learning by imitation through social feedback between infants and caregivers. Recent studies revealed similar developmental processes being influenced by parental feedback in marmoset monkeys for apparently innate vocalizations. Marmosets produce infant-specific vocalizations that disappear after the first postnatal months. However, it is yet unclear whether parental feedback is an obligate requirement for proper vocal development. Using quantitative measures to compare call parameters and vocal sequence structure we show that, in contrast to normally raised marmosets, marmosets that were separated from parents after the third postnatal month still produced infant-specific vocal behaviour at subadult stages. These findings suggest a significant role of social feedback on primate vocal development until the subadult stages and further show that marmoset monkeys are a compelling model system for early human vocal development. PMID:28090084
Peripheral mechanisms for vocal production in birds - differences and similarities to human speech and singing.

PubMed

Riede, Tobias; Goller, Franz

2010-10-01

Song production in songbirds is a model system for studying learned vocal behavior. As in humans, bird phonation involves three main motor systems (respiration, vocal organ and vocal tract). The avian respiratory mechanism uses pressure regulation in air sacs to ventilate a rigid lung. In songbirds sound is generated with two independently controlled sound sources, which reside in a uniquely avian vocal organ, the syrinx. However, the physical sound generation mechanism in the syrinx shows strong analogies to that in the human larynx, such that both can be characterized as myoelastic-aerodynamic sound sources. Similarities include active adduction and abduction, oscillating tissue masses which modulate flow rate through the organ and a layered structure of the oscillating tissue masses giving rise to complex viscoelastic properties. Differences in the functional morphology of the sound producing system between birds and humans require specific motor control patterns. The songbird vocal apparatus is adapted for high speed, suggesting that temporal patterns and fast modulation of sound features are important in acoustic communication. Rapid respiratory patterns determine the coarse temporal structure of song and maintain gas exchange even during very long songs. The respiratory system also contributes to the fine control of airflow. Muscular control of the vocal organ regulates airflow and acoustic features. The upper vocal tract of birds filters the sounds generated in the syrinx, and filter properties are actively adjusted. Nonlinear source-filter interactions may also play a role. The unique morphology and biomechanical system for sound production in birds presents an interesting model for exploring parallels in control mechanisms that give rise to highly convergent physical patterns of sound generation. More comparative work should provide a rich source for our understanding of the evolution of complex sound producing systems. Copyright © 2009 Elsevier Inc. All rights reserved.
Specialized Motor-Driven dusp1 Expression in the Song Systems of Multiple Lineages of Vocal Learning Birds

PubMed Central

Horita, Haruhito; Kobayashi, Masahiko; Liu, Wan-chun; Oka, Kotaro; Jarvis, Erich D.; Wada, Kazuhiro

2012-01-01

Mechanisms for the evolution of convergent behavioral traits are largely unknown. Vocal learning is one such trait that evolved multiple times and is necessary in humans for the acquisition of spoken language. Among birds, vocal learning is evolved in songbirds, parrots, and hummingbirds. Each time similar forebrain song nuclei specialized for vocal learning and production have evolved. This finding led to the hypothesis that the behavioral and neuroanatomical convergences for vocal learning could be associated with molecular convergence. We previously found that the neural activity-induced gene dual specificity phosphatase 1 (dusp1) was up-regulated in non-vocal circuits, specifically in sensory-input neurons of the thalamus and telencephalon; however, dusp1 was not up-regulated in higher order sensory neurons or motor circuits. Here we show that song motor nuclei are an exception to this pattern. The song nuclei of species from all known vocal learning avian lineages showed motor-driven up-regulation of dusp1 expression induced by singing. There was no detectable motor-driven dusp1 expression throughout the rest of the forebrain after non-vocal motor performance. This pattern contrasts with expression of the commonly studied activity-induced gene egr1, which shows motor-driven expression in song nuclei induced by singing, but also motor-driven expression in adjacent brain regions after non-vocal motor behaviors. In the vocal non-learning avian species, we found no detectable vocalizing-driven dusp1 expression in the forebrain. These findings suggest that independent evolutions of neural systems for vocal learning were accompanied by selection for specialized motor-driven expression of the dusp1 gene in those circuits. This specialized expression of dusp1 could potentially lead to differential regulation of dusp1-modulated molecular cascades in vocal learning circuits. PMID:22876306
High channel count microphone array accurately and precisely localizes ultrasonic signals from freely-moving mice.

PubMed

Warren, Megan R; Sangiamo, Daniel T; Neunuebel, Joshua P

2018-03-01

An integral component in the assessment of vocal behavior in groups of freely interacting animals is the ability to determine which animal is producing each vocal signal. This process is facilitated by using microphone arrays with multiple channels. Here, we made important refinements to a state-of-the-art microphone array based system used to localize vocal signals produced by freely interacting laboratory mice. Key changes to the system included increasing the number of microphones as well as refining the methodology for localizing and assigning vocal signals to individual mice. We systematically demonstrate that the improvements in the methodology for localizing mouse vocal signals led to an increase in the number of signals detected as well as the number of signals accurately assigned to an animal. These changes facilitated the acquisition of larger and more comprehensive data sets that better represent the vocal activity within an experiment. Furthermore, this system will allow more thorough analyses of the role that vocal signals play in social communication. We expect that such advances will broaden our understanding of social communication deficits in mouse models of neurological disorders. Copyright © 2018 Elsevier B.V. All rights reserved.
Rehabilitation of a patient with complete mandibulectomy and partial glossectomy.

PubMed

Meyerson, M D; Johnson, B H; Weitzman, R S

1980-05-01

Following a number of radiologic and surgical procedures for the treatment of oral cancer, a patient with severe facial disfigurement and alteration of the vocal tract acquired acceptable speech. Consultation among referring physicians and speech pathologists can aid such a patient by facilitating the rehabilitative process through improvement of communicative skills.
Coarticulation in Early Vocalizations by Children with Hearing Loss: A Locus Perspective

ERIC Educational Resources Information Center

Morrison, Helen Mccaffrey

2012-01-01

Locus equations derived from productions by three children with hearing loss revealed sensory and motor influences on anticipatory coarticulation. Participants who received auditory access to speech via hearing aids and cochlear implants at different ages (5-39 months) were recorded at approximately 6 and 12 months after hearing technology…
Application of motion analysis in the study of the effect of botulinum toxin to rat vocal folds

NASA Astrophysics Data System (ADS)

Saadah, Abdul K.; Galatsanos, Nikolas P.; Inagi, K.; Bless, D.

1997-05-01

In the past we have proposed a system that measures the deformations of the vocal folds from videostroboscopic images of the larynx, in that system: (1) we extract the boundaries of the vocal folds, (2) we register elastically the vocal fold boundaries in successive frames. This yields the displacement vector field (DVF) between adjacent frames, and (3) we fit using a least-squares approach an affine transformation model to succinctly describe the deformations between adjacent frames. In this paper, we present as an example of the capabilities of this system, an initial study of the deformation changes in rat vocal folds pre and post injection with Botulinum toxin. For this application the generated DVF was segmented into right DVF and left DVF and the deformation of each segment is studied separately.
[Can music therapy for patients with neurological disorders?].

PubMed

Myskja, Audun

2004-12-16

Recent developments in brain research and in the field of music therapy have led to the development of music-based methods specifically aimed at relieving symptoms of Parkinson's disease and other neurologic disorders. Rhythmic auditory stimulation uses external rhythmic auditory cues from song, music or metronome to aid patients improving their walking functioning and has been shown to be effective both within sessions and as a result of training over time. Melodic intonation therapy and related vocal techniques can improve expressive dysphasia and aid rehabilitation of neurologic disorders, particularly Parkinson's disease, stroke and developmental disorders.
Mechanical properties of the vocal fold. Stress-strain studies.

PubMed

Haji, T; Mori, K; Omori, K; Isshiki, N

1992-01-01

The viscoelasticity of the vocal and ventricular folds was experimentally assessed by analyzing the stress-strain relationships obtained using a newly developed measuring system. The degree of stiffness of the mid-membranous portion of the vocal fold was less than that near the anterior commissure or the vocal process. The ventricular fold was much less stiff and significantly more viscous than the vocal fold. At the membranous portion of the vocal fold, the degree of stiffness was less and that of viscosity greater at 2 mm above and below the free margin than at the free margin itself.
The stabilized, wavelet-Mellin transform for analyzing the size and shape information of vocalized sounds

NASA Astrophysics Data System (ADS)

Irino, Toshio; Patterson, Roy

2005-04-01

We hear vowels produced by men, women, and children as approximately the same although there is considerable variability in glottal pulse rate and vocal tract length. At the same time, we can identify the speaker group. Recent experiments show that it is possible to identify vowels even when the glottal pulse rate and vocal tract length are condensed or expanded beyond the range of natural vocalization. This suggests that the auditory system has an automatic process to segregate information about shape and size of the vocal tract. Recently we proposed that the auditory system uses some form of Stabilized, Wavelet-Mellin Transform (SWMT) to analyze scale information in bio-acoustic sounds as a general framework for auditory processing from cochlea to cortex. This talk explains the theoretical background of the model and how the vocal information is normalized in the representation. [Work supported by GASR(B)(2) No. 15300061, JSPS.
Cross-cultural and cross-ecotype production of a killer whale 'excitement' call suggests universality.

PubMed

Rehn, Nicola; Filatova, Olga A; Durban, John W; Foote, Andrew D

2011-01-01

Facial and vocal expressions of emotion have been found in a number of social mammal species and are thought to have evolved to aid social communication. There has been much debate about whether such signals are culturally inherited or are truly biologically innate. Evidence for the innateness of such signals can come from cross-cultural studies. Previous studies have identified a vocalisation (the V4 or 'excitement' call) associated with high arousal behaviours in a population of killer whales in British Columbia, Canada. In this study, we compared recordings from three different socially and reproductively isolated ecotypes of killer whales, including five vocal clans of one ecotype, each clan having discrete culturally transmitted vocal traditions. The V4 call was found in recordings of each ecotype and each vocal clan. Nine independent observers reproduced our classification of the V4 call from each population with high inter-observer agreement. Our results suggest the V4 call may be universal in Pacific killer whale populations and that transmission of this call is independent of cultural tradition or ecotype. We argue that such universality is more consistent with an innate vocalisation than one acquired through social learning and may be linked to its apparent function of motivational expression.
Cross-cultural and cross-ecotype production of a killer whale `excitement' call suggests universality

NASA Astrophysics Data System (ADS)

Rehn, Nicola; Filatova, Olga A.; Durban, John W.; Foote, Andrew D.

2011-01-01

Facial and vocal expressions of emotion have been found in a number of social mammal species and are thought to have evolved to aid social communication. There has been much debate about whether such signals are culturally inherited or are truly biologically innate. Evidence for the innateness of such signals can come from cross-cultural studies. Previous studies have identified a vocalisation (the V4 or `excitement' call) associated with high arousal behaviours in a population of killer whales in British Columbia, Canada. In this study, we compared recordings from three different socially and reproductively isolated ecotypes of killer whales, including five vocal clans of one ecotype, each clan having discrete culturally transmitted vocal traditions. The V4 call was found in recordings of each ecotype and each vocal clan. Nine independent observers reproduced our classification of the V4 call from each population with high inter-observer agreement. Our results suggest the V4 call may be universal in Pacific killer whale populations and that transmission of this call is independent of cultural tradition or ecotype. We argue that such universality is more consistent with an innate vocalisation than one acquired through social learning and may be linked to its apparent function of motivational expression.
Vocal Imitations of Non-Vocal Sounds

PubMed Central

Houix, Olivier; Voisin, Frédéric; Misdariis, Nicolas; Susini, Patrick

2016-01-01

Imitative behaviors are widespread in humans, in particular whenever two persons communicate and interact. Several tokens of spoken languages (onomatopoeias, ideophones, and phonesthemes) also display different degrees of iconicity between the sound of a word and what it refers to. Thus, it probably comes at no surprise that human speakers use a lot of imitative vocalizations and gestures when they communicate about sounds, as sounds are notably difficult to describe. What is more surprising is that vocal imitations of non-vocal everyday sounds (e.g. the sound of a car passing by) are in practice very effective: listeners identify sounds better with vocal imitations than with verbal descriptions, despite the fact that vocal imitations are inaccurate reproductions of a sound created by a particular mechanical system (e.g. a car driving by) through a different system (the voice apparatus). The present study investigated the semantic representations evoked by vocal imitations of sounds by experimentally quantifying how well listeners could match sounds to category labels. The experiment used three different types of sounds: recordings of easily identifiable sounds (sounds of human actions and manufactured products), human vocal imitations, and computational “auditory sketches” (created by algorithmic computations). The results show that performance with the best vocal imitations was similar to the best auditory sketches for most categories of sounds, and even to the referent sounds themselves in some cases. More detailed analyses showed that the acoustic distance between a vocal imitation and a referent sound is not sufficient to account for such performance. Analyses suggested that instead of trying to reproduce the referent sound as accurately as vocally possible, vocal imitations focus on a few important features, which depend on each particular sound category. These results offer perspectives for understanding how human listeners store and access long-term sound representations, and sets the stage for the development of human-computer interfaces based on vocalizations. PMID:27992480
Modulation of voice related to tremor and vibrato

NASA Astrophysics Data System (ADS)

Lester, Rosemary Anne

Modulation of voice is a result of physiologic oscillation within one or more components of the vocal system including the breathing apparatus (i.e., pressure supply), the larynx (i.e. sound source), and the vocal tract (i.e., sound filter). These oscillations may be caused by pathological tremor associated with neurological disorders like essential tremor or by volitional production of vibrato in singers. Because the acoustical characteristics of voice modulation specific to each component of the vocal system and the effect of these characteristics on perception are not well-understood, it is difficult to assess individuals with vocal tremor and to determine the most effective interventions for reducing the perceptual severity of the disorder. The purpose of the present studies was to determine how the acoustical characteristics associated with laryngeal-based vocal tremor affect the perception of the magnitude of voice modulation, and to determine if adjustments could be made to the voice source and vocal tract filter to alter the acoustic output and reduce the perception of modulation. This research was carried out using both a computational model of speech production and trained singers producing vibrato to simulate laryngeal-based vocal tremor with different voice source characteristics (i.e., vocal fold length and degree of vocal fold adduction) and different vocal tract filter characteristics (i.e., vowel shapes). It was expected that, by making adjustments to the voice source and vocal tract filter that reduce the amplitude of the higher harmonics, the perception of magnitude of voice modulation would be reduced. The results of this study revealed that listeners' perception of the magnitude of modulation of voice was affected by the degree of vocal fold adduction and the vocal tract shape with the computational model, but only by the vocal quality (corresponding to the degree of vocal fold adduction) with the female singer. Based on regression analyses, listeners' judgments were predicted by modulation information in both low and high frequency bands. The findings from these studies indicate that production of a breathy vocal quality might be a useful compensatory strategy for reducing the perceptual severity of modulation of voice for individuals with tremor affecting the larynx.
Central pattern generators for social vocalization: Androgen-dependent neurophysiological mechanisms

PubMed Central

Bass, Andrew H.; Remage-Healey, Luke

2008-01-01

Historically, most studies of vertebrate central pattern generators (CPGs) have focused on mechanisms for locomotion and respiration. Here, we highlight new results for ectothermic vertebrates, namely teleost fish and amphibians, showing how androgenic steroids can influence the temporal patterning of CPGs for social vocalization. Investigations of vocalizing teleosts show how androgens can rapidly (within minutes) modulate the neurophysiological output of the vocal CPG (fictive vocalizations that mimic the temporal properties of natural vocalizations) inclusive of their divergent actions between species, as well as intraspecific differences between male reproductive morphs. Studies of anuran amphibians (frogs) demonstrate that long-term steroid treatments (wks) can masculinize the fictive vocalizations of females, inclusive of its sensitivity to rapid modulation by serotonin. Given the conserved organization of vocal control systems across vertebrate groups, the vocal CPGs of fish and amphibians provide tractable models for identifying androgen-dependent events that are fundamental to the mechanisms of vocal motor patterning. These basic mechanisms can also inform our understanding of the more complex CPGs for vocalization, and social behaviors in general, that have evolved among birds and mammals. PMID:18262186
How to bootstrap a human communication system.

PubMed

Fay, Nicolas; Arbib, Michael; Garrod, Simon

2013-01-01

How might a human communication system be bootstrapped in the absence of conventional language? We argue that motivated signs play an important role (i.e., signs that are linked to meaning by structural resemblance or by natural association). An experimental study is then reported in which participants try to communicate a range of pre-specified items to a partner using repeated non-linguistic vocalization, repeated gesture, or repeated non-linguistic vocalization plus gesture (but without using their existing language system). Gesture proved more effective (measured by communication success) and more efficient (measured by the time taken to communicate) than non-linguistic vocalization across a range of item categories (emotion, object, and action). Combining gesture and vocalization did not improve performance beyond gesture alone. We experimentally demonstrate that gesture is a more effective means of bootstrapping a human communication system. We argue that gesture outperforms non-linguistic vocalization because it lends itself more naturally to the production of motivated signs. © 2013 Cognitive Science Society, Inc.
Effects of Environmental Stimulation on Infant Vocalizations and Orofacial Dynamics at the Onset of Canonical Babbling

PubMed Central

Harold, Meredith Poore; Barlow, Steven M.

2012-01-01

The vocalizations and jaw kinematics of 30 infants aged 6–8 months were recorded using a Motion Analysis System and audiovisual technologies. This study represents the first attempt to determine the effect of play environment on infants’ rate of vocalization and jaw movement. Four play conditions were compared: watching videos, social contingent reinforcement and vocal modeling with an adult, playing alone with small toys, and playing alone with large toys. The fewest vocalizations and spontaneous movement were observed when infants were watching videos or interacting with an adult. Infants vocalized most when playing with large toys. The small toys, which naturally elicited gross motor movement (e.g., waving, banging, shaking), educed fewer vocalizations. This study was also the first to quantify the kinematics of vocalized and non-vocalized jaw movements of 6–8 month-old infants. Jaw kinematics did not differentiate infants who produced canonical syllables from those who did not. All infants produced many jaw movements without vocalization. However, during vocalization, infants were unlikely to move their jaw. This contradicts current theories that infant protophonic vocalizations are jaw dominant. Results of the current study can inform socio-linguistic and kinematic theories of canonical babbling. PMID:23261792

Gender differences affecting vocal health of women in vocally demanding careers

PubMed Central

Hunter, Eric J.; Smith, Marshall E.; Tanner, Kristine

2012-01-01

Studies suggest that occupational voice users have a greater incidence of vocal issues than the general population. Women have been found to experience vocal health problems more frequently than men, regardless of their occupation. Traditionally, it has been assumed that differences in the laryngeal system are the cause of this disproportion. Nevertheless, it is valuable to identify other potential gender distinctions which may make women more vulnerable to voice disorders. A search of the literature was conducted for gender-specific characteristics which might impact the vocal health of women. This search can be used by healthcare practitioners to help female patients avoid serious vocal health injuries, as well as to better treat women who already suffer from such vocal health issues. PMID:21722077
Short bouts of vocalization induce long lasting fast gamma oscillations in a sensorimotor nucleus

PubMed Central

Lewandowski, Brian; Schmidt, Marc

2011-01-01

Performance evaluation is a critical feature of motor learning. In the vocal system, it requires the integration of auditory feedback signals with vocal motor commands. The network activity that supports such integration is unknown, but it has been proposed that vocal performance evaluation occurs offline. Recording from NIf, a sensorimotor structure in the avian song system, we show that short bouts of singing in adult male zebra finches (Taeniopygia guttata) induce persistent increases in firing activity and coherent oscillations in the fast gamma range (90–150 Hz). Single units are strongly phase-locked to these oscillations, which can last up to 30 s, often outlasting vocal activity by an order of magnitude. In other systems, oscillations often are triggered by events or behavioral tasks but rarely outlast the event that triggered them by more than 1 second. The present observations are the longest reported gamma oscillations triggered by an isolated behavioral event. In mammals, gamma oscillations have been associated with memory consolidation and are hypothesized to facilitate communication between brain regions. We suggest that the timing and persistent nature of NIf’s fast gamma oscillations make them well suited to facilitate the integration of auditory and vocal motor traces associated with vocal performance evaluation. PMID:21957255
Three-dimensional optical reconstruction of vocal fold kinematics using high-speed video with a laser projection system

PubMed Central

Luegmair, Georg; Mehta, Daryush D.; Kobler, James B.; Döllinger, Michael

2015-01-01

Vocal fold kinematics and its interaction with aerodynamic characteristics play a primary role in acoustic sound production of the human voice. Investigating the temporal details of these kinematics using high-speed videoendoscopic imaging techniques has proven challenging in part due to the limitations of quantifying complex vocal fold vibratory behavior using only two spatial dimensions. Thus, we propose an optical method of reconstructing the superior vocal fold surface in three spatial dimensions using a high-speed video camera and laser projection system. Using stereo-triangulation principles, we extend the camera-laser projector method and present an efficient image processing workflow to generate the three-dimensional vocal fold surfaces during phonation captured at 4000 frames per second. Initial results are provided for airflow-driven vibration of an ex vivo vocal fold model in which at least 75% of visible laser points contributed to the reconstructed surface. The method captures the vertical motion of the vocal folds at a high accuracy to allow for the computation of three-dimensional mucosal wave features such as vibratory amplitude, velocity, and asymmetry. PMID:26087485
Multiple Coordination Patterns in Infant and Adult Vocalizations

PubMed Central

Abney, Drew H.; Warlaumont, Anne S.; Oller, D. Kimbrough; Wallot, Sebastian; Kello, Christopher T.

2017-01-01

The study of vocal coordination between infants and adults has led to important insights into the development of social, cognitive, emotional and linguistic abilities. We used an automatic system to identify vocalizations produced by infants and adults over the course of the day for fifteen infants studied longitudinally during the first two years of life. We measured three different types of vocal coordination: coincidence-based, rate-based, and cluster-based. Coincidence-based and rate-based coordination are established measures in the developmental literature. Cluster-based coordination is new and measures the strength of matching in the degree to which vocalization events occur in hierarchically nested clusters. We investigated whether various coordination patterns differ as a function of vocalization type, whether different coordination patterns provide unique information about the dynamics of vocal interaction, and how the various coordination patterns each relate to infant age. All vocal coordination patterns displayed greater coordination for infant speech-related vocalizations, adults adapted the hierarchical clustering of their vocalizations to match that of infants, and each of the three coordination patterns had unique associations with infant age. Altogether, our results indicate that vocal coordination between infants and adults is multifaceted, suggesting a complex relationship between vocal coordination and the development of vocal communication. PMID:29375276
Vocalization-Induced Enhancement of the Auditory Cortex Responsiveness during Voice F0 Feedback Perturbation

PubMed Central

Behroozmand, Roozbeh; Karvelis, Laura; Liu, Hanjun; Larson, Charles R.

2009-01-01

Objective The present study investigated whether self-vocalization enhances auditory neural responsiveness to voice pitch feedback perturbation and how this vocalization-induced neural modulation can be affected by the extent of the feedback deviation. Method Event related potentials (ERPs) were recorded in 15 subjects in response to +100, +200 and +500 cents pitch-shifted voice auditory feedback during active vocalization and passive listening to the playback of the self-produced vocalizations. Result The amplitude of the evoked P1 (latency: 73.51 ms) and P2 (latency: 199.55 ms) ERP components in response to feedback perturbation were significantly larger during vocalization than listening. The difference between P2 peak amplitudes during vocalization vs. listening was shown to be significantly larger for +100 than +500 cents stimulus. Conclusion Results indicate that the human auditory cortex is more responsive to voice F0 feedback perturbations during vocalization than passive listening. Greater vocalization-induced enhancement of the auditory responsiveness to smaller feedback perturbations may imply that the audio-vocal system detects and corrects for errors in vocal production that closely match the expected vocal output. Significance Findings of this study support previous suggestions regarding the enhanced auditory sensitivity to feedback alterations during self-vocalization, which may serve the purpose of feedback-based monitoring of one’s voice. PMID:19520602
Human Factors Engineering Bibliographic Series. Volume 2: 1960-1964 Literature

DTIC Science & Technology

1966-10-01

flutter discrimination, melodic and temporal) binaural vs. monaural equipment and methods (e.g., anechoic chambers, audiometric devices, communication...brightness, duration, timbre, vocality) stimulus mixtures (e.g., harmonics, beats , combination tones, modulations) thresholds training, nonverbal--see Training...scales and aids) Beats --see Audition (stimulus mixtures) Bells--see Auditory (displays, nonverbal) Belts, Harnesses, and other Restraining Devices--see
Assessing cow-calf welfare. Part 2: Risk factors for beef cow health and behavior and stockperson handling.

PubMed

Simon, G E; Hoar, B R; Tucker, C B

2016-08-01

Epidemiological studies can be used to identify risk factors for livestock welfare concerns but have not been conducted in the cow-calf sector for this purpose. The objectives of this study were to investigate the relationships of 1) herd-level management, facilities, and producer perspectives with cattle health and behavior and stockperson handling and 2) stockperson handling on cattle behavior at the individual cow level. Cow ( = 3,065) health and behavior and stockperson handling during a routine procedure (e.g., pregnancy checks) were observed on 30 California ranches. Management and producer perspectives were evaluated using an interview, and handling facility features were recorded at the chute. After predictors were screened for univariable associations, multivariable models were built for cattle health (i.e., thin body condition, lameness, abrasions, hairless patches, swelling, blind eyes, and dirtiness) and behavior (i.e., balking, vocalizing, stumbling and falling in the chute and while exiting the restraint, and running out of the restraint) and stockperson handling (i.e., electric prod use, moving aid use, tail twisting, and mis-catching cattle). When producers empathized more toward an animal's pain experience, there was a lower risk of swelling (odds ratio [OR] = 0.7) but a higher risk of lameness (OR = 1.3), which may indicate a lack of awareness of the latter. Training stockpersons using the Beef Quality Assurance program had a protective effect on cow cleanliness and mis-catching in the restraint (OR = 0.2 and OR = 0.5, respectively). Hydraulic chutes increased the risk of vocalizations (OR = 2.7), possibly because these systems can apply greater pressure to the sides of the animal than manual restraints. When a moving aid was used to move an individual cow, it increased the risk of her balking, but when hands, in particular, were used, the risk of balking decreased across the herd (OR = 34.1 and OR = 0.3, respectively). Likewise, individual cows were at a greater risk of balking, vocalizing, stumbling and falling in the chute, and stumbling and running at exit when they were touched with an electric prod (OR = 11.0, OR = 3.3, OR = 1.9, OR = 2.3, OR = 1.8, and OR = 1.7, respectively). Although the implications of using moving aids are unclear, reducing the use of electric prods could improve cattle handling. In conclusion, cattle handling was influenced by a number of facility and stockperson factors: personnel training, facility design, and electric prod use are key areas for future improvements.
Repertoire and classification of non-song calls in Southeast Alaskan humpback whales (Megaptera novaeangliae).

PubMed

Fournet, Michelle E; Szabo, Andy; Mellinger, David K

2015-01-01

On low-latitude breeding grounds, humpback whales produce complex and highly stereotyped songs as well as a range of non-song sounds associated with breeding behaviors. While on their Southeast Alaskan foraging grounds, humpback whales produce a range of previously unclassified non-song vocalizations. This study investigates the vocal repertoire of Southeast Alaskan humpback whales from a sample of 299 non-song vocalizations collected over a 3-month period on foraging grounds in Frederick Sound, Southeast Alaska. Three classification systems were used, including aural spectrogram analysis, statistical cluster analysis, and discriminant function analysis, to describe and classify vocalizations. A hierarchical acoustic structure was identified; vocalizations were classified into 16 individual call types nested within four vocal classes. The combined classification method shows promise for identifying variability in call stereotypy between vocal groupings and is recommended for future classification of broad vocal repertoires.
Identification of a motor to auditory pathway important for vocal learning

PubMed Central

Roberts, Todd F.; Hisey, Erin; Tanaka, Masashi; Kearney, Matthew; Chattree, Gaurav; Yang, Cindy F.; Shah, Nirao M.; Mooney, Richard

2017-01-01

Summary Learning to vocalize depends on the ability to adaptively modify the temporal and spectral features of vocal elements. Neurons that convey motor-related signals to the auditory system are theorized to facilitate vocal learning, but the identity and function of such neurons remain unknown. Here we identify a previously unknown neuron type in the songbird brain that transmits vocal motor signals to the auditory cortex. Genetically ablating these neurons in juveniles disrupted their ability to imitate features of an adult tutor’s song. Ablating these neurons in adults had little effect on previously learned songs, but interfered with their ability to adaptively modify the duration of vocal elements and largely prevented the degradation of song’s temporal features normally caused by deafening. These findings identify a motor to auditory circuit essential to vocal imitation and to the adaptive modification of vocal timing. PMID:28504672
LANGUAGE DEVELOPMENT. The developmental dynamics of marmoset monkey vocal production.

PubMed

Takahashi, D Y; Fenley, A R; Teramoto, Y; Narayanan, D Z; Borjon, J I; Holmes, P; Ghazanfar, A A

2015-08-14

Human vocal development occurs through two parallel interactive processes that transform infant cries into more mature vocalizations, such as cooing sounds and babbling. First, natural categories of sounds change as the vocal apparatus matures. Second, parental vocal feedback sensitizes infants to certain features of those sounds, and the sounds are modified accordingly. Paradoxically, our closest living ancestors, nonhuman primates, are thought to undergo few or no production-related acoustic changes during development, and any such changes are thought to be impervious to social feedback. Using early and dense sampling, quantitative tracking of acoustic changes, and biomechanical modeling, we showed that vocalizations in infant marmoset monkeys undergo dramatic changes that cannot be solely attributed to simple consequences of growth. Using parental interaction experiments, we found that contingent parental feedback influences the rate of vocal development. These findings overturn decades-old ideas about primate vocalizations and show that marmoset monkeys are a compelling model system for early vocal development in humans. Copyright © 2015, American Association for the Advancement of Science.
Computational model for vocal tract dynamics in a suboscine bird.

PubMed

Assaneo, M F; Trevisan, M A

2010-09-01

In a recent work, active use of the vocal tract has been reported for singing oscines. The reconfiguration of the vocal tract during song serves to match its resonances to the syringeal fundamental frequency, demonstrating a precise coordination of the two main pieces of the avian vocal system for songbirds characterized by tonal songs. In this work we investigated the Great Kiskadee (Pitangus sulfuratus), a suboscine bird whose calls display a rich harmonic content. Using a recently developed mathematical model for the syrinx and a mobile vocal tract, we set up a computational model that provides a plausible reconstruction of the vocal tract movement using a few spectral features taken from the utterances. Moreover, synthetic calls were generated using the articulated vocal tract that accounts for all the acoustical features observed experimentally.
Central Nervous System Control of Voice and Swallowing

PubMed Central

Ludlow, Christy L.

2015-01-01

This review of the central nervous control systems for voice and swallowing has suggested that the traditional concepts of a separation between cortical and limbic and brain stem control should be refined and more integrative. For voice production, a separation of the non-human vocalization system from the human learned voice production system has been posited based primarily on studies of non-human primates. However, recent humans studies of emotionally based vocalizations and human volitional voice production has shown more integration between these two systems than previously proposed. Recent human studies have shown that reflexive vocalization as well as learned voice production not involving speech, involve a common integrative system. On the other hand, recent studies of non-human primates have provided evidence of some cortical activity during vocalization and cortical changes with training during vocal behavior. For swallowing, evidence from the macaque and functional brain imaging in humans indicates that the control for the pharyngeal phase of swallowing is not primarily under brain stem mechanisms as previously proposed. Studies suggest that the initiation and patterning of swallowing for the pharyngeal phase is also under active cortical control for both spontaneous as well as volitional swallowing in awake humans and non-human primates. PMID:26241238
Information Theory Applied to Dolphin Whistle Vocalizations with Possible Application to SETI Signals

NASA Astrophysics Data System (ADS)

Doyle, Laurance R.; McCowan, Brenda; Hanser, Sean F.

2002-01-01

Information theory allows a quantification of the complexity of a given signaling system. We are applying information theory to dolphin whistle vocalizations, humpback whale songs, squirrel monkey chuck calls, and several other animal communication systems' in order to develop a quantitative and objective way to compare inter species communication systems' complexity. Once signaling units have been correctly classified the communication system must obey certain statistical distributions in order to contain complexity whether it is human languages, dolphin whistle vocalizations, or even a system of communication signals received from an extraterrestrial source.
Assessment of recurrent laryngeal nerve function during thyroid surgery

PubMed Central

Douglas, J; Smith, B; Dougherty, T; Ayshford, C

2014-01-01

Introduction There is disparity in the reported incidence of temporary and permanent recurrent laryngeal nerve (RLN) palsy following thyroidectomy. Much of the disparity is due to the method of assessing vocal cord function. We sought to identify the incidence and natural history of temporary and permanent vocal cord palsy following thyroid surgery. The authors wanted to establish whether intraoperative nerve monitoring and stimulation aids in prognosis when managing vocal cord palsy. Methods Prospective data on consecutive thyroid operations were collected. Intraoperative nerve monitoring and stimulation, using an endotracheal tube mounted device, was performed in all cases. Endoscopic examination of the larynx was performed on the first postoperative day and at three weeks. Results Data on 102 patients and 123 nerves were collated. Temporary and permanent RLN palsy rates were 6.1% and 1.7%. Most RLN palsies were identified on the first postoperative day with all recognised at the three-week review. No preoperative clinical risk factors were identified. Although dysphonia at the three-week follow-up visit was the only significant predictor of vocal cord palsy, only two-thirds of patients with cord palsies were dysphonic. Intraoperative nerve monitoring and stimulation did not predict outcome in terms of vocal cord function. Conclusions Temporary nerve palsy rates were consistent with other series where direct laryngoscopy is used to assess laryngeal function. Direct laryngoscopy is the only reliable measure of cord function, with intraoperative monitoring being neither a reliable predictor of cord function nor a predictor of eventual laryngeal function. The fact that all temporary palsies recovered within four months has implications for staged procedures. PMID:24780671
Assessment of recurrent laryngeal nerve function during thyroid surgery.

PubMed

Smith, J; Douglas, J; Smith, B; Dougherty, T; Ayshford, C

2014-03-01

There is disparity in the reported incidence of temporary and permanent recurrent laryngeal nerve (RLN) palsy following thyroidectomy. Much of the disparity is due to the method of assessing vocal cord function. We sought to identify the incidence and natural history of temporary and permanent vocal cord palsy following thyroid surgery. The authors wanted to establish whether intraoperative nerve monitoring and stimulation aids in prognosis when managing vocal cord palsy. Prospective data on consecutive thyroid operations were collected. Intraoperative nerve monitoring and stimulation, using an endotracheal tube mounted device, was performed in all cases. Endoscopic examination of the larynx was performed on the first postoperative day and at three weeks. Data on 102 patients and 123 nerves were collated. Temporary and permanent RLN palsy rates were 6.1% and 1.7%. Most RLN palsies were identified on the first postoperative day with all recognised at the three-week review. No preoperative clinical risk factors were identified. Although dysphonia at the three-week follow-up visit was the only significant predictor of vocal cord palsy, only two-thirds of patients with cord palsies were dysphonic. Intraoperative nerve monitoring and stimulation did not predict outcome in terms of vocal cord function. Temporary nerve palsy rates were consistent with other series where direct laryngoscopy is used to assess laryngeal function. Direct laryngoscopy is the only reliable measure of cord function, with intraoperative monitoring being neither a reliable predictor of cord function nor a predictor of eventual laryngeal function. The fact that all temporary palsies recovered within four months has implications for staged procedures.
Using statistical deformable models to reconstruct vocal tract shape from magnetic resonance images.

PubMed

Vasconcelos, M J M; Rua Ventura, S M; Freitas, D R S; Tavares, J M R S

2010-10-01

The mechanisms involved in speech production are complex and have thus been subject to growing attention by the scientific community. It has been demonstrated that magnetic resonance imaging (MRI) is a powerful means in the understanding of the morphology of the vocal tract. Over the last few years, statistical deformable models have been successfully used to identify and characterize bones and organs in medical images and point distribution models (PDMs) have gained particular relevance. In this work, the suitability of these models has been studied to characterize and further reconstruct the shape of the vocal tract in the articulation of Portuguese European (EP) speech sounds, one of the most spoken languages worldwide, with the aid of MR images. Therefore, a PDM has been built from a set of MR images acquired during the artificially sustained articulation of 25 EP speech sounds. Following this, the capacity of this statistical model to characterize the shape deformation of the vocal tract during the production of sounds was analysed. Next, the model was used to reconstruct five EP oral vowels and the EP fricative consonants. As far as a study on speech production is concerned, this study is considered to be the first approach to characterize and reconstruct the vocal tract shape from MR images by using PDMs. In addition, the findings achieved permit one to conclude that this modelling technique compels an enhanced understanding of the dynamic speech events involved in sustained articulations based on MRI, which are of particular interest for speech rehabilitation and simulation.
Effects of environmental stimulation on infant vocalizations and orofacial dynamics at the onset of canonical babbling.

PubMed

Harold, Meredith Poore; Barlow, Steven M

2013-02-01

The vocalizations and jaw kinematics of 30 infants aged 6-8 months were recorded using a Motion Analysis System and audiovisual technologies. This study represents the first attempt to determine the effect of play environment on infants' rate of vocalization and jaw movement. Four play conditions were compared: watching videos, social contingent reinforcement and vocal modeling with an adult, playing alone with small toys, and playing alone with large toys. The fewest vocalizations and spontaneous movement were observed when infants were watching videos or interacting with an adult. Infants vocalized most when playing with large toys. The small toys, which naturally elicited gross motor movement (e.g., waving, banging, shaking), educed fewer vocalizations. This study was also the first to quantify the kinematics of vocalized and non-vocalized jaw movements of 6-8 month-old infants. Jaw kinematics did not differentiate infants who produced canonical syllables from those who did not. All infants produced many jaw movements without vocalization. However, during vocalization, infants were unlikely to move their jaw. This contradicts current theories that infant protophonic vocalizations are jaw-dominant. Results of the current study can inform socio-linguistic and kinematic theories of canonical babbling. Copyright © 2012 Elsevier Inc. All rights reserved.
Distribution of androgen receptor mRNA expression in vocal, auditory, and neuroendocrine circuits in a teleost fish.

PubMed

Forlano, Paul M; Marchaterre, Margaret; Deitcher, David L; Bass, Andrew H

2010-02-15

Across all major vertebrate groups, androgen receptors (ARs) have been identified in neural circuits that shape reproductive-related behaviors, including vocalization. The vocal control network of teleost fishes presents an archetypal example of how a vertebrate nervous system produces social, context-dependent sounds. We cloned a partial cDNA of AR that was used to generate specific probes to localize AR expression throughout the central nervous system of the vocal plainfin midshipman fish (Porichthys notatus). In the forebrain, AR mRNA is abundant in proposed homologs of the mammalian striatum and amygdala, and in anterior and posterior parvocellular and magnocellular nuclei of the preoptic area, nucleus preglomerulosus, and posterior, ventral and anterior tuberal nuclei of the hypothalamus. Many of these nuclei are part of the known vocal and auditory circuitry in midshipman. The midbrain periaqueductal gray, an essential link between forebrain and hindbrain vocal circuitry, and the lateral line recipient nucleus medialis in the rostral hindbrain also express abundant AR mRNA. In the caudal hindbrain-spinal vocal circuit, high AR mRNA is found in the vocal prepacemaker nucleus and along the dorsal periphery of the vocal motor nucleus congruent with the known pattern of expression of aromatase-containing glial cells. Additionally, abundant AR mRNA expression is shown for the first time in the inner ear of a vertebrate. The distribution of AR mRNA strongly supports the role of androgens as modulators of behaviorally defined vocal, auditory, and neuroendocrine circuits in teleost fish and vertebrates in general. 2009 Wiley-Liss, Inc.
Detection of the Vibration Signal from Human Vocal Folds Using a 94-GHz Millimeter-Wave Radar

PubMed Central

Chen, Fuming; Li, Sheng; Zhang, Yang; Wang, Jianqi

2017-01-01

The detection of the vibration signal from human vocal folds provides essential information for studying human phonation and diagnosing voice disorders. Doppler radar technology has enabled the noncontact measurement of the human-vocal-fold vibration. However, existing systems must be placed in close proximity to the human throat and detailed information may be lost because of the low operating frequency. In this paper, a long-distance detection method, involving the use of a 94-GHz millimeter-wave radar sensor, is proposed for detecting the vibration signals from human vocal folds. An algorithm that combines empirical mode decomposition (EMD) and the auto-correlation function (ACF) method is proposed for detecting the signal. First, the EMD method is employed to suppress the noise of the radar-detected signal. Further, the ratio of the energy and entropy is used to detect voice activity in the radar-detected signal, following which, a short-time ACF is employed to extract the vibration signal of the human vocal folds from the processed signal. For validating the method and assessing the performance of the radar system, a vibration measurement sensor and microphone system are additionally employed for comparison. The experimental results obtained from the spectrograms, the vibration frequency of the vocal folds, and coherence analysis demonstrate that the proposed method can effectively detect the vibration of human vocal folds from a long detection distance. PMID:28282892
The Human Voice in Speech and Singing

NASA Astrophysics Data System (ADS)

Lindblom, Björn; Sundberg, Johan

This chapter speech describes various aspects of the human voice as a means of communication in speech and singing. From the point of view of function, vocal sounds can be regarded as the end result of a three stage process: (1) the compression of air in the respiratory system, which produces an exhalatory airstream, (2) the vibrating vocal folds' transformation of this air stream to an intermittent or pulsating air stream, which is a complex tone, referred to as the voice source, and (3) the filtering of this complex tone in the vocal tract resonator. The main function of the respiratory system is to generate an overpressure of air under the glottis, or a subglottal pressure. Section 16.1 describes different aspects of the respiratory system of significance to speech and singing, including lung volume ranges, subglottal pressures, and how this pressure is affected by the ever-varying recoil forces. The complex tone generated when the air stream from the lungs passes the vibrating vocal folds can be varied in at least three dimensions: fundamental frequency, amplitude and spectrum. Section 16.2 describes how these properties of the voice source are affected by the subglottal pressure, the length and stiffness of the vocal folds and how firmly the vocal folds are adducted. Section 16.3 gives an account of the vocal tract filter, how its form determines the frequencies of its resonances, and Sect. 16.4 gives an account for how these resonance frequencies or formants shape the vocal sounds by imposing spectrum peaks separated by spectrum valleys, and how the frequencies of these peaks determine vowel and voice qualities. The remaining sections of the chapter describe various aspects of the acoustic signals used for vocal communication in speech and singing. The syllable structure is discussed in Sect. 16.5, the closely related aspects of rhythmicity and timing in speech and singing is described in Sect. 16.6, and pitch and rhythm aspects in Sect. 16.7. The impressive control of all these acoustic characteristics of vocal signals is discussed in Sect. 16.8, while Sect. 16.9 considers expressive aspects of vocal communication.

The Human Voice in Speech and Singing

NASA Astrophysics Data System (ADS)

Lindblom, Björn; Sundberg, Johan

This chapter describes various aspects of the human voice as a means of communication in speech and singing. From the point of view of function, vocal sounds can be regarded as the end result of a three stage process: (1) the compression of air in the respiratory system, which produces an exhalatory airstream, (2) the vibrating vocal folds' transformation of this air stream to an intermittent or pulsating air stream, which is a complex tone, referred to as the voice source, and (3) the filtering of this complex tone in the vocal tract resonator. The main function of the respiratory system is to generate an overpressure of air under the glottis, or a subglottal pressure. Section 16.1 describes different aspects of the respiratory system of significance to speech and singing, including lung volume ranges, subglottal pressures, and how this pressure is affected by the ever-varying recoil forces. The complex tone generated when the air stream from the lungs passes the vibrating vocal folds can be varied in at least three dimensions: fundamental frequency, amplitude and spectrum. Section 16.2 describes how these properties of the voice source are affected by the subglottal pressure, the length and stiffness of the vocal folds and how firmly the vocal folds are adducted. Section 16.3 gives an account of the vocal tract filter, how its form determines the frequencies of its resonances, and Sect. 16.4 gives an account for how these resonance frequencies or formants shape the vocal sounds by imposing spectrum peaks separated by spectrum valleys, and how the frequencies of these peaks determine vowel and voice qualities. The remaining sections of the chapter describe various aspects of the acoustic signals used for vocal communication in speech and singing. The syllable structure is discussed in Sect. 16.5, the closely related aspects of rhythmicity and timing in speech and singing is described in Sect. 16.6, and pitch and rhythm aspects in Sect. 16.7. The impressive control of all these acoustic characteristics of vocal signals is discussed in Sect. 16.8, while Sect. 16.9 considers expressive aspects of vocal communication.
Visual classification of feral cat Felis silvestris catus vocalizations.

PubMed

Owens, Jessica L; Olsen, Mariana; Fontaine, Amy; Kloth, Christopher; Kershenbaum, Arik; Waller, Sara

2017-06-01

Cat vocal behavior, in particular, the vocal and social behavior of feral cats, is poorly understood, as are the differences between feral and fully domestic cats. The relationship between feral cat social and vocal behavior is important because of the markedly different ecology of feral and domestic cats, and enhanced comprehension of the repertoire and potential information content of feral cat calls can provide both better understanding of the domestication and socialization process, and improved welfare for feral cats undergoing adoption. Previous studies have used conflicting classification schemes for cat vocalizations, often relying on onomatopoeic or popular descriptions of call types (e.g., "miow"). We studied the vocalizations of 13 unaltered domestic cats that complied with our behavioral definition used to distinguish feral cats from domestic. A total of 71 acoustic units were extracted and visually analyzed for the construction of a hierarchical classification of vocal sounds, based on acoustic properties. We identified 3 major categories (tonal, pulse, and broadband) that further breakdown into 8 subcategories, and show a high degree of reliability when sounds are classified blindly by independent observers (Fleiss' Kappa K = 0.863). Due to the limited behavioral contexts in this study, additional subcategories of cat vocalizations may be identified in the future, but our hierarchical classification system allows for the addition of new categories and new subcategories as they are described. This study shows that cat vocalizations are diverse and complex, and provides an objective and reliable classification system that can be used in future studies.
Investigation into the response of the auditory and acoustic communications systems in the Beluga whale (Delphinapterus leucas) of the St. Lawrence River Estuary to noise, using vocal classification

NASA Astrophysics Data System (ADS)

Scheifele, Peter Martin

2003-06-01

Noise pollution has only recently become recognized as a potential danger to marine mammals in general, and to the Beluga Whale (Delphinapterus leucas) in particular. These small gregarious Odontocetes make extensive use of sound for social communication and pod cohesion. The St. Lawrence River Estuary is habitat to a small, critically endangered population of about 700 Beluga whales who congregate in four different sites in its upper estuary. The population is believed to be threatened by the stress of high-intensity, low frequency noise. One way to determine whether noise is having an effect on an animal's auditory ability might be to observe a natural and repeatable response of the auditory and vocal systems to varying noise levels. This can be accomplished by observing changes in animal vocalizations in response to auditory feedback. A response such as this observed in humans and some animals is known as the Lombard Vocal Response, which represents a reaction of the auditory system directly manifested by changes in vocalization level. In this research this population of Beluga Whales was tested to determine whether a vocalization-as-a-function-of-noise phenomenon existed by using Hidden Markhov "classified" vocalizations as targets for acoustical analyses. Correlation and regression analyses indicated that the phenomenon does exist and results of a human subjects experiment along with results from other animal species known to exhibit the response strongly implicate the Lombard Vocal Response in the Beluga.
Measurement of Lombard-like response in the beluga whale

NASA Astrophysics Data System (ADS)

Scheifele, Peter M.

2004-05-01

Noise pollution has become recognized as a potential danger to marine mammals in general, and to the St. Lawrence beluga (Delphinapterus leucas) in particular. One method to determine whether noise is having an effect on an animals auditory ability is to observe a natural and repeatable response of the auditory and vocal systems to varying noise levels. This can be accomplished by observing changes in animal vocalizations in response to auditory feedback. A response such as this observed in humans and some animals is known as the Lombard vocal response, which represents a reaction of the auditory system directly manifested by changes in vocalization level. This response is known in humans, songbirds, and some primates. In this research a population of belugas in the St. Lawrence River Estuary was tested to determine whether a vocalization-as-a-function-of-noise phenomenon existed by using hidden Markhov classified vocalizations as targets for acoustical analyses. Correlation and regression analyses of signals and noise indicated that the phenomenon does exist and results of a human subjects experiment along with results from other animal species known to exhibit the response strongly implicate the Lombard vocal response in the St. Lawrence population of beluga.
Auditory and audio-vocal responses of single neurons in the monkey ventral premotor cortex.

PubMed

Hage, Steffen R

2018-03-20

Monkey vocalization is a complex behavioral pattern, which is flexibly used in audio-vocal communication. A recently proposed dual neural network model suggests that cognitive control might be involved in this behavior, originating from a frontal cortical network in the prefrontal cortex and mediated via projections from the rostral portion of the ventral premotor cortex (PMvr) and motor cortex to the primary vocal motor network in the brainstem. For the rapid adjustment of vocal output to external acoustic events, strong interconnections between vocal motor and auditory sites are needed, which are present at cortical and subcortical levels. However, the role of the PMvr in audio-vocal integration processes remains unclear. In the present study, single neurons in the PMvr were recorded in rhesus monkeys (Macaca mulatta) while volitionally producing vocalizations in a visual detection task or passively listening to monkey vocalizations. Ten percent of randomly selected neurons in the PMvr modulated their discharge rate in response to acoustic stimulation with species-specific calls. More than four-fifths of these auditory neurons showed an additional modulation of their discharge rates either before and/or during the monkeys' motor production of the vocalization. Based on these audio-vocal interactions, the PMvr might be well positioned to mediate higher order auditory processing with cognitive control of the vocal motor output to the primary vocal motor network. Such audio-vocal integration processes in the premotor cortex might constitute a precursor for the evolution of complex learned audio-vocal integration systems, ultimately giving rise to human speech. Copyright © 2018 Elsevier B.V. All rights reserved.
Vocal emotion of humanoid robots: a study from brain mechanism.

PubMed

Wang, Youhui; Hu, Xiaohua; Dai, Weihui; Zhou, Jie; Kuo, Taitzong

2014-01-01

Driven by rapid ongoing advances in humanoid robot, increasing attention has been shifted into the issue of emotion intelligence of AI robots to facilitate the communication between man-machines and human beings, especially for the vocal emotion in interactive system of future humanoid robots. This paper explored the brain mechanism of vocal emotion by studying previous researches and developed an experiment to observe the brain response by fMRI, to analyze vocal emotion of human beings. Findings in this paper provided a new approach to design and evaluate the vocal emotion of humanoid robots based on brain mechanism of human beings.
Mastery, Enjoyment, Tradition and Innovation: A Reflective Practice Model for Instrumental and Vocal Teachers

ERIC Educational Resources Information Center

Parkinson, Tom

2016-01-01

This article offers a model to assist music teachers in reflecting on their teaching practice in relation to their aims and values. Initially developed as a workshop aid for use on a music education MA program, the model is intended to provoke critical engagement with two prominent tensions in music education: that between mastery and enjoyment,…
Animal models of speech and vocal communication deficits associated with psychiatric disorders

PubMed Central

Konopka, Genevieve; Roberts, Todd F.

2015-01-01

Disruptions in speech, language and vocal communication are hallmarks of several neuropsychiatric disorders, most notably autism spectrum disorders. Historically, the use of animal models to dissect molecular pathways and connect them to behavioral endophenotypes in cognitive disorders has proven to be an effective approach for developing and testing disease-relevant therapeutics. The unique aspects of human language when compared to vocal behaviors in other animals make such an approach potentially more challenging. However, the study of vocal learning in species with analogous brain circuits to humans may provide entry points for understanding this human-specific phenotype and diseases. Here, we review animal models of vocal learning and vocal communication, and specifically link phenotypes of psychiatric disorders to relevant model systems. Evolutionary constraints in the organization of neural circuits and synaptic plasticity result in similarities in the brain mechanisms for vocal learning and vocal communication. Comparative approaches and careful consideration of the behavioral limitations among different animal models can provide critical avenues for dissecting the molecular pathways underlying cognitive disorders that disrupt speech, language and vocal communication. PMID:26232298
Automated Assessment of Child Vocalization Development Using LENA.

PubMed

Richards, Jeffrey A; Xu, Dongxin; Gilkerson, Jill; Yapanel, Umit; Gray, Sharmistha; Paul, Terrance

2017-07-12

To produce a novel, efficient measure of children's expressive vocal development on the basis of automatic vocalization assessment (AVA), child vocalizations were automatically identified and extracted from audio recordings using Language Environment Analysis (LENA) System technology. Assessment was based on full-day audio recordings collected in a child's unrestricted, natural language environment. AVA estimates were derived using automatic speech recognition modeling techniques to categorize and quantify the sounds in child vocalizations (e.g., protophones and phonemes). These were expressed as phone and biphone frequencies, reduced to principal components, and inputted to age-based multiple linear regression models to predict independently collected criterion-expressive language scores. From these models, we generated vocal development AVA estimates as age-standardized scores and development age estimates. AVA estimates demonstrated strong statistical reliability and validity when compared with standard criterion expressive language assessments. Automated analysis of child vocalizations extracted from full-day recordings in natural settings offers a novel and efficient means to assess children's expressive vocal development. More research remains to identify specific mechanisms of operation.
Auditory Signal Processing in Communication: Perception and Performance of Vocal Sounds

PubMed Central

Prather, Jonathan F.

2013-01-01

Learning and maintaining the sounds we use in vocal communication require accurate perception of the sounds we hear performed by others and feedback-dependent imitation of those sounds to produce our own vocalizations. Understanding how the central nervous system integrates auditory and vocal-motor information to enable communication is a fundamental goal of systems neuroscience, and insights into the mechanisms of those processes will profoundly enhance clinical therapies for communication disorders. Gaining the high-resolution insight necessary to define the circuits and cellular mechanisms underlying human vocal communication is presently impractical. Songbirds are the best animal model of human speech, and this review highlights recent insights into the neural basis of auditory perception and feedback-dependent imitation in those animals. Neural correlates of song perception are present in auditory areas, and those correlates are preserved in the auditory responses of downstream neurons that are also active when the bird sings. Initial tests indicate that singing-related activity in those downstream neurons is associated with vocal-motor performance as opposed to the bird simply hearing itself sing. Therefore, action potentials related to auditory perception and action potentials related to vocal performance are co-localized in individual neurons. Conceptual models of song learning involve comparison of vocal commands and the associated auditory feedback to compute an error signal that is used to guide refinement of subsequent song performances, yet the sites of that comparison remain unknown. Convergence of sensory and motor activity onto individual neurons points to a possible mechanism through which auditory and vocal-motor signals may be linked to enable learning and maintenance of the sounds used in vocal communication. PMID:23827717
Adapted to Roar: Functional Morphology of Tiger and Lion Vocal Folds

PubMed Central

Klemuk, Sarah A.; Riede, Tobias; Walsh, Edward J.; Titze, Ingo R.

2011-01-01

Vocal production requires active control of the respiratory system, larynx and vocal tract. Vocal sounds in mammals are produced by flow-induced vocal fold oscillation, which requires vocal fold tissue that can sustain the mechanical stress during phonation. Our understanding of the relationship between morphology and vocal function of vocal folds is very limited. Here we tested the hypothesis that vocal fold morphology and viscoelastic properties allow a prediction of fundamental frequency range of sounds that can be produced, and minimal lung pressure necessary to initiate phonation. We tested the hypothesis in lions and tigers who are well-known for producing low frequency and very loud roaring sounds that expose vocal folds to large stresses. In histological sections, we found that the Panthera vocal fold lamina propria consists of a lateral region with adipocytes embedded in a network of collagen and elastin fibers and hyaluronan. There is also a medial region that contains only fibrous proteins and hyaluronan but no fat cells. Young's moduli range between 10 and 2000 kPa for strains up to 60%. Shear moduli ranged between 0.1 and 2 kPa and differed between layers. Biomechanical and morphological data were used to make predictions of fundamental frequency and subglottal pressure ranges. Such predictions agreed well with measurements from natural phonation and phonation of excised larynges, respectively. We assume that fat shapes Panthera vocal folds into an advantageous geometry for phonation and it protects vocal folds. Its primary function is probably not to increase vocal fold mass as suggested previously. The large square-shaped Panthera vocal fold eases phonation onset and thereby extends the dynamic range of the voice. PMID:22073246
Human mutant huntingtin disrupts vocal learning in transgenic songbirds.

PubMed

Liu, Wan-Chun; Kohn, Jessica; Szwed, Sarah K; Pariser, Eben; Sepe, Sharon; Haripal, Bhagwattie; Oshimori, Naoki; Marsala, Martin; Miyanohara, Atsushi; Lee, Ramee

2015-11-01

Speech and vocal impairments characterize many neurological disorders. However, the neurogenetic mechanisms of these disorders are not well understood, and current animal models do not have the necessary circuitry to recapitulate vocal learning deficits. We developed germline transgenic songbirds, zebra finches (Taneiopygia guttata) expressing human mutant huntingtin (mHTT), a protein responsible for the progressive deterioration of motor and cognitive function in Huntington's disease (HD). Although generally healthy, the mutant songbirds had severe vocal disorders, including poor vocal imitation, stuttering, and progressive syntax and syllable degradation. Their song abnormalities were associated with HD-related neuropathology and dysfunction of the cortical-basal ganglia (CBG) song circuit. These transgenics are, to the best of our knowledge, the first experimentally created, functional mutant songbirds. Their progressive and quantifiable vocal disorder, combined with circuit dysfunction in the CBG song system, offers a model for genetic manipulation and the development of therapeutic strategies for CBG-related vocal and motor disorders.
Knockout of Foxp2 disrupts vocal development in mice.

PubMed

Castellucci, Gregg A; McGinley, Matthew J; McCormick, David A

2016-03-16

The FOXP2 gene is important for the development of proper speech motor control in humans. However, the role of the gene in general vocal behavior in other mammals, including mice, is unclear. Here, we track the vocal development of Foxp2 heterozygous knockout (Foxp2+/-) mice and their wildtype (WT) littermates from juvenile to adult ages, and observe severe abnormalities in the courtship song of Foxp2+/- mice. In comparison to their WT littermates, Foxp2+/- mice vocalized less, produced shorter syllable sequences, and possessed an abnormal syllable inventory. In addition, Foxp2+/- song also exhibited irregular rhythmic structure, and its development did not follow the consistent trajectories observed in WT vocalizations. These results demonstrate that the Foxp2 gene is critical for normal vocal behavior in juvenile and adult mice, and that Foxp2 mutant mice may provide a tractable model system for the study of the gene's role in general vocal motor control.
Vocal repertoire of the social giant otter.

PubMed

Leuchtenberger, Caroline; Sousa-Lima, Renata; Duplaix, Nicole; Magnusson, William E; Mourão, Guilherme

2014-11-01

According to the "social intelligence hypothesis," species with complex social interactions have more sophisticated communication systems. Giant otters (Pteronura brasiliensis) live in groups with complex social interactions. It is likely that the vocal communication of giant otters is more sophisticated than previous studies suggest. The objectives of the current study were to describe the airborne vocal repertoire of giant otters in the Pantanal area of Brazil, to analyze call types within different behavioral contexts, and to correlate vocal complexity with level of sociability of mustelids to verify whether or not the result supports the social intelligence hypothesis. The behavior of nine giant otters groups was observed. Vocalizations recorded were acoustically and statistically analyzed to describe the species' repertoire. The repertoire was comprised by 15 sound types emitted in different behavioral contexts. The main behavioral contexts of each sound type were significantly associated with the acoustic variable ordination of different sound types. A strong correlation between vocal complexity and sociability was found for different species, suggesting that the communication systems observed in the family mustelidae support the social intelligence hypothesis.
Aeroelastic Model of Vocal-Fold Vibrating Element for Studying the Phonation Threshold

NASA Astrophysics Data System (ADS)

Horáček, J.; Švec, J. G.

2002-10-01

An original theoretical model for vibration onset of the vocal folds in the air-flow coming from the human subglottal tract is designed, which allows studying the influence of the physical properties of the vocal folds (e.g., geometrical shape, mass, viscosity) on their vibration characteristics (such as the natural frequencies, mode shapes of vibration and the thresholds of instability). The mathematical model of the vocal fold is designed as a simplified dynamic system of two degrees of freedom (rotation and translation) vibrating on an elastic foundation in the wall of a channel conveying air. An approximate unsteady one-dimensional flow theory for the inviscid incompressible fluid is presented for the phonatory air-flow. A generally defined shape of the vocal-fold surface is considered for expressing the unsteady aerodynamic forces in the glottis. The parameters of the mechanical part of the model, i.e., the mass, stiffness and damping matrices, are related to the geometry and material density of the vocal folds as well as to the fundamental natural frequency and damping known from experiments. The coupled numerical solution yields the vibration characteristics (natural frequencies, damping and mode shapes of vibration), including the instability thresholds of the aeroelastic system. The vibration characteristics obtained from the coupled numerical solution of the system appear to be in reasonable qualitative agreement with the physiological data and clinical observations. The model is particularly suitable for studying the phonation threshold, i.e., the onset of vibration of the vocal folds.
Visual classification of feral cat Felis silvestris catus vocalizations

PubMed Central

Owens, Jessica L.; Olsen, Mariana; Fontaine, Amy; Kloth, Christopher; Kershenbaum, Arik

2017-01-01

Abstract Cat vocal behavior, in particular, the vocal and social behavior of feral cats, is poorly understood, as are the differences between feral and fully domestic cats. The relationship between feral cat social and vocal behavior is important because of the markedly different ecology of feral and domestic cats, and enhanced comprehension of the repertoire and potential information content of feral cat calls can provide both better understanding of the domestication and socialization process, and improved welfare for feral cats undergoing adoption. Previous studies have used conflicting classification schemes for cat vocalizations, often relying on onomatopoeic or popular descriptions of call types (e.g., “miow”). We studied the vocalizations of 13 unaltered domestic cats that complied with our behavioral definition used to distinguish feral cats from domestic. A total of 71 acoustic units were extracted and visually analyzed for the construction of a hierarchical classification of vocal sounds, based on acoustic properties. We identified 3 major categories (tonal, pulse, and broadband) that further breakdown into 8 subcategories, and show a high degree of reliability when sounds are classified blindly by independent observers (Fleiss’ Kappa K = 0.863). Due to the limited behavioral contexts in this study, additional subcategories of cat vocalizations may be identified in the future, but our hierarchical classification system allows for the addition of new categories and new subcategories as they are described. This study shows that cat vocalizations are diverse and complex, and provides an objective and reliable classification system that can be used in future studies. PMID:29491992
Vocal Emotion of Humanoid Robots: A Study from Brain Mechanism

PubMed Central

Wang, Youhui; Hu, Xiaohua; Zhou, Jie; Kuo, Taitzong

2014-01-01

Driven by rapid ongoing advances in humanoid robot, increasing attention has been shifted into the issue of emotion intelligence of AI robots to facilitate the communication between man-machines and human beings, especially for the vocal emotion in interactive system of future humanoid robots. This paper explored the brain mechanism of vocal emotion by studying previous researches and developed an experiment to observe the brain response by fMRI, to analyze vocal emotion of human beings. Findings in this paper provided a new approach to design and evaluate the vocal emotion of humanoid robots based on brain mechanism of human beings. PMID:24587712
Acoustic signatures of sound source-tract coupling.

PubMed

Arneodo, Ezequiel M; Perl, Yonatan Sanz; Mindlin, Gabriel B

2011-04-01

Birdsong is a complex behavior, which results from the interaction between a nervous system and a biomechanical peripheral device. While much has been learned about how complex sounds are generated in the vocal organ, little has been learned about the signature on the vocalizations of the nonlinear effects introduced by the acoustic interactions between a sound source and the vocal tract. The variety of morphologies among bird species makes birdsong a most suitable model to study phenomena associated to the production of complex vocalizations. Inspired by the sound production mechanisms of songbirds, in this work we study a mathematical model of a vocal organ, in which a simple sound source interacts with a tract, leading to a delay differential equation. We explore the system numerically, and by taking it to the weakly nonlinear limit, we are able to examine its periodic solutions analytically. By these means we are able to explore the dynamics of oscillatory solutions of a sound source-tract coupled system, which are qualitatively different from those of a sound source-filter model of a vocal organ. Nonlinear features of the solutions are proposed as the underlying mechanisms of observed phenomena in birdsong, such as unilaterally produced "frequency jumps," enhancement of resonances, and the shift of the fundamental frequency observed in heliox experiments. ©2011 American Physical Society
Acoustic signatures of sound source-tract coupling

PubMed Central

Arneodo, Ezequiel M.; Perl, Yonatan Sanz; Mindlin, Gabriel B.

2014-01-01

Birdsong is a complex behavior, which results from the interaction between a nervous system and a biomechanical peripheral device. While much has been learned about how complex sounds are generated in the vocal organ, little has been learned about the signature on the vocalizations of the nonlinear effects introduced by the acoustic interactions between a sound source and the vocal tract. The variety of morphologies among bird species makes birdsong a most suitable model to study phenomena associated to the production of complex vocalizations. Inspired by the sound production mechanisms of songbirds, in this work we study a mathematical model of a vocal organ, in which a simple sound source interacts with a tract, leading to a delay differential equation. We explore the system numerically, and by taking it to the weakly nonlinear limit, we are able to examine its periodic solutions analytically. By these means we are able to explore the dynamics of oscillatory solutions of a sound source-tract coupled system, which are qualitatively different from those of a sound source-filter model of a vocal organ. Nonlinear features of the solutions are proposed as the underlying mechanisms of observed phenomena in birdsong, such as unilaterally produced “frequency jumps,” enhancement of resonances, and the shift of the fundamental frequency observed in heliox experiments. PMID:21599213
High-Resolution, Non-Invasive Imaging of Upper Vocal Tract Articulators Compatible with Human Brain Recordings

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bouchard, Kristofer E.; Conant, David F.; Anumanchipalli, Gopala K.

A complete neurobiological understanding of speech motor control requires determination of the relationship between simultaneously recorded neural activity and the kinematics of the lips, jaw, tongue, and larynx. Many speech articulators are internal to the vocal tract, and therefore simultaneously tracking the kinematics of all articulators is nontrivial-especially in the context of human electrophysiology recordings. Here, we describe a noninvasive, multi-modal imaging system to monitor vocal tract kinematics, demonstrate this system in six speakers during production of nine American English vowels, and provide new analysis of such data. Classification and regression analysis revealed considerable variability in the articulator-to-acoustic relationship acrossmore » speakers. Non-negative matrix factorization extracted basis sets capturing vocal tract shapes allowing for higher vowel classification accuracy than traditional methods. Statistical speech synthesis generated speech from vocal tract measurements, and we demonstrate perceptual identification. We demonstrate the capacity to predict lip kinematics from ventral sensorimotor cortical activity. These results demonstrate a multi-modal system to non-invasively monitor articulator kinematics during speech production, describe novel analytic methods for relating kinematic data to speech acoustics, and provide the first decoding of speech kinematics from electrocorticography. These advances will be critical for understanding the cortical basis of speech production and the creation of vocal prosthetics.« less

High-Resolution, Non-Invasive Imaging of Upper Vocal Tract Articulators Compatible with Human Brain Recordings

PubMed Central

Anumanchipalli, Gopala K.; Dichter, Benjamin; Chaisanguanthum, Kris S.; Johnson, Keith; Chang, Edward F.

2016-01-01

A complete neurobiological understanding of speech motor control requires determination of the relationship between simultaneously recorded neural activity and the kinematics of the lips, jaw, tongue, and larynx. Many speech articulators are internal to the vocal tract, and therefore simultaneously tracking the kinematics of all articulators is nontrivial—especially in the context of human electrophysiology recordings. Here, we describe a noninvasive, multi-modal imaging system to monitor vocal tract kinematics, demonstrate this system in six speakers during production of nine American English vowels, and provide new analysis of such data. Classification and regression analysis revealed considerable variability in the articulator-to-acoustic relationship across speakers. Non-negative matrix factorization extracted basis sets capturing vocal tract shapes allowing for higher vowel classification accuracy than traditional methods. Statistical speech synthesis generated speech from vocal tract measurements, and we demonstrate perceptual identification. We demonstrate the capacity to predict lip kinematics from ventral sensorimotor cortical activity. These results demonstrate a multi-modal system to non-invasively monitor articulator kinematics during speech production, describe novel analytic methods for relating kinematic data to speech acoustics, and provide the first decoding of speech kinematics from electrocorticography. These advances will be critical for understanding the cortical basis of speech production and the creation of vocal prosthetics. PMID:27019106
High-Resolution, Non-Invasive Imaging of Upper Vocal Tract Articulators Compatible with Human Brain Recordings

DOE PAGES

Bouchard, Kristofer E.; Conant, David F.; Anumanchipalli, Gopala K.; ...

2016-03-28

A complete neurobiological understanding of speech motor control requires determination of the relationship between simultaneously recorded neural activity and the kinematics of the lips, jaw, tongue, and larynx. Many speech articulators are internal to the vocal tract, and therefore simultaneously tracking the kinematics of all articulators is nontrivial-especially in the context of human electrophysiology recordings. Here, we describe a noninvasive, multi-modal imaging system to monitor vocal tract kinematics, demonstrate this system in six speakers during production of nine American English vowels, and provide new analysis of such data. Classification and regression analysis revealed considerable variability in the articulator-to-acoustic relationship acrossmore » speakers. Non-negative matrix factorization extracted basis sets capturing vocal tract shapes allowing for higher vowel classification accuracy than traditional methods. Statistical speech synthesis generated speech from vocal tract measurements, and we demonstrate perceptual identification. We demonstrate the capacity to predict lip kinematics from ventral sensorimotor cortical activity. These results demonstrate a multi-modal system to non-invasively monitor articulator kinematics during speech production, describe novel analytic methods for relating kinematic data to speech acoustics, and provide the first decoding of speech kinematics from electrocorticography. These advances will be critical for understanding the cortical basis of speech production and the creation of vocal prosthetics.« less
Indication of a Lombard vocal response in the St. Lawrence River beluga

NASA Astrophysics Data System (ADS)

Scheifele, P. M.; Andrew, S.; Cooper, R. A.; Darre, M.; Musiek, F. E.; Max, L.

2005-03-01

Noise pollution is recognized as a potential danger to marine mammals in general, and to the St. Lawrence beluga in particular. One method of determining the impacts of noise on an animal's communication is to observe a natural and repeatable response of the vocal system to variations in noise level. This is accomplished by observing intensity changes in animal vocalizations in response to environmental noise. One such response observed in humans, songbirds, and some primates is the Lombard vocal response. This response represents a vocal system reaction manifested by changes in vocalization level in direct response to changes in the noise field. In this research, a population of belugas in the St. Lawrence River Estuary was tested to determine whether a Lombard response existed by using hidden Markhov-classified vocalizations as targets for acoustical analyses. Correlation and regression analyses of signals and noise indicated that the phenomenon does exist. Further, results of human subjects experiments [Egan, J. J. (1966), Ph.D. dissertation; Scheifele, P. M. (2003), Ph.D. dissertation], along with previously reported data from other animal species, are similar to those exhibited by the belugas. Overall, findings suggest that typical noise levels in the St. Lawrence River Estuary have a detectable effect on the communication of the beluga. .
Vocal Fold Epithelial Barrier in Health and Injury A Research Review

PubMed Central

Levendoski, Elizabeth Erickson; Leydon, Ciara; Thibeault, Susan L.

2015-01-01

Purpose Vocal fold epithelium is composed of layers of individual epithelial cells joined by junctional complexes constituting a unique interface with the external environment. This barrier provides structural stability to the vocal folds and protects underlying connective tissue from injury while being nearly continuously exposed to potentially hazardous insults including environmental or systemic-based irritants such as pollutants and reflux, surgical procedures, and vibratory trauma. Small disruptions in the epithelial barrier may have a large impact on susceptibility to injury and overall vocal health. The purpose of this article is to provide a broad-based review of our current knowledge of the vocal fold epithelial barrier. Methods A comprehensive review of the literature was conducted. Details of the structure of the vocal fold epithelial barrier are presented and evaluated in the context of function in injury and pathology. The importance of the epithelial-associated vocal fold mucus barrier is also introduced. Results/Conclusions Information presented in this review is valuable for clinicians and researchers as it highlights the importance of this understudied portion of the vocal folds to overall vocal health and disease. Prevention and treatment of injury to the epithelial barrier is a significant area awaiting further investigation. PMID:24686981
Gestures, vocalizations, and memory in language origins.

PubMed

Aboitiz, Francisco

2012-01-01

THIS ARTICLE DISCUSSES THE POSSIBLE HOMOLOGIES BETWEEN THE HUMAN LANGUAGE NETWORKS AND COMPARABLE AUDITORY PROJECTION SYSTEMS IN THE MACAQUE BRAIN, IN AN ATTEMPT TO RECONCILE TWO EXISTING VIEWS ON LANGUAGE EVOLUTION: one that emphasizes hand control and gestures, and the other that emphasizes auditory-vocal mechanisms. The capacity for language is based on relatively well defined neural substrates whose rudiments have been traced in the non-human primate brain. At its core, this circuit constitutes an auditory-vocal sensorimotor circuit with two main components, a "ventral pathway" connecting anterior auditory regions with anterior ventrolateral prefrontal areas, and a "dorsal pathway" connecting auditory areas with parietal areas and with posterior ventrolateral prefrontal areas via the arcuate fasciculus and the superior longitudinal fasciculus. In humans, the dorsal circuit is especially important for phonological processing and phonological working memory, capacities that are critical for language acquisition and for complex syntax processing. In the macaque, the homolog of the dorsal circuit overlaps with an inferior parietal-premotor network for hand and gesture selection that is under voluntary control, while vocalizations are largely fixed and involuntary. The recruitment of the dorsal component for vocalization behavior in the human lineage, together with a direct cortical control of the subcortical vocalizing system, are proposed to represent a fundamental innovation in human evolution, generating an inflection point that permitted the explosion of vocal language and human communication. In this context, vocal communication and gesturing have a common history in primate communication.
Improvement of a Vocal Fold Imaging System

DOE Office of Scientific and Technical Information (OSTI.GOV)

Krauter, K. G.

Medical professionals can better serve their patients through continual update of their imaging tools. A wide range of pathologies and disease may afflict human vocal cords or, as they’re also known, vocal folds. These diseases can affect human speech hampering the ability of the patient to communicate. Vocal folds must be opened for breathing and the closed to produce speech. Currently methodologies to image markers of potential pathologies are difficult to use and often fail to detect early signs of disease. These current methodologies rely on a strobe light and slower frame rate camera in an attempt to obtain imagesmore » as the vocal folds travel over the full extent of their motion.« less
A portable high-speed camera system for vocal fold examinations.

PubMed

Hertegård, Stellan; Larsson, Hans

2014-11-01

In this article, we present a new portable low-cost system for high-speed examinations of the vocal folds. Analysis of glottal vibratory parameters from the high-speed recordings is compared with videostroboscopic recordings. The high-speed system is built around a Fastec 1 monochrome camera, which is used with newly developed software, High-Speed Studio (HSS). The HSS has options for video/image recording, contains a database, and has a set of analysis options. The Fastec/HSS system has been used clinically since 2011 in more than 2000 patient examinations and recordings. The Fastec 1 camera has sufficient time resolution (≥4000 frames/s) and light sensitivity (ISO 3200) to produce images for detailed analyses of parameters pertinent to vocal fold function. The camera can be used with both rigid and flexible endoscopes. The HSS software includes options for analyses of glottal vibrations, such as kymogram, phase asymmetry, glottal area variation, open and closed phase, and angle of vocal fold abduction. It can also be used for separate analysis of the left and vocal fold movements, including maximum speed during opening and closing, a parameter possibly related to vocal fold elasticity. A blinded analysis of 32 patients with various voice disorders examined with both the Fastec/HSS system and videostroboscopy showed that the high-speed recordings were significantly better for the analysis of glottal parameters (eg, mucosal wave and vibration asymmetry). The monochrome high-speed system can be used in daily clinical work within normal clinical time limits for patient examinations. A detailed analysis can be made of voice disorders and laryngeal pathology at a relatively low cost. Copyright © 2014 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Divergent expression of 11beta-hydroxysteroid dehydrogenase and 11beta-hydroxylase genes between male morphs in the central nervous system, sonic muscle and testis of a vocal fish.

PubMed

Arterbery, Adam S; Deitcher, David L; Bass, Andrew H

2010-05-15

The vocalizing midshipman fish, Porichthys notatus, has two male morphs that exhibit alternative mating tactics. Only territorial males acoustically court females with long duration (minutes to >1h) calls, whereas sneaker males attempt to steal fertilizations. During the breeding season, morph-specific tactics are paralleled by a divergence in relative testis and vocal muscle size, plasma levels of the androgen 11-ketotestosterone (11KT) and the glucocorticoid cortisol, and mRNA expression levels in the central nervous system (CNS) of the steroid-synthesizing enzyme aromatase (estrogen synthase). Here, we tested the hypothesis that the midshipman's two male morphs would further differ in the CNS, as well as in the testis and vocal muscle, in mRNA abundance for the enzymes 11beta-hydroxylase (11betaH) and 11beta-hydroxysteroid dehydrogenase (11betaHSD) that directly regulate both 11KT and cortisol synthesis. Quantitative real-time PCR demonstrated male morph-specific profiles for both enzymes. Territorial males had higher 11betaH and 11betaHSD mRNA levels in testis and vocal muscle. By contrast, sneaker males had the higher CNS expression, especially for 11betaHSD, in the region containing an expansive vocal pacemaker circuit that directly determines the temporal attributes of natural calls. We propose for territorial males that higher enzyme expression in testis underlies its greater plasma 11KT levels, which in vocal muscle provides both gluconeogenic and androgenic support for its long duration calling. We further propose for sneaker males that higher enzyme expression in the vocal CNS contributes to known cortisol-specific effects on its vocal physiology. Copyright 2010 Elsevier Inc. All rights reserved.
The VAMOS Ocean-Cloud-Atmosphere-Land Study Regional Experiment (VOCALS-REx): Goals, platforms, and field operations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wood, R.; Springston, S.; Mechoso, C. R.

2011-01-21

The VAMOS Ocean-Cloud-Atmosphere-Land Study Regional Experiment (VOCALS-REx) was an international field program designed to make observations of poorly understood but critical components of the coupled climate system of the southeast Pacific. This region is characterized by strong coastal upwelling, the coolest SSTs in the tropical belt, and is home to the largest subtropical stratocumulus deck on Earth. The field intensive phase of VOCALS-REx took place during October and November 2008 and constitutes a critical part of a broader CLIVAR program (VOCALS) designed to develop and promote scientific activities leading to improved understanding, model simulations, and predictions of the southeastern Pacificmore » (SEP) coupled ocean-atmosphere-land system, on diurnal to interannual timescales. The other major components of VOCALS are a modeling program with a model hierarchy ranging from the local to global scales, and a suite of extended observations from regular research cruises, instrumented moorings, and satellites. The two central themes of VOCALS-REx focus upon (a) links between aerosols, clouds and precipitation and their impacts on marine stratocumulus radiative properties, and (b) physical and chemical couplings between the upper ocean and the lower atmosphere, including the role that mesoscale ocean eddies play. A set of hypotheses designed to be tested with the combined field, monitoring and modeling work in VOCALS is presented here. A further goal of VOCALS-REx is to provide datasets for the evaluation and improvement of large-scale numerical models. VOCALS-REx involved five research aircraft, two ships and two surface sites in northern Chile. We describe the instrument payloads and key mission strategies for these platforms and give a summary of the missions conducted.« less
A Primary Role for Nucleus Accumbens and Related Limbic Network in Vocal Tics.

PubMed

McCairn, Kevin W; Nagai, Yuji; Hori, Yukiko; Ninomiya, Taihei; Kikuchi, Erika; Lee, Ju-Young; Suhara, Tetsuya; Iriki, Atsushi; Minamimoto, Takafumi; Takada, Masahiko; Isoda, Masaki; Matsumoto, Masayuki

2016-01-20

Inappropriate vocal expressions, e.g., vocal tics in Tourette syndrome, severely impact quality of life. Neural mechanisms underlying vocal tics remain unexplored because no established animal model representing the condition exists. We report that unilateral disinhibition of the nucleus accumbens (NAc) generates vocal tics in monkeys. Whole-brain PET imaging identified prominent, bilateral limbic cortico-subcortical activation. Local field potentials (LFPs) developed abnormal spikes in the NAc and the anterior cingulate cortex (ACC). Vocalization could occur without obvious LFP spikes, however, when phase-phase coupling of alpha oscillations were accentuated between the NAc, ACC, and the primary motor cortex. These findings contrasted with myoclonic motor tics induced by disinhibition of the dorsolateral putamen, where PET activity was confined to the ipsilateral sensorimotor system and LFP spikes always preceded motor tics. We propose that vocal tics emerge as a consequence of dysrhythmic alpha coupling between critical nodes in the limbic and motor networks. VIDEO ABSTRACT. Copyright © 2016 Elsevier Inc. All rights reserved.
Knockout of Foxp2 disrupts vocal development in mice

PubMed Central

Castellucci, Gregg A.; McGinley, Matthew J.; McCormick, David A.

2016-01-01

The FOXP2 gene is important for the development of proper speech motor control in humans. However, the role of the gene in general vocal behavior in other mammals, including mice, is unclear. Here, we track the vocal development of Foxp2 heterozygous knockout (Foxp2+/−) mice and their wildtype (WT) littermates from juvenile to adult ages, and observe severe abnormalities in the courtship song of Foxp2+/− mice. In comparison to their WT littermates, Foxp2+/− mice vocalized less, produced shorter syllable sequences, and possessed an abnormal syllable inventory. In addition, Foxp2+/− song also exhibited irregular rhythmic structure, and its development did not follow the consistent trajectories observed in WT vocalizations. These results demonstrate that the Foxp2 gene is critical for normal vocal behavior in juvenile and adult mice, and that Foxp2 mutant mice may provide a tractable model system for the study of the gene’s role in general vocal motor control. PMID:26980647
Tissue engineering therapies for the vocal fold lamina propria.

PubMed

Kutty, Jaishankar K; Webb, Ken

2009-09-01

The vocal folds are laryngeal connective tissues with complex matrix composition/organization that provide the viscoelastic mechanical properties required for voice production. Vocal fold injury results in alterations in tissue structure and corresponding changes in tissue biomechanics that reduce vocal quality. Recent work has begun to elucidate the biochemical changes underlying injury-induced pathology and to apply tissue engineering principles to the prevention and reversal of vocal fold scarring. Based on the extensive history of injectable biomaterials in laryngeal surgery, a major focus of regenerative therapies has been the development of novel scaffolds with controlled in vivo residence time and viscoelastic properties approximating the native tissue. Additional strategies have included cell transplantation and delivery of the antifibrotic cytokine hepatocyte growth factor, as well as investigation of the effects of the unique vocal fold vibratory microenvironment using in vitro dynamic culture systems. Recent achievements of significant reductions in fibrosis and improved recovery of native tissue viscoelasticity and vibratory/functional performance in animal models are rapidly moving vocal fold tissue engineering toward clinical application.
Simulation based estimation of dynamic mechanical properties for viscoelastic materials used for vocal fold models

NASA Astrophysics Data System (ADS)

Rupitsch, Stefan J.; Ilg, Jürgen; Sutor, Alexander; Lerch, Reinhard; Döllinger, Michael

2011-08-01

In order to obtain a deeper understanding of the human phonation process and the mechanisms generating sound, realistic setups are built up containing artificial vocal folds. Usually, these vocal folds consist of viscoelastic materials (e.g., polyurethane mixtures). Reliable simulation based studies on the setups require the mechanical properties of the utilized viscoelastic materials. The aim of this work is the identification of mechanical material parameters (Young's modulus, Poisson's ratio, and loss factor) for those materials. Therefore, we suggest a low-cost measurement setup, the so-called vibration transmission analyzer (VTA) enabling to analyze the transfer behavior of viscoelastic materials for propagating mechanical waves. With the aid of a mathematical Inverse Method, the material parameters are adjusted in a convenient way so that the simulation results coincide with the measurement results for the transfer behavior. Contrary to other works, we determine frequency dependent functions for the mechanical properties characterizing the viscoelastic material in the frequency range of human speech (100-250 Hz). The results for three different materials clearly show that the Poisson's ratio is close to 0.5 and that the Young's modulus increases with higher frequencies. For a frequency of 400 Hz, the Young's modulus of the investigated viscoelastic materials is approximately 80% higher than for the static case (0 Hz). We verify the identified mechanical properties with experiments on fabricated vocal fold models. Thereby, only small deviations between measurements and simulations occur.
Neural coding of syntactic structure in learned vocalizations in the songbird.

PubMed

Fujimoto, Hisataka; Hasegawa, Taku; Watanabe, Dai

2011-07-06

Although vocal signals including human languages are composed of a finite number of acoustic elements, complex and diverse vocal patterns can be created from combinations of these elements, linked together by syntactic rules. To enable such syntactic vocal behaviors, neural systems must extract the sequence patterns from auditory information and establish syntactic rules to generate motor commands for vocal organs. However, the neural basis of syntactic processing of learned vocal signals remains largely unknown. Here we report that the basal ganglia projecting premotor neurons (HVC(X) neurons) in Bengalese finches represent syntactic rules that generate variable song sequences. When vocalizing an alternative transition segment between song elements called syllables, sparse burst spikes of HVC(X) neurons code the identity of a specific syllable type or a specific transition direction among the alternative trajectories. When vocalizing a variable repetition sequence of the same syllable, HVC(X) neurons not only signal the initiation and termination of the repetition sequence but also indicate the progress and state-of-completeness of the repetition. These different types of syntactic information are frequently integrated within the activity of single HVC(X) neurons, suggesting that syntactic attributes of the individual neurons are not programmed as a basic cellular subtype in advance but acquired in the course of vocal learning and maturation. Furthermore, some auditory-vocal mirroring type HVC(X) neurons display transition selectivity in the auditory phase, much as they do in the vocal phase, suggesting that these songbirds may extract syntactic rules from auditory experience and apply them to form their own vocal behaviors.
Respiratory and Laryngeal Changes with Vocal Loading in Younger and Older Individuals

ERIC Educational Resources Information Center

Sundarrajan, Anusha; Huber, Jessica E.; Sivasankar, M. Preeti

2017-01-01

Purpose: The objective of the current study was to investigate the effects of age and vocal loading on the respiratory and laryngeal systems. Method: Fourteen younger (M = 20 years) and 13 older (M = 75 years) healthy individuals participated in a 40-min vocal loading challenge in the presence of 70-dB background noise. Respiratory kinematic and…
The vocal monotony of monogamy

NASA Astrophysics Data System (ADS)

Thomas, Jeanette

2003-04-01

There are four phocids in waters around Antarctica: Weddell, leopard, crabeater, and Ross seals. These four species provide a unique opportunity to examine underwater vocal behavior in species sharing the same ecosystem. Some species live in pack ice, others in factice, but all are restricted to the Antarctic or sub-Antarctic islands. All breed and produce vocalizations under water. Social systems range from polygyny in large breeding colonies, to serial monogamy, to solitary species. The type of mating system influences the number of underwater vocalizations in the repertoire, with monogamous seals producing only a single call, polygynous species producing up to 35 calls, and solitary species an intermediate number of about 10 calls. Breeding occurs during the austral spring and each species carves-out an acoustic niche for communicating, with species using different frequency ranges, temporal patterns, and amplitude changes to convey their species-specific calls and presumably reduce acoustic competition. Some species exhibit geographic variations in their vocalizations around the continent, which may reflect discrete breeding populations. Some seals become silent during a vulnerable time of predation by killer whales, perhaps to avoid detection. Overall, vocalizations of these seals exhibit adaptive characteristics that reflect the co-evolution among species in the same ecosystem.
Early development of turn-taking with parents shapes vocal acoustics in infant marmoset monkeys

PubMed Central

Takahashi, Daniel Y.; Fenley, Alicia R.; Ghazanfar, Asif A.

2016-01-01

In humans, vocal turn-taking is a ubiquitous form of social interaction. It is a communication system that exhibits the properties of a dynamical system: two individuals become coupled to each other via acoustic exchanges and mutually affect each other. Human turn-taking develops during the first year of life. We investigated the development of vocal turn-taking in infant marmoset monkeys, a New World species whose adult vocal behaviour exhibits the same universal features of human turn-taking. We find that marmoset infants undergo the same trajectory of change for vocal turn-taking as humans, and do so during the same life-history stage. Our data show that turn-taking by marmoset infants depends on the development of self-monitoring, and that contingent parental calls elicit more mature-sounding calls from infants. As in humans, there was no evidence that parental feedback affects the rate of turn-taking maturation. We conclude that vocal turn-taking by marmoset monkeys and humans is an instance of convergent evolution, possibly as a result of pressures on both species to adopt a cooperative breeding strategy and increase volubility. PMID:27069047
Nocturnal "humming" vocalizations: adding a piece to the puzzle of giraffe vocal communication.

PubMed

Baotic, Anton; Sicks, Florian; Stoeger, Angela S

2015-09-09

Recent research reveals that giraffes (Giraffa camelopardalis sp.) exhibit a socially structured, fission-fusion system. In other species possessing this kind of society, information exchange is important and vocal communication is usually well developed. But is this true for giraffes? Giraffes are known to produce sounds, but there is no evidence that they use vocalizations for communication. Reports on giraffe vocalizations are mainly anecdotal and the missing acoustic descriptions make it difficult to establish a call nomenclature. Despite inconclusive evidence to date, it is widely assumed that giraffes produce infrasonic vocalizations similar to elephants. In order to initiate a more detailed investigation of the vocal communication in giraffes, we collected data of captive individuals during day and night. We particularly focussed on detecting tonal, infrasonic or sustained vocalizations. We collected over 947 h of audio material in three European zoos and quantified the spectral and temporal components of acoustic signals to obtain an accurate set of acoustic parameters. Besides the known burst, snorts and grunts, we detected harmonic, sustained and frequency-modulated "humming" vocalizations during night recordings. None of the recorded vocalizations were within the infrasonic range. These results show that giraffes do produce vocalizations, which, based on their acoustic structure, might have the potential to function as communicative signals to convey information about the physical and motivational attributes of the caller. The data further reveal that the assumption of infrasonic communication in giraffes needs to be considered with caution and requires further investigations in future studies.
Classification for animal vocal fold surgery: resection margins impact histological outcomes of vocal fold injury.

PubMed

Imaizumi, Mitsuyoshi; Thibeault, Susan L; Leydon, Ciara

2014-11-01

Extent of vocal fold injury impacts the nature and timing of wound healing and voice outcomes. However, depth and extent of the lesion created to study wound healing in animal models vary across studies, likely contributing to different outcomes. Our goal was to create a surgery classification system to enable comparison of postoperative outcomes across animal vocal fold wound-healing studies. Prospective, controlled animal study. Rats underwent one of three types of unilateral vocal fold surgeries classified by depth and length of resection. The surgeries were: for subepithelial injury, resection of epithelium and superficial layer of the lamina propria at the midmembranous portion of the vocal fold; for transmucosal injury, resection of epithelium and lamina propria; and for transmuscular injury, resection of epithelium, lamina propria, and superficial portion of the vocalis muscle. Wound healing was evaluated histologically at various time points up to 35 days postinjury. Complete healing occurred by 14 days postsurgery for subepithelial injury, and by day 35 for transmucosal injury. Injury remained present at day 35 for transmuscular injury. Timing and completeness of healing varied by extent and depth of resection. Scarless healing occurred rapidly following subepithelial injury, whereas scarring was observed at 5 weeks after transmuscular injury. The proposed classification system may facilitate comparison of surgical outcomes across vocal fold wound-healing studies. N/A. © 2014 The American Laryngological, Rhinological and Otological Society, Inc.
The respiratory-vocal system of songbirds: anatomy, physiology, and neural control.

PubMed

Schmidt, Marc F; Martin Wild, J

2014-01-01

This wide-ranging review presents an overview of the respiratory-vocal system in songbirds, which are the only other vertebrate group known to display a degree of respiratory control during song rivalling that of humans during speech; this despite the fact that the peripheral components of both the respiratory and vocal systems differ substantially in the two groups. We first provide a brief description of these peripheral components in songbirds (lungs, air sacs and respiratory muscles, vocal organ (syrinx), upper vocal tract) and then proceed to a review of the organization of central respiratory-related neurons in the spinal cord and brainstem, the latter having an organization fundamentally similar to that of the ventral respiratory group of mammals. The second half of the review describes the nature of the motor commands generated in a specialized "cortical" song control circuit and how these might engage brainstem respiratory networks to shape the temporal structure of song. We also discuss a bilaterally projecting "respiratory-thalamic" pathway that links the respiratory system to "cortical" song control nuclei. This necessary pathway for song originates in the brainstem's primary inspiratory center and is hypothesized to play a vital role in synchronizing song motor commands both within and across hemispheres. © 2014 Elsevier B.V. All rights reserved.

The respiratory-vocal system of songbirds: Anatomy, physiology, and neural control

PubMed Central

Schmidt, Marc F.; Wild, J. Martin

2015-01-01

This wide-ranging review presents an overview of the respiratory-vocal system in songbirds, which are the only other vertebrate group known to display a degree of respiratory control during song rivalling that of humans during speech; this despite the fact that the peripheral components of both the respiratory and vocal systems differ substantially in the two groups. We first provide a brief description of these peripheral components in songbirds (lungs, air sacs and respiratory muscles, vocal organ (syrinx), upper vocal tract) and then proceed to a review of the organization of central respiratory-related neurons in the spinal cord and brainstem, the latter having an organization fundamentally similar to that of the ventral respiratory group of mammals. The second half of the review describes the nature of the motor commands generated in a specialized “cortical” song control circuit and how these might engage brainstem respiratory networks to shape the temporal structure of song. We also discuss a bilaterally projecting “respiratory-thalamic” pathway that links the respiratory system to “cortical” song control nuclei. This necessary pathway for song originates in the brainstem’s primary inspiratory center and is hypothesized to play a vital role in synchronizing song motor commands both within and across hemispheres. PMID:25194204
Representation of complex vocalizations in the Lusitanian toadfish auditory system: evidence of fine temporal, frequency and amplitude discrimination

PubMed Central

Vasconcelos, Raquel O.; Fonseca, Paulo J.; Amorim, M. Clara P.; Ladich, Friedrich

2011-01-01

Many fishes rely on their auditory skills to interpret crucial information about predators and prey, and to communicate intraspecifically. Few studies, however, have examined how complex natural sounds are perceived in fishes. We investigated the representation of conspecific mating and agonistic calls in the auditory system of the Lusitanian toadfish Halobatrachus didactylus, and analysed auditory responses to heterospecific signals from ecologically relevant species: a sympatric vocal fish (meagre Argyrosomus regius) and a potential predator (dolphin Tursiops truncatus). Using auditory evoked potential (AEP) recordings, we showed that both sexes can resolve fine features of conspecific calls. The toadfish auditory system was most sensitive to frequencies well represented in the conspecific vocalizations (namely the mating boatwhistle), and revealed a fine representation of duration and pulsed structure of agonistic and mating calls. Stimuli and corresponding AEP amplitudes were highly correlated, indicating an accurate encoding of amplitude modulation. Moreover, Lusitanian toadfish were able to detect T. truncatus foraging sounds and A. regius calls, although at higher amplitudes. We provide strong evidence that the auditory system of a vocal fish, lacking accessory hearing structures, is capable of resolving fine features of complex vocalizations that are probably important for intraspecific communication and other relevant stimuli from the auditory scene. PMID:20861044
In Vivo Measurement of Pediatric Vocal Fold Motion Using Structured Light Laser Projection

PubMed Central

Patel, Rita R.; Donohue, Kevin D.; Lau, Daniel; Unnikrishnan, Harikrishnan

2013-01-01

Summary Objective The aim of the study was to present the development of a miniature structured light laser projection endoscope and to quantify vocal fold length and vibratory features related to impact stress of the pediatric glottis using high-speed imaging. Study Design The custom-developed laser projection system consists of a green laser with a 4-mm diameter optics module at the tip of the endoscope, projecting 20 vertical laser lines on the glottis. Measurements of absolute phonatory vocal fold length, membranous vocal fold length, peak amplitude, amplitude-to-length ratio, average closing velocity, and impact velocity were obtained in five children (6–9 years), two adult male and three adult female participants without voice disorders, and one child (10 years) with bilateral vocal fold nodules during modal phonation. Results Independent measurements made on the glottal length of a vocal fold phantom demonstrated a 0.13 mm bias error with a standard deviation of 0.23 mm, indicating adequate precision and accuracy for measuring vocal fold structures and displacement. First, in vivo measurements of amplitude-to-length ratio, peak closing velocity, and impact velocity during phonation in pediatric population and a child with vocal fold nodules are reported. Conclusion The proposed laser projection system can be used to obtain in vivo measurements of absolute length and vibratory features in children and adults. Children have large amplitude-to-length ratio compared with typically developing adults, whereas nodules result in larger peak amplitude, amplitude-to-length ratio, average closing velocity, and impact velocity compared with typically developing children. PMID:23809569
Humans recognize emotional arousal in vocalizations across all classes of terrestrial vertebrates: evidence for acoustic universals.

PubMed

Filippi, Piera; Congdon, Jenna V; Hoang, John; Bowling, Daniel L; Reber, Stephan A; Pašukonis, Andrius; Hoeschele, Marisa; Ocklenburg, Sebastian; de Boer, Bart; Sturdy, Christopher B; Newen, Albert; Güntürkün, Onur

2017-07-26

Writing over a century ago, Darwin hypothesized that vocal expression of emotion dates back to our earliest terrestrial ancestors. If this hypothesis is true, we should expect to find cross-species acoustic universals in emotional vocalizations. Studies suggest that acoustic attributes of aroused vocalizations are shared across many mammalian species, and that humans can use these attributes to infer emotional content. But do these acoustic attributes extend to non-mammalian vertebrates? In this study, we asked human participants to judge the emotional content of vocalizations of nine vertebrate species representing three different biological classes-Amphibia, Reptilia (non-aves and aves) and Mammalia. We found that humans are able to identify higher levels of arousal in vocalizations across all species. This result was consistent across different language groups (English, German and Mandarin native speakers), suggesting that this ability is biologically rooted in humans. Our findings indicate that humans use multiple acoustic parameters to infer relative arousal in vocalizations for each species, but mainly rely on fundamental frequency and spectral centre of gravity to identify higher arousal vocalizations across species. These results suggest that fundamental mechanisms of vocal emotional expression are shared among vertebrates and could represent a homologous signalling system. © 2017 The Author(s).
Effects of Voice Harmonic Complexity on ERP Responses to Pitch-Shifted Auditory Feedback

PubMed Central

Behroozmand, Roozbeh; Korzyukov, Oleg; Larson, Charles R.

2011-01-01

Objective The present study investigated the neural mechanisms of voice pitch control for different levels of harmonic complexity in the auditory feedback. Methods Event-related potentials (ERPs) were recorded in response to +200 cents pitch perturbations in the auditory feedback of self-produced natural human vocalizations, complex and pure tone stimuli during active vocalization and passive listening conditions. Results During active vocal production, ERP amplitudes were largest in response to pitch shifts in the natural voice, moderately large for non-voice complex stimuli and smallest for the pure tones. However, during passive listening, neural responses were equally large for pitch shifts in voice and non-voice complex stimuli but still larger than that for pure tones. Conclusions These findings suggest that pitch change detection is facilitated for spectrally rich sounds such as natural human voice and non-voice complex stimuli compared with pure tones. Vocalization-induced increase in neural responses for voice feedback suggests that sensory processing of naturally-produced complex sounds such as human voice is enhanced by means of motor-driven mechanisms (e.g. efference copies) during vocal production. Significance This enhancement may enable the audio-vocal system to more effectively detect and correct for vocal errors in the feedback of natural human vocalizations to maintain an intended vocal output for speaking. PMID:21719346
Asymmetric spatiotemporal chaos induced by a polypoid mass in the excised larynx

PubMed Central

Zhang, Yu; Jiang, Jack J.

2008-01-01

In this paper, asymmetric spatiotemporal chaos induced by a polypoid mass simulating the laryngeal pathology of a vocal polyp is experimentally observed using high-speed imaging in an excised larynx. Spatiotemporal analysis reveals that the normal vocal folds show spatiotemporal correlation and symmetry. Normal vocal fold vibrations are dominated mainly by the first vibratory eigenmode. However, pathological vocal folds with a polypoid mass show broken symmetry and spatiotemporal irregularity. The spatial correlation is decreased. The pathological vocal folds spread vibratory energy across a large number of eigenmodes and induce asymmetric spatiotemporal chaos. High-order eigenmodes show complicated dynamics. Spatiotemporal analysis provides a valuable biomedical application for investigating the spatiotemporal chaotic dynamics of pathological vocal fold systems with a polypoid mass and may represent a valuable clinical tool for the detection of laryngeal mass lesion using high-speed imaging. PMID:19123612
Mapping the distribution of language related genes FoxP1, FoxP2, and CntnaP2 in the brains of vocal learning bat species.

PubMed

Rodenas-Cuadrado, Pedro M; Mengede, Janine; Baas, Laura; Devanna, Paolo; Schmid, Tobias A; Yartsev, Michael; Firzlaff, Uwe; Vernes, Sonja C

2018-06-01

Genes including FOXP2, FOXP1, and CNTNAP2, have been implicated in human speech and language phenotypes, pointing to a role in the development of normal language-related circuitry in the brain. Although speech and language are unique to humans a comparative approach is possible by addressing language-relevant traits in animal systems. One such trait, vocal learning, represents an essential component of human spoken language, and is shared by cetaceans, pinnipeds, elephants, some birds and bats. Given their vocal learning abilities, gregarious nature, and reliance on vocalizations for social communication and navigation, bats represent an intriguing mammalian system in which to explore language-relevant genes. We used immunohistochemistry to detail the distribution of FoxP2, FoxP1, and Cntnap2 proteins, accompanied by detailed cytoarchitectural histology in the brains of two vocal learning bat species; Phyllostomus discolor and Rousettus aegyptiacus. We show widespread expression of these genes, similar to what has been previously observed in other species, including humans. A striking difference was observed in the adult P. discolor bat, which showed low levels of FoxP2 expression in the cortex that contrasted with patterns found in rodents and nonhuman primates. We created an online, open-access database within which all data can be browsed, searched, and high resolution images viewed to single cell resolution. The data presented herein reveal regions of interest in the bat brain and provide new opportunities to address the role of these language-related genes in complex vocal-motor and vocal learning behaviors in a mammalian model system. © 2018 The Authors The Journal of Comparative Neurology Published by Wiley Periodicals, Inc.
Individual killer whale vocal variation during intra-group behavioral dynamics

NASA Astrophysics Data System (ADS)

Grebner, Dawn M.

The scientific goal of this dissertation was to carefully study the signal structure of killer whale communications and vocal complexity and link them to behavioral circumstances. The overall objective of this research sought to provide insight into killer whale call content and usage which may be conveying information to conspecifics in order to maintain group cohesion. Data were collected in the summers of 2006 and 2007 in Johnstone Strait, British Columbia. For both individuals and small groups, vocalizations were isolated using a triangular hydrophone array and the behavioral movement patterns were captured by a theodolite and video camera positioned on a cliff overlooking the hyrophone locations. This dissertation is divided into four analysis chapters. In Chapter 3, discriminant analysis was used to validate the four N04 call subtypes which were originally parsed due to variations in slope segments. The first two functions of the discriminant analysis explained 97% of the variability. Most of the variability for the N04 call was found in the front convex and the terminal portions of the call, while very little variability was found in the center region of the call. This research revealed that individual killer whales produced multiple subtypes of the N04 call. No correlations of behaviors to acoustic parameters obtained were found. The aim of the Chapter 4 was to determine if killer whale calling behavior varied prior to and after the animals had joined. Pulsed call rates were found to be greater pre- compared to post-joining events. Two-way vocal exchanges were more common occurring 74% of the time during pre-joining events. In Chapter 5, initiated and first response to calls varied between age/sex class groups when mothers were separated from an offspring. Solo mothers and calves initiated pulsed calls more often than they responded. Most of the no vocal responses were due to mothers who were foraging. Finally, observations of the frequency split in N04 calls discussed in Chapter 6 showed that the higher frequency component (HFC) was always associated with sideband 7 (SB7) of the lower frequency component (LFC). Insight into Northern Resident killer whale intra-group vocal dynamics would aid our understanding of vocal behaviors of many other marine mammal species that rely on vocal exchanges for prey capture, group movement or survival. This is the first study to focus on killer whale vocal content and usage as it pertains to intra-group dynamics for (1) mother and offspring separations and (2) for all individuals prior to joining events, as well as (3) individual usage in a diverging pulsed call. It is also the first time the N04 call has been parsed into subtypes.
Conserved mechanisms of vocalization coding in mammalian and songbird auditory midbrain.

PubMed

Woolley, Sarah M N; Portfors, Christine V

2013-11-01

The ubiquity of social vocalizations among animals provides the opportunity to identify conserved mechanisms of auditory processing that subserve communication. Identifying auditory coding properties that are shared across vocal communicators will provide insight into how human auditory processing leads to speech perception. Here, we compare auditory response properties and neural coding of social vocalizations in auditory midbrain neurons of mammalian and avian vocal communicators. The auditory midbrain is a nexus of auditory processing because it receives and integrates information from multiple parallel pathways and provides the ascending auditory input to the thalamus. The auditory midbrain is also the first region in the ascending auditory system where neurons show complex tuning properties that are correlated with the acoustics of social vocalizations. Single unit studies in mice, bats and zebra finches reveal shared principles of auditory coding including tonotopy, excitatory and inhibitory interactions that shape responses to vocal signals, nonlinear response properties that are important for auditory coding of social vocalizations and modulation tuning. Additionally, single neuron responses in the mouse and songbird midbrain are reliable, selective for specific syllables, and rely on spike timing for neural discrimination of distinct vocalizations. We propose that future research on auditory coding of vocalizations in mouse and songbird midbrain neurons adopt similar experimental and analytical approaches so that conserved principles of vocalization coding may be distinguished from those that are specialized for each species. This article is part of a Special Issue entitled "Communication Sounds and the Brain: New Directions and Perspectives". Copyright © 2013 Elsevier B.V. All rights reserved.
Determining the etiology of mild vocal fold hypomobility.

PubMed

Heman-Ackah, Yolanda D; Batory, Mark

2003-12-01

The prevalence of mild vocal fold hypomobility is unknown. In a study by Heman-Ackah et al, vocal fold hypomobility in a population of singing teachers was found to be associated more frequently with vocal complaints than was the presence of vocal fold masses. The etiology of mild vocal fold hypomobility has not been previously explored. In the present study, a retrospective chart review was performed of 134 patients who presented to a tertiary laryngology referral center over a 6-month period for evaluation of vocal complaints. Of the 134 patients, 61 (46%) were found to have mild vocal referring otolaryngologist. Imaging studies and laboratory tests to evaluate for structural, metabolic, and infectious causes of the decreased mobility had been ordered. Forty-nine patients completed the work-up. Of these, 41 out of 49 (84%) were found to have imaging or laboratory findings that could explain the hypomobility. Thyroid abnormalities were found to be associated with vocal fold hypomobility in 21 out of 49 (43%) of those with a complete evaluation. Other causes of vocal fold hypomobility included idiopathic (8 of 49, 16%), viral neuritis (5 of 49, 10%), central nervous system abnormality (4 of 49, 8%), neural tumor (3 of 49, 6%), joint dysfunction (3 of 49, 6%), iatrogenic nerve injury (2 of 49, 4%), myopathy (2 of 49, 4%), and noniatrogenic traumatic nerve injury (1 of 49, 2%), This study shows that unilateral vocal fold hypomobility often is associated with a physiologic process, and a complete investigation to determine the etiology is warranted in all cases.
Vocal Changes Following Thyroid Surgery: Prospective Study of Objective and Subjective Parameters.

PubMed

Delgado-Vargas, Beatriz; Lloris Romero-Salazar, Azucena; Cobeta, Ignacio

2017-10-19

Vocal changes are frequent following a surgical procedure to the thyroid gland. Even though they are a recognized morbidity, their bases are yet to be defined as well as their effect on vocal parameters. This study investigates the objective and subjective changes that occur after the surgery. This study is a prospective analysis of consecutive cases. This study was conducted in a single-center tertiary care facility. Patients programmed for any thyroid procedure in Hospital Universitario Ramón y Cajal were enrolled consecutively to perform the vocal analysis before and after the surgery from April 2014 to April 2016. Patients were divided according to the vocal fold motility, and their vocal and aerodynamic parameters were obtained by means of electroglottography and phonatory aerodynamic system. Patients filled in the 10-item Voice Handicap Index (VHI-10) questionnaire. Statistical analysis was performed comparing vocal and aerodynamic parameters and quality of life before and after the surgery. 218 patients met inclusion criteria and completed the protocol. A total of 86.6% of the sample showed no vocal motility impairment, whereas the rest of the patients showed a paresis or a paralysis. Maximum phonatory time and VHI-10 questionnaire showed a statistically significant difference (P < 0.05) between groups. No differences were assessed regarding other vocal parameters. Efforts are still needed to understand the groundings and magnitude of the vocal changes after a thyroid surgery. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
On the role of emerging voluntary control of vocalization in language evolution. Comment on "Towards a Computational Comparative Neuroprimatology: Framing the language-ready brain" by Michael A. Arbib

NASA Astrophysics Data System (ADS)

Coudé, Gino

2016-03-01

This comment will be focused on the role of monkey vocal control in the evolution of language. I will essentially reiterate the observations expressed in a commentary [1] about the book ;How the brain got language: the mirror system hypothesis;, written by Arbib [2]. I will hopefully clarify our suggestion that non-human primates vocal communication, in conjunction with gestures, could have had an active role in the emergence of the first voluntary forms of utterances that will later shape protospeech. This suggestion is mainly rooted in neurophysiological data about vocal control in monkey. I will very briefly summarize how neurophysiological data allowed us to suggest a possible role for monkey vocalization in language evolution. We conducted a study [3] in which we recorded from ventral premotor cortex (PMv) of macaques trained to emit vocalizations (i.e. coo-calls). The results showed that the rostro-lateral part of PMv contains neurons that fire during conditioned vocalization. The involvement of PMv in vocalization production was further supported by electrical microstimulation of the cortical sector where some of the vocalization neurons were found. Microstimulation elicited in some cases a combination of jaw, tongue and larynx movements. To us, the evolutionary implications of those results were obvious: a partial voluntary vocal control was already taking place in the primate PMv cortex some 25 million years ago.
Voice disorder in systemic lupus erythematosus

PubMed Central

de Macedo, Milena S. F. C.; da Silva Filho, Manoel

2017-01-01

Systemic lupus erythematosus (SLE) is a chronic disease characterized by progressive tissue damage. In recent decades, novel treatments have greatly extended the life span of SLE patients. This creates a high demand for identifying the overarching symptoms associated with SLE and developing therapies that improve their life quality under chronic care. We hypothesized that SLE patients would present dysphonic symptoms. Given that voice disorders can reduce life quality, identifying a potential SLE-related dysphonia could be relevant for the appraisal and management of this disease. We measured objective vocal parameters and perceived vocal quality with the GRBAS (Grade, Roughness, Breathiness, Asthenia, Strain) scale in SLE patients and compared them to matched healthy controls. SLE patients also filled a questionnaire reporting perceived vocal deficits. SLE patients had significantly lower vocal intensity and harmonics to noise ratio, as well as increased jitter and shimmer. All subjective parameters of the GRBAS scale were significantly abnormal in SLE patients. Additionally, the vast majority of SLE patients (29/36) reported at least one perceived vocal deficit, with the most prevalent deficits being vocal fatigue (19/36) and hoarseness (17/36). Self-reported voice deficits were highly correlated with altered GRBAS scores. Additionally, tissue damage scores in different organ systems correlated with dysphonic symptoms, suggesting that some features of SLE-related dysphonia are due to tissue damage. Our results show that a large fraction of SLE patients suffers from perceivable dysphonia and may benefit from voice therapy in order to improve quality of life. PMID:28414781
Communication modality sampling for a toddler with Angelman syndrome.

PubMed

Hyppa Martin, Jolene; Reichle, Joe; Dimian, Adele; Chen, Mo

2013-10-01

Vocal, gestural, and graphic communication modes were implemented concurrently with a toddler with Angelman syndrome to identify the most efficiently learned communication mode to emphasize in an initial augmentative communication system. Symbols representing preferred objects were introduced in vocal, gestural, and graphic communication modes using an alternating treatment single-subject experimental design. Conventionally accepted prompting strategies were used to teach symbols in each communication mode. Because the learner did not vocally imitate, vocal mode intervention focused on increasing vocal frequency as an initial step. When graphic and gestural mode performances were compared, the learner most accurately produced requests in graphic mode (percentage of nonoverlapping data = 96). Given the lack of success in prompting vocal productions, a comparison between vocal and the other two communication modes was not made. A growing body of evidence suggests that concurrent modality sampling is a promising low-inference, data-driven procedure that can be used to inform selection of a communication mode(s) for initial emphasis with young children. Concurrent modality sampling can guide clinical decisions regarding the allocation of treatment resources to promote success in building an initial communicative repertoire.
In Vivo measurement of pediatric vocal fold motion using structured light laser projection.

PubMed

Patel, Rita R; Donohue, Kevin D; Lau, Daniel; Unnikrishnan, Harikrishnan

2013-07-01

The aim of the study was to present the development of a miniature structured light laser projection endoscope and to quantify vocal fold length and vibratory features related to impact stress of the pediatric glottis using high-speed imaging. The custom-developed laser projection system consists of a green laser with a 4-mm diameter optics module at the tip of the endoscope, projecting 20 vertical laser lines on the glottis. Measurements of absolute phonatory vocal fold length, membranous vocal fold length, peak amplitude, amplitude-to-length ratio, average closing velocity, and impact velocity were obtained in five children (6-9 years), two adult male and three adult female participants without voice disorders, and one child (10 years) with bilateral vocal fold nodules during modal phonation. Independent measurements made on the glottal length of a vocal fold phantom demonstrated a 0.13mm bias error with a standard deviation of 0.23mm, indicating adequate precision and accuracy for measuring vocal fold structures and displacement. First, in vivo measurements of amplitude-to-length ratio, peak closing velocity, and impact velocity during phonation in pediatric population and a child with vocal fold nodules are reported. The proposed laser projection system can be used to obtain in vivo measurements of absolute length and vibratory features in children and adults. Children have large amplitude-to-length ratio compared with typically developing adults, whereas nodules result in larger peak amplitude, amplitude-to-length ratio, average closing velocity, and impact velocity compared with typically developing children. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Effects of voice harmonic complexity on ERP responses to pitch-shifted auditory feedback.

PubMed

Behroozmand, Roozbeh; Korzyukov, Oleg; Larson, Charles R

2011-12-01

The present study investigated the neural mechanisms of voice pitch control for different levels of harmonic complexity in the auditory feedback. Event-related potentials (ERPs) were recorded in response to+200 cents pitch perturbations in the auditory feedback of self-produced natural human vocalizations, complex and pure tone stimuli during active vocalization and passive listening conditions. During active vocal production, ERP amplitudes were largest in response to pitch shifts in the natural voice, moderately large for non-voice complex stimuli and smallest for the pure tones. However, during passive listening, neural responses were equally large for pitch shifts in voice and non-voice complex stimuli but still larger than that for pure tones. These findings suggest that pitch change detection is facilitated for spectrally rich sounds such as natural human voice and non-voice complex stimuli compared with pure tones. Vocalization-induced increase in neural responses for voice feedback suggests that sensory processing of naturally-produced complex sounds such as human voice is enhanced by means of motor-driven mechanisms (e.g. efference copies) during vocal production. This enhancement may enable the audio-vocal system to more effectively detect and correct for vocal errors in the feedback of natural human vocalizations to maintain an intended vocal output for speaking. Copyright Â© 2011 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.
A rare case of non-surgical vocal cord paralysis: Vocal cord hematoma.

PubMed

Arıkan, Akif Enes; Teksöz, Serkan; Bilgin, İsmail Ahmet; Tarhan, Özge; Özyeğin, Ateş

2017-01-01

Although vocal cord paralysis (VCP) following thyroidectomy is primarily associated with surgical trauma, it is not the sole etiology. Vocal cord paralysis following thyroidectomy can be caused by a vocal cord hematoma with an incidence of 1.4% due to direct injury during orotracheal intubation. In this article, we present a case of VCP caused by vocal cord hematoma. A 32-year-old male patient who has been receiving propylthiouracil treatment for toxic multinodular goiter since 10 years was admitted to our hospital to be operated because of persisting complaints. The patient was hospitalized for sutureless thyroidectomy after he became euthyroid. Preoperative fiberoptic laryngoscopy performed by the ear, nose, and throat department revealed bilaterally motile vocal folds and a completely open airway. Patient underwent sutureless total thyroidectomy with a vessel sealing device (Ligasure TM LF1212, Covidien, CO), and a minivac drainage system was placed in the thyroid lodge. On the morning of the first postoperative day, 50 mL of serosanguinous fluid was drained. The patient's voice was normal, and there was no ecchymosis. Postoperative fiberoptic laryngoscopy revealed a hematoma near the right vocal fold and paralysis of the right vocal fold; however, the airway was open. It should be kept in mind that VCP is not solely due to surgery but can also result from intubation, as observed in this case.
A rare case of non-surgical vocal cord paralysis: Vocal cord hematoma

PubMed Central

Arıkan, Akif Enes; Teksöz, Serkan; Bilgin, İsmail Ahmet; Tarhan, Özge; Özyeğin, Ateş

2017-01-01

Although vocal cord paralysis (VCP) following thyroidectomy is primarily associated with surgical trauma, it is not the sole etiology. Vocal cord paralysis following thyroidectomy can be caused by a vocal cord hematoma with an incidence of 1.4% due to direct injury during orotracheal intubation. In this article, we present a case of VCP caused by vocal cord hematoma. A 32-year-old male patient who has been receiving propylthiouracil treatment for toxic multinodular goiter since 10 years was admitted to our hospital to be operated because of persisting complaints. The patient was hospitalized for sutureless thyroidectomy after he became euthyroid. Preoperative fiberoptic laryngoscopy performed by the ear, nose, and throat department revealed bilaterally motile vocal folds and a completely open airway. Patient underwent sutureless total thyroidectomy with a vessel sealing device (LigasureTM LF1212, Covidien, CO), and a minivac drainage system was placed in the thyroid lodge. On the morning of the first postoperative day, 50 mL of serosanguinous fluid was drained. The patient’s voice was normal, and there was no ecchymosis. Postoperative fiberoptic laryngoscopy revealed a hematoma near the right vocal fold and paralysis of the right vocal fold; however, the airway was open. It should be kept in mind that VCP is not solely due to surgery but can also result from intubation, as observed in this case. PMID:29260141
Self-Organization: Complex Dynamical Systems in the Evolution of Speech

NASA Astrophysics Data System (ADS)

Oudeyer, Pierre-Yves

Human vocalization systems are characterized by complex structural properties. They are combinatorial, based on the systematic reuse of phonemes, and the set of repertoires in human languages is characterized by both strong statistical regularities—universals—and a great diversity. Besides, they are conventional codes culturally shared in each community of speakers. What are the origins of the forms of speech? What are the mechanisms that permitted their evolution in the course of phylogenesis and cultural evolution? How can a shared speech code be formed in a community of individuals? This chapter focuses on the way the concept of self-organization, and its interaction with natural selection, can throw light on these three questions. In particular, a computational model is presented which shows that a basic neural equipment for adaptive holistic vocal imitation, coupling directly motor and perceptual representations in the brain, can generate spontaneously shared combinatorial systems of vocalizations in a society of babbling individuals. Furthermore, we show how morphological and physiological innate constraints can interact with these self-organized mechanisms to account for both the formation of statistical regularities and diversity in vocalization systems.
Food for song: expression of c-Fos and ZENK in the zebra finch song nuclei during food aversion learning.

PubMed

Tokarev, Kirill; Tiunova, Anna; Scharff, Constance; Anokhin, Konstantin

2011-01-01

Specialized neural pathways, the song system, are required for acquiring, producing, and perceiving learned avian vocalizations. Birds that do not learn to produce their vocalizations lack telencephalic song system components. It is not known whether the song system forebrain regions are exclusively evolved for song or whether they also process information not related to song that might reflect their 'evolutionary history'. To address this question we monitored the induction of two immediate-early genes (IEGs) c-Fos and ZENK in various regions of the song system in zebra finches (Taeniopygia guttata) in response to an aversive food learning paradigm; this involves the association of a food item with a noxious stimulus that affects the oropharyngeal-esophageal cavity and tongue, causing subsequent avoidance of that food item. The motor response results in beak and head movements but not vocalizations. IEGs have been extensively used to map neuro-molecular correlates of song motor production and auditory processing. As previously reported, neurons in two pallial vocal motor regions, HVC and RA, expressed IEGs after singing. Surprisingly, c-Fos was induced equivalently also after food aversion learning in the absence of singing. The density of c-Fos positive neurons was significantly higher than that of birds in control conditions. This was not the case in two other pallial song nuclei important for vocal plasticity, LMAN and Area X, although singing did induce IEGs in these structures, as reported previously. Our results are consistent with the possibility that some of the song nuclei may participate in non-vocal learning and the populations of neurons involved in the two tasks show partial overlap. These findings underscore the previously advanced notion that the specialized forebrain pre-motor nuclei controlling song evolved from circuits involved in behaviors related to feeding.

The predictability of frequency-altered auditory feedback changes the weighting of feedback and feedforward input for speech motor control.

PubMed

Scheerer, Nichole E; Jones, Jeffery A

2014-12-01

Speech production requires the combined effort of a feedback control system driven by sensory feedback, and a feedforward control system driven by internal models. However, the factors that dictate the relative weighting of these feedback and feedforward control systems are unclear. In this event-related potential (ERP) study, participants produced vocalisations while being exposed to blocks of frequency-altered feedback (FAF) perturbations that were either predictable in magnitude (consistently either 50 or 100 cents) or unpredictable in magnitude (50- and 100-cent perturbations varying randomly within each vocalisation). Vocal and P1-N1-P2 ERP responses revealed decreases in the magnitude and trial-to-trial variability of vocal responses, smaller N1 amplitudes, and shorter vocal, P1 and N1 response latencies following predictable FAF perturbation magnitudes. In addition, vocal response magnitudes correlated with N1 amplitudes, vocal response latencies, and P2 latencies. This pattern of results suggests that after repeated exposure to predictable FAF perturbations, the contribution of the feedforward control system increases. Examination of the presentation order of the FAF perturbations revealed smaller compensatory responses, smaller P1 and P2 amplitudes, and shorter N1 latencies when the block of predictable 100-cent perturbations occurred prior to the block of predictable 50-cent perturbations. These results suggest that exposure to large perturbations modulates responses to subsequent perturbations of equal or smaller size. Similarly, exposure to a 100-cent perturbation prior to a 50-cent perturbation within a vocalisation decreased the magnitude of vocal and N1 responses, but increased P1 and P2 latencies. Thus, exposure to a single perturbation can affect responses to subsequent perturbations. © 2014 Federation of European Neuroscience Societies and John Wiley & Sons Ltd.
Electroglottographic parameterization of the effects of gender, vowel and phonatory registers on vocal fold vibratory patterns: an Indian perspective.

PubMed

Paul, Nilanjan; Kumar, Suman; Chatterjee, Indranil; Mukherjee, Biswarup

2011-01-01

In-depth study on laryngeal biomechanics and vocal fold vibratory patterns reveal that a single vibratory cycle can be divided into two major phases, the closed and open phase, which is subdivided into opening and closing phases. Studies reveal that the relative time course of abduction and adduction, which in turn is dependent on the relative relaxing and tensing of the vocal fold cover and body, to be the determining factor in production of a particular vocal register like the modal (or chest), falsetto, glottal fry registers. Studies further point out Electroglottography to be particularly suitable for the study of vocal vibratory patterns during register changes. However, to date, there has been limited study on quantitative parameterization of EGG wave form in vocal fry register. Moreover, contradictory findings abound in literature regarding effects of gender and vowel types on vocal vibratory patterns, especially during phonation at different registers. The present study endeavors to find out the effects of vowel and gender differences on the vocal fold vibratory patterns in different registers and how these would be reflected in standard EGG parameters of Contact Quotient (CQ) and Contact Index (CI), taking into consideration the Indian sociolinguistic context. Electroglottographic recordings of 10 young adults (5 males and 5 females) were taken while the subjects phonated the three vowels /a/,/i/,/u/ each in two vocal registers, modal and vocal fry. Obtained raw EGG were further normalized using the Derived EGG algorithm and theCQ and CI values were derived. Obtained data were subject to statistical analysis using the 3-way ANOVA with gender, vowel and vocal register as the three variables. Post-hoc Dunnett C multiple comparison analysis were also performed. Results reveal that CQ values are significantly higher in vocal fry than modal phonation for both males and females, indicating a relatively hyperconstricted vocal system during vocal fry. The males have significantly greater CQ values than females both at modal and vocal fry phonations which indicate that the males are predisposed to greater vocal fold constriction. Females demonstrated no significant increase in CI values in vocal fry state; and in some cases actually decrease in the CI values which suggest an inherently distinct vocal fold physiological adjustment from that in males. No vowel effects were found in any conditions. Perturbation values (CQP and CIP) are significantly more in vocal fry register than in modal register, and the increase was more in case of females than males. The findings give strong evidence to certain hypotheses in literature regarding effects of vowel, gender and phonatory register on vocal fold vibratory patterns.
A Joyful Noise: The Vocal Health of Worship Leaders and Contemporary Christian Singers.

PubMed

Neto, Leon; Meyer, David

2017-03-01

Contemporary commercial music (CCM) is a term that encompasses many styles of music. A growing subset of CCM is contemporary Christian music, a genre that has outpaced other popular styles such as Latin, jazz, and classical music. Contemporary Christian singers (CCSs) and worship leaders (WLs) are a subset of CCM musicians that face unique vocal demands and risks. They typically lack professional training and often perform in acoustically disadvantageous venues with substandard sound reinforcement systems. The vocal needs and risks of these singers are not well understood, and because of this, their training and care may be suboptimal. The aim of the present study was to investigate the vocal health of this growing population and their awareness of standard vocal hygiene principles. An online questionnaire was designed and administered to participants in the Americas, Europe, Australia, and Asia. A total of 614 participants responded to the questionnaire, which is made available in English, Portuguese, and Spanish. Many participants reported vocal symptoms such as vocal fatigue (n = 213; 34.7%), tickling or choking sensation (n = 149; 24.3%), loss of upper range (n = 172; 28%), and complete loss of voice (n = 25; 4.1%). One third of the participants (n = 210; 34%) indicated that they do not warm up their voices before performances and over half of the participants (n = 319; 52%) have no formal vocal training. Results suggest that this population demonstrates low awareness of vocal hygiene principles, frequently experience difficulty with their voices, and may face elevated risk of vocal pathology. Future studies of this population may confirm the vocal risks that our preliminary findings suggest. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
A new approach to geometrical measurements in an animal model of vocal fold scar.

PubMed

Jabbour, Noel; Krishna, Priya D; Osborne, James; Rosen, Clark A

2009-01-01

A standard method for quantifying the geometric properties of vocal folds has not been widely adopted. An ideal method of geometrical measurement should effectively quantify the dimensions of the medial vibratory portion of the vocal fold, should be easily performed, should yield consistent results, and should be readily available at little to no cost. We have developed a new approach for geometrical measurements to meet these goals. The objective of this study is to describe this new approach and to assess its effectiveness in a canine model of vocal fold scar. One hundred thirty-five mid-membranous coronal sections of vocal folds from 10 canines (five with unilateral surgical scarring) were examined by light microscopy; digital images were captured. ImageJ was used to measure a variety of described parameters. Comparison between scarred vocal folds and control vocal folds was made. At least 20% of the slides for each vocal fold were randomly selected (n=42) for repeat measurements of interrater and intrarater reliability. A statistically significant difference between scarred and control vocal folds was obtained for horizontal distance (P<0.001), vertical distance (P=0.005), area (P<0.001), mean optical density (OD) (P<0.001), and OD at defined points along the length of the vocal fold (P< or =0.009). Reliability calculations for intrarater and interrater measurements ranged from r=0.845 to r=0.994 and from r=0.734 to r=0.976, respectively. The proposed approach for geometrical measurements meets the intended objectives in a canine model of vocal fold scar. Future work is needed to apply this approach to other model systems.
Experimental and Theoretical Investigations of Phonation Threshold Pressure as a Function of Vocal Fold Elongation

PubMed Central

Tao, Chao; Regner, Michael F.; Zhang, Yu; Jiang, Jack J.

2014-01-01

Summary The relationship between the vocal fold elongation and the phonation threshold pressure (PTP) was experimentally and theoretically investigated. The PTP values of seventeen excised canine larynges with 0% to 15% bilateral vocal fold elongations in 5% elongation steps were measured using an excised larynx phonation system. It was found that twelve larynges exhibited a monotonic relationship between PTP and elongation; in these larynges, the 0% elongation condition had the lowest PTP. Five larynges exhibited a PTP minimum at 5% elongation. To provide a theoretical explanation of these phenomena, a two-mass model was modified to simulate vibration of the elongated vocal folds. Two pairs of longitudinal springs were used to represent the longitudinal elastin in the vocal folds. This model showed that when the vocal folds were elongated, the increased longitudinal tension would increase the PTP value and the increased vocal fold length would decrease the PTP value. The antagonistic effects contributed by these two factors were found to be able to cause either a monotonic or a non-monotonic relationship between PTP and elongation, which were consistent with experimental observations. Because PTP describes the ease of phonation, this study suggests that there may exist a nonzero optimal vocal fold elongation for the greatest ease for phonation in some larynges. PMID:25530744
The Neural Basis of Vocal Pitch Imitation in Humans.

PubMed

Belyk, Michel; Pfordresher, Peter Q; Liotti, Mario; Brown, Steven

2016-04-01

Vocal imitation is a phenotype that is unique to humans among all primate species, and so an understanding of its neural basis is critical in explaining the emergence of both speech and song in human evolution. Two principal neural models of vocal imitation have emerged from a consideration of nonhuman animals. One hypothesis suggests that putative mirror neurons in the inferior frontal gyrus pars opercularis of Broca's area may be important for imitation. An alternative hypothesis derived from the study of songbirds suggests that the corticostriate motor pathway performs sensorimotor processes that are specific to vocal imitation. Using fMRI with a sparse event-related sampling design, we investigated the neural basis of vocal imitation in humans by comparing imitative vocal production of pitch sequences with both nonimitative vocal production and pitch discrimination. The strongest difference between these tasks was found in the putamen bilaterally, providing a striking parallel to the role of the analogous region in songbirds. Other areas preferentially activated during imitation included the orofacial motor cortex, Rolandic operculum, and SMA, which together outline the corticostriate motor loop. No differences were seen in the inferior frontal gyrus. The corticostriate system thus appears to be the central pathway for vocal imitation in humans, as predicted from an analogy with songbirds.
Social Vocalizations of Big Brown Bats Vary with Behavioral Context

PubMed Central

Gadziola, Marie A.; Grimsley, Jasmine M. S.; Faure, Paul A.; Wenstrup, Jeffrey J.

2012-01-01

Bats are among the most gregarious and vocal mammals, with some species demonstrating a diverse repertoire of syllables under a variety of behavioral contexts. Despite extensive characterization of big brown bat (Eptesicus fuscus) biosonar signals, there have been no detailed studies of adult social vocalizations. We recorded and analyzed social vocalizations and associated behaviors of captive big brown bats under four behavioral contexts: low aggression, medium aggression, high aggression, and appeasement. Even limited to these contexts, big brown bats possess a rich repertoire of social vocalizations, with 18 distinct syllable types automatically classified using a spectrogram cross-correlation procedure. For each behavioral context, we describe vocalizations in terms of syllable acoustics, temporal emission patterns, and typical syllable sequences. Emotion-related acoustic cues are evident within the call structure by context-specific syllable types or variations in the temporal emission pattern. We designed a paradigm that could evoke aggressive vocalizations while monitoring heart rate as an objective measure of internal physiological state. Changes in the magnitude and duration of elevated heart rate scaled to the level of evoked aggression, confirming the behavioral state classifications assessed by vocalizations and behavioral displays. These results reveal a complex acoustic communication system among big brown bats in which acoustic cues and call structure signal the emotional state of a caller. PMID:22970247
Vocal Fry Use in Adult Female Speakers Exposed to Two Languages.

PubMed

Gibson, Todd A; Summers, Connie; Walls, Sydney

2017-07-01

Several studies have identified the widespread use of vocal fry among American women. Popular explanations for this phenomenon appeal to sociolinguistic purposes that likely take significant time for second language users to learn. The objective of this study was to determine if mere exposure to this vocal register, as opposed to nuanced sociolinguistic motivations, might explain its widespread use. This study used multigroup within- and between-subjects design. Fifty-eight women from one of three language background groups (functionally monolingual in English, functionally monolingual in Spanish, and Spanish-English bilinguals) living in El Paso, Texas, repeated a list of nonwords conforming to the sound rules of English and another list of nonwords conforming to the sound rules of Spanish. Perceptual analysis identified each episode of vocal fry. There were no statistically significant differences between groups in their frequency of vocal fry use despite large differences in their amount of English-language exposure. All groups produced more vocal fry when repeating English than when repeating Spanish nonwords. Because the human perceptual system encodes for vocal qualities even after minimal language experience, the widespread use of vocal fry among female residents in the United States likely is owing to mere exposure to English rather than nuanced sociolinguistic motivations. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Measurement of flow separation in a human vocal folds model

NASA Astrophysics Data System (ADS)

Šidlof, Petr; Doaré, Olivier; Cadot, Olivier; Chaigne, Antoine

2011-07-01

The paper provides experimental data on flow separation from a model of the human vocal folds. Data were measured on a four times scaled physical model, where one vocal fold was fixed and the other oscillated due to fluid-structure interaction. The vocal folds were fabricated from silicone rubber and placed on elastic support in the wall of a transparent wind tunnel. A PIV system was used to visualize the flow fields immediately downstream of the glottis and to measure the velocity fields. From the visualizations, the position of the flow separation point was evaluated using a semiautomatic procedure and plotted for different airflow velocities. The separation point position was quantified relative to the orifice width separately for the left and right vocal folds to account for flow asymmetry. The results indicate that the flow separation point remains close to the narrowest cross-section during most of the vocal fold vibration cycle, but moves significantly further downstream shortly prior to and after glottal closure.
Tashkeela: Novel corpus of Arabic vocalized texts, data for auto-diacritization systems.

PubMed

Zerrouki, Taha; Balla, Amar

2017-04-01

Arabic diacritics are often missed in Arabic scripts. This feature is a handicap for new learner to read َArabic, text to speech conversion systems, reading and semantic analysis of Arabic texts. The automatic diacritization systems are the best solution to handle this issue. But such automation needs resources as diactritized texts to train and evaluate such systems. In this paper, we describe our corpus of Arabic diacritized texts. This corpus is called Tashkeela. It can be used as a linguistic resource tool for natural language processing such as automatic diacritics systems, dis-ambiguity mechanism, features and data extraction. The corpus is freely available, it contains 75 million of fully vocalized words mainly 97 books from classical and modern Arabic language. The corpus is collected from manually vocalized texts using web crawling process.
The ambiguities of the 'partnership' between civil society and the state in Uganda's AIDS response during the 1990s and 2000s as demonstrated in the development of TASO.

PubMed

Grebe, Eduard

2016-01-01

This article critically investigates state-civil society relations in the Ugandan AIDS response by tracing the history of Uganda's 'multisectoral' and 'partnership' approaches, particularly as it pertains to The AIDS Support Organisation (TASO). It finds that the Ugandan government's reputation for good leadership on AIDS is more ambiguous than commonly supposed and that the much-vaunted 'partnership' approach has not enabled strong critical civil society voices to emerge or prevented the harmful impact of a socially conservative agenda. By the 1990s, TASO had become the most important provider of medical and psychosocial support services to HIV/AIDS patients, but was less effective in influencing policy or holding the state accountable (because the political context prevented a more activist stance). The effectiveness of civil society has been constrained by an authoritarian political culture and institutions that discourage vocal criticism. Despite these limitations, however, state-civil society partnership did contribute to the emergence of a relatively effective coalition for action against HIV/AIDS. Donors were essential in encouraging the emergence of this coalition.
Qualification of a Quantitative Laryngeal Imaging System Using Videostroboscopy and Videokymography

PubMed Central

Popolo, Peter S.; Titze, Ingo R.

2008-01-01

Objectives: We sought to determine whether full-cycle glottal width measurements could be obtained with a quantitative laryngeal imaging system using videostroboscopy, and whether glottal width and vocal fold length measurements were repeatable and reliable. Methods: Synthetic vocal folds were phonated on a laboratory bench, and dynamic images were obtained in repeated trials by use of videostroboscopy and videokymography (VKG) with an imaging system equipped with a 2-point laser projection device for measuring absolute dimensions. Video images were also obtained with an industrial videoscope system with a built-in laser measurement capability. Maximum glottal width and vocal fold length were compared among these 3 methods. Results: The average variation in maximum glottal width measurements between stroboscopic data and VKG data was 3.10%. The average variations in width measurements between the clinical system and the industrial system were 1.93% (stroboscopy) and 3.49% (VKG). The variations in vocal fold length were similarly small. The standard deviations across trials were 0.29 mm for width and 0.48 mm for length (stroboscopy), 0.18 mm for width (VKG), and 0.25 mm for width and 0.84 mm for length (industrial). Conclusions: For stable, periodic vibration, the full extent of the glottal width can be reliably measured with the quantitative videostroboscopy system. PMID:18646436
Memory-dependent adjustment of vocal response latencies in a territorial songbird.

PubMed

Geberzahn, Nicole; Hultsch, Henrike; Todt, Dietmar

2013-06-01

Vocal interactions in songbirds can be used as a model system to investigate the interplay of intrinsic singing programmes (e.g. influences from vocal memories) and external variables (e.g. social factors). When characterizing vocal interactions between territorial rivals two aspects are important: (1) the timing of songs in relation to the conspecific's singing and (2) the use of a song pattern that matches the rival's song. Responses in both domains can be used to address a territorial rival. This study is the first to investigate the relation of the timing of vocal responses to (1) the vocal memory of a responding subject and (2) the selection of the song pattern that the subject uses as a response. To this end, we conducted interactive playback experiments with adult nightingales (Luscinia megarhynchos) that had been hand-reared and tutored in the laboratory. We analysed the subjects' vocal response latencies towards broadcast playback stimuli that they either had in their own vocal repertoire (songs shared with playback) or that they had not heard before (unknown songs). Likewise, we compared vocal response latencies between responses that matched the stimulus song and those that did not. Our findings showed that the latency of singing in response to the playback was shorter for shared versus unknown song stimuli when subjects overlapped the playback stimuli with their own song. Moreover birds tended to overlap faster when vocally matching the stimulus song rather than when replying with a non-matching song type. We conclude that memory of song patterns influenced response latencies and discuss possible mechanisms. Copyright © 2012 Elsevier Ltd. All rights reserved.
Male goat vocalizations stimulate the estrous behavior and LH secretion in anestrous goats that have been previously exposed to bucks.

PubMed

Delgadillo, José Alberto; Vielma, Jesús; Hernandez, Horacio; Flores, José Alfredo; Duarte, Gerardo; Fernández, Ilda Graciela; Keller, Matthieu; Gelez, Hélène

2012-09-01

We investigated whether live vocalizations emitted by bucks interacting with anestrous females stimulate secretion of LH, estrous behavior and ovulation in anestrous goats. In experiment 1, bucks rendered sexually active by exposure to long days followed by natural photoperiod were exposed in a light-proof-building to five anestrous females. Buck vocalizations were reproduced through a microphone-amplifier-loudspeaker system to an open pen where one group of goats (n=6) was exposed for 10 days to these live vocalizations. Another group of females (n=6) was isolated from males and vocalizations. The proportion of goats displaying estrous behavior was significantly higher in females exposed to buck vocalizations than in females isolated from males. The proportion of goats that ovulated did not differ between the 2 groups (exposed to males versus isolated). In experiment 2, female goats that either had previous contact with males (n=7), or no previous contact with males (n=7) were exposed to live buck vocalizations, reproduced as described in experiment 1, for 5 days. The number and amplitude of LH pulses did not differ between groups before exposition to buck vocalizations. Five days of exposure to male vocalizations significantly increased LH pulsatility only in females that had previous contact with males, while LH pulse amplitude was not modified. We concluded that live buck vocalizations can stimulate estrous behavior and LH secretion in goats if they have had previous contact with bucks. Copyright © 2012 Elsevier Inc. All rights reserved.
Human Adipose Tissue Derived Extracellular Matrix and Methylcellulose Hydrogels Augments and Regenerates the Paralyzed Vocal Fold

PubMed Central

Kim, Eun Na; Sung, Myung Whun; Kwon, Tack-Kyun; Cho, Yong Woo; Kwon, Seong Keun

2016-01-01

Vocal fold paralysis results from various etiologies and can induce voice changes, swallowing complications, and issues with aspiration. Vocal fold paralysis is typically managed using injection laryngoplasty with fat or synthetic polymers. Injection with autologous fat has shown excellent biocompatibility. However, it has several disadvantages such as unpredictable resorption rate, morbidities associated with liposuction procedure which has to be done in operating room under general anesthesia. Human adipose-derived extracellular matrix (ECM) grafts have been reported to form new adipose tissue and have greater biostability than autologous fat graft. Here, we present an injectable hydrogel that is constructed from adipose tissue derived soluble extracellular matrix (sECM) and methylcellulose (MC) for use in vocal fold augmentation. Human sECM derived from adipose tissue was extracted using two major steps—ECM was isolated from human adipose tissue and was subsequently solubilized. Injectable sECM/MC hydrogels were prepared by blending of sECM and MC. Sustained vocal fold augmentation and symmetric vocal fold vibration were accomplished by the sECM/MC hydrogel in paralyzed vocal fold which were confirmed by laryngoscope, histology and a high-speed imaging system. There were increased number of collagen fibers and fatty granules at the injection site without significant inflammation or fibrosis. Overall, these results indicate that the sECM/MC hydrogel can enhance vocal function in paralyzed vocal folds without early resorption and has potential as a promising material for injection laryngoplasty for stable vocal fold augmentation which can overcome the shortcomings of autologous fat such as unpredictable duration and morbidity associated with the fat harvest. PMID:27768757
Simultaneous determination of neuromuscular blockade at the adducting and abducting laryngeal muscles using phonomyography.

PubMed

Hemmerling, Thomas M; Michaud, Guillaume; Trager, Guillaume; Donati, François

2004-06-01

Phonomyography (PMG) is a new method for measuring neuromuscular blockade (NMB) at the larynx. In this study, we used PMG to compare NMB at the posterior cricoarytenoid (PCA) and the lateral cricoarytenoid muscle (LCA) in humans. Twelve patients were included in this study. Endotracheal intubation was performed without aid of neuromuscular blocking drugs. One small condenser microphone was inserted beside the vocal cords into the muscular process at the base of the arytenoid cartilage to record acoustic responses of the LCA (vocal cord adduction), and a second microphone was placed behind the larynx to measure NMB of the PCA (vocal cord abduction). Stimulation of the recurrent laryngeal nerve was performed using superficial electrodes placed at the neck (midline between jugular notch and cricoid cartilage) using train-of-four (TOF) stimulation every 12 s. After supramaximal stimulation, mivacurium 0.1 mg/kg was injected and onset, peak effect, and offset of NMB measured and compared using t-test (P < 0.05). The data are presented as mean (SD). Peak effect, onset time, and early recovery to 25% of control twitch height were not significantly different between PCA and LCA at 86% (13) versus 78% (16), 2.3 min (0.45) versus 2.3 min (1.0), and 9.55 min (3.05) versus 8.5 min (4.7), respectively. However, recovery to 75%, 90% of control twitch height, and recovery to a TOF ratio of 0.8 were significantly longer at the PCA than at the LCA at 14 min (4) versus 11 min (5), 17 min (5) versus 11.8 min (5.6), and 17.5 min (5.6) versus 12.3 min (5.5), respectively. The authors conclude that recovery of NMB at the PCA takes longer than at the LCA in humans after mivacurium. After neuromuscular blockade in humans, the recovery of the ability to open the vocal cords takes longer than the ability to close the vocal cords.
Infant Vocal-Motor Coordination: Precursor to the Gesture-Speech System?

ERIC Educational Resources Information Center

Iverson, Jana M.; Fagan, Mary K.

2004-01-01

This study was designed to provide a general picture of infant vocal-motor coordination and test predictions generated by Iverson and Thelen's (1999) model of the development of the gesture-speech system. Forty-seven 6- to 9-month-old infants were videotaped with a primary caregiver during rattle and toy play. Results indicated an age-related…
ANALYSIS OF FLOW-STRUCTURE COUPLING IN A MECHANICAL MODEL OF THE VOCAL FOLDS AND THE SUBGLOTTAL SYSTEM.

PubMed

Howe, M S; McGowan, R S

2009-11-01

An analysis is made of the nonlinear interactions between flow in the subglottal vocal tract and glottis, sound waves in the subglottal system and a mechanical model of the vocal folds. The mean flow through the system is produced by a nominally steady contraction of the lungs, and mechanical experiments frequently involve a 'lung cavity' coupled to an experimental subglottal tube of arbitrary or ill-defined effective length L, on the basis that the actual value of L has little or no influence on excitation of the vocal folds. A simple, self-exciting single mass mathematical model of the vocal folds is used to investigate the sound generated within the subglottal domain and the unsteady volume flux from the glottis for experiments where it is required to suppress feedback of sound from the supraglottal vocal tract. In experiments where the assumed absorption of sound within the sponge-like interior of the lungs is small, the influence of changes in L can be very significant: when the subglottal tube behaves as an open-ended resonator (when L is as large as half the acoustic wavelength) there is predicted to be a mild increase in volume flux magnitude and a small change in waveform. However, the strong appearance of second harmonics of the acoustic field is predicted at intermediate lengths, when L is roughly one quarter of the acoustic wavelength. In cases of large lung damping, however, only modest changes in the volume flux are predicted to occur with variations in L.
Error-dependent modulation of speech-induced auditory suppression for pitch-shifted voice feedback.

PubMed

Behroozmand, Roozbeh; Larson, Charles R

2011-06-06

The motor-driven predictions about expected sensory feedback (efference copies) have been proposed to play an important role in recognition of sensory consequences of self-produced motor actions. In the auditory system, this effect was suggested to result in suppression of sensory neural responses to self-produced voices that are predicted by the efference copies during vocal production in comparison with passive listening to the playback of the identical self-vocalizations. In the present study, event-related potentials (ERPs) were recorded in response to upward pitch shift stimuli (PSS) with five different magnitudes (0, +50, +100, +200 and +400 cents) at voice onset during active vocal production and passive listening to the playback. Results indicated that the suppression of the N1 component during vocal production was largest for unaltered voice feedback (PSS: 0 cents), became smaller as the magnitude of PSS increased to 200 cents, and was almost completely eliminated in response to 400 cents stimuli. Findings of the present study suggest that the brain utilizes the motor predictions (efference copies) to determine the source of incoming stimuli and maximally suppresses the auditory responses to unaltered feedback of self-vocalizations. The reduction of suppression for 50, 100 and 200 cents and its elimination for 400 cents pitch-shifted voice auditory feedback support the idea that motor-driven suppression of voice feedback leads to distinctly different sensory neural processing of self vs. non-self vocalizations. This characteristic may enable the audio-vocal system to more effectively detect and correct for unexpected errors in the feedback of self-produced voice pitch compared with externally-generated sounds.
Popular song and lyrics synchronization and its application to music information retrieval

NASA Astrophysics Data System (ADS)

Chen, Kai; Gao, Sheng; Zhu, Yongwei; Sun, Qibin

2006-01-01

An automatic synchronization system of the popular song and its lyrics is presented in the paper. The system includes two main components: a) automatically detecting vocal/non-vocal in the audio signal and b) automatically aligning the acoustic signal of the song with its lyric using speech recognition techniques and positioning the boundaries of the lyrics in its acoustic realization at the multiple levels simultaneously (e.g. the word / syllable level and phrase level). The GMM models and a set of HMM-based acoustic model units are carefully designed and trained for the detection and alignment. To eliminate the severe mismatch due to the diversity of musical signal and sparse training data available, the unsupervised adaptation technique such as maximum likelihood linear regression (MLLR) is exploited for tailoring the models to the real environment, which improves robustness of the synchronization system. To further reduce the effect of the missed non-vocal music on alignment, a novel grammar net is build to direct the alignment. As we know, this is the first automatic synchronization system only based on the low-level acoustic feature such as MFCC. We evaluate the system on a Chinese song dataset collecting from 3 popular singers. We obtain 76.1% for the boundary accuracy at the syllable level (BAS) and 81.5% for the boundary accuracy at the phrase level (BAP) using fully automatic vocal/non-vocal detection and alignment. The synchronization system has many applications such as multi-modality (audio and textual) content-based popular song browsing and retrieval. Through the study, we would like to open up the discussion of some challenging problems when developing a robust synchronization system for largescale database.

Vocal acoustic analysis as a biometric indicator of information processing: implications for neurological and psychiatric disorders.

PubMed

Cohen, Alex S; Dinzeo, Thomas J; Donovan, Neila J; Brown, Caitlin E; Morrison, Sean C

2015-03-30

Vocal expression reflects an integral component of communication that varies considerably within individuals across contexts and is disrupted in a range of neurological and psychiatric disorders. There is reason to suspect that variability in vocal expression reflects, in part, the availability of "on-line" resources (e.g., working memory, attention). Thus, understanding vocal expression is a potentially important biometric index of information processing, not only across but within individuals over time. A first step in this line of research involves establishing a link between vocal expression and information processing systems in healthy adults. The present study employed a dual attention experimental task where participants provided natural speech while simultaneously engaged in a baseline, medium or high nonverbal processing-load task. Objective, automated, and computerized analysis was employed to measure vocal expression in 226 adults. Increased processing load resulted in longer pauses, fewer utterances, greater silence overall and less variability in frequency and intensity levels. These results provide compelling evidence of a link between information processing resources and vocal expression, and provide important information for the development of an automated, inexpensive and uninvasive biometric measure of information processing. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Geographical variation of St. Lucia Parrot flight vocalizations

USGS Publications Warehouse

Kleeman, Patrick M.; Gilardi, James D.

2005-01-01

Parrots are vocal learners and many species of parrots are capable of learning new calls, even as adults. This capability gives parrots the potential to develop communication systems that can vary dramatically over space. St. Lucia Parrot (Amazona versicolor) flight vocalizations were examined for geographic variation between four different sites on the island of St. Lucia. Spectrographic cross-correlation analysis of a commonly used flight vocalization, the p-chow call, demonstrated quantitative differences between sites. Additionally, the similarity of p-chows decreased as the distance between sites increased. Flight call repertoires also differed among sites; parrots at the Des Bottes and Quilesse sites each used one flight call unique to those sites, while parrots at the Barre de L'Isle site used a flight call that Quilesse parrots gave only while perched. It is unclear whether the vocal variation changed clinally with distance, or whether there were discrete dialect boundaries as in a congener, the Yellow-naped Parrot (Amazona auropalliata, Wright 1996). The geographical scale over which the St. Lucia Parrot's vocal variation occurred was dramatically smaller than that of the Yellow-naped Parrot. Similar patterns of fine-scale vocal variation may be more widespread among other parrot species in the Caribbean than previously documented.
Cortical representations of communication sounds.

PubMed

Heiser, Marc A; Cheung, Steven W

2008-10-01

This review summarizes recent research into cortical processing of vocalizations in animals and humans. There has been a resurgent interest in this topic accompanied by an increased number of studies using animal models with complex vocalizations and new methods in human brain imaging. Recent results from such studies are discussed. Experiments have begun to reveal the bilateral cortical fields involved in communication sound processing and the transformations of neural representations that occur among those fields. Advances have also been made in understanding the neuronal basis of interaction between developmental exposures and behavioral experiences with vocalization perception. Exposure to sounds during the developmental period produces large effects on brain responses, as do a variety of specific trained tasks in adults. Studies have also uncovered a neural link between the motor production of vocalizations and the representation of vocalizations in cortex. Parallel experiments in humans and animals are answering important questions about vocalization processing in the central nervous system. This dual approach promises to reveal microscopic, mesoscopic, and macroscopic principles of large-scale dynamic interactions between brain regions that underlie the complex phenomenon of vocalization perception. Such advances will yield a greater understanding of the causes, consequences, and treatment of disorders related to speech processing.
The role of the medial temporal limbic system in processing emotions in voice and music.

PubMed

Frühholz, Sascha; Trost, Wiebke; Grandjean, Didier

2014-12-01

Subcortical brain structures of the limbic system, such as the amygdala, are thought to decode the emotional value of sensory information. Recent neuroimaging studies, as well as lesion studies in patients, have shown that the amygdala is sensitive to emotions in voice and music. Similarly, the hippocampus, another part of the temporal limbic system (TLS), is responsive to vocal and musical emotions, but its specific roles in emotional processing from music and especially from voices have been largely neglected. Here we review recent research on vocal and musical emotions, and outline commonalities and differences in the neural processing of emotions in the TLS in terms of emotional valence, emotional intensity and arousal, as well as in terms of acoustic and structural features of voices and music. We summarize the findings in a neural framework including several subcortical and cortical functional pathways between the auditory system and the TLS. This framework proposes that some vocal expressions might already receive a fast emotional evaluation via a subcortical pathway to the amygdala, whereas cortical pathways to the TLS are thought to be equally used for vocal and musical emotions. While the amygdala might be specifically involved in a coarse decoding of the emotional value of voices and music, the hippocampus might process more complex vocal and musical emotions, and might have an important role especially for the decoding of musical emotions by providing memory-based and contextual associations. Copyright © 2014 Elsevier Ltd. All rights reserved.
Involvement of the avian song system in reproductive behaviour

PubMed Central

Wild, J. Martin; Botelho, João F.

2015-01-01

The song system of songbirds consists of an interconnected set of forebrain nuclei that has traditionally been regarded as dedicated to the learning and production of song. Here, however, we suggest that the song system could also influence muscles used in reproductive behaviour, such as the cloacal sphincter muscle. We show that the same medullary nucleus, retroambigualis (RAm), that projects upon spinal motoneurons innervating expiratory muscles (which provide the pressure head for vocalization) and upon vocal motoneurons for respiratory–vocal coordination also projects upon cloacal motoneurons. Furthermore, RAm neurons projecting to sacral spinal levels were shown to receive direct projections from nucleus robustus arcopallialis (RA) of the forebrain song system. Thus, by indicating a possible disynaptic relationship between RA and motoneurons innervating the reproductive organ, in both males and females, these results potentially extend the role of the song system to include consummatory as well as appetitive aspects of reproductive behaviour. PMID:26631245
Gelada vocal sequences follow Menzerath's linguistic law.

PubMed

Gustison, Morgan L; Semple, Stuart; Ferrer-I-Cancho, Ramon; Bergman, Thore J

2016-05-10

Identifying universal principles underpinning diverse natural systems is a key goal of the life sciences. A powerful approach in addressing this goal has been to test whether patterns consistent with linguistic laws are found in nonhuman animals. Menzerath's law is a linguistic law that states that, the larger the construct, the smaller the size of its constituents. Here, to our knowledge, we present the first evidence that Menzerath's law holds in the vocal communication of a nonhuman species. We show that, in vocal sequences of wild male geladas (Theropithecus gelada), construct size (sequence size in number of calls) is negatively correlated with constituent size (duration of calls). Call duration does not vary significantly with position in the sequence, but call sequence composition does change with sequence size and most call types are abbreviated in larger sequences. We also find that intercall intervals follow the same relationship with sequence size as do calls. Finally, we provide formal mathematical support for the idea that Menzerath's law reflects compression-the principle of minimizing the expected length of a code. Our findings suggest that a common principle underpins human and gelada vocal communication, highlighting the value of exploring the applicability of linguistic laws in vocal systems outside the realm of language.
[3D visualization and analysis of vocal fold dynamics].

PubMed

Bohr, C; Döllinger, M; Kniesburges, S; Traxdorf, M

2016-04-01

Visual investigation methods of the larynx mainly allow for the two-dimensional presentation of the three-dimensional structures of the vocal fold dynamics. The vertical component of the vocal fold dynamics is often neglected, yielding a loss of information. The latest studies show that the vertical dynamic components are in the range of the medio-lateral dynamics and play a significant role within the phonation process. This work presents a method for future 3D reconstruction and visualization of endoscopically recorded vocal fold dynamics. The setup contains a high-speed camera (HSC) and a laser projection system (LPS). The LPS projects a regular grid on the vocal fold surfaces and in combination with the HSC allows a three-dimensional reconstruction of the vocal fold surface. Hence, quantitative information on displacements and velocities can be provided. The applicability of the method is presented for one ex-vivo human larynx, one ex-vivo porcine larynx and one synthetic silicone larynx. The setup introduced allows the reconstruction of the entire visible vocal fold surfaces for each oscillation status. This enables a detailed analysis of the three dimensional dynamics (i. e. displacements, velocities, accelerations) of the vocal folds. The next goal is the miniaturization of the LPS to allow clinical in-vivo analysis in humans. We anticipate new insight on dependencies between 3D dynamic behavior and the quality of the acoustic outcome for healthy and disordered phonation.
[Clinical study on vocal cords spontaneous rehabilitation after CO2 laser surgery].

PubMed

Zhang, Qingxiang; Hu, Huiying; Sun, Guoyan; Yu, Zhenkun

2014-10-01

To study the spontaneous rehabilitation and phonation quality of vocal cords after different types of CO2 laser microsurgery. Surgical procedures based on Remacle system Type I, Type II, Type III, Type IV and Type V a respectively. Three hundred and fifteen cases with hoarseness based on strobe laryngoscopy results were prospectively assigned to different group according to vocal lesions apperence,vocal vibration and imaging of larynx CT/MRI. Each group holded 63 cases. The investigation included the vocal cords morphological features,the patients' subjective feelings and objective results of vocal cords. There are no severe complications for all patients in perioperative period. Vocal scar found in Type I ,1 case; Type II, 9 cases ;Type III, 47 cases; Type IV, 61 cases and Type Va 63 cases respectively after surgery. The difference of Vocal scar formation after surgery between surgical procedures are statistical significance (χ2 = 222.24, P < 0.05). Hoarseness improved after the surgery in 59 cases of Type I , 51 cases of Type II, 43 cases of Type III, 21 cases of Type IV and 17 cases of Type Va. There are statistically significance (χ2 = 89.46, P < 0.05) between different surgical procedures. The parameters of strobe laryngoscope: there are statistical significance on jitter between procedures (F 44.51, P < 0.05), but without difference within Type I and Type II (P > 0.05). This happened in shimmer parameter and the maximum phonation time (MPT) as jitter. There are no statistical significance between Type IV and Type Va on MPT (P > 0.05). Morphological and functional rehabilitation of vocal cord will be affected obviously when the body layer is injured. The depth and range of the CO2 laser microsurgery are the key factors affecting the vocal rehabilitation.
Two organizing principles of vocal production: Implications for nonhuman and human primates.

PubMed

Owren, Michael J; Amoss, R Toby; Rendall, Drew

2011-06-01

Vocal communication in nonhuman primates receives considerable research attention, with many investigators arguing for similarities between this calling and speech in humans. Data from development and neural organization show a central role of affect in monkey and ape sounds, however, suggesting that their calls are homologous to spontaneous human emotional vocalizations while having little relation to spoken language. Based on this evidence, we propose two principles that can be useful in evaluating the many and disparate empirical findings that bear on the nature of vocal production in nonhuman and human primates. One principle distinguishes production-first from reception-first vocal development, referring to the markedly different role of auditory-motor experience in each case. The second highlights a phenomenon dubbed dual neural pathways, specifically that when a species with an existing vocal system evolves a new functionally distinct vocalization capability, it occurs through emergence of a second parallel neural pathway rather than through expansion of the extant circuitry. With these principles as a backdrop, we review evidence of acoustic modification of calling associated with background noise, conditioning effects, audience composition, and vocal convergence and divergence in nonhuman primates. Although each kind of evidence has been interpreted to show flexible cognitively mediated control over vocal production, we suggest that most are more consistent with affectively grounded mechanisms. The lone exception is production of simple, novel sounds in great apes, which is argued to reveal at least some degree of volitional vocal control. If also present in early hominins, the cortically based circuitry surmised to be associated with these rudimentary capabilities likely also provided the substrate for later emergence of the neural pathway allowing volitional production in modern humans. © 2010 Wiley-Liss, Inc.
Individual vocal signatures in barn owl nestlings: does individual recognition have an adaptive role in sibling vocal competition?

PubMed

Dreiss, A N; Ruppli, C A; Roulin, A

2014-01-01

To compete over limited parental resources, young animals communicate with their parents and siblings by producing honest vocal signals of need. Components of begging calls that are sensitive to food deprivation may honestly signal need, whereas other components may be associated with individual-specific attributes that do not change with time such as identity, sex, absolute age and hierarchy. In a sib-sib communication system where barn owl (Tyto alba) nestlings vocally negotiate priority access to food resources, we show that calls have individual signatures that are used by nestlings to recognize which siblings are motivated to compete, even if most vocalization features vary with hunger level. Nestlings were more identifiable when food-deprived than food-satiated, suggesting that vocal identity is emphasized when the benefit of winning a vocal contest is higher. In broods where siblings interact iteratively, we speculate that individual-specific signature permits siblings to verify that the most vocal individual in the absence of parents is the one that indeed perceived the food brought by parents. Individual recognition may also allow nestlings to associate identity with individual-specific characteristics such as position in the within-brood dominance hierarchy. Calls indeed revealed age hierarchy and to a lower extent sex and absolute age. Using a cross-fostering experimental design, we show that most acoustic features were related to the nest of origin (but not the nest of rearing), suggesting a genetic or an early developmental effect on the ontogeny of vocal signatures. To conclude, our study suggests that sibling competition has promoted the evolution of vocal behaviours that signal not only hunger level but also intrinsic individual characteristics such as identity, family, sex and age. © 2013 The Authors. Journal of Evolutionary Biology © 2013 European Society For Evolutionary Biology.
A Bayesian Account of Vocal Adaptation to Pitch-Shifted Auditory Feedback

PubMed Central

Hahnloser, Richard H. R.

2017-01-01

Motor systems are highly adaptive. Both birds and humans compensate for synthetically induced shifts in the pitch (fundamental frequency) of auditory feedback stemming from their vocalizations. Pitch-shift compensation is partial in the sense that large shifts lead to smaller relative compensatory adjustments of vocal pitch than small shifts. Also, compensation is larger in subjects with high motor variability. To formulate a mechanistic description of these findings, we adapt a Bayesian model of error relevance. We assume that vocal-auditory feedback loops in the brain cope optimally with known sensory and motor variability. Based on measurements of motor variability, optimal compensatory responses in our model provide accurate fits to published experimental data. Optimal compensation correctly predicts sensory acuity, which has been estimated in psychophysical experiments as just-noticeable pitch differences. Our model extends the utility of Bayesian approaches to adaptive vocal behaviors. PMID:28135267
Vocal fold vibrations: high-speed imaging, kymography, and acoustic analysis: a preliminary report.

PubMed

Larsson, H; Hertegård, S; Lindestad, P A; Hammarberg, B

2000-12-01

To evaluate a new analysis system, High-Speed Tool Box (H. Larsson, custom-made program for image analysis, version 1.1, Department of Logopedics and Phoniatrics, Huddinge University Hospital, Huddinge, Sweden, 1998) for studying vocal fold vibrations using a high-speed camera and to relate findings from these analyses to sound characteristics. A Weinberger Speedcam + 500 system (Weinberger AG, Dietikon, Switzerland) was used with a frame rate of 1,904 frames per second. Images were stored and analyzed digitally. Analysis included automatic glottal edge detection and calculation of glottal area variations, as well as kymography. These signals were compared with acoustic waveforms using the Soundswell program (Hitech Development AB, Stockholm, Sweden). The High-Speed Tool Box was applied on two types of high-speed recordings: a diplophonic phonation and a tremor voice. Relations between glottal vibratory patterns and the sound waveform were analyzed. In the diplophonic phonation, the glottal area waveform, as well as the kymogram, showed a specific pattern of repetitive glottal closures, which was also seen in the acoustic waveform. In the tremor voice, fundamental frequency (F0) fluctuations in the acoustic waveform were reflected in slow variations in amplitude in the glottal area waveform. For studying details of mucosal movements during these kinds of abnormal vibrations, the glottal area waveform was particularly useful. Our results suggest that this combined high-speed acoustic-kymographic analysis package is a promising aid for separating and specifying different voice qualities such as diplophonia and voice tremor. Apart from clinical use, this finding should be of help for specification of the terminology of different voice qualities.
A real-time LPC-based vocal tract area display for voice development.

PubMed

Rossiter, D; Howard, D M; Downes, M

1994-12-01

This article reports the design and implementation of a graphical display that presents an approximation to vocal tract area in real time for voiced vowel articulation. The acoustic signal is digitally sampled by the system. From these data a set of reflection coefficients is derived using linear predictive coding. A matrix of area coefficients is then determined that approximates the vocal tract area of the user. From this information a graphical display is then generated. The complete cycle of analysis and display is repeated at approximately 20 times/s. Synchronised audio and visual sequences can be recorded and used as dynamic targets for articulatory development. Use of the system is illustrated by diagrams of system output for spoken cardinal vowels and for vowels sung in a trained and untrained style.
Sleep, offline processing, and vocal learning

PubMed Central

Margoliash, Daniel; Schmidt, Marc F

2009-01-01

The study of song learning and the neural song system has provided an important comparative model system for the study of speech and language acquisition. We describe some recent advances in the bird song system, focusing on the role of offline processing including sleep in processing sensory information and in guiding developmental song learning. These observations motivate a new model of the organization and role of the sensory memories in vocal learning. PMID:19906416
Effect of pneumotach on measurement of vocal function

NASA Astrophysics Data System (ADS)

Walters, Gage; McPhail, Michael; Krane, Michael

2017-11-01

Aerodynamic and acoustic measurements of vocal function were performed in a physical model of the human airway with and without a pneumotach (Rothenberg mask), used by clinicians to measure vocal volume flow. The purpose of these experiments was to assess whether the device alters acoustic and aerodynamic conditions sufficiently to change phonation behavior. The airway model, which mimics acoustic behavior of an adult human airway from trachea to mouth, consists of a 31.5cm long straight duct with a 2.54cm square cross section. Model vocal folds comprised of molded silicone rubber were set into vibration by introducing airflow from a compressed air source. Measurements included transglottal pressure difference, mean volume flow, vocal fold vibratory motion, and sound pressure measured at the mouth. The experiments show that while the pneumotach imparted measurable aerodynamic and acoustic loads on the system, measurement of mean glottal resistance was not affected. Acoustic pressure levels were attenuated, however, suggesting clinical acoustic measurements of vocal function need correction when performed in conjunction with a pneumotach Acknowledge support from NIH DC R01005642-11.
Wavelet based detection of manatee vocalizations

NASA Astrophysics Data System (ADS)

Gur, Berke M.; Niezrecki, Christopher

2005-04-01

The West Indian manatee (Trichechus manatus latirostris) has become endangered partly because of watercraft collisions in Florida's coastal waterways. Several boater warning systems, based upon manatee vocalizations, have been proposed to reduce the number of collisions. Three detection methods based on the Fourier transform (threshold, harmonic content and autocorrelation methods) were previously suggested and tested. In the last decade, the wavelet transform has emerged as an alternative to the Fourier transform and has been successfully applied in various fields of science and engineering including the acoustic detection of dolphin vocalizations. As of yet, no prior research has been conducted in analyzing manatee vocalizations using the wavelet transform. Within this study, the wavelet transform is used as an alternative to the Fourier transform in detecting manatee vocalizations. The wavelet coefficients are analyzed and tested against a specified criterion to determine the existence of a manatee call. The performance of the method presented is tested on the same data previously used in the prior studies, and the results are compared. Preliminary results indicate that using the wavelet transform as a signal processing technique to detect manatee vocalizations shows great promise.
Acoustic, respiratory kinematic and electromyographic effects of vocal training

NASA Astrophysics Data System (ADS)

Mendes, Ana Paula De Brito Garcia

The longitudinal effects of vocal training on the respiratory, phonatory and articulatory systems were investigated in this study. During four semesters, fourteen voice major students were recorded while speaking and singing. Acoustic, temporal, respiratory kinematic and electromyographic parameters were measured to determine changes in the three systems as a function of vocal training. Acoustic measures of the speaking voice included fundamental frequency, sound pressure level (SPL), percent jitter and shimmer, and harmonic-to-noise ratio. Temporal measures included duration of sentences, diphthongs and the closure durations of stop consonants. Acoustic measures of the singing voice included fundamental frequency and sound pressure level of the phonational range, vibrato pulses per second, vibrato amplitude variation and the presence of the singer's formant. Analysis of the data revealed that vocal training had a significant effect on the singing voice. Fundamental frequency and SPL of the 90% level and 90--10% of the phonational range increased significantly during four semesters of vocal training. Physiological data was collected from four subjects during three semesters of vocal training. Respiratory kinematic measures included lung volume, rib cage and abdominal excursions extracted from spoken sung samples. Descriptive statistics revealed that rib cage and abdominal excursions increased from the 1st to the 2nd semester and decrease from the 2nd to the 3rd semester of vocal training. Electromyographic measures of the pectoralis major, rectus abdominis and external obliques muscles revealed that burst duration means decreased from the 1st to the 2nd semester and increased from the 2nd to the 3rd semester. Peak amplitude means increased from the 1st to the 2nd and decreased from the 2nd to the 3rd semester of vocal training. Chest wall excursions and muscle force generation of the three muscles increased as the demanding level and the length of the phonatory tasks increased.
Acoustic characteristics used by Japanese macaques for individual discrimination.

PubMed

Furuyama, Takafumi; Kobayasi, Kohta I; Riquimaroux, Hiroshi

2017-10-01

The vocalizations of primates contain information about speaker individuality. Many primates, including humans, are able to distinguish conspecifics based solely on vocalizations. The purpose of this study was to investigate the acoustic characteristics used by Japanese macaques in individual vocal discrimination. Furthermore, we tested human subjects using monkey vocalizations to evaluate species specificity with respect to such discriminations. Two monkeys and five humans were trained to discriminate the coo calls of two unfamiliar monkeys. We created a stimulus continuum between the vocalizations of the two monkeys as a set of probe stimuli (whole morph). We also created two sets of continua in which only one acoustic parameter, fundamental frequency ( f 0 ) or vocal tract characteristic (VTC), was changed from the coo call of one monkey to that of another while the other acoustic feature remained the same ( f 0 morph and VTC morph, respectively). According to the results, the reaction times both of monkeys and humans were correlated with the morph proportion under the whole morph and f 0 morph conditions. The reaction time to the VTC morph was correlated with the morph proportion in both monkeys, whereas the reaction time in humans, on average, was not correlated with morph proportion. Japanese monkeys relied more consistently on VTC than did humans for discriminating monkey vocalizations. Our results support the idea that the auditory system of primates is specialized for processing conspecific vocalizations and suggest that VTC is a significant acoustic feature used by Japanese macaques to discriminate conspecific vocalizations. © 2017. Published by The Company of Biologists Ltd.
A meta-analysis of outcomes of hydration intervention on phonation threshold pressure.

PubMed

Leydon, Ciara; Wroblewski, Marcin; Eichorn, Naomi; Sivasankar, Mahalakshmi

2010-11-01

Vocal fold hydration is purported to promote optimal biomechanical characteristics of vocal fold mucosa, increase efficiency of vocal fold oscillation, and enhance voice quality. The purpose of this work was to determine the magnitude and consistency of the effect of vocal fold hydration on vocal fold function across published clinical studies. We completed a comprehensive meta-analysis of the effects of superficial and systemic vocal fold hydration on phonation threshold pressure (PTP), a measure of efficiency of voice production. We identified 34 studies that examined the effects of hydration on vocal function. Of these studies, 14 examined the effects of hydration on PTP. Nine of these articles met the criteria for inclusion in this analysis. We observed an average effect size of 0.33, indicating that, overall, hydration treatment demonstrated a tendency to reduce PTP. However, this decrease in phonatory effort did not reach significance at the 95% confidence level. The effects of hydration intervention varied considerably across studies (-0.19 to 3.96). We considered that two factors, pitch level of the task and vocal health of participants, may have contributed to this variability in findings. However, our analysis found that these factors could not account for differences in effect size. To understand the variability in outcomes across studies, the role of factors that may impact the effects of hydration, such as the amount, type, and duration of intervention, must be determined. Only then can we obtain data to guide best clinical practice for protecting and rehabilitating vocal function. Copyright © 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
A Chinese alligator in heliox: formant frequencies in a crocodilian

PubMed Central

Reber, Stephan A.; Nishimura, Takeshi; Janisch, Judith; Robertson, Mark; Fitch, W. Tecumseh

2015-01-01

ABSTRACT Crocodilians are among the most vocal non-avian reptiles. Adults of both sexes produce loud vocalizations known as ‘bellows’ year round, with the highest rate during the mating season. Although the specific function of these vocalizations remains unclear, they may advertise the caller's body size, because relative size differences strongly affect courtship and territorial behaviour in crocodilians. In mammals and birds, a common mechanism for producing honest acoustic signals of body size is via formant frequencies (vocal tract resonances). To our knowledge, formants have to date never been documented in any non-avian reptile, and formants do not seem to play a role in the vocalizations of anurans. We tested for formants in crocodilian vocalizations by using playbacks to induce a female Chinese alligator (Alligator sinensis) to bellow in an airtight chamber. During vocalizations, the animal inhaled either normal air or a helium/oxygen mixture (heliox) in which the velocity of sound is increased. Although heliox allows normal respiration, it alters the formant distribution of the sound spectrum. An acoustic analysis of the calls showed that the source signal components remained constant under both conditions, but an upward shift of high-energy frequency bands was observed in heliox. We conclude that these frequency bands represent formants. We suggest that crocodilian vocalizations could thus provide an acoustic indication of body size via formants. Because birds and crocodilians share a common ancestor with all dinosaurs, a better understanding of their vocal production systems may also provide insight into the communication of extinct Archosaurians. PMID:26246611

Mapping the distribution of language related genes FoxP1, FoxP2, and CntnaP2 in the brains of vocal learning bat species

PubMed Central

Rodenas‐Cuadrado, Pedro M.; Mengede, Janine; Baas, Laura; Devanna, Paolo; Schmid, Tobias A.; Yartsev, Michael; Firzlaff, Uwe

2018-01-01

Abstract Genes including FOXP2, FOXP1, and CNTNAP2, have been implicated in human speech and language phenotypes, pointing to a role in the development of normal language‐related circuitry in the brain. Although speech and language are unique to humans a comparative approach is possible by addressing language‐relevant traits in animal systems. One such trait, vocal learning, represents an essential component of human spoken language, and is shared by cetaceans, pinnipeds, elephants, some birds and bats. Given their vocal learning abilities, gregarious nature, and reliance on vocalizations for social communication and navigation, bats represent an intriguing mammalian system in which to explore language‐relevant genes. We used immunohistochemistry to detail the distribution of FoxP2, FoxP1, and Cntnap2 proteins, accompanied by detailed cytoarchitectural histology in the brains of two vocal learning bat species; Phyllostomus discolor and Rousettus aegyptiacus. We show widespread expression of these genes, similar to what has been previously observed in other species, including humans. A striking difference was observed in the adult P. discolor bat, which showed low levels of FoxP2 expression in the cortex that contrasted with patterns found in rodents and nonhuman primates. We created an online, open‐access database within which all data can be browsed, searched, and high resolution images viewed to single cell resolution. The data presented herein reveal regions of interest in the bat brain and provide new opportunities to address the role of these language‐related genes in complex vocal‐motor and vocal learning behaviors in a mammalian model system. PMID:29297931
From imitation to meaning: circuit plasticity and the acquisition of a conventionalized semantics

PubMed Central

García, Ricardo R.; Zamorano, Francisco; Aboitiz, Francisco

2014-01-01

The capacity for language is arguably the most remarkable innovation of the human brain. A relatively recent interpretation prescribes that part of the language-related circuits were co-opted from circuitry involved in hand control—the mirror neuron system (MNS), involved both in the perception and in the execution of voluntary grasping actions. A less radical view is that in early humans, communication was opportunistic and multimodal, using signs, vocalizations or whatever means available to transmit social information. However, one point that is not yet clear under either perspective is how learned communication acquired a semantic property thereby allowing us to name objects and eventually describe our surrounding environment. Here we suggest a scenario involving both manual gestures and learned vocalizations that led to the development of a primitive form of conventionalized reference. This proposal is based on comparative evidence gathered from other species and on neurolinguistic evidence in humans, which points to a crucial role for vocal learning in the early development of language. Firstly, the capacity to direct the attention of others to a common object may have been crucial for developing a consensual referential system. Pointing, which is a ritualized grasping gesture, may have been crucial to this end. Vocalizations also served to generate joint attention among conversants, especially when combined with gaze direction. Another contributing element was the development of pantomimic actions resembling events or animals. In conjunction with this mimicry, the development of plastic neural circuits that support complex, learned vocalizations was probably a significant factor in the evolution of conventionalized semantics in our species. Thus, vocal imitations of sounds, as in onomatopoeias (words whose sound resembles their meaning), are possibly supported by mirror system circuits, and may have been relevant in the acquisition of early meanings. PMID:25152726
Error-dependent modulation of speech-induced auditory suppression for pitch-shifted voice feedback

PubMed Central

2011-01-01

Background The motor-driven predictions about expected sensory feedback (efference copies) have been proposed to play an important role in recognition of sensory consequences of self-produced motor actions. In the auditory system, this effect was suggested to result in suppression of sensory neural responses to self-produced voices that are predicted by the efference copies during vocal production in comparison with passive listening to the playback of the identical self-vocalizations. In the present study, event-related potentials (ERPs) were recorded in response to upward pitch shift stimuli (PSS) with five different magnitudes (0, +50, +100, +200 and +400 cents) at voice onset during active vocal production and passive listening to the playback. Results Results indicated that the suppression of the N1 component during vocal production was largest for unaltered voice feedback (PSS: 0 cents), became smaller as the magnitude of PSS increased to 200 cents, and was almost completely eliminated in response to 400 cents stimuli. Conclusions Findings of the present study suggest that the brain utilizes the motor predictions (efference copies) to determine the source of incoming stimuli and maximally suppresses the auditory responses to unaltered feedback of self-vocalizations. The reduction of suppression for 50, 100 and 200 cents and its elimination for 400 cents pitch-shifted voice auditory feedback support the idea that motor-driven suppression of voice feedback leads to distinctly different sensory neural processing of self vs. non-self vocalizations. This characteristic may enable the audio-vocal system to more effectively detect and correct for unexpected errors in the feedback of self-produced voice pitch compared with externally-generated sounds. PMID:21645406
Animal vocal sequences: not the Markov chains we thought they were

PubMed Central

Kershenbaum, Arik; Bowles, Ann E.; Freeberg, Todd M.; Jin, Dezhe Z.; Lameira, Adriano R.; Bohn, Kirsten

2014-01-01

Many animals produce vocal sequences that appear complex. Most researchers assume that these sequences are well characterized as Markov chains (i.e. that the probability of a particular vocal element can be calculated from the history of only a finite number of preceding elements). However, this assumption has never been explicitly tested. Furthermore, it is unclear how language could evolve in a single step from a Markovian origin, as is frequently assumed, as no intermediate forms have been found between animal communication and human language. Here, we assess whether animal taxa produce vocal sequences that are better described by Markov chains, or by non-Markovian dynamics such as the ‘renewal process’ (RP), characterized by a strong tendency to repeat elements. We examined vocal sequences of seven taxa: Bengalese finches Lonchura striata domestica, Carolina chickadees Poecile carolinensis, free-tailed bats Tadarida brasiliensis, rock hyraxes Procavia capensis, pilot whales Globicephala macrorhynchus, killer whales Orcinus orca and orangutans Pongo spp. The vocal systems of most of these species are more consistent with a non-Markovian RP than with the Markovian models traditionally assumed. Our data suggest that non-Markovian vocal sequences may be more common than Markov sequences, which must be taken into account when evaluating alternative hypotheses for the evolution of signalling complexity, and perhaps human language origins. PMID:25143037
Communication Modality Sampling for a Toddler with Angelman Syndrome

ERIC Educational Resources Information Center

Martin, Jolene Hyppa; Reichle, Joe; Dimian, Adele; Chen, Mo

2013-01-01

Purpose: Vocal, gestural, and graphic communication modes were implemented concurrently with a toddler with Angelman syndrome to identify the most efficiently learned communication mode to emphasize in an initial augmentative communication system. Method: Symbols representing preferred objects were introduced in vocal, gestural, and graphic…
Analysis of human scream and its impact on text-independent speaker verification.

PubMed

Hansen, John H L; Nandwana, Mahesh Kumar; Shokouhi, Navid

2017-04-01

Scream is defined as sustained, high-energy vocalizations that lack phonological structure. Lack of phonological structure is how scream is identified from other forms of loud vocalization, such as "yell." This study investigates the acoustic aspects of screams and addresses those that are known to prevent standard speaker identification systems from recognizing the identity of screaming speakers. It is well established that speaker variability due to changes in vocal effort and Lombard effect contribute to degraded performance in automatic speech systems (i.e., speech recognition, speaker identification, diarization, etc.). However, previous research in the general area of speaker variability has concentrated on human speech production, whereas less is known about non-speech vocalizations. The UT-NonSpeech corpus is developed here to investigate speaker verification from scream samples. This study considers a detailed analysis in terms of fundamental frequency, spectral peak shift, frame energy distribution, and spectral tilt. It is shown that traditional speaker recognition based on the Gaussian mixture models-universal background model framework is unreliable when evaluated with screams.
Recording vocalizations with Bluetooth technology.

PubMed

Gaona-González, Andrés; Santillán-Doherty, Ana María; Arenas-Rosas, Rita Virginia; Muñoz-Delgado, Jairo; Aguillón-Pantaleón, Miguel Angel; Ordoñez-Gómez, José Domingo; Márquez-Arias, Alejandra

2011-06-01

We propose a method for capturing vocalizations that is designed to avoid some of the limiting factors found in traditional bioacoustical methods, such as the impossibility of obtaining continuous long-term registers or analyzing amplitude due to the continuous change of distance between the subject and the position of the recording system. Using Bluetooth technology, vocalizations are captured and transmitted wirelessly into a receiving system without affecting the quality of the signal. The recordings of the proposed system were compared to those obtained as a reference, which were based on the coding of the signal with the so-called pulse-code modulation technique in WAV audio format without any compressing process. The evaluation showed p < .05 for the measured quantitative and qualitative parameters. We also describe how the transmitting system is encapsulated and fixed on the animal and a way to video record a spider monkey's behavior simultaneously with the audio recordings.
Acoustic detection of manatee vocalizations

NASA Astrophysics Data System (ADS)

Niezrecki, Christopher; Phillips, Richard; Meyer, Michael; Beusse, Diedrich O.

2003-09-01

The West Indian manatee (trichechus manatus latirostris) has become endangered partly because of a growing number of collisions with boats. A system to warn boaters of the presence of manatees, that can signal to boaters that manatees are present in the immediate vicinity, could potentially reduce these boat collisions. In order to identify the presence of manatees, acoustic methods are employed. Within this paper, three different detection algorithms are used to detect the calls of the West Indian manatee. The detection systems are tested in the laboratory using simulated manatee vocalizations from an audio compact disk. The detection method that provides the best overall performance is able to correctly identify ~96% of the manatee vocalizations. However, the system also results in a false alarm rate of ~16%. The results of this work may ultimately lead to the development of a manatee warning system that can warn boaters of the presence of manatees.
Examining diseased states in a scaled-up vocal fold model using simultaneous temporally resolved DPIV and pressure measurements

NASA Astrophysics Data System (ADS)

Rogers, Dylan; Wei, Nathaniel; Ringenber, Hunter; Krane, Michael; Wei, Timothy

2017-11-01

This study builds on the parallel presentation of Ringenberg, et al. (APS-DFD 2017) involving simultaneous, temporally and spatially resolved flow and pressure measurements in a scaled-up vocal fold model. In this talk, data from experiments replicating characteristics of diseased vocal folds are presented. This begins with vocal folds that do not fully close and continues with asymmetric oscillations. Data are compared to symmetric, i.e. `healthy', oscillatory motions presented in the companion talk. Having pressure and flow data for individual as well as phase averaged oscillations for these diseased cases highlights the potential for aeroacoustic analysis in this complex system. Supported by NIH Grant No. 2R01 DC005642-11.
Effect of the losses in the vocal tract on determination of the area function.

PubMed

Gülmezoğlu, M Bilginer; Barkana, Atalay

2003-01-01

In this work, the cross-sectional areas of the vocal tract are determined for the lossy and lossless cases by using the pole-zero models obtained from the electrical equivalent circuit model of the vocal tract and the system identification method. The cross-sectional areas are used to compare the lossy and lossless cases. In the lossy case, the internal losses due to wall vibration, heat conduction, air friction and viscosity are considered, that is, the complex poles and zeros obtained from the models are used directly. Whereas, in the lossless case, only the imaginary parts of these poles and zeros are used. The vocal tract shapes obtained for the lossy case are close to the actual ones.
Food for Song: Expression of C-Fos and ZENK in the Zebra Finch Song Nuclei during Food Aversion Learning

PubMed Central

Tokarev, Kirill; Tiunova, Anna

2011-01-01

Background Specialized neural pathways, the song system, are required for acquiring, producing, and perceiving learned avian vocalizations. Birds that do not learn to produce their vocalizations lack telencephalic song system components. It is not known whether the song system forebrain regions are exclusively evolved for song or whether they also process information not related to song that might reflect their ‘evolutionary history’. Methodology/Principal Findings To address this question we monitored the induction of two immediate-early genes (IEGs) c-Fos and ZENK in various regions of the song system in zebra finches (Taeniopygia guttata) in response to an aversive food learning paradigm; this involves the association of a food item with a noxious stimulus that affects the oropharyngeal-esophageal cavity and tongue, causing subsequent avoidance of that food item. The motor response results in beak and head movements but not vocalizations. IEGs have been extensively used to map neuro-molecular correlates of song motor production and auditory processing. As previously reported, neurons in two pallial vocal motor regions, HVC and RA, expressed IEGs after singing. Surprisingly, c-Fos was induced equivalently also after food aversion learning in the absence of singing. The density of c-Fos positive neurons was significantly higher than that of birds in control conditions. This was not the case in two other pallial song nuclei important for vocal plasticity, LMAN and Area X, although singing did induce IEGs in these structures, as reported previously. Conclusions/Significance Our results are consistent with the possibility that some of the song nuclei may participate in non-vocal learning and the populations of neurons involved in the two tasks show partial overlap. These findings underscore the previously advanced notion that the specialized forebrain pre-motor nuclei controlling song evolved from circuits involved in behaviors related to feeding. PMID:21695176
Meaning in the avian auditory cortex: Neural representation of communication calls

PubMed Central

Elie, Julie E; Theunissen, Frédéric E

2014-01-01

Understanding how the brain extracts the behavioral meaning carried by specific vocalization types that can be emitted by various vocalizers and in different conditions is a central question in auditory research. This semantic categorization is a fundamental process required for acoustic communication and presupposes discriminative and invariance properties of the auditory system for conspecific vocalizations. Songbirds have been used extensively to study vocal learning, but the communicative function of all their vocalizations and their neural representation has yet to be examined. In our research, we first generated a library containing almost the entire zebra finch vocal repertoire and organized communication calls along 9 different categories based on their behavioral meaning. We then investigated the neural representations of these semantic categories in the primary and secondary auditory areas of 6 anesthetized zebra finches. To analyze how single units encode these call categories, we described neural responses in terms of their discrimination, selectivity and invariance properties. Quantitative measures for these neural properties were obtained using an optimal decoder based both on spike counts and spike patterns. Information theoretic metrics show that almost half of the single units encode semantic information. Neurons achieve higher discrimination of these semantic categories by being more selective and more invariant. These results demonstrate that computations necessary for semantic categorization of meaningful vocalizations are already present in the auditory cortex and emphasize the value of a neuro-ethological approach to understand vocal communication. PMID:25728175
Quantitative evaluation of phonetograms in the case of functional dysphonia.

PubMed

Airainer, R; Klingholz, F

1993-06-01

According to the laryngeal clinical findings, figures making up a scale were assigned to vocally trained and vocally untrained persons suffering from different types of functional dysphonia. The different types of dysphonia--from the manifested hypofunctional to the extreme hyperfunctional dysphonia--were classified by means of this scale. Besides, the subjects' phonetograms were measured and approximated by three ellipses, what rendered possible the definition of phonetogram parameters. The combining of selected phonetogram parameters to linear combinations served the purpose of a phonetographic evaluation. The linear combinations were to bring phonetographic and clinical evaluations into correspondence as accurately as possible. It was necessary to use different kinds of linear combinations for male and female singers and nonsingers. As a result of the reclassification of 71 and the new classification of 89 patients, it was possible to graduate the types of functional dysphonia by means of computer-aided phonetogram evaluation with a clinically acceptable error rate. This method proved to be an important supplement to the conventional diagnostics of functional dysphonia.
Mouse Vocal Communication System: Are Ultrasounds Learned or Innate?

ERIC Educational Resources Information Center

Arriaga, Gustavo; Jarvis, Erich D.

2013-01-01

Mouse ultrasonic vocalizations (USVs) are often used as behavioral readouts of internal states, to measure effects of social and pharmacological manipulations, and for behavioral phenotyping of mouse models for neuropsychiatric and neurodegenerative disorders. However, little is known about the neurobiological mechanisms of rodent USV production.…
Electrophysiological neural monitoring of the laryngeal nerves in thyroid surgery: review of the current literature

PubMed Central

Deniwar, Ahmed; Randolph, Gregory

2015-01-01

Recurrent laryngeal nerve (RLN) injury is one of the most common complications of thyroid surgery. RLN injury can cause vocal cord paralysis, affecting the patient’s voice and the quality of life. Injury of the external branch of the superior laryngeal nerve (EBSLN) can cause cricothyroid muscle denervation affecting high vocal tones. Thus, securing the laryngeal nerves in these surgeries is of utmost importance. Visual identification of the nerves has long been the standard method for this precaution. Intraoperative neuromonitoring (IONM) has been introduced as a novel technology to improve the protection of the laryngeal nerves and reduce the rate of RLN injury. The aim of this article is to provide a brief description of the technique and review the literature to illustrate the value of IONM. IONM can provide early identification of anatomical variations and unusual nerve routes, which carry a higher risk of injury if not detected. IONM helps in prognosticating postoperative nerve function. Moreover, by detecting nerve injury intraoperatively, it aids in staging bilateral surgeries to avoid bilateral vocal cord paralysis and tracheostomy. The article will discuss the value of continuous IONM (C-IOMN) that may prevent nerve injury by detecting EMG waveform changes indicating impending nerve injury. Herein, we are also discussing anatomy of laryngeal nerves and aspects of its injury. PMID:26425449
Infant vocalizations and the early diagnosis of severe hearing impairment.

PubMed

Eilers, R E; Oller, D K

1994-02-01

To determine whether late onset of canonical babbling could be used as a criterion to determine risk of hearing impairment, we obtained vocalization samples longitudinally from 94 infants with normal hearing and 37 infants with severe to profound hearing impairment. Parents were instructed to report the onset of canonical babbling (the production of well-formed syllables such as "da," "na," "bee," "yaya"). Verification that the infants were producing canonical syllables was collected in laboratory audio recordings. Infants with normal hearing produced canonical vocalizations before 11 months of age (range, 3 to 10 months; mode, 7 months); infants who were deaf failed to produce canonical syllables until 11 months of age or older, often well into the third year of life (range, 11 to 49 months; mode, 24 months). The correlation between age at onset of the canonical stage and age at auditory amplification was 0.68, indicating that early identification and fitting of hearing aids is of significant benefit to infants learning language. The fact that there is no overlap in the distribution of the onset of canonical babbling between infants with normal hearing and infants with hearing impairment means that the failure of otherwise healthy infants to produce canonical syllables before 11 months of age should be considered a serious risk factor for hearing impairment and, when observed, should result in immediate referral for audiologic evaluation.
Gelada vocal sequences follow Menzerath’s linguistic law

PubMed Central

Gustison, Morgan L.; Semple, Stuart; Ferrer-i-Cancho, Ramon; Bergman, Thore J.

2016-01-01

Identifying universal principles underpinning diverse natural systems is a key goal of the life sciences. A powerful approach in addressing this goal has been to test whether patterns consistent with linguistic laws are found in nonhuman animals. Menzerath’s law is a linguistic law that states that, the larger the construct, the smaller the size of its constituents. Here, to our knowledge, we present the first evidence that Menzerath’s law holds in the vocal communication of a nonhuman species. We show that, in vocal sequences of wild male geladas (Theropithecus gelada), construct size (sequence size in number of calls) is negatively correlated with constituent size (duration of calls). Call duration does not vary significantly with position in the sequence, but call sequence composition does change with sequence size and most call types are abbreviated in larger sequences. We also find that intercall intervals follow the same relationship with sequence size as do calls. Finally, we provide formal mathematical support for the idea that Menzerath’s law reflects compression—the principle of minimizing the expected length of a code. Our findings suggest that a common principle underpins human and gelada vocal communication, highlighting the value of exploring the applicability of linguistic laws in vocal systems outside the realm of language. PMID:27091968
Subglottal pressure and fundamental frequency control in contact calls of juvenile Alligator mississippiensis

PubMed Central

Riede, Tobias; Tokuda, Isao T.; Farmer, C. G.

2011-01-01

SUMMARY Vocalization is rare among non-avian reptiles, with the exception of the crocodilians, the sister taxon of birds. Crocodilians have a complex vocal repertoire. Their vocal and respiratory system is not well understood but appears to consist of a combination of features that are also found in the extremely vocal avian and mammalian taxa. Anatomical studies suggest that the alligator larynx is able to abduct and adduct the vocal folds, but not to elongate or shorten them, and is therefore lacking a key regulator of frequency, yet alligators can modulate fundamental frequency remarkably well. We investigated the morphological and physiological features of sound production in alligators. Vocal fold length scales isometrically across a wide range of alligator body sizes. The relationship between fundamental frequency and subglottal pressure is significant in some individuals at some isolated points, such as call onset and position of maximum fundamental frequency. The relationship is not consistent over large segments of the call. Fundamental frequency can change faster than expected by pressure changes alone, suggesting an active motor pattern controls frequency and is intrinsic to the larynx. We utilized a two-mass vocal fold model to test whether abduction and adduction could generate this motor pattern. The fine-tuned interplay between subglottal pressure and glottal adduction can achieve frequency modulations much larger than those resulting from subglottal pressure variations alone and of similar magnitude, as observed in alligator calls. We conclude that the alligator larynx represents a sound source with only two control parameters (subglottal pressure and vocal fold adduction) in contrast to the mammalian larynx in which three parameters can be altered to modulate frequency (subglottal pressure, vocal fold adduction and length/tension). PMID:21865521
Birds, primates, and spoken language origins: behavioral phenotypes and neurobiological substrates

PubMed Central

Petkov, Christopher I.; Jarvis, Erich D.

2012-01-01

Vocal learners such as humans and songbirds can learn to produce elaborate patterns of structurally organized vocalizations, whereas many other vertebrates such as non-human primates and most other bird groups either cannot or do so to a very limited degree. To explain the similarities among humans and vocal-learning birds and the differences with other species, various theories have been proposed. One set of theories are motor theories, which underscore the role of the motor system as an evolutionary substrate for vocal production learning. For instance, the motor theory of speech and song perception proposes enhanced auditory perceptual learning of speech in humans and song in birds, which suggests a considerable level of neurobiological specialization. Another, a motor theory of vocal learning origin, proposes that the brain pathways that control the learning and production of song and speech were derived from adjacent motor brain pathways. Another set of theories are cognitive theories, which address the interface between cognition and the auditory-vocal domains to support language learning in humans. Here we critically review the behavioral and neurobiological evidence for parallels and differences between the so-called vocal learners and vocal non-learners in the context of motor and cognitive theories. In doing so, we note that behaviorally vocal-production learning abilities are more distributed than categorical, as are the auditory-learning abilities of animals. We propose testable hypotheses on the extent of the specializations and cross-species correspondences suggested by motor and cognitive theories. We believe that determining how spoken language evolved is likely to become clearer with concerted efforts in testing comparative data from many non-human animal species. PMID:22912615
Audio-vocal system regulation in children with autism spectrum disorders.

PubMed

Russo, Nicole; Larson, Charles; Kraus, Nina

2008-06-01

Do children with autism spectrum disorders (ASD) respond similarly to perturbations in auditory feedback as typically developing (TD) children? Presentation of pitch-shifted voice auditory feedback to vocalizing participants reveals a close coupling between the processing of auditory feedback and vocal motor control. This paradigm was used to test the hypothesis that abnormalities in the audio-vocal system would negatively impact ASD compensatory responses to perturbed auditory feedback. Voice fundamental frequency (F(0)) was measured while children produced an /a/ sound into a microphone. The voice signal was fed back to the subjects in real time through headphones. During production, the feedback was pitch shifted (-100 cents, 200 ms) at random intervals for 80 trials. Averaged voice F(0) responses to pitch-shifted stimuli were calculated and correlated with both mental and language abilities as tested via standardized tests. A subset of children with ASD produced larger responses to perturbed auditory feedback than TD children, while the other children with ASD produced significantly lower response magnitudes. Furthermore, robust relationships between language ability, response magnitude and time of peak magnitude were identified. Because auditory feedback helps to stabilize voice F(0) (a major acoustic cue of prosody) and individuals with ASD have problems with prosody, this study identified potential mechanisms of dysfunction in the audio-vocal system for voice pitch regulation in some children with ASD. Objectively quantifying this deficit may inform both the assessment of a subgroup of ASD children with prosody deficits, as well as remediation strategies that incorporate pitch training.

Evaluation of aerodynamic characteristics of a coupled fluid-structure system using generalized Bernoulli’s principle: An application to vocal folds vibration

PubMed Central

Zhang, Lucy T.; Yang, Jubiao

2017-01-01

In this work we explore the aerodynamics flow characteristics of a coupled fluid-structure interaction system using a generalized Bernoulli equation derived directly from the Cauchy momentum equations. Unlike the conventional Bernoulli equation where incompressible, inviscid, and steady flow conditions are assumed, this generalized Bernoulli equation includes the contributions from compressibility, viscous, and unsteadiness, which could be essential in defining aerodynamic characteristics. The application of the derived Bernoulli’s principle is on a fully-coupled fluid-structure interaction simulation of the vocal folds vibration. The coupled system is simulated using the immersed finite element method where compressible Navier-Stokes equations are used to describe the air and an elastic pliable structure to describe the vocal fold. The vibration of the vocal fold works to open and close the glottal flow. The aerodynamics flow characteristics are evaluated using the derived Bernoulli’s principles for a vibration cycle in a carefully partitioned control volume based on the moving structure. The results agree very well to experimental observations, which validate the strategy and its use in other types of flow characteristics that involve coupled fluid-structure interactions. PMID:29527541
Evaluation of aerodynamic characteristics of a coupled fluid-structure system using generalized Bernoulli's principle: An application to vocal folds vibration.

PubMed

Zhang, Lucy T; Yang, Jubiao

2016-12-01

In this work we explore the aerodynamics flow characteristics of a coupled fluid-structure interaction system using a generalized Bernoulli equation derived directly from the Cauchy momentum equations. Unlike the conventional Bernoulli equation where incompressible, inviscid, and steady flow conditions are assumed, this generalized Bernoulli equation includes the contributions from compressibility, viscous, and unsteadiness, which could be essential in defining aerodynamic characteristics. The application of the derived Bernoulli's principle is on a fully-coupled fluid-structure interaction simulation of the vocal folds vibration. The coupled system is simulated using the immersed finite element method where compressible Navier-Stokes equations are used to describe the air and an elastic pliable structure to describe the vocal fold. The vibration of the vocal fold works to open and close the glottal flow. The aerodynamics flow characteristics are evaluated using the derived Bernoulli's principles for a vibration cycle in a carefully partitioned control volume based on the moving structure. The results agree very well to experimental observations, which validate the strategy and its use in other types of flow characteristics that involve coupled fluid-structure interactions.
Systemic Hydration: Relating Science to Clinical Practice in Vocal Health

PubMed Central

Hartley, Naomi A.; Thibeault, Susan L.

2014-01-01

Objectives To examine the current state of the science regarding the role of systemic hydration in vocal function and health. Study Design Literature Review Methods Literature search spanning multiple disciplines, including speech-language pathology, nutrition and dietetics, medicine, sports and exercise science, physiology and biomechanics. Results The relationship between hydration and physical function is an area of common interest amongst multiple professions. Each discipline provides valuable insight into the connection between performance and water balance, as well as complimentary methods of investigation. Existing voice literature suggests a relationship between hydration and voice production, however the underlying mechanisms are not yet defined and a treatment effect for systemic hydration remains to be demonstrated. Literature from other disciplines sheds light on methodological shortcomings and in some cases offers an alternative explanation for observed phenomena. Conclusions A growing body of literature in the field of voice science is documenting a relationship between hydration and vocal function, however greater understanding is required to guide best practice in the maintenance of vocal health and management of voice disorders. Integration of knowledge and technical expertise from multiple disciplines facilitates analysis of existing literature and provides guidance as to future research. PMID:24880674
Pre-Lingual Communication and Attachment Behavior.

ERIC Educational Resources Information Center

Modarressi, Taghi; McCulloch, Duncan

Infant's crying may have an important mediating role in the formation of attachment behavior. The earliest vocalizations are discussed in terms of an acoustic communications model in which the baby's vocal repertoire becomes incorporated into a closed-loop, feedback system with his mother. Certain pre-lingual "signals" may be associated with those…
Vocal Pitch Discrimination in the Motor System

ERIC Educational Resources Information Center

D'Ausilio, Alessandro; Bufalari, Ilaria; Salmas, Paola; Busan, Pierpaolo; Fadiga, Luciano

2011-01-01

Speech production can be broadly separated into two distinct components: Phonation and Articulation. These two aspects require the efficient control of several phono-articulatory effectors. Speech is indeed generated by the vibration of the vocal-folds in the larynx (F0) followed by "filtering" by articulators, to select certain resonant…
The vocal repertoire in a solitary foraging carnivore, Cynictis penicillata, may reflect facultative sociality.

PubMed

Le Roux, Aliza; Cherry, Michael I; Manser, Marta B

2009-05-01

We describe the vocal repertoire of a facultatively social carnivore, the yellow mongoose, Cynictis penicillata. Using a combination of close-range observations, recordings and experiments with simulated predators, we were able to obtain clear descriptions of call structure and function for a wide range of calls used by this herpestid. The vocal repertoire of the yellow mongooses comprised ten call types, half of which were used in appeasing or fearful contexts and half in aggressive interactions. Data from this study suggest that the yellow mongoose uses an urgency-based alarm calling system, indicating high and low urgency through two distinct call types. Compared to solitary mongooses, the yellow mongoose has a large proportion of 'friendly' vocalisations that enhance group cohesion, but its vocal repertoire is smaller and less context-specific than those of obligate social species. This study of the vocal repertoire of the yellow mongoose is, to our knowledge, the most complete to have been conducted on a facultatively social species in its natural habitat.
The vocal repertoire in a solitary foraging carnivore, Cynictis penicillata, may reflect facultative sociality

NASA Astrophysics Data System (ADS)

Le Roux, Aliza; Cherry, Michael I.; Manser, Marta B.

2009-05-01

We describe the vocal repertoire of a facultatively social carnivore, the yellow mongoose, Cynictis penicillata. Using a combination of close-range observations, recordings and experiments with simulated predators, we were able to obtain clear descriptions of call structure and function for a wide range of calls used by this herpestid. The vocal repertoire of the yellow mongooses comprised ten call types, half of which were used in appeasing or fearful contexts and half in aggressive interactions. Data from this study suggest that the yellow mongoose uses an urgency-based alarm calling system, indicating high and low urgency through two distinct call types. Compared to solitary mongooses, the yellow mongoose has a large proportion of ‘friendly’ vocalisations that enhance group cohesion, but its vocal repertoire is smaller and less context-specific than those of obligate social species. This study of the vocal repertoire of the yellow mongoose is, to our knowledge, the most complete to have been conducted on a facultatively social species in its natural habitat.
Computation of physiological human vocal fold parameters by mathematical optimization of a biomechanical model

PubMed Central

Yang, Anxiong; Stingl, Michael; Berry, David A.; Lohscheller, Jörg; Voigt, Daniel; Eysholdt, Ulrich; Döllinger, Michael

2011-01-01

With the use of an endoscopic, high-speed camera, vocal fold dynamics may be observed clinically during phonation. However, observation and subjective judgment alone may be insufficient for clinical diagnosis and documentation of improved vocal function, especially when the laryngeal disease lacks any clear morphological presentation. In this study, biomechanical parameters of the vocal folds are computed by adjusting the corresponding parameters of a three-dimensional model until the dynamics of both systems are similar. First, a mathematical optimization method is presented. Next, model parameters (such as pressure, tension and masses) are adjusted to reproduce vocal fold dynamics, and the deduced parameters are physiologically interpreted. Various combinations of global and local optimization techniques are attempted. Evaluation of the optimization procedure is performed using 50 synthetically generated data sets. The results show sufficient reliability, including 0.07 normalized error, 96% correlation, and 91% accuracy. The technique is also demonstrated on data from human hemilarynx experiments, in which a low normalized error (0.16) and high correlation (84%) values were achieved. In the future, this technique may be applied to clinical high-speed images, yielding objective measures with which to document improved vocal function of patients with voice disorders. PMID:21877808
Performance of a reduced-order FSI model for flow-induced vocal fold vibration

NASA Astrophysics Data System (ADS)

Chang, Siyuan; Luo, Haoxiang; Luo's lab Team

2016-11-01

Vocal fold vibration during speech production involves a three-dimensional unsteady glottal jet flow and three-dimensional nonlinear tissue mechanics. A full 3D fluid-structure interaction (FSI) model is computationally expensive even though it provides most accurate information about the system. On the other hand, an efficient reduced-order FSI model is useful for fast simulation and analysis of the vocal fold dynamics, which is often needed in procedures such as optimization and parameter estimation. In this work, we study the performance of a reduced-order model as compared with the corresponding full 3D model in terms of its accuracy in predicting the vibration frequency and deformation mode. In the reduced-order model, we use a 1D flow model coupled with a 3D tissue model. Two different hyperelastic tissue behaviors are assumed. In addition, the vocal fold thickness and subglottal pressure are varied for systematic comparison. The result shows that the reduced-order model provides consistent predictions as the full 3D model across different tissue material assumptions and subglottal pressures. However, the vocal fold thickness has most effect on the model accuracy, especially when the vocal fold is thin. Supported by the NSF.
FoxP2 Expression in a Highly Vocal Teleost Fish with Comparisons to Tetrapods.

PubMed

Pengra, Ian G G; Marchaterre, Margaret A; Bass, Andrew H

2018-04-19

Motivated by studies of speech deficits in humans, several studies over the past two decades have investigated the potential role of a forkhead domain transcription factor, FoxP2, in the central control of acoustic signaling/vocalization among vertebrates. Comparative neuroanatomical studies that mainly include mammalian and avian species have mapped the distribution of FoxP2 expression in multiple brain regions that imply a greater functional significance beyond vocalization that might be shared broadly across vertebrate lineages. To date, reports for teleost fish have been limited in number and scope to nonvocal species. Here, we map the neuroanatomical distribution of FoxP2 mRNA expression in a highly vocal teleost, the plainfin midshipman (Porichthys notatus). We report an extensive overlap between FoxP2 expression and vocal, auditory, and steroid-signaling systems with robust expression at multiple sites in the telencephalon, the preoptic area, the diencephalon, and the midbrain. Label was far more restricted in the hindbrain though robust in one region of the reticular formation. A comparison with other teleosts and tetrapods suggests an evolutionarily conserved FoxP2 phenotype important to vocal-acoustic and, more broadly, sensorimotor function among vertebrates. © 2018 S. Karger AG, Basel.
[Applicability of voice acoustic analysis with vocal loading testto diagnostics of occupational voice diseases].

PubMed

Niebudek-Bogusz, Ewa; Sliwińska-Kowalska, Mariola

2006-01-01

An assessment of the vocal system, as a part of the medical certification of occupational diseases, should be objective and reliable. Therefore, interest in the method of acoustic voice analysis enabling objective assessment of voice parameters is still growing. The aim of the present study was to evaluate the applicability of acoustic analysis with vocal loading test to the diagnostics of occupational voice disorders. The results of acoustic voice analysis were compared using IRIS software for phoniatrics, before and after a 30-min vocal loading test in 35 female teachers with diagnosed occupational voice disorders (group I) and in 31 female teachers with functional dysphonia (group II). In group I, vocal effort produced significant abnormalities in voice acoustic parameters, compared to group II. These included significantly increased mean fundamental frequency (Fo) value (by 11 Hz) and worsened jitter, shimmer and NHR parameters. Also, the percentage of subjects showing abnormalities in voice acoustic analysis was higher in this group. Conducting voice acoustic analysis before and after the vocal loading test makes it possible to objectively confirm irreversible voice impairments in persons with work-related pathologies of the larynx, which is essential for medical certification of occupational voice diseases.
Social calls provide novel insights into the evolution of vocal learning

PubMed Central

Sewall, Kendra B.; Young, Anna M.; Wright, Timothy F.

2016-01-01

Learned song is among the best-studied models of animal communication. In oscine songbirds, where learned song is most prevalent, it is used primarily for intrasexual selection and mate attraction. Learning of a different class of vocal signals, known as contact calls, is found in a diverse array of species, where they are used to mediate social interactions among individuals. We argue that call learning provides a taxonomically rich system for studying testable hypotheses for the evolutionary origins of vocal learning. We describe and critically evaluate four nonmutually exclusive hypotheses for the origin and current function of vocal learning of calls, which propose that call learning (1) improves auditory detection and recognition, (2) signals local knowledge, (3) signals group membership, or (4) allows for the encoding of more complex social information. We propose approaches to testing these four hypotheses but emphasize that all of them share the idea that social living, not sexual selection, is a central driver of vocal learning. Finally, we identify future areas for research on call learning that could provide new perspectives on the origins and mechanisms of vocal learning in both animals and humans. PMID:28163325
Vocal communication in a complex multi-level society: constrained acoustic structure and flexible call usage in Guinea baboons.

PubMed

Maciej, Peter; Ndao, Ibrahima; Hammerschmidt, Kurt; Fischer, Julia

2013-09-23

To understand the evolution of acoustic communication in animals, it is important to distinguish between the structure and the usage of vocal signals, since both aspects are subject to different constraints. In terrestrial mammals, the structure of calls is largely innate, while individuals have a greater ability to actively initiate or withhold calls. In closely related taxa, one would therefore predict a higher flexibility in call usage compared to call structure. In the present study, we investigated the vocal repertoire of free living Guinea baboons (Papio papio) and examined the structure and usage of the animals' vocal signals. Guinea baboons live in a complex multi-level social organization and exhibit a largely tolerant and affiliative social style, contrary to most other baboon taxa. To classify the vocal repertoire of male and female Guinea baboons, cluster analyses were used and focal observations were conducted to assess the usage of vocal signals in the particular contexts. In general, the vocal repertoire of Guinea baboons largely corresponded to the vocal repertoire other baboon taxa. The usage of calls, however, differed considerably from other baboon taxa and corresponded with the specific characteristics of the Guinea baboons' social behaviour. While Guinea baboons showed a diminished usage of contest and display vocalizations (a common pattern observed in chacma baboons), they frequently used vocal signals during affiliative and greeting interactions. Our study shows that the call structure of primates is largely unaffected by the species' social system (including grouping patterns and social interactions), while the usage of calls can be more flexibly adjusted, reflecting the quality of social interactions of the individuals. Our results support the view that the primary function of social signals is to regulate social interactions, and therefore the degree of competition and cooperation may be more important to explain variation in call usage than grouping patterns or group size.
Vocal communication in a complex multi-level society: constrained acoustic structure and flexible call usage in Guinea baboons

PubMed Central

2013-01-01

Background To understand the evolution of acoustic communication in animals, it is important to distinguish between the structure and the usage of vocal signals, since both aspects are subject to different constraints. In terrestrial mammals, the structure of calls is largely innate, while individuals have a greater ability to actively initiate or withhold calls. In closely related taxa, one would therefore predict a higher flexibility in call usage compared to call structure. In the present study, we investigated the vocal repertoire of free living Guinea baboons (Papio papio) and examined the structure and usage of the animals’ vocal signals. Guinea baboons live in a complex multi-level social organization and exhibit a largely tolerant and affiliative social style, contrary to most other baboon taxa. To classify the vocal repertoire of male and female Guinea baboons, cluster analyses were used and focal observations were conducted to assess the usage of vocal signals in the particular contexts. Results In general, the vocal repertoire of Guinea baboons largely corresponded to the vocal repertoire other baboon taxa. The usage of calls, however, differed considerably from other baboon taxa and corresponded with the specific characteristics of the Guinea baboons’ social behaviour. While Guinea baboons showed a diminished usage of contest and display vocalizations (a common pattern observed in chacma baboons), they frequently used vocal signals during affiliative and greeting interactions. Conclusions Our study shows that the call structure of primates is largely unaffected by the species’ social system (including grouping patterns and social interactions), while the usage of calls can be more flexibly adjusted, reflecting the quality of social interactions of the individuals. Our results support the view that the primary function of social signals is to regulate social interactions, and therefore the degree of competition and cooperation may be more important to explain variation in call usage than grouping patterns or group size. PMID:24059742
Finding the Beat: From Socially Coordinated Vocalizations in Songbirds to Rhythmic Entrainment in Humans.

PubMed

Benichov, Jonathan I; Globerson, Eitan; Tchernichovski, Ofer

2016-01-01

Humans and oscine songbirds share the rare capacity for vocal learning. Songbirds have the ability to acquire songs and calls of various rhythms through imitation. In several species, birds can even coordinate the timing of their vocalizations with other individuals in duets that are synchronized with millisecond-accuracy. It is not known, however, if songbirds can perceive rhythms holistically nor if they are capable of spontaneous entrainment to complex rhythms, in a manner similar to humans. Here we review emerging evidence from studies of rhythm generation and vocal coordination across songbirds and humans. In particular, recently developed experimental methods have revealed neural mechanisms underlying the temporal structure of song and have allowed us to test birds' abilities to predict the timing of rhythmic social signals. Surprisingly, zebra finches can readily learn to anticipate the calls of a "vocal robot" partner and alter the timing of their answers to avoid jamming, even in reference to complex rhythmic patterns. This capacity resembles, to some extent, human predictive motor response to an external beat. In songbirds, this is driven, at least in part, by the forebrain song system, which controls song timing and is essential for vocal learning. Building upon previous evidence for spontaneous entrainment in human and non-human vocal learners, we propose a comparative framework for future studies aimed at identifying shared mechanism of rhythm production and perception across songbirds and humans.
Animal vocal sequences: not the Markov chains we thought they were.

PubMed

Kershenbaum, Arik; Bowles, Ann E; Freeberg, Todd M; Jin, Dezhe Z; Lameira, Adriano R; Bohn, Kirsten

2014-10-07

Many animals produce vocal sequences that appear complex. Most researchers assume that these sequences are well characterized as Markov chains (i.e. that the probability of a particular vocal element can be calculated from the history of only a finite number of preceding elements). However, this assumption has never been explicitly tested. Furthermore, it is unclear how language could evolve in a single step from a Markovian origin, as is frequently assumed, as no intermediate forms have been found between animal communication and human language. Here, we assess whether animal taxa produce vocal sequences that are better described by Markov chains, or by non-Markovian dynamics such as the 'renewal process' (RP), characterized by a strong tendency to repeat elements. We examined vocal sequences of seven taxa: Bengalese finches Lonchura striata domestica, Carolina chickadees Poecile carolinensis, free-tailed bats Tadarida brasiliensis, rock hyraxes Procavia capensis, pilot whales Globicephala macrorhynchus, killer whales Orcinus orca and orangutans Pongo spp. The vocal systems of most of these species are more consistent with a non-Markovian RP than with the Markovian models traditionally assumed. Our data suggest that non-Markovian vocal sequences may be more common than Markov sequences, which must be taken into account when evaluating alternative hypotheses for the evolution of signalling complexity, and perhaps human language origins. © 2014 The Author(s) Published by the Royal Society. All rights reserved.
Auditory responses in the amygdala to social vocalizations

NASA Astrophysics Data System (ADS)

Gadziola, Marie A.

The underlying goal of this dissertation is to understand how the amygdala, a brain region involved in establishing the emotional significance of sensory input, contributes to the processing of complex sounds. The general hypothesis is that communication calls of big brown bats (Eptesicus fuscus) transmit relevant information about social context that is reflected in the activity of amygdalar neurons. The first specific aim analyzed social vocalizations emitted under a variety of behavioral contexts, and related vocalizations to an objective measure of internal physiological state by monitoring the heart rate of vocalizing bats. These experiments revealed a complex acoustic communication system among big brown bats in which acoustic cues and call structure signal the emotional state of a sender. The second specific aim characterized the responsiveness of single neurons in the basolateral amygdala to a range of social syllables. Neurons typically respond to the majority of tested syllables, but effectively discriminate among vocalizations by varying the response duration. This novel coding strategy underscores the importance of persistent firing in the general functioning of the amygdala. The third specific aim examined the influence of acoustic context by characterizing both the behavioral and neurophysiological responses to natural vocal sequences. Vocal sequences differentially modify the internal affective state of a listening bat, with lower aggression vocalizations evoking the greatest change in heart rate. Amygdalar neurons employ two different coding strategies: low background neurons respond selectively to very few stimuli, whereas high background neurons respond broadly to stimuli but demonstrate variation in response magnitude and timing. Neurons appear to discriminate the valence of stimuli, with aggression sequences evoking robust population-level responses across all sound levels. Further, vocal sequences show improved discrimination among stimuli compared to isolated syllables, and this improved discrimination is expressed in part by the timing of action potentials. Taken together, these data support the hypothesis that big brown bat social vocalizations transmit relevant information about the social context that is encoded within the discharge pattern of amygdalar neurons ultimately responsible for coordinating appropriate social behaviors. I further propose that vocalization-evoked amygdalar activity will have significant impact on subsequent sensory processing and plasticity.
Evidence of a Vocalic Proto-System in the Baboon (Papio papio) Suggests Pre-Hominin Speech Precursors

PubMed Central

Boë, Louis-Jean; Berthommier, Frédéric; Legou, Thierry; Captier, Guillaume; Kemp, Caralyn; Sawallis, Thomas R.; Becker, Yannick; Rey, Arnaud; Fagot, Joël

2017-01-01

Language is a distinguishing characteristic of our species, and the course of its evolution is one of the hardest problems in science. It has long been generally considered that human speech requires a low larynx, and that the high larynx of nonhuman primates should preclude their producing the vowel systems universally found in human language. Examining the vocalizations through acoustic analyses, tongue anatomy, and modeling of acoustic potential, we found that baboons (Papio papio) produce sounds sharing the F1/F2 formant structure of the human [ɨ æ ɑ ɔ u] vowels, and that similarly with humans those vocalic qualities are organized as a system on two acoustic-anatomic axes. This confirms that hominoids can produce contrasting vowel qualities despite a high larynx. It suggests that spoken languages evolved from ancient articulatory skills already present in our last common ancestor with Cercopithecoidea, about 25 MYA. PMID:28076426
Articulatory speech synthesis and speech production modelling

NASA Astrophysics Data System (ADS)

Huang, Jun

This dissertation addresses the problem of speech synthesis and speech production modelling based on the fundamental principles of human speech production. Unlike the conventional source-filter model, which assumes the independence of the excitation and the acoustic filter, we treat the entire vocal apparatus as one system consisting of a fluid dynamic aspect and a mechanical part. We model the vocal tract by a three-dimensional moving geometry. We also model the sound propagation inside the vocal apparatus as a three-dimensional nonplane-wave propagation inside a viscous fluid described by Navier-Stokes equations. In our work, we first propose a combined minimum energy and minimum jerk criterion to estimate the dynamic vocal tract movements during speech production. Both theoretical error bound analysis and experimental results show that this method can achieve very close match at the target points and avoid the abrupt change in articulatory trajectory at the same time. Second, a mechanical vocal fold model is used to compute the excitation signal of the vocal tract. The advantage of this model is that it is closely coupled with the vocal tract system based on fundamental aerodynamics. As a result, we can obtain an excitation signal with much more detail than the conventional parametric vocal fold excitation model. Furthermore, strong evidence of source-tract interaction is observed. Finally, we propose a computational model of the fricative and stop types of sounds based on the physical principles of speech production. The advantage of this model is that it uses an exogenous process to model the additional nonsteady and nonlinear effects due to the flow mode, which are ignored by the conventional source- filter speech production model. A recursive algorithm is used to estimate the model parameters. Experimental results show that this model is able to synthesize good quality fricative and stop types of sounds. Based on our dissertation work, we carefully argue that the articulatory speech production model has the potential to flexibly synthesize natural-quality speech sounds and to provide a compact computational model for speech production that can be beneficial to a wide range of areas in speech signal processing.
Julius Eichberg: String and Vocal Instruction in Nineteenth-Century Boston.

ERIC Educational Resources Information Center

Howe, Sondra Wieland

1996-01-01

Reviews the career and contributions of Julius Eichberg (1824-93), a pioneer in string and vocal instruction in Boston (Massachusetts). Eichberg founded the Boston Conservatory, supervised music education for the Boston public school system, and taught teacher-training courses. In addition, he composed choral works and operas, and edited several…

Discussion: Changes in Vocal Production and Auditory Perception after Hair Cell Regeneration.

ERIC Educational Resources Information Center

Ryals, Brenda M.; Dooling, Robert J.

2000-01-01

A bird study found that with sufficient time and training after hair cell and hearing loss and hair cell regeneration, the mature avian auditory system can accommodate input from a newly regenerated periphery sufficiently to allow for recognition of previously familiar vocalizations and the learning of new complex acoustic classifications.…
Developing technologies for bioacoustic vocal profiling as a viable component of integrative medical diagnostics and treatment

NASA Astrophysics Data System (ADS)

Edwards, Sharry K.

2005-04-01

Over the past 20+ years the pioneering field of Human Bioacoustics, which includes voice spectral analysis, has begun to model the frequencies and architecture of human vocalizations to identify the innate mathematical templates found within the various system of the human body. Using the idea that the voice is a holographic representation of health and wellness, these non-invasive techniques are being advanced to the extent that a computerized Vocal Profile, using a system of Frequency Equivalents, can be used to accurately quantify, organize, interpret, define, and extrapolate biometric information from the human voice. This information, in turn, provides the opportunity to predict, direct, and maintain intrinsic form and function. This novel approach has provided an accumulation of significant data but until recently has been without an efficient biological framework of reference. The emerging Mathematical Model being assembled through Human Bioacoustic research likely has the potential to allow Vocal Profiling to be used to predict and monitor health issues from the very first cries of a newborn through the frequency foundations of disease and aging.
An immersed-boundary method for flow–structure interaction in biological systems with application to phonation

PubMed Central

Luo, Haoxiang; Mittal, Rajat; Zheng, Xudong; Bielamowicz, Steven A.; Walsh, Raymond J.; Hahn, James K.

2008-01-01

A new numerical approach for modeling a class of flow–structure interaction problems typically encountered in biological systems is presented. In this approach, a previously developed, sharp-interface, immersed-boundary method for incompressible flows is used to model the fluid flow and a new, sharp-interface Cartesian grid, immersed boundary method is devised to solve the equations of linear viscoelasticity that governs the solid. The two solvers are coupled to model flow–structure interaction. This coupled solver has the advantage of simple grid generation and efficient computation on simple, single-block structured grids. The accuracy of the solid-mechanics solver is examined by applying it to a canonical problem. The solution methodology is then applied to the problem of laryngeal aerodynamics and vocal fold vibration during human phonation. This includes a three-dimensional eigen analysis for a multi-layered vocal fold prototype as well as two-dimensional, flow-induced vocal fold vibration in a modeled larynx. Several salient features of the aerodynamics as well as vocal-fold dynamics are presented. PMID:19936017
Eye-movements and Voice as Interface Modalities to Computer Systems

NASA Astrophysics Data System (ADS)

Farid, Mohsen M.; Murtagh, Fionn D.

2003-03-01

We investigate the visual and vocal modalities of interaction with computer systems. We focus our attention on the integration of visual and vocal interface as possible replacement and/or additional modalities to enhance human-computer interaction. We present a new framework for employing eye gaze as a modality of interface. While voice commands, as means of interaction with computers, have been around for a number of years, integration of both the vocal interface and the visual interface, in terms of detecting user's eye movements through an eye-tracking device, is novel and promises to open the horizons for new applications where a hand-mouse interface provides little or no apparent support to the task to be accomplished. We present an array of applications to illustrate the new framework and eye-voice integration.
Vocal Fold Vibration Following Surgical Intervention in Three Vocal Pathologies: A Preliminary Study.

PubMed

Chen, Wenli; Woo, Peak; Murry, Thomas

2017-09-01

High-speed videoendoscopy captures the cycle-to-cycle vibratory motion of each individual vocal fold in normal and severely disordered phonation. Therefore, it provides a direct method to examine the specific vibratory changes following vocal fold surgery. The purpose of this study was to examine the vocal fold vibratory pattern changes in the surgically treated pathologic vocal fold and the contralateral vocal fold in three vocal pathologies: vocal polyp (n = 3), paresis or paralysis (n = 3), and scar (n = 3). Digital kymography was used to extract high-speed kymographic vocal fold images at the mid-membranous region of the vocal fold. Spectral analysis was subsequently applied to the digital kymography to quantify the cycle-to-cycle movements of each vocal fold, expressed as a spectrum. Surgical modification resulted in significantly improved spectral power of the treated pathologic vocal fold. Furthermore, the contralateral vocal fold also presented with improved spectral power irrespective of vocal pathology. In comparison with normal vocal fold spectrum, postsurgical vocal fold vibrations continued to demonstrate decreased vibratory amplitude in both vocal folds. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Paradigms and progress in vocal fold restoration.

PubMed

Ford, Charles N

2008-09-01

Science advances occur through orderly steps, puzzle-solving leaps, or divergences from the accepted disciplinary matrix that occasionally result in a revolutionary paradigm shift. Key advances must overcome bias, criticism, and rejection. Examples in biological science include use of embryonic stem cells, recognition of Helicobacter pylori in the etiology of ulcer disease, and the evolution of species. Our work in vocal fold restoration reflects these patterns. We progressed through phases of tissue replacement with fillers and biological implants, to current efforts at vocal fold regeneration through tissue engineering, and face challenges of a new "systems biology" paradigm embracing genomics and proteomics.
Life Experience of Patients With Unilateral Vocal Fold Paralysis.

PubMed

Francis, David O; Sherman, Ariel E; Hovis, Kristen L; Bonnet, Kemberlee; Schlundt, David; Garrett, C Gaelyn; Davies, Louise

2018-05-01

Clinicians and patients benefit when they have a clear understanding of how medical conditions influence patients' life experiences. Patients' perspectives on life with unilateral vocal fold paralysis have not been well described. To promote patient-centered care by characterizing the patient experiences of living with unilateral vocal fold paralysis. This study used mixed methods: surveys using the voice and dysphagia handicap indexes (VHI and DHI) and semistructured interviews with adults with unilateral vocal cord paralysis recruited from a tertiary voice center. Recorded interviews were transcribed, coded using a hierarchical coding system, and analyzed using an iterative inductive-deductive approach. Symptom domains of the patient experience. In 36 patients (26 [72%] were female, and the median age and interquartile range [IQR] were 63 years [48-68 years]; median interview duration, 42 minutes), median VHI and DHI scores were 96 (IQR, 77-108) and 55.5 (IQR, 35-89) at the time of interviews, respectively. Frustration, isolation, fear, and altered self-identity were primary themes permeating patients' experiences. Frustrations related to limitations in communication, employment, and the medical system. Sources of fear included a loss of control, fear of further dysfunction or permanent disability, concern for health consequences (eg, aspiration pneumonia), and/or an inability to call for help in emergency situations. These experiences were modified by the following factors: resilience, self-efficacy, perceived sense of control, and social support systems. Effects of unilateral vocal fold paralysis extend beyond impaired voice and other somatic symptoms. Awareness of the extent to which these patients experience frustration, isolation, fear, and altered self-identity is important. A patient-centered approach to optimizing unilateral vocal fold paralysis treatment is enhanced by an understanding of both the physical dimension of this condition and how patients cope with the considerable emotional and social consequences. Recognizing the psychosocial dimensions of disease allows clinicians to communicate more effectively, be more empathetic, and to better personalize treatment plans, which may lead to improved patient care and patient satisfaction.
A Mozart is not a Pavarotti: singers outperform instrumentalists on foreign accent imitation

PubMed Central

Christiner, Markus; Reiterer, Susanne Maria

2015-01-01

Recent findings have shown that people with higher musical aptitude were also better in oral language imitation tasks. However, whether singing capacity and instrument playing contribute differently to the imitation of speech has been ignored so far. Research has just recently started to understand that instrumentalists develop quite distinct skills when compared to vocalists. In the same vein the role of the vocal motor system in language acquisition processes has poorly been investigated as most investigations (neurobiological and behavioral) favor to examine speech perception. We set out to test whether the vocal motor system can influence an ability to learn, produce and perceive new languages by contrasting instrumentalists and vocalists. Therefore, we investigated 96 participants, 27 instrumentalists, 33 vocalists and 36 non-musicians/non-singers. They were tested for their abilities to imitate foreign speech: unknown language (Hindi), second language (English) and their musical aptitude. Results revealed that both instrumentalists and vocalists have a higher ability to imitate unintelligible speech and foreign accents than non-musicians/non-singers. Within the musician group, vocalists outperformed instrumentalists significantly. Conclusion: First, adaptive plasticity for speech imitation is not reliant on audition alone but also on vocal-motor induced processes. Second, vocal flexibility of singers goes together with higher speech imitation aptitude. Third, vocal motor training, as of singers, may speed up foreign language acquisition processes. PMID:26379537
A Mozart is not a Pavarotti: singers outperform instrumentalists on foreign accent imitation.

PubMed

Christiner, Markus; Reiterer, Susanne Maria

2015-01-01

Recent findings have shown that people with higher musical aptitude were also better in oral language imitation tasks. However, whether singing capacity and instrument playing contribute differently to the imitation of speech has been ignored so far. Research has just recently started to understand that instrumentalists develop quite distinct skills when compared to vocalists. In the same vein the role of the vocal motor system in language acquisition processes has poorly been investigated as most investigations (neurobiological and behavioral) favor to examine speech perception. We set out to test whether the vocal motor system can influence an ability to learn, produce and perceive new languages by contrasting instrumentalists and vocalists. Therefore, we investigated 96 participants, 27 instrumentalists, 33 vocalists and 36 non-musicians/non-singers. They were tested for their abilities to imitate foreign speech: unknown language (Hindi), second language (English) and their musical aptitude. Results revealed that both instrumentalists and vocalists have a higher ability to imitate unintelligible speech and foreign accents than non-musicians/non-singers. Within the musician group, vocalists outperformed instrumentalists significantly. First, adaptive plasticity for speech imitation is not reliant on audition alone but also on vocal-motor induced processes. Second, vocal flexibility of singers goes together with higher speech imitation aptitude. Third, vocal motor training, as of singers, may speed up foreign language acquisition processes.
Vibrational dynamics of vocal folds using nonlinear normal modes.

PubMed

Pinheiro, Alan P; Kerschen, Gaëtan

2013-08-01

Many previous works involving physical models, excised and in vivo larynges have pointed out nonlinear vibration in vocal folds during voice production. Moreover, theoretical studies involving mechanical modeling of these folds have tried to gain a profound understanding of the observed nonlinear phenomena. In this context, the present work uses the nonlinear normal mode theory to investigate the nonlinear modal behavior of 16 subjects using a two-mass mechanical modeling of the vocal folds. The free response of the conservative system at different energy levels is considered to assess the impact of the structural nonlinearity of the vocal fold tissues. The results show very interesting and complex nonlinear phenomena including frequency-energy dependence, subharmonic regimes and, in some cases, modal interactions, entrainment and bifurcations. Copyright © 2012 IPEM. Published by Elsevier Ltd. All rights reserved.
Automatic classification of animal vocalizations

NASA Astrophysics Data System (ADS)

Clemins, Patrick J.

2005-11-01

Bioacoustics, the study of animal vocalizations, has begun to use increasingly sophisticated analysis techniques in recent years. Some common tasks in bioacoustics are repertoire determination, call detection, individual identification, stress detection, and behavior correlation. Each research study, however, uses a wide variety of different measured variables, called features, and classification systems to accomplish these tasks. The well-established field of human speech processing has developed a number of different techniques to perform many of the aforementioned bioacoustics tasks. Melfrequency cepstral coefficients (MFCCs) and perceptual linear prediction (PLP) coefficients are two popular feature sets. The hidden Markov model (HMM), a statistical model similar to a finite autonoma machine, is the most commonly used supervised classification model and is capable of modeling both temporal and spectral variations. This research designs a framework that applies models from human speech processing for bioacoustic analysis tasks. The development of the generalized perceptual linear prediction (gPLP) feature extraction model is one of the more important novel contributions of the framework. Perceptual information from the species under study can be incorporated into the gPLP feature extraction model to represent the vocalizations as the animals might perceive them. By including this perceptual information and modifying parameters of the HMM classification system, this framework can be applied to a wide range of species. The effectiveness of the framework is shown by analyzing African elephant and beluga whale vocalizations. The features extracted from the African elephant data are used as input to a supervised classification system and compared to results from traditional statistical tests. The gPLP features extracted from the beluga whale data are used in an unsupervised classification system and the results are compared to labels assigned by experts. The development of a framework from which to build animal vocalization classifiers will provide bioacoustics researchers with a consistent platform to analyze and classify vocalizations. A common framework will also allow studies to compare results across species and institutions. In addition, the use of automated classification techniques can speed analysis and uncover behavioral correlations not readily apparent using traditional techniques.
Performance of a reduced-order FSI model for flow-induced vocal fold vibration

NASA Astrophysics Data System (ADS)

Luo, Haoxiang; Chang, Siyuan; Chen, Ye; Rousseau, Bernard; PhonoSim Team

2017-11-01

Vocal fold vibration during speech production involves a three-dimensional unsteady glottal jet flow and three-dimensional nonlinear tissue mechanics. A full 3D fluid-structure interaction (FSI) model is computationally expensive even though it provides most accurate information about the system. On the other hand, an efficient reduced-order FSI model is useful for fast simulation and analysis of the vocal fold dynamics, which can be applied in procedures such as optimization and parameter estimation. In this work, we study performance of a reduced-order model as compared with the corresponding full 3D model in terms of its accuracy in predicting the vibration frequency and deformation mode. In the reduced-order model, we use a 1D flow model coupled with a 3D tissue model that is the same as in the full 3D model. Two different hyperelastic tissue behaviors are assumed. In addition, the vocal fold thickness and subglottal pressure are varied for systematic comparison. The result shows that the reduced-order model provides consistent predictions as the full 3D model across different tissue material assumptions and subglottal pressures. However, the vocal fold thickness has most effect on the model accuracy, especially when the vocal fold is thin.
Using Ambulatory Voice Monitoring to Investigate Common Voice Disorders: Research Update

PubMed Central

Mehta, Daryush D.; Van Stan, Jarrad H.; Zañartu, Matías; Ghassemi, Marzyeh; Guttag, John V.; Espinoza, Víctor M.; Cortés, Juan P.; Cheyne, Harold A.; Hillman, Robert E.

2015-01-01

Many common voice disorders are chronic or recurring conditions that are likely to result from inefficient and/or abusive patterns of vocal behavior, referred to as vocal hyperfunction. The clinical management of hyperfunctional voice disorders would be greatly enhanced by the ability to monitor and quantify detrimental vocal behaviors during an individual’s activities of daily life. This paper provides an update on ongoing work that uses a miniature accelerometer on the neck surface below the larynx to collect a large set of ambulatory data on patients with hyperfunctional voice disorders (before and after treatment) and matched-control subjects. Three types of analysis approaches are being employed in an effort to identify the best set of measures for differentiating among hyperfunctional and normal patterns of vocal behavior: (1) ambulatory measures of voice use that include vocal dose and voice quality correlates, (2) aerodynamic measures based on glottal airflow estimates extracted from the accelerometer signal using subject-specific vocal system models, and (3) classification based on machine learning and pattern recognition approaches that have been used successfully in analyzing long-term recordings of other physiological signals. Preliminary results demonstrate the potential for ambulatory voice monitoring to improve the diagnosis and treatment of common hyperfunctional voice disorders. PMID:26528472
Prosthetic avian vocal organ controlled by a freely behaving bird based on a low dimensional model of the biomechanical periphery.

PubMed

Arneodo, Ezequiel M; Perl, Yonatan Sanz; Goller, Franz; Mindlin, Gabriel B

2012-01-01

Because of the parallels found with human language production and acquisition, birdsong is an ideal animal model to study general mechanisms underlying complex, learned motor behavior. The rich and diverse vocalizations of songbirds emerge as a result of the interaction between a pattern generator in the brain and a highly nontrivial nonlinear periphery. Much of the complexity of this vocal behavior has been understood by studying the physics of the avian vocal organ, particularly the syrinx. A mathematical model describing the complex periphery as a nonlinear dynamical system leads to the conclusion that nontrivial behavior emerges even when the organ is commanded by simple motor instructions: smooth paths in a low dimensional parameter space. An analysis of the model provides insight into which parameters are responsible for generating a rich variety of diverse vocalizations, and what the physiological meaning of these parameters is. By recording the physiological motor instructions elicited by a spontaneously singing muted bird and computing the model on a Digital Signal Processor in real-time, we produce realistic synthetic vocalizations that replace the bird's own auditory feedback. In this way, we build a bio-prosthetic avian vocal organ driven by a freely behaving bird via its physiologically coded motor commands. Since it is based on a low-dimensional nonlinear mathematical model of the peripheral effector, the emulation of the motor behavior requires light computation, in such a way that our bio-prosthetic device can be implemented on a portable platform.
The Vocal Tract Organ: A New Musical Instrument Using 3-D Printed Vocal Tracts.

PubMed

Howard, David M

2017-10-27

The advent and now increasingly widespread availability of 3-D printers is transforming our understanding of the natural world by enabling observations to be made in a tangible manner. This paper describes the use of 3-D printed models of the vocal tract for different vowels that are used to create an acoustic output when stimulated with an appropriate sound source in a new musical instrument: the Vocal Tract Organ. The shape of each printed vocal tract is recovered from magnetic resonance imaging. It sits atop a loudspeaker to which is provided an acoustic L-F model larynx input signal that is controlled by the notes played on a musical instrument digital interface device such as a keyboard. The larynx input is subject to vibrato with extent and frequency adjustable as desired within the ranges usually found for human singing. Polyphonic inputs for choral singing textures can be applied via a single loudspeaker and vocal tract, invoking the approximation of linearity in the voice production system, thereby making multiple vowel stops a possibility while keeping the complexity of the instrument in reasonable check. The Vocal Tract Organ offers a much more human and natural sounding result than the traditional Vox Humana stops found in larger pipe organs, offering the possibility of enhancing pipe organs of the future as well as becoming the basis for a "multi-vowel" chamber organ in its own right. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
IMRT for Image-Guided Single Vocal Cord Irradiation

DOE Office of Scientific and Technical Information (OSTI.GOV)

Osman, Sarah O.S., E-mail: s.osman@erasmusmc.nl; Astreinidou, Eleftheria; Boer, Hans C.J. de

2012-02-01

Purpose: We have been developing an image-guided single vocal cord irradiation technique to treat patients with stage T1a glottic carcinoma. In the present study, we compared the dose coverage to the affected vocal cord and the dose delivered to the organs at risk using conventional, intensity-modulated radiotherapy (IMRT) coplanar, and IMRT non-coplanar techniques. Methods and Materials: For 10 patients, conventional treatment plans using two laterally opposed wedged 6-MV photon beams were calculated in XiO (Elekta-CMS treatment planning system). An in-house IMRT/beam angle optimization algorithm was used to obtain the coplanar and non-coplanar optimized beam angles. Using these angles, the IMRTmore » plans were generated in Monaco (IMRT treatment planning system, Elekta-CMS) with the implemented Monte Carlo dose calculation algorithm. The organs at risk included the contralateral vocal cord, arytenoids, swallowing muscles, carotid arteries, and spinal cord. The prescription dose was 66 Gy in 33 fractions. Results: For the conventional plans and coplanar and non-coplanar IMRT plans, the population-averaged mean dose {+-} standard deviation to the planning target volume was 67 {+-} 1 Gy. The contralateral vocal cord dose was reduced from 66 {+-} 1 Gy in the conventional plans to 39 {+-} 8 Gy and 36 {+-} 6 Gy in the coplanar and non-coplanar IMRT plans, respectively. IMRT consistently reduced the doses to the other organs at risk. Conclusions: Single vocal cord irradiation with IMRT resulted in good target coverage and provided significant sparing of the critical structures. This has the potential to improve the quality-of-life outcomes after RT and maintain the same local control rates.« less
A novel model for examining recovery of phonation after vocal nerve damage.

PubMed

Bhama, Prabhat K; Hillel, Allen D; Merati, Albert L; Perkel, David J

2011-05-01

Recurrent laryngeal nerve injury remains a dominant clinical issue in laryngology. To date, no animal model of laryngeal reinnervation has offered an outcome measure that can reflect the degree of recovery based on vocal function. We present an avian model system for studying recovery of learned vocalizations after nerve injury. Prospective animal study. Digital recordings of bird song were made from 11 adult male zebra finches; nine birds underwent bilateral crushing of the nerve supplying the vocal organ, and two birds underwent sham surgery. Songs from all the birds were then recorded regularly and analyzed based on temporal and spectral characteristics using computer software. Indices were calculated to indicate the degree of similarity between preoperative and postoperative song. Nerve crush caused audible differences in song quality and significant drops (P<0.05) in measured spectral and, to a lesser degree, temporal indices. Spectral indices recovered significantly (mean=43.0%; standard deviation [SD]=40.7; P<0.02), and there was an insignificant trend toward recovery of temporal index (mean=28.0%; SD=41.4; P=0.0771). In five of the nine (56%) birds, there was a greater than 50% recovery of spectral indices within a 4-week period. Two birds exhibited substantially less recovery of spectral indices and two birds had a persistent decline in spectral indices. Recovery of temporal index was highly variable as well, ranging from persistent further declines of 45.1% to recovery of 87%. Neither sham bird exhibited significant (P>0.05) differences in song after nerve crush. The songbird model system allows functional analysis of learned vocalization after surgical damage to vocal nerves. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Contextual cuing contributes to the independent modification of multiple internal models for vocal control

PubMed Central

Keough, Dwayne

2011-01-01

Research on the control of visually guided limb movements indicates that the brain learns and continuously updates an internal model that maps the relationship between motor commands and sensory feedback. A growing body of work suggests that an internal model that relates motor commands to sensory feedback also supports vocal control. There is evidence from arm-reaching studies that shows that when provided with a contextual cue, the motor system can acquire multiple internal models, which allows an animal to adapt to different perturbations in diverse contexts. In this study we show that trained singers can rapidly acquire multiple internal models regarding voice fundamental frequency (F0). These models accommodate different perturbations to ongoing auditory feedback. Participants heard three musical notes and reproduced each one in succession. The musical targets could serve as a contextual cue to indicate which direction (up or down) feedback would be altered on each trial; however, participants were not explicitly instructed to use this strategy. When participants were gradually exposed to altered feedback adaptation was observed immediately following vocal onset. Aftereffects were target specific and did not influence vocal productions on subsequent trials. When target notes were no longer a contextual cue, adaptation occurred during altered feedback trials and evidence for trial-by-trial adaptation was found. These findings indicate that the brain is exceptionally sensitive to the deviations between auditory feedback and the predicted consequence of a motor command during vocalization. Moreover, these results indicate that, with contextual cues, the vocal control system may maintain multiple internal models that are capable of independent modification during different tasks or environments. PMID:21346208
Chimpanzee vocal signaling points to a multimodal origin of human language.

PubMed

Taglialatela, Jared P; Russell, Jamie L; Schaeffer, Jennifer A; Hopkins, William D

2011-04-20

The evolutionary origin of human language and its neurobiological foundations has long been the object of intense scientific debate. Although a number of theories have been proposed, one particularly contentious model suggests that human language evolved from a manual gestural communication system in a common ape-human ancestor. Consistent with a gestural origins theory are data indicating that chimpanzees intentionally and referentially communicate via manual gestures, and the production of manual gestures, in conjunction with vocalizations, activates the chimpanzee Broca's area homologue--a region in the human brain that is critical for the planning and execution of language. However, it is not known if this activity observed in the chimpanzee Broca's area is the result of the chimpanzees producing manual communicative gestures, communicative sounds, or both. This information is critical for evaluating the theory that human language evolved from a strictly manual gestural system. To this end, we used positron emission tomography (PET) to examine the neural metabolic activity in the chimpanzee brain. We collected PET data in 4 subjects, all of whom produced manual communicative gestures. However, 2 of these subjects also produced so-called attention-getting vocalizations directed towards a human experimenter. Interestingly, only the two subjects that produced these attention-getting sounds showed greater mean metabolic activity in the Broca's area homologue as compared to a baseline scan. The two subjects that did not produce attention-getting sounds did not. These data contradict an exclusive "gestural origins" theory for they suggest that it is vocal signaling that selectively activates the Broca's area homologue in chimpanzees. In other words, the activity observed in the Broca's area homologue reflects the production of vocal signals by the chimpanzees, suggesting that this critical human language region was involved in vocal signaling in the common ancestor of both modern humans and chimpanzees.
Biosimulation of Inflammation and Healing in Surgically Injured Vocal Folds

PubMed Central

Li, Nicole Y. K.; Vodovotz, Yoram; Hebda, Patricia A.; Abbott, Katherine Verdolini

2010-01-01

Objectives The pathogenesis of vocal fold scarring is complex and remains to be deciphered. The current study is part of research endeavors aimed at applying systems biology approaches to address the complex biological processes involved in the pathogenesis of vocal fold scarring and other lesions affecting the larynx. Methods We developed a computational agent-based model (ABM) to quantitatively characterize multiple cellular and molecular interactions involved in inflammation and healing in vocal fold mucosa after surgical trauma. The ABM was calibrated with empirical data on inflammatory mediators (eg, tumor necrosis factor) and extracellular matrix components (eg, hyaluronan) from published studies on surgical vocal fold injury in the rat population. Results The simulation results reproduced and predicted trajectories seen in the empirical data from the animals. Moreover, the ABM studies suggested that hyaluronan fragments might be the clinical surrogate of tissue damage, a key variable that in these simulations both is enhanced by and further induces inflammation. Conclusions A relatively simple ABM such as the one reported in this study can provide new understanding of laryngeal wound healing and generate working hypotheses for further wet-lab studies. PMID:20583741

Combining Multiobjective Optimization and Cluster Analysis to Study Vocal Fold Functional Morphology

PubMed Central

Palaparthi, Anil; Riede, Tobias

2017-01-01

Morphological design and the relationship between form and function have great influence on the functionality of a biological organ. However, the simultaneous investigation of morphological diversity and function is difficult in complex natural systems. We have developed a multiobjective optimization (MOO) approach in association with cluster analysis to study the form-function relation in vocal folds. An evolutionary algorithm (NSGA-II) was used to integrate MOO with an existing finite element model of the laryngeal sound source. Vocal fold morphology parameters served as decision variables and acoustic requirements (fundamental frequency, sound pressure level) as objective functions. A two-layer and a three-layer vocal fold configuration were explored to produce the targeted acoustic requirements. The mutation and crossover parameters of the NSGA-II algorithm were chosen to maximize a hypervolume indicator. The results were expressed using cluster analysis and were validated against a brute force method. Results from the MOO and the brute force approaches were comparable. The MOO approach demonstrated greater resolution in the exploration of the morphological space. In association with cluster analysis, MOO can efficiently explore vocal fold functional morphology. PMID:24771563
Biosimulation of inflammation and healing in surgically injured vocal folds.

PubMed

Li, Nicole Y K; Vodovotz, Yoram; Hebda, Patricia A; Abbott, Katherine Verdolini

2010-06-01

The pathogenesis of vocal fold scarring is complex and remains to be deciphered. The current study is part of research endeavors aimed at applying systems biology approaches to address the complex biological processes involved in the pathogenesis of vocal fold scarring and other lesions affecting the larynx. We developed a computational agent-based model (ABM) to quantitatively characterize multiple cellular and molecular interactions involved in inflammation and healing in vocal fold mucosa after surgical trauma. The ABM was calibrated with empirical data on inflammatory mediators (eg, tumor necrosis factor) and extracellular matrix components (eg, hyaluronan) from published studies on surgical vocal fold injury in the rat population. The simulation results reproduced and predicted trajectories seen in the empirical data from the animals. Moreover, the ABM studies suggested that hyaluronan fragments might be the clinical surrogate of tissue damage, a key variable that in these simulations both is enhanced by and further induces inflammation. A relatively simple ABM such as the one reported in this study can provide new understanding of laryngeal wound healing and generate working hypotheses for further wet-lab studies.
The Vocal Jazz Ensemble: Systemic Interactions in the Creation of Three University Programs

ERIC Educational Resources Information Center

Letson, Stephanie Austin

2010-01-01

This study examined the experiences of three vocal jazz ensemble directors who influenced the field through their successful programs at the university level. These directors, Dr. Gene Aitken, Professor Larry Lapin, and Dr. Stephen Zegree, were chosen because of their national reputations as well as their program's longevity and success. The…
Coding of vocalizations by single neurons in ventrolateral prefrontal cortex.

PubMed

Plakke, Bethany; Diltz, Mark D; Romanski, Lizabeth M

2013-11-01

Neuronal activity in single prefrontal neurons has been correlated with behavioral responses, rules, task variables and stimulus features. In the non-human primate, neurons recorded in ventrolateral prefrontal cortex (VLPFC) have been found to respond to species-specific vocalizations. Previous studies have found multisensory neurons which respond to simultaneously presented faces and vocalizations in this region. Behavioral data suggests that face and vocal information are inextricably linked in animals and humans and therefore may also be tightly linked in the coding of communication calls in prefrontal neurons. In this study we therefore examined the role of VLPFC in encoding vocalization call type information. Specifically, we examined previously recorded single unit responses from the VLPFC in awake, behaving rhesus macaques in response to 3 types of species-specific vocalizations made by 3 individual callers. Analysis of responses by vocalization call type and caller identity showed that ∼19% of cells had a main effect of call type with fewer cells encoding caller. Classification performance of VLPFC neurons was ∼42% averaged across the population. When assessed at discrete time bins, classification performance reached 70 percent for coos in the first 300 ms and remained above chance for the duration of the response period, though performance was lower for other call types. In light of the sub-optimal classification performance of the majority of VLPFC neurons when only vocal information is present, and the recent evidence that most VLPFC neurons are multisensory, the potential enhancement of classification with the addition of accompanying face information is discussed and additional studies recommended. Behavioral and neuronal evidence has shown a considerable benefit in recognition and memory performance when faces and voices are presented simultaneously. In the natural environment both facial and vocalization information is present simultaneously and neural systems no doubt evolved to integrate multisensory stimuli during recognition. This article is part of a Special Issue entitled "Communication Sounds and the Brain: New Directions and Perspectives". Copyright © 2013 Elsevier B.V. All rights reserved.
At the interface of the auditory and vocal motor systems: NIf and its role in vocal processing, production and learning.

PubMed

Lewandowski, Brian; Vyssotski, Alexei; Hahnloser, Richard H R; Schmidt, Marc

2013-06-01

Communication between auditory and vocal motor nuclei is essential for vocal learning. In songbirds, the nucleus interfacialis of the nidopallium (NIf) is part of a sensorimotor loop, along with auditory nucleus avalanche (Av) and song system nucleus HVC, that links the auditory and song systems. Most of the auditory information comes through this sensorimotor loop, with the projection from NIf to HVC representing the largest single source of auditory information to the song system. In addition to providing the majority of HVC's auditory input, NIf is also the primary driver of spontaneous activity and premotor-like bursting during sleep in HVC. Like HVC and RA, two nuclei critical for song learning and production, NIf exhibits behavioral-state dependent auditory responses and strong motor bursts that precede song output. NIf also exhibits extended periods of fast gamma oscillations following vocal production. Based on the converging evidence from studies of physiology and functional connectivity it would be reasonable to expect NIf to play an important role in the learning, maintenance, and production of song. Surprisingly, however, lesions of NIf in adult zebra finches have no effect on song production or maintenance. Only the plastic song produced by juvenile zebra finches during the sensorimotor phase of song learning is affected by NIf lesions. In this review, we carefully examine what is known about NIf at the anatomical, physiological, and behavioral levels. We reexamine conclusions drawn from previous studies in the light of our current understanding of the song system, and establish what can be said with certainty about NIf's involvement in song learning, maintenance, and production. Finally, we review recent theories of song learning integrating possible roles for NIf within these frameworks and suggest possible parallels between NIf and sensorimotor areas that form part of the neural circuitry for speech processing in humans. Copyright © 2013 Elsevier Ltd. All rights reserved.
At the interface of the auditory and vocal motor systems: NIf and its role in vocal processing, production and learning

PubMed Central

Lewandowski, Brian; Vyssotski, Alexei; Hahnloser, Richard H.R.; Schmidt, Marc

2015-01-01

Communication between auditory and vocal motor nuclei is essential for vocal learning. In songbirds, the nucleus interfacialis of the nidopallium (NIf) is part of a sensorimotor loop, along with auditory nucleus avalanche (Av) and song system nucleus HVC, that links the auditory and song systems. Most of the auditory information comes through this sensorimotor loop, with the projection from NIf to HVC representing the largest single source of auditory information to the song system. In addition to providing the majority of HVC’s auditory input, NIf is also the primary driver of spontaneous activity and premotor-like bursting during sleep in HVC. Like HVC and RA, two nuclei critical for song learning and production, NIf exhibits behavioral-state dependent auditory responses and strong motor bursts that precede song output. NIf also exhibits extended periods of fast gamma oscillations following vocal production. Based on the converging evidence from studies of physiology and functional connectivity it would be reasonable to expect NIf to play an important role in the learning, maintenance, and production of song. Surprisingly, however, lesions of NIf in adult zebra finches have no effect on song production or maintenance. Only the plastic song produced by juvenile zebra finches during the sensorimotor phase of song learning is affected by NIf lesions. In this review, we carefully examine what is known about NIf at the anatomical, physiological, and behavioral levels. We reexamine conclusions drawn from previous studies in the light of our current understanding of the song system, and establish what can be said with certainty about NIf’s involvement in song learning, maintenance, and production. Finally, we review recent theories of song learning integrating possible roles for NIf within these frameworks and suggest possible parallels between NIf and sensorimotor areas that form part of the neural circuitry for speech processing in humans. PMID:23603062
Voice tuning with new instruments for type II thyroplasty in the treatment of adductor spasmodic dysphonia.

PubMed

Sanuki, Tetsuji; Yumoto, Eiji; Toya, Yutaka; Kumai, Yoshihiko

2016-10-01

Adductor spasmodic dysphonia is a rare voice disorder characterized by strained and strangled voice quality with intermittent phonatory breaks and adductory vocal fold spasms. Type II thyroplasty differs from previous treatments in that this surgery does not involve any surgical intervention into the laryngeal muscle, nerve or vocal folds. Type II thyroplasty intervenes in the thyroid cartilage, which is unrelated to the lesion. This procedure, conducted with the aim of achieving lateralization of the vocal folds, requires utmost surgical caution due to the extreme delicacy of the surgical site, critically sensitive adjustment, and difficult procedures to maintain the incised cartilages at a correct position. During surgery, the correct separation of the incised cartilage edges with voice monitoring is the most important factor determining surgical success and patient satisfaction. We designed new surgical instruments: a thyroid cartilage elevator for undermining the thyroid cartilage, and spacer devices to gauge width while performing voice monitoring. These devices were designed to prevent surgical complications, and to aid in selecting the optimal size of titanium bridges while temporally maintaining a separation during voice monitoring. We designed new surgical instruments, including a thyroid cartilage elevator and spacer devices. Precise surgical procedures and performing voice tuning during surgery with the optimal separation width of the thyroid cartilage are key points for surgical success. We introduce the technique of voice tuning using these surgical tools in order to achieve a better outcome with minimal surgical complications. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
The effect of age of cochlear implantation on vocal characteristics in children.

PubMed

Knight, Kerry; Ducasse, Simone; Coetzee, Ashley; van der Linde, Jeannie; Louw, Anel

2016-06-27

Early cochlear implantation aids auditory feedback and supports better communication and self-monitoring of the voice. The objective of this study was to determine whether the age of cochlear implantation has an impact on vocal development in children implanted before age 4. The study consisted of 19 participants in total. All implant recipients (experimental group) were 3-5 years post-implantation, including four prelingual (0-2 years) and five perilingual (2-4 years) implant recipients. The control group consisted of 10 children whose hearing was within normal limits between the ages 3-6 years and 10 months, which was compared to the experimental group. Established paediatric norms were used for additional comparison. A questionnaire was used to gather information from each of the participant's caregivers to determine whether other personal and contextual factors had an impact on voice production. An acoustic analysis was conducted for each participant using the Multi-Dimensional Voice Program of the Computerized Speech Lab. When the experimental group and the control group were compared, similar results were yielded for fundamental frequency and short-term perturbation (jitter and shimmer). More variability was noted in long-term frequency and amplitude measures, with significantly higher differences, and therefore further outside the norms, in the prelingual group when compared to the perilingual and control groups. In this study, age of implantation did not impact vocal characteristics. Further research should include larger sample sizes, with participants that are age and gender matched.
Laryngeal Aerodynamics in Children with Hearing Impairment versus Age and Height Matched Normal Hearing Peers.

PubMed

Das, Barshapriya; Chatterjee, Indranil; Kumar, Suman

2013-01-01

Lack of proper auditory feedback in hearing-impaired subjects results in functional voice disorder. It is directly related to discoordination of intrinsic and extrinsic laryngeal muscles and disturbed contraction and relaxation of antagonistic muscles. A total of twenty children in the age range of 5-10 years were considered for the study. They were divided into two groups: normal hearing children and hearing aid user children. Results showed a significant difference in the vital capacity, maximum sustained phonation, and fast adduction abduction rate having equal variance for normal and hearing aid user children, respectively, but no significant difference was found in the peak flow value with being statistically significant. A reduced vital capacity in hearing aid user children suggests a limited use of the lung volume for speech production. It may be inferred from the study that the hearing aid user children have poor vocal proficiency which is reflected in their voice. The use of voicing component in hearing impaired subjects is seen due to improper auditory feedback. It was found that there was a significant difference in the vital capacity, maximum sustained phonation (MSP), and fast adduction abduction rate and no significant difference in the peak flow.
Amplitude Modulations of Acoustic Communication Signals

NASA Astrophysics Data System (ADS)

Turesson, Hjalmar K.

2011-12-01

In human speech, amplitude modulations at 3 -- 8 Hz are important for discrimination and detection. Two different neurophysiological theories have been proposed to explain this effect. The first theory proposes that, as a consequence of neocortical synaptic dynamics, signals that are amplitude modulated at 3 -- 8 Hz are propagated better than un-modulated signals, or signals modulated above 8 Hz. This suggests that neural activity elicited by vocalizations modulated at 3 -- 8 Hz is optimally transmitted, and the vocalizations better discriminated and detected. The second theory proposes that 3 -- 8 Hz amplitude modulations interact with spontaneous neocortical oscillations. Specifically, vocalizations modulated at 3 -- 8 Hz entrain local populations of neurons, which in turn, modulate the amplitude of high frequency gamma oscillations. This suggests that vocalizations modulated at 3 -- 8 Hz should induce stronger cross-frequency coupling. Similar to human speech, we found that macaque monkey vocalizations also are amplitude modulated between 3 and 8 Hz. Humans and macaque monkeys share similarities in vocal production, implying that the auditory systems subserving perception of acoustic communication signals also share similarities. Based on the similarities between human speech and macaque monkey vocalizations, we addressed how amplitude modulated vocalizations are processed in the auditory cortex of macaque monkeys, and what behavioral relevance modulations may have. Recording single neuron activity, as well as, the activity of local populations of neurons allowed us to test both of the neurophysiological theories presented above. We found that single neuron responses to vocalizations amplitude modulated at 3 -- 8 Hz resulted in better stimulus discrimination than vocalizations lacking 3 -- 8 Hz modulations, and that the effect most likely was mediated by synaptic dynamics. In contrast, we failed to find support for the oscillation-based model proposing a coupling between 3 -- 8 Hz oscillations and gamma band amplitude. In a behavioral experiment, we found that 3 -- 8 amplitude modulations improved auditory detection in noise. In conclusion, our results suggest that, as in human speech, 3 -- 8 Hz amplitude modulations have a behaviorally important effect, and that this effect probably is mediated by synaptic dynamics.
Artificially lengthened and constricted vocal tract in vocal training methods.

PubMed

Bele, Irene Velsvik

2005-01-01

It is common practice in vocal training to make use of vocal exercise techniques that involve partial occlusion of the vocal tract. Various techniques are used; some of them form an occlusion within the front part of the oral cavity or at the lips. Another vocal exercise technique involves lengthening the vocal tract; for example, the method of phonation into small tubes. This essay presents some studies made on the effects of various vocal training methods that involve an artificially lengthened and constricted vocal tract. The influence of sufficient acoustic impedance on vocal fold vibration and economical voice production is presented.
Dual Neural Network Model for the Evolution of Speech and Language.

PubMed

Hage, Steffen R; Nieder, Andreas

2016-12-01

Explaining the evolution of speech and language poses one of the biggest challenges in biology. We propose a dual network model that posits a volitional articulatory motor network (VAMN) originating in the prefrontal cortex (PFC; including Broca's area) that cognitively controls vocal output of a phylogenetically conserved primary vocal motor network (PVMN) situated in subcortical structures. By comparing the connections between these two systems in human and nonhuman primate brains, we identify crucial biological preadaptations in monkeys for the emergence of a language system in humans. This model of language evolution explains the exclusiveness of non-verbal communication sounds (e.g., cries) in infants with an immature PFC, as well as the observed emergence of non-linguistic vocalizations in adults after frontal lobe pathologies. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Cetacean vocal learning and communication.

PubMed

Janik, Vincent M

2014-10-01

The cetaceans are one of the few mammalian clades capable of vocal production learning. Evidence for this comes from synchronous changes in song patterns of baleen whales and experimental work on toothed whales in captivity. While baleen whales like many vocal learners use this skill in song displays that are involved in sexual selection, toothed whales use learned signals in individual recognition and the negotiation of social relationships. Experimental studies demonstrated that dolphins can use learned signals referentially. Studies on wild dolphins demonstrated how this skill appears to be useful in their own communication system, making them an interesting subject for comparative communication studies. Copyright © 2014. Published by Elsevier Ltd.
Whistle register: a preliminary investigation by HSDI visualization and acoustics on female cases

NASA Astrophysics Data System (ADS)

Di Corcia, Antonio; Fussi, Franco

2012-02-01

In this study we investigated laryngeal behaviors involved during vocal production of highest female vocal ranges in Flute in M3 Register, in Whistle Register and in a newly formulated by us, Hiss Register. Observations were carried with stroboscopy and High Speed Digital Imaging and with spectrographic and psycho-acoustic analysis by means of a software system having a wide spectral range (0-20.000 Hz). Results indicate that at the highest pitch vocal folds vibration is absent or significantly reduced, glottic contact is incomplete. These acoustic form of extreme pitch levels comprised intra-harmonic noise and overtones within 10 to 18 kHz range.
A new measure of child vocal reciprocity in children with autism spectrum disorder.

PubMed

Harbison, Amy L; Woynaroski, Tiffany G; Tapp, Jon; Wade, Joshua W; Warlaumont, Anne S; Yoder, Paul J

2018-06-01

Children's vocal development occurs in the context of reciprocal exchanges with a communication partner who models "speechlike" productions. We propose a new measure of child vocal reciprocity, which we define as the degree to which an adult vocal response increases the probability of an immediately following child vocal response. Vocal reciprocity is likely to be associated with the speechlikeness of vocal communication in young children with autism spectrum disorder (ASD). Two studies were conducted to test the utility of the new measure. The first used simulated vocal samples with randomly sequenced child and adult vocalizations to test the accuracy of the proposed index of child vocal reciprocity. The second was an empirical study of 21 children with ASD who were preverbal or in the early stages of language development. Daylong vocal samples collected in the natural environment were computer analyzed to derive the proposed index of child vocal reciprocity, which was highly stable when derived from two daylong vocal samples and was associated with speechlikeness of vocal communication. This association was significant even when controlling for chance probability of child vocalizations to adult vocal responses, probability of adult vocalizations, or probability of child vocalizations. A valid measure of children's vocal reciprocity might eventually improve our ability to predict which children are on track to develop useful speech and/or are most likely to respond to language intervention. A link to a free, publicly-available software program to derive the new measure of child vocal reciprocity is provided. Autism Res 2018, 11: 903-915. © 2018 International Society for Autism Research, Wiley Periodicals, Inc. Children and adults often engage in back-and-forth vocal exchanges. The extent to which they do so is believed to support children's early speech and language development. Two studies tested a new measure of child vocal reciprocity using computer-generated and real-life vocal samples of young children with autism collected in natural settings. The results provide initial evidence of accuracy, test-retest reliability, and validity of the new measure of child vocal reciprocity. A sound measure of children's vocal reciprocity might improve our ability to predict which children are on track to develop useful speech and/or are most likely to respond to language intervention. A free, publicly-available software program and manuals are provided. © 2018 International Society for Autism Research, Wiley Periodicals, Inc.
Information content and acoustic structure of male African elephant social rumbles

PubMed Central

Stoeger, Angela S.; Baotic, Anton

2016-01-01

Until recently, the prevailing theory about male African elephants (Loxodonta africana) was that, once adult and sexually mature, males are solitary and targeted only at finding estrous females. While this is true during the state of ‘musth’ (a condition characterized by aggressive behavior and elevated androgen levels), ‘non-musth’ males exhibit a social system seemingly based on companionship, dominance and established hierarchies. Research on elephant vocal communication has so far focused on females, and very little is known about the acoustic structure and the information content of male vocalizations. Using the source and filter theory approach, we analyzed social rumbles of 10 male African elephants. Our results reveal that male rumbles encode information about individuality and maturity (age and size), with formant frequencies and absolute fundamental frequency values having the most informative power. This first comprehensive study on male elephant vocalizations gives important indications on their potential functional relevance for male-male and male-female communication. Our results suggest that, similar to the highly social females, future research on male elephant vocal behavior will reveal a complex communication system in which social knowledge, companionship, hierarchy, reproductive competition and the need to communicate over long distances play key roles. PMID:27273586
Social functioning and autonomic nervous system sensitivity across vocal and musical emotion in Williams syndrome and autism spectrum disorder.

PubMed

Järvinen, Anna; Ng, Rowena; Crivelli, Davide; Neumann, Dirk; Arnold, Andrew J; Woo-VonHoogenstyn, Nicholas; Lai, Philip; Trauner, Doris; Bellugi, Ursula

2016-01-01

Both Williams syndrome (WS) and autism spectrum disorders (ASD) are associated with unusual auditory phenotypes with respect to processing vocal and musical stimuli, which may be shaped by the atypical social profiles that characterize the syndromes. Autonomic nervous system (ANS) reactivity to vocal and musical emotional stimuli was examined in 12 children with WS, 17 children with ASD, and 20 typically developing (TD) children, and related to their level of social functioning. The results of this small-scale study showed that after controlling for between-group differences in cognitive ability, all groups showed similar emotion identification performance across conditions. Additionally, in ASD, lower autonomic reactivity to human voice, and in TD, to musical emotion, was related to more normal social functioning. Compared to TD, both clinical groups showed increased arousal to vocalizations. A further result highlighted uniquely increased arousal to music in WS, contrasted with a decrease in arousal in ASD and TD. The ASD and WS groups exhibited arousal patterns suggestive of diminished habituation to the auditory stimuli. The results are discussed in the context of the clinical presentation of WS and ASD. © 2015 Wiley Periodicals, Inc.
Acoustic analysis of trill sounds.

PubMed

Dhananjaya, N; Yegnanarayana, B; Bhaskararao, Peri

2012-04-01

In this paper, the acoustic-phonetic characteristics of steady apical trills--trill sounds produced by the periodic vibration of the apex of the tongue--are studied. Signal processing methods, namely, zero-frequency filtering and zero-time liftering of speech signals, are used to analyze the excitation source and the resonance characteristics of the vocal tract system, respectively. Although it is natural to expect the effect of trilling on the resonances of the vocal tract system, it is interesting to note that trilling influences the glottal source of excitation as well. The excitation characteristics derived using zero-frequency filtering of speech signals are glottal epochs, strength of impulses at the glottal epochs, and instantaneous fundamental frequency of the glottal vibration. Analysis based on zero-time liftering of speech signals is used to study the dynamic resonance characteristics of vocal tract system during the production of trill sounds. Qualitative analysis of trill sounds in different vowel contexts, and the acoustic cues that may help spotting trills in continuous speech are discussed.
The effects of preventive vocal hygiene education on the vocal hygiene habits and perceptual vocal characteristics of training singers.

PubMed

Broaddus-Lawrence, P L; Treole, K; McCabe, R B; Allen, R L; Toppin, L

2000-03-01

The purpose of the present study was to determine the effects of vocal hygiene education on the vocal hygiene behaviors and perceptual vocal characteristics of untrained singers. Eleven adult untrained singers served as subjects. They attended four 1-hour class sessions on vocal hygiene, including anatomy and physiology of the phonatory mechanism, vocally abusive behaviors, voice disorders commonly seen in singers, and measures to prevent voice disorders. Pre- and postinstruction surveys were used to record subjects' vocal abuses and their perceptions of their speaking and singing voice. They also rated their perceived value of vocal hygiene education. Results revealed minimal changes in vocal hygiene behaviors and perceptual voice characteristics. The subjects did report a high degree of benefit and learning, however.
[The comparative assessment of the vocal function in the professional voice users and non-occupational voice users in the late adulthood].

PubMed

Pavlikhin, O G; Romanenko, S G; Krasnikova, D I; Lesogorova, E V; Yakovlev, V S

The objective of the present study was to evaluate the clinical and functional condition of the voice apparatus in the elderly patients and to elaborate recommendations for the prevention of disturbances of the vocal function in the professional voice users. This comprehensive study involved 95 patients including the active professional voice users (n=48) and 45 non-occupational voice users at the age from 61 to 82 years with the employment history varying from 32 to 51 years. The study was designed to obtain the voice characteristics by means of the subjective auditory assessment, microlaryngoscopy, video laryngostroboscopy, determination of maximum phonation time (MPT), and computer-assisted acoustic analysis of the voice with the use of the MDVP Kay Pentaxy system. The level of anxiety of the patients was estimated based on the results of the HADS questionnaire study. It is concluded that the majority of the disturbances of the vocal function in the professional voice users have the functional nature. It is concluded that the method of neuro-muscular electrophonopedic stimulation (NMEPS) of laryngeal muscles is the method of choice for the diagnostics of the vocal function of the voice users in the late adulthood. It is recommended that the professional vocal load for such subjects should not exceed 12-14 hours per week. Rational psychotherapy must constitute an important component of the system of measures intended to support the working capacity of the voice users belonging to this age group.

fMRI Mapping of Brain Activity Associated with the Vocal Production of Consonant and Dissonant Intervals.

PubMed

González-García, Nadia; Rendón, Pablo L

2017-05-23

The neural correlates of consonance and dissonance perception have been widely studied, but not the neural correlates of consonance and dissonance production. The most straightforward manner of musical production is singing, but, from an imaging perspective, it still presents more challenges than listening because it involves motor activity. The accurate singing of musical intervals requires integration between auditory feedback processing and vocal motor control in order to correctly produce each note. This protocol presents a method that permits the monitoring of neural activations associated with the vocal production of consonant and dissonant intervals. Four musical intervals, two consonant and two dissonant, are used as stimuli, both for an auditory discrimination test and a task that involves first listening to and then reproducing given intervals. Participants, all female vocal students at the conservatory level, were studied using functional Magnetic Resonance Imaging (fMRI) during the performance of the singing task, with the listening task serving as a control condition. In this manner, the activity of both the motor and auditory systems was observed, and a measure of vocal accuracy during the singing task was also obtained. Thus, the protocol can also be used to track activations associated with singing different types of intervals or with singing the required notes more accurately. The results indicate that singing dissonant intervals requires greater participation of the neural mechanisms responsible for the integration of external feedback from the auditory and sensorimotor systems than does singing consonant intervals.
Refinements in modeling the passive properties of laryngeal soft tissue.

PubMed

Hunter, Eric J; Titze, Ingo R

2007-07-01

The nonlinear viscoelastic passive properties of three canine intrinsic laryngeal muscles, the lateral cricoarytenoid (LCA), the posterior cricoarytenoid (PCA), and the interarytenoid (IA), were fit to the parameters of a modified Kelvin model. These properties were compared with those of the thyroarytenoid (TA) and cricothyroid (CT) muscles, as well as previously unpublished viscoelastic characteristics of the human vocal ligament. Passive parameters of the modified Kelvin model were summarized for the vocal ligament, mucosa, and all five laryngeal muscles. Results suggest that the LCA, PCA, and IA muscles are functionally different from the TA and CT muscles in their load-bearing capacity. Furthermore, the LCA, PCA, and IA have a much larger stress-strain hysteresis effect than has been previously reported for the TA and CT or the vocal ligament. The variation in this effect suggests that the connective tissue within the TA and CT muscles is somehow similar to the vocal ligament but different from the LCA, PCA, or IA muscles. Further demonstrating the potential significance of grouping tissues in the laryngeal system by functional groups in the laryngeal system was the unique finding that, over their working elongation range, the LCA and PCA were nearly as exponentially stiff as the vocal ligament. This paper was written in conjunction with an online technical report (http://www.ncvs.org/ncvs/library/tech) in which comprehensive muscle data and sensitivity analysis, as well as downloadable data files and computer scripts, are made available.
Acoustic Analysis and Electroglottography in Elite Vocal Performers.

PubMed

Villafuerte-Gonzalez, Rocio; Valadez-Jimenez, Victor M; Sierra-Ramirez, Jose A; Ysunza, Pablo Antonio; Chavarria-Villafuerte, Karen; Hernandez-Lopez, Xochiquetzal

2017-05-01

Acoustic analysis of voice (AAV) and electroglottography (EGG) have been used for assessing vocal quality in patients with voice disorders. The effectiveness of these procedures for detecting mild disturbances in vocal quality in elite vocal performers has been controversial. To compare acoustic parameters obtained by AAV and EGG before and after vocal training to determine the effectiveness of these procedures for detecting vocal improvements in elite vocal performers. Thirty-three elite vocal performers were studied. The study group included 14 males and 19 females, ages 18-40 years, without a history of voice disorders. Acoustic parameters were obtained through AAV and EGG before and after vocal training using the Linklater method. Nonsignificant differences (P > 0.05) were found between values of fundamental frequency (F 0 ), shimmer, and jitter obtained by both procedures before vocal training. Mean F 0 was similar after vocal training. Jitter percentage as measured by AAV showed nonsignificant differences (P > 0.05) before and after vocal training. Shimmer percentage as measured by AAV demonstrated a significant reduction (P < 0.05) after vocal training. As measured by EGG after vocal training, shimmer and jitter were significantly reduced (P < 0.05); open quotient was significantly increased (P < 0.05); and irregularity was significantly reduced (P < 0.05). AAV and EGG were effective for detecting improvements in vocal function after vocal training in male and female elite vocal performers undergoing vocal training. EGG demonstrated better efficacy for detecting improvements and provided additional parameters as compared to AAV. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Prosthetic Avian Vocal Organ Controlled by a Freely Behaving Bird Based on a Low Dimensional Model of the Biomechanical Periphery

PubMed Central

Arneodo, Ezequiel M.; Perl, Yonatan Sanz; Goller, Franz; Mindlin, Gabriel B.

2012-01-01

Because of the parallels found with human language production and acquisition, birdsong is an ideal animal model to study general mechanisms underlying complex, learned motor behavior. The rich and diverse vocalizations of songbirds emerge as a result of the interaction between a pattern generator in the brain and a highly nontrivial nonlinear periphery. Much of the complexity of this vocal behavior has been understood by studying the physics of the avian vocal organ, particularly the syrinx. A mathematical model describing the complex periphery as a nonlinear dynamical system leads to the conclusion that nontrivial behavior emerges even when the organ is commanded by simple motor instructions: smooth paths in a low dimensional parameter space. An analysis of the model provides insight into which parameters are responsible for generating a rich variety of diverse vocalizations, and what the physiological meaning of these parameters is. By recording the physiological motor instructions elicited by a spontaneously singing muted bird and computing the model on a Digital Signal Processor in real-time, we produce realistic synthetic vocalizations that replace the bird's own auditory feedback. In this way, we build a bio-prosthetic avian vocal organ driven by a freely behaving bird via its physiologically coded motor commands. Since it is based on a low-dimensional nonlinear mathematical model of the peripheral effector, the emulation of the motor behavior requires light computation, in such a way that our bio-prosthetic device can be implemented on a portable platform. PMID:22761555
Intraoperative handheld probe for 3D imaging of pediatric benign vocal fold lesions using optical coherence tomography (Conference Presentation)

NASA Astrophysics Data System (ADS)

Benboujja, Fouzi; Garcia, Jordan; Beaudette, Kathy; Strupler, Mathias; Hartnick, Christopher J.; Boudoux, Caroline

2016-02-01

Excessive and repetitive force applied on vocal fold tissue can induce benign vocal fold lesions. Children affected suffer from chronic hoarseness. In this instance, the vibratory ability of the folds, a complex layered microanatomy, becomes impaired. Histological findings have shown that lesions produce a remodeling of sup-epithelial vocal fold layers. However, our understanding of lesion features and development is still limited. Indeed, conventional imaging techniques do not allow a non-invasive assessment of sub-epithelial integrity of the vocal fold. Furthermore, it remains challenging to differentiate these sub-epithelial lesions (such as bilateral nodules, polyps and cysts) from a clinical perspective, as their outer surfaces are relatively similar. As treatment strategy differs for each lesion type, it is critical to efficiently differentiate sub-epithelial alterations involved in benign lesions. In this study, we developed an optical coherence tomography (OCT) based handheld probe suitable for pediatric laryngological imaging. The probe allows for rapid three-dimensional imaging of vocal fold lesions. The system is adapted to allow for high-resolution intra-operative imaging. We imaged 20 patients undergoing direct laryngoscopy during which we looked at different benign pediatric pathologies such as bilateral nodules, cysts and laryngeal papillomatosis and compared them to healthy tissue. We qualitatively and quantitatively characterized laryngeal pathologies and demonstrated the added advantage of using 3D OCT imaging for lesion discrimination and margin assessment. OCT evaluation of the integrity of the vocal cord could yield to a better pediatric management of laryngeal diseases.
From mouth to hand: gesture, speech, and the evolution of right-handedness.

PubMed

Corballis, Michael C

2003-04-01

The strong predominance of right-handedness appears to be a uniquely human characteristic, whereas the left-cerebral dominance for vocalization occurs in many species, including frogs, birds, and mammals. Right-handedness may have arisen because of an association between manual gestures and vocalization in the evolution of language. I argue that language evolved from manual gestures, gradually incorporating vocal elements. The transition may be traced through changes in the function of Broca's area. Its homologue in monkeys has nothing to do with vocal control, but contains the so-called "mirror neurons," the code for both the production of manual reaching movements and the perception of the same movements performed by others. This system is bilateral in monkeys, but predominantly left-hemispheric in humans, and in humans is involved with vocalization as well as manual actions. There is evidence that Broca's area is enlarged on the left side in Homo habilis, suggesting that a link between gesture and vocalization may go back at least two million years, although other evidence suggests that speech may not have become fully autonomous until Homo sapiens appeared some 170,000 years ago, or perhaps even later. The removal of manual gesture as a necessary component of language may explain the rapid advance of technology, allowing late migrations of Homo sapiens from Africa to replace all other hominids in other parts of the world, including the Neanderthals in Europe and Homo erectus in Asia. Nevertheless, the long association of vocalization with manual gesture left us a legacy of right-handedness.
Localization and Divergent Profiles of Estrogen Receptors and Aromatase in the Vocal and Auditory Networks of a Fish with Alternative Mating Tactics

PubMed Central

Fergus, Daniel J.; Bass, Andrew H.

2013-01-01

Estrogens play a salient role in the development and maintenance of both male and female nervous systems and behaviors. The plainfin midshipman (Porichthys notatus), a teleost fish, has two male reproductive morphs that follow alternative mating tactics and diverge in multiple somatic, hormonal and neural traits, including the central control of morph-specific vocal behaviors. After we identified duplicate estrogen receptors (ERβ1 and ERβ2) in midshipman, we developed antibodies to localize protein expression in the central vocal-acoustic networks and saccule, the auditory division of the inner ear. As in other teleost species, ERβ1 and ERβ2 were robustly expressed in the telencephalon and hypothalamus in vocal-acoustic and other brain regions shown previously to exhibit strong expression of ERα and aromatase (estrogen synthetase, CYP19) in midshipman. Like aromatase, ERβ1 label co-localized with glial fibrillary acidic protein (GFAP) in telencephalic radial glial cells. Quantitative PCR revealed similar patterns of transcript abundance across reproductive morphs for ERβ1, ERβ2, ERα and aromatase in the forebrain and saccule. In contrast, transcript abundance for ERs and aromatase varied significantly between morphs in and around the sexually polymorphic vocal motor nucleus (VMN). Together, the results suggest that VMN is the major estrogen target within the estrogen-sensitive hindbrain vocal network that directly determines the duration, frequency and amplitude of morph-specific vocalizations. Comparable regional differences in steroid receptor abundances likely regulate morph-specific behaviors in males and females of other species exhibiting alternative reproductive tactics. PMID:23460422
Differential Expression of Glutamate Receptors in Avian Neural Pathways for Learned Vocalization

PubMed Central

WADA, KAZUHIRO; SAKAGUCHI, HIRONOBU; JARVIS, ERICH D.; HAGIWARA, MASATOSHI

2008-01-01

Learned vocalization, the substrate for human language, is a rare trait. It is found in three distantly related groups of birds—parrots, hummingbirds, and songbirds. These three groups contain cerebral vocal nuclei for learned vocalization not found in their more closely related vocal nonlearning relatives. Here, we cloned 21 receptor subunits/subtypes of all four glutamate receptor families (AMPA, kainate, NMDA, and metabotropic) and examined their expression in vocal nuclei of songbirds. We also examined expression of a subset of these receptors in vocal nuclei of hummingbirds and parrots, as well as in the brains of dove species as examples of close vocal nonlearning relatives. Among the 21 subunits/subtypes, 19 showed higher and/or lower prominent differential expression in songbird vocal nuclei relative to the surrounding brain subdivisions in which the vocal nuclei are located. This included relatively lower levels of all four AMPA subunits in lMAN, strikingly higher levels of the kainite subunit GluR5 in the robust nucleus of the arcopallium (RA), higher and lower levels respectively of the NMDA subunits NR2A and NR2B in most vocal nuclei and lower levels of the metabotropic group I subtypes (mGluR1 and -5) in most vocal nuclei and the group II subtype (mGluR2), showing a unique expression pattern of very low levels in RA and very high levels in HVC. The splice variants of AMPA subunits showed further differential expression in vocal nuclei. Some of the receptor subunits/subtypes also showed differential expression in hummingbird and parrot vocal nuclei. The magnitude of differential expression in vocal nuclei of all three vocal learners was unique compared with the smaller magnitude of differences found for nonvocal areas of vocal learners and vocal nonlearners. Our results suggest that evolution of vocal learning was accompanied by differential expression of a conserved gene family for synaptic transmission and plasticity in vocal nuclei. They also suggest that neural activity and signal transduction in vocal nuclei of vocal learners will be different relative to the surrounding brain areas. PMID:15236466
Automatic and quantitative measurement of laryngeal video stroboscopic images.

PubMed

Kuo, Chung-Feng Jeffrey; Kuo, Joseph; Hsiao, Shang-Wun; Lee, Chi-Lung; Lee, Jih-Chin; Ke, Bo-Han

2017-01-01

The laryngeal video stroboscope is an important instrument for physicians to analyze abnormalities and diseases in the glottal area. Stroboscope has been widely used around the world. However, without quantized indices, physicians can only make subjective judgment on glottal images. We designed a new laser projection marking module and applied it onto the laryngeal video stroboscope to provide scale conversion reference parameters for glottal imaging and to convert the physiological parameters of glottis. Image processing technology was used to segment the important image regions of interest. Information of the glottis was quantified, and the vocal fold image segmentation system was completed to assist clinical diagnosis and increase accuracy. Regarding image processing, histogram equalization was used to enhance glottis image contrast. The center weighted median filters image noise while retaining the texture of the glottal image. Statistical threshold determination was used for automatic segmentation of a glottal image. As the glottis image contains saliva and light spots, which are classified as the noise of the image, noise was eliminated by erosion, expansion, disconnection, and closure techniques to highlight the vocal area. We also used image processing to automatically identify an image of vocal fold region in order to quantify information from the glottal image, such as glottal area, vocal fold perimeter, vocal fold length, glottal width, and vocal fold angle. The quantized glottis image database was created to assist physicians in diagnosing glottis diseases more objectively.
Effect of high-dose vocal fold injection of cidofovir and bevacizumab in a porcine model.

PubMed

Ahmed, Mostafa M; Connor, Matthew P; Palazzolo, Mitzi; Thompson, Michelle E; Lospinoso, Josh; O'Connor, Peter; Howard, N Scott; Maturo, Stephen C

2017-03-01

Perform a follow-up study to investigate the histologic impact of high-dose intralaryngeal cidofovir injections in porcine vocal cords, either alone or in combination with bevacizumab, and compared to saline controls. This was an in vivo study involving 24 pigs with blinded pathologist review of specimens. Six groups were created, with four subjects in each group. Each subject received 10 or 20 mg of either cidofovir or bevacizumab alone, or in combination, injected into the right vocal cord. The left vocal fold was used as a saline control. Three separate injections were made at 2-week intervals. Larynges were harvested at 8 and 12 weeks, stained with hematoxylin and eosin and trichrome stain, and reviewed for histologic changes by two blinded pathologists. Minimal inflammation, edema, and atypia were noted with all treatments. Increased glandular inflammation was noted with 10 mg bevacizumab (P < 0.05), which decreased when combined with 10 mg cidofovir (P < 0.05). No lamina propria or muscle fibrosis was observed. Drug duration had no statistically significant histologic impact. High-dose cidofovir and bevacizumab do not induce detrimental vocal fold changes. Combination cidofovir and bevacizumab do not cause vocal fold scarring. Further work is needed to assess systemic concentration with this high-dose combination in humans. N/A. Laryngoscope, 127:671-675, 2017. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
Neural Correlates of the Lombard Effect in Primate Auditory Cortex

PubMed Central

Eliades, Steven J.

2012-01-01

Speaking is a sensory-motor process that involves constant self-monitoring to ensure accurate vocal production. Self-monitoring of vocal feedback allows rapid adjustment to correct perceived differences between intended and produced vocalizations. One important behavior in vocal feedback control is a compensatory increase in vocal intensity in response to noise masking during vocal production, commonly referred to as the Lombard effect. This behavior requires mechanisms for continuously monitoring auditory feedback during speaking. However, the underlying neural mechanisms are poorly understood. Here we show that when marmoset monkeys vocalize in the presence of masking noise that disrupts vocal feedback, the compensatory increase in vocal intensity is accompanied by a shift in auditory cortex activity toward neural response patterns seen during vocalizations under normal feedback condition. Furthermore, we show that neural activity in auditory cortex during a vocalization phrase predicts vocal intensity compensation in subsequent phrases. These observations demonstrate that the auditory cortex participates in self-monitoring during the Lombard effect, and may play a role in the compensation of noise masking during feedback-mediated vocal control. PMID:22855821
The perceptual features of vocal fatigue as self-reported by a group of actors and singers.

PubMed

Kitch, J A; Oates, J

1994-09-01

Performers (10 actors/10 singers) rated via a self-report questionnaire the severity of their voice-related changes when vocally fatigued. Similar frequency patterns and perceptual features of vocal fatigue were found across subjects. Actors rated "power" aspects (e.g., voice projection) and singers rated vocal dynamic aspects (e.g., pitch range) of their voices as most affected when vocally fatigued. Vocal fatigue was evidenced by changes in kinesthetic/proprioceptive sensations and vocal dynamics. The causes and context of vocal fatigue were vocal misuse, being "run down," high performance demands, and using high pitch/volume levels. Further research is needed to delineate the perceptual features of "normal" levels of vocal fatigue and its possible causes.
A simple-shear rheometer for linear viscoelastic characterization of vocal fold tissues at phonatory frequencies.

PubMed

Chan, Roger W; Rodriguez, Maritza L

2008-08-01

Previous studies reporting the linear viscoelastic shear properties of the human vocal fold cover or mucosa have been based on torsional rheometry, with measurements limited to low audio frequencies, up to around 80 Hz. This paper describes the design and validation of a custom-built, controlled-strain, linear, simple-shear rheometer system capable of direct empirical measurements of viscoelastic shear properties at phonatory frequencies. A tissue specimen was subjected to simple shear between two parallel, rigid acrylic plates, with a linear motor creating a translational sinusoidal displacement of the specimen via the upper plate, and the lower plate transmitting the harmonic shear force resulting from the viscoelastic response of the specimen. The displacement of the specimen was measured by a linear variable differential transformer whereas the shear force was detected by a piezoelectric transducer. The frequency response characteristics of these system components were assessed by vibration experiments with accelerometers. Measurements of the viscoelastic shear moduli (G' and G") of a standard ANSI S2.21 polyurethane material and those of human vocal fold cover specimens were made, along with estimation of the system signal and noise levels. Preliminary results showed that the rheometer can provide valid and reliable rheometric data of vocal fold lamina propria specimens at frequencies of up to around 250 Hz, well into the phonatory range.
The Emotional Communication in Hearing Questionnaire (EMO-CHeQ): Development and Evaluation.

PubMed

Singh, Gurjit; Liskovoi, Lisa; Launer, Stefan; Russo, Frank

2018-06-11

The objectives of this research were to develop and evaluate a self-report questionnaire (the Emotional Communication in Hearing Questionnaire or EMO-CHeQ) designed to assess experiences of hearing and handicap when listening to signals that contain vocal emotion information. Study 1 involved internet-based administration of a 42-item version of the EMO-CHeQ to 586 adult participants (243 with self-reported normal hearing [NH], 193 with self-reported hearing impairment but no reported use of hearing aids [HI], and 150 with self-reported hearing impairment and use of hearing aids [HA]). To better understand the factor structure of the EMO-CHeQ and eliminate redundant items, an exploratory factor analysis was conducted. Study 2 involved laboratory-based administration of a 16-item version of the EMO-CHeQ to 32 adult participants (12 normal hearing/near normal hearing (NH/nNH), 10 HI, and 10 HA). In addition, participants completed an emotion-identification task under audio and audiovisual conditions. In study 1, the exploratory factor analysis yielded an interpretable solution with four factors emerging that explained a total of 66.3% of the variance in performance the EMO-CHeQ. Item deletion resulted in construction of the 16-item EMO-CHeQ. In study 1, both the HI and HA group reported greater vocal emotion communication handicap on the EMO-CHeQ than on the NH group, but differences in handicap were not observed between the HI and HA group. In study 2, the same pattern of reported handicap was observed in individuals with audiometrically verified hearing as was found in study 1. On the emotion-identification task, no group differences in performance were observed in the audiovisual condition, but group differences were observed in the audio alone condition. Although the HI and HA group exhibited similar emotion-identification performance, both groups performed worse than the NH/nNH group, thus suggesting the presence of behavioral deficits that parallel self-reported vocal emotion communication handicap. The EMO-CHeQ was significantly and strongly (r = -0.64) correlated with performance on the emotion-identification task for listeners with hearing impairment. The results from both studies suggest that the EMO-CHeQ appears to be a reliable and ecologically valid measure to rapidly assess experiences of hearing and handicap when listening to signals that contain vocal emotion information.This is an open-access article distributed under the terms of the Creative Commons Attribution-Non Commercial-No Derivatives License 4.0 (CCBY-NC-ND), where it is permissible to download and share the work provided it is properly cited. The work cannot be changed in any way or used commercially without permission from the journal.
An Automated Procedure for Evaluating Song Imitation

PubMed Central

Mandelblat-Cerf, Yael; Fee, Michale S.

2014-01-01

Songbirds have emerged as an excellent model system to understand the neural basis of vocal and motor learning. Like humans, songbirds learn to imitate the vocalizations of their parents or other conspecific “tutors.” Young songbirds learn by comparing their own vocalizations to the memory of their tutor song, slowly improving until over the course of several weeks they can achieve an excellent imitation of the tutor. Because of the slow progression of vocal learning, and the large amounts of singing generated, automated algorithms for quantifying vocal imitation have become increasingly important for studying the mechanisms underlying this process. However, methodologies for quantifying song imitation are complicated by the highly variable songs of either juvenile birds or those that learn poorly because of experimental manipulations. Here we present a method for the evaluation of song imitation that incorporates two innovations: First, an automated procedure for selecting pupil song segments, and, second, a new algorithm, implemented in Matlab, for computing both song acoustic and sequence similarity. We tested our procedure using zebra finch song and determined a set of acoustic features for which the algorithm optimally differentiates between similar and non-similar songs. PMID:24809510
Measurements of vocal fold tissue viscoelasticity: Approaching the male phonatory frequency range

NASA Astrophysics Data System (ADS)

Chan, Roger W.

2004-06-01

Viscoelastic shear properties of human vocal fold tissues have been reported previously. However, data have only been obtained at very low frequencies (<=15 Hz). This necessitates data extrapolation to the frequency range of phonation based on constitutive modeling and time-temperature superposition. This study attempted to obtain empirical measurements at higher frequencies with the use of a controlled strain torsional rheometer, with a design of directly controlling input strain that introduced significantly smaller system inertial errors compared to controlled stress rheometry. Linear viscoelastic shear properties of the vocal fold mucosa (cover) from 17 canine larynges were quantified at frequencies of up to 50 Hz. Consistent with previous data, results showed that the elastic shear modulus (G'), viscous shear modulus (G''), and damping ratio (ζ) of the vocal fold mucosa were relatively constant across 0.016-50 Hz, whereas the dynamic viscosity (ɛ') decreased monotonically with frequency. Constitutive characterization of the empirical data by a quasilinear viscoelastic model and a statistical network model demonstrated trends of viscoelastic behavior at higher frequencies generally following those observed at lower frequencies. These findings supported the use of controlled strain rheometry for future investigations of the viscoelasticity of vocal fold tissues and phonosurgical biomaterials at phonatory frequencies.
Laughter as an approach to vocal evolution: The bipedal theory.

PubMed

Provine, Robert R

2017-02-01

Laughter is a simple, stereotyped, innate, human play vocalization that is ideal for the study of vocal evolution. The basic approach of describing the act of laughter and when we do it has revealed a variety of phenomena of social, linguistic, and neurological significance. Findings include the acoustic structure of laughter, the minimal voluntary control of laughter, the punctuation effect (which describes the placement of laughter in conversation and indicates the dominance of speech over laughter), and the role of laughter in human matching and mating. Especially notable is the use of laughter to discover why humans can speak and other apes cannot. Quadrupeds, including our primate ancestors, have a 1:1 relation between breathing and stride because their thorax must absorb forelimb impacts during running. The direct link between breathing and locomotion limits vocalizations to short, simple utterances, such as the characteristic panting chimpanzee laugh (one sound per inward or outward breath). The evolution of bipedal locomotion freed the respiration system of its support function during running, permitting greater breath control and the selection for human-type laughter (a parsed exhalation), and subsequently the virtuosic, sustained, expiratory vocalization of speech. This is the basis of the bipedal theory of speech evolution.
Automated Vocal Analysis of Children with Hearing Loss and Their Typical and Atypical Peers

PubMed Central

VanDam, Mark; Oller, D. Kimbrough; Ambrose, Sophie E.; Gray, Sharmistha; Richards, Jeffrey A.; Xu, Dongxin; Gilkerson, Jill; Silbert, Noah H.; Moeller, Mary Pat

2014-01-01

Objectives This study investigated automatic assessment of vocal development in children with hearing loss as compared with children who are typically developing, have language delays, and autism spectrum disorder. Statistical models are examined for performance in a classification model and to predict age within the four groups of children. Design The vocal analysis system analyzed over 1900 whole-day, naturalistic acoustic recordings from 273 toddlers and preschoolers comprising children who were typically developing, hard of hearing, language delayed, or autistic. Results Samples from children who were hard-of-hearing patterned more similarly to those of typically-developing children than to the language-delayed or autistic samples. The statistical models were able to classify children from the four groups examined and estimate developmental age based on automated vocal analysis. Conclusions This work shows a broad similarity between children with hearing loss and typically developing children, although children with hearing loss show some delay in their production of speech. Automatic acoustic analysis can now be used to quantitatively compare vocal development in children with and without speech-related disorders. The work may serve to better distinguish among various developmental disorders and ultimately contribute to improved intervention. PMID:25587667
Reinforcement of Infant Vocalizations through Contingent Vocal Imitation

ERIC Educational Resources Information Center

Pelaez, Martha; Virues-Ortega, Javier; Gewirtz, Jacob L.

2011-01-01

Maternal vocal imitation of infant vocalizations is highly prevalent during face-to-face interactions of infants and their caregivers. Although maternal vocal imitation has been associated with later verbal development, its potentially reinforcing effect on infant vocalizations has not been explored experimentally. This study examined the…
Limiting parental interaction during vocal development affects acoustic call structure in marmoset monkeys

PubMed Central

2018-01-01

Human vocal development is dependent on learning by imitation through social feedback between infants and caregivers. Recent studies have revealed that vocal development is also influenced by parental feedback in marmoset monkeys, suggesting vocal learning mechanisms in nonhuman primates. Marmoset infants that experience more contingent vocal feedback than their littermates develop vocalizations more rapidly, and infant marmosets with limited parental interaction exhibit immature vocal behavior beyond infancy. However, it is yet unclear whether direct parental interaction is an obligate requirement for proper vocal development because all monkeys in the aforementioned studies were able to produce the adult call repertoire after infancy. Using quantitative measures to compare distinct call parameters and vocal sequence structure, we show that social interaction has a direct impact not only on the maturation of the vocal behavior but also on acoustic call structures during vocal development. Monkeys with limited parental interaction during development show systematic differences in call entropy, a measure for maturity, compared with their normally raised siblings. In addition, different call types were occasionally uttered in motif-like sequences similar to those exhibited by vocal learners, such as birds and humans, in early vocal development. These results indicate that a lack of parental interaction leads to long-term disturbances in the acoustic structure of marmoset vocalizations, suggesting an imperative role for social interaction in proper primate vocal development. PMID:29651461

Limiting parental interaction during vocal development affects acoustic call structure in marmoset monkeys.

PubMed

Gultekin, Yasemin B; Hage, Steffen R

2018-04-01

Human vocal development is dependent on learning by imitation through social feedback between infants and caregivers. Recent studies have revealed that vocal development is also influenced by parental feedback in marmoset monkeys, suggesting vocal learning mechanisms in nonhuman primates. Marmoset infants that experience more contingent vocal feedback than their littermates develop vocalizations more rapidly, and infant marmosets with limited parental interaction exhibit immature vocal behavior beyond infancy. However, it is yet unclear whether direct parental interaction is an obligate requirement for proper vocal development because all monkeys in the aforementioned studies were able to produce the adult call repertoire after infancy. Using quantitative measures to compare distinct call parameters and vocal sequence structure, we show that social interaction has a direct impact not only on the maturation of the vocal behavior but also on acoustic call structures during vocal development. Monkeys with limited parental interaction during development show systematic differences in call entropy, a measure for maturity, compared with their normally raised siblings. In addition, different call types were occasionally uttered in motif-like sequences similar to those exhibited by vocal learners, such as birds and humans, in early vocal development. These results indicate that a lack of parental interaction leads to long-term disturbances in the acoustic structure of marmoset vocalizations, suggesting an imperative role for social interaction in proper primate vocal development.
Singing proficiency in congenital amusia: imitation helps.

PubMed

Tremblay-Champoux, Alexandra; Dalla Bella, Simone; Phillips-Silver, Jessica; Lebrun, Marie-Andrée; Peretz, Isabelle

2010-09-01

Singing out of tune characterizes congenital amusia. Here, we examine whether an aid to memory improves singing by studying vocal imitation in 11 amusic adults and 11 matched controls. Participants sang a highly familiar melody on the original lyrics and on the syllable /la/ in three conditions. First, they sang the melody from memory. Second, they sang it after hearing a model, and third, they sang in unison with the model. Results show that amusic individuals benefit from singing by imitation, whether singing after the model or in unison with the model. The amusics who were the most impaired in memory benefited most, particularly when singing on the syllable /la/. Nevertheless, singing remains poor on the pitch dimension; rhythm was intact and unaffected by imitation. These results point to memory as a source of impairment in poor singing, and to imitation as a possible aid for poor singers.
Technology-aided leisure and communication: Opportunities for persons with advanced Parkinson's disease.

PubMed

Lancioni, Giulio; Singh, Nirbhay; O'Reilly, Mark; Sigafoos, Jeff; D'Amico, Fiora; Sasanelli, Giovanni; Denitto, Floriana; Lang, Russell

2016-12-01

This study investigated whether simple technology-aided programs could be used to promote leisure and communication engagement in three persons with advanced Parkinson's disease. The programs included music and video options, which were combined with (a) text messaging and telephone calls for the first participant, (b) verbal statements/requests, text messaging, and reading for the second participant, and (c) verbal statements/requests and prayers for the third participant. The participants could activate those options via hand movement or vocal emission and specific microswitches. All three participants were successful in activating the options available. The mean cumulative frequencies of option activations were about five per 15-min session for the first two participants and about four per 10-min session for the third participant. The results were considered encouraging and relevant given the limited amount of evidence available on helping persons with advanced Parkinson's disease with leisure and communication.
Histopathologic study of human vocal fold mucosa unphonated over a decade.

PubMed

Sato, Kiminori; Umeno, Hirohito; Ono, Takeharu; Nakashima, Tadashi

2011-12-01

Mechanotransduction caused by vocal fold vibration could possibly be an important factor in the maintenance of extracellular matrices and layered structure of the human adult vocal fold mucosa as a vibrating tissue after the layered structure has been completed. Vocal fold stellate cells (VFSCs) in the human maculae flavae of the vocal fold mucosa are inferred to be involved in the metabolism of extracellular matrices of the vocal fold mucosa. Maculae flavae are also considered to be an important structure in the growth and development of the human vocal fold mucosa. Tension caused by phonation (vocal fold vibration) is hypothesized to stimulate the VFSCs to accelerate production of extracellular matrices. A human adult vocal fold mucosa unphonated over a decade was investigated histopathologically. Vocal fold mucosa unphonated for 11 years and 2 months of a 64-year-old male with cerebral hemorrhage was investigated by light and electron microscopy. The vocal fold mucosae (including maculae flavae) were atrophic. The vocal fold mucosa did not have a vocal ligament, Reinke's space or a layered structure. The lamina propria appeared as a uniform structure. Morphologically, the VFSCs synthesized fewer extracellular matrices, such as fibrous protein and glycosaminoglycan. Consequently, VFSCs appeared to decrease their level of activity.
Coos, booms, and hoots: The evolution of closed-mouth vocal behavior in birds.

PubMed

Riede, Tobias; Eliason, Chad M; Miller, Edward H; Goller, Franz; Clarke, Julia A

2016-08-01

Most birds vocalize with an open beak, but vocalization with a closed beak into an inflating cavity occurs in territorial or courtship displays in disparate species throughout birds. Closed-mouth vocalizations generate resonance conditions that favor low-frequency sounds. By contrast, open-mouth vocalizations cover a wider frequency range. Here we describe closed-mouth vocalizations of birds from functional and morphological perspectives and assess the distribution of closed-mouth vocalizations in birds and related outgroups. Ancestral-state optimizations of body size and vocal behavior indicate that closed-mouth vocalizations are unlikely to be ancestral in birds and have evolved independently at least 16 times within Aves, predominantly in large-bodied lineages. Closed-mouth vocalizations are rare in the small-bodied passerines. In light of these results and body size trends in nonavian dinosaurs, we suggest that the capacity for closed-mouth vocalization was present in at least some extinct nonavian dinosaurs. As in birds, this behavior may have been limited to sexually selected vocal displays, and hence would have co-occurred with open-mouthed vocalizations. © 2016 The Author(s). Evolution © 2016 The Society for the Study of Evolution.
The Vocal Repertoire of Adult and Neonate Giant Otters (Pteronura brasiliensis)

PubMed Central

Mumm, Christina A. S.; Knörnschild, Mirjam

2014-01-01

Animals use vocalizations to exchange information about external events, their own physical or motivational state, or about individuality and social affiliation. Infant babbling can enhance the development of the full adult vocal repertoire by providing ample opportunity for practice. Giant otters are very social and frequently vocalizing animals. They live in highly cohesive groups, generally including a reproductive pair and their offspring born in different years. This basic social structure may vary in the degree of relatedness of the group members. Individuals engage in shared group activities and different social roles and thus, the social organization of giant otters provides a basis for complex and long-term individual relationships. We recorded and analysed the vocalizations of adult and neonate giant otters from wild and captive groups. We classified the adult vocalizations according to their acoustic structure, and described their main behavioural context. Additionally, we present the first description of vocalizations uttered in babbling bouts of new born giant otters. We expected to find 1) a sophisticated vocal repertoire that would reflect the species’ complex social organisation, 2) that giant otter vocalizations have a clear relationship between signal structure and function, and 3) that the vocal repertoire of new born giant otters would comprise age-specific vocalizations as well as precursors of the adult repertoire. We found a vocal repertoire with 22 distinct vocalization types produced by adults and 11 vocalization types within the babbling bouts of the neonates. A comparison within the otter subfamily suggests a relation between vocal and social complexity, with the giant otters being the socially and vocally most complex species. PMID:25391142
Variability of normal vocal fold dynamics for different vocal loading in one healthy subject investigated by phonovibrograms.

PubMed

Doellinger, Michael; Lohscheller, Joerg; McWhorter, Andrew; Kunduk, Melda

2009-03-01

We investigate the potential of high-speed digital imaging technique (HSI) and the phonovibrogram (PVG) analysis in normal vocal fold dynamics by studying the effects of continuous voice use (vocal loading) during the workday. One healthy subject was recorded at sustained phonation 13 times within 2 consecutive days in the morning before and in the afternoon after vocal loading, respectively. Vocal fold dynamics were extracted and visualized by PVGs. The characteristic PVG patterns were extracted representing vocal fold vibration types. The parameter values were then analyzed by statistics regarding vocal load, left-right PVG asymmetries, anterior-posterior PVG asymmetries, and opening-closing differences. For the first time, the direct impact of vocal load could be determined by analyzing vocal fold dynamics. For same vocal loading conditions, equal dynamical behavior of the vocal folds were confirmed. Comparison of recordings performed in the morning with the recordings after work revealed significant changes in vibration behavior, indicating impact of occurring vocal load. Left-right asymmetries in vocal fold dynamics were found confirming earlier assumptions. Different dynamics between opening and closing procedure as well as for anterior and posterior parts were found. Constant voice usage stresses the vocal folds even in healthy subjects and can be detected by applying the PVG technique. Furthermore, left-right PVG asymmetries do occur in healthy voice to a certain extent. HSI in combination with PVG analysis seems to be a promising tool for investigation of vocal fold fatigue and pathologies resulting in small forms of dynamical changes.
Learned Vocal Variation Is Associated with Abrupt Cryptic Genetic Change in a Parrot Species Complex

PubMed Central

Ribot, Raoul F. H.; Buchanan, Katherine L.; Endler, John A.; Joseph, Leo; Bennett, Andrew T. D.; Berg, Mathew L.

2012-01-01

Contact zones between subspecies or closely related species offer valuable insights into speciation processes. A typical feature of such zones is the presence of clinal variation in multiple traits. The nature of these traits and the concordance among clines are expected to influence whether and how quickly speciation will proceed. Learned signals, such as vocalizations in species having vocal learning (e.g. humans, many birds, bats and cetaceans), can exhibit rapid change and may accelerate reproductive isolation between populations. Therefore, particularly strong concordance among clines in learned signals and population genetic structure may be expected, even among continuous populations in the early stages of speciation. However, empirical evidence for this pattern is often limited because differences in vocalisations between populations are driven by habitat differences or have evolved in allopatry. We tested for this pattern in a unique system where we may be able to separate effects of habitat and evolutionary history. We studied geographic variation in the vocalizations of the crimson rosella (Platycercus elegans) parrot species complex. Parrots are well known for their life-long vocal learning and cognitive abilities. We analysed contact calls across a ca 1300 km transect encompassing populations that differed in neutral genetic markers and plumage colour. We found steep clinal changes in two acoustic variables (fundamental frequency and peak frequency position). The positions of the two clines in vocal traits were concordant with a steep cline in microsatellite-based genetic variation, but were discordant with the steep clines in mtDNA, plumage and habitat. Our study provides new evidence that vocal variation, in a species with vocal learning, can coincide with areas of restricted gene flow across geographically continuous populations. Our results suggest that traits that evolve culturally can be strongly associated with reduced gene flow between populations, and therefore may promote speciation, even in the absence of other barriers. PMID:23227179
Applicability of Cone Beam Computed Tomography to the Assessment of the Vocal Tract before and after Vocal Exercises in Normal Subjects.

PubMed

Garcia, Elisângela Zacanti; Yamashita, Hélio Kiitiro; Garcia, Davi Sousa; Padovani, Marina Martins Pereira; Azevedo, Renata Rangel; Chiari, Brasília Maria

2016-01-01

Cone beam computed tomography (CBCT), which represents an alternative to traditional computed tomography and magnetic resonance imaging, may be a useful instrument to study vocal tract physiology related to vocal exercises. This study aims to evaluate the applicability of CBCT to the assessment of variations in the vocal tract of healthy individuals before and after vocal exercises. Voice recordings and CBCT images before and after vocal exercises performed by 3 speech-language pathologists without vocal complaints were collected and compared. Each participant performed 1 type of exercise, i.e., Finnish resonance tube technique, prolonged consonant "b" technique, or chewing technique. The analysis consisted of an acoustic analysis and tomographic imaging. Modifications of the vocal tract settings following vocal exercises were properly detected by CBCT, and changes in the acoustic parameters were, for the most part, compatible with the variations detected in image measurements. CBCT was shown to be capable of properly assessing the changes in vocal tract settings promoted by vocal exercises. © 2017 S. Karger AG, Basel.
What songbirds teach us about learning

NASA Astrophysics Data System (ADS)

Brainard, Michael S.; Doupe, Allison J.

2002-05-01

Bird fanciers have known for centuries that songbirds learn their songs. This learning has striking parallels to speech acquisition: like humans, birds must hear the sounds of adults during a sensitive period, and must hear their own voice while learning to vocalize. With the discovery and investigation of discrete brain structures required for singing, songbirds are now providing insights into neural mechanisms of learning. Aided by a wealth of behavioural observations and species diversity, studies in songbirds are addressing such basic issues in neuroscience as perceptual and sensorimotor learning, developmental regulation of plasticity, and the control and function of adult neurogenesis.
Dependence of phonation threshold pressure on vocal tract acoustics and vocal fold tissue mechanics.

PubMed

Chan, Roger W; Titze, Ingo R

2006-04-01

Analytical and computer simulation studies have shown that the acoustic impedance of the vocal tract as well as the viscoelastic properties of vocal fold tissues are critical for determining the dynamics and the energy transfer mechanism of vocal fold oscillation. In the present study, a linear, small-amplitude oscillation theory was revised by taking into account the propagation of a mucosal wave and the inertive reactance (inertance) of the supraglottal vocal tract as the major energy transfer mechanisms for flow-induced self-oscillation of the vocal fold. Specifically, analytical results predicted that phonation threshold pressure (Pth) increases with the viscous shear properties of the vocal fold, but decreases with vocal tract inertance. This theory was empirically tested using a physical model of the larynx, where biological materials (fat, hyaluronic acid, and fibronectin) were implanted into the vocal fold cover to investigate the effect of vocal fold tissue viscoelasticity on Pth. A uniform-tube supraglottal vocal tract was also introduced to examine the effect of vocal tract inertance on Pth. Results showed that Pth decreased with the inertive impedance of the vocal tract and increased with the viscous shear modulus (G") or dynamic viscosity (eta') of the vocal fold cover, consistent with theoretical predictions. These findings supported the potential biomechanical benefits of hyaluronic acid as a surgical bioimplant for repairing voice disorders involving the superficial layer of the lamina propria, such as scarring, sulcus vocalis, atrophy, and Reinke's edema.
Sensory-motor interactions for vocal pitch monitoring in non-primary human auditory cortex.

PubMed

Greenlee, Jeremy D W; Behroozmand, Roozbeh; Larson, Charles R; Jackson, Adam W; Chen, Fangxiang; Hansen, Daniel R; Oya, Hiroyuki; Kawasaki, Hiroto; Howard, Matthew A

2013-01-01

The neural mechanisms underlying processing of auditory feedback during self-vocalization are poorly understood. One technique used to study the role of auditory feedback involves shifting the pitch of the feedback that a speaker receives, known as pitch-shifted feedback. We utilized a pitch shift self-vocalization and playback paradigm to investigate the underlying neural mechanisms of audio-vocal interaction. High-resolution electrocorticography (ECoG) signals were recorded directly from auditory cortex of 10 human subjects while they vocalized and received brief downward (-100 cents) pitch perturbations in their voice auditory feedback (speaking task). ECoG was also recorded when subjects passively listened to playback of their own pitch-shifted vocalizations. Feedback pitch perturbations elicited average evoked potential (AEP) and event-related band power (ERBP) responses, primarily in the high gamma (70-150 Hz) range, in focal areas of non-primary auditory cortex on superior temporal gyrus (STG). The AEPs and high gamma responses were both modulated by speaking compared with playback in a subset of STG contacts. From these contacts, a majority showed significant enhancement of high gamma power and AEP responses during speaking while the remaining contacts showed attenuated response amplitudes. The speaking-induced enhancement effect suggests that engaging the vocal motor system can modulate auditory cortical processing of self-produced sounds in such a way as to increase neural sensitivity for feedback pitch error detection. It is likely that mechanisms such as efference copies may be involved in this process, and modulation of AEP and high gamma responses imply that such modulatory effects may affect different cortical generators within distinctive functional networks that drive voice production and control.
Sensory-Motor Interactions for Vocal Pitch Monitoring in Non-Primary Human Auditory Cortex

PubMed Central

Larson, Charles R.; Jackson, Adam W.; Chen, Fangxiang; Hansen, Daniel R.; Oya, Hiroyuki; Kawasaki, Hiroto; Howard, Matthew A.

2013-01-01

The neural mechanisms underlying processing of auditory feedback during self-vocalization are poorly understood. One technique used to study the role of auditory feedback involves shifting the pitch of the feedback that a speaker receives, known as pitch-shifted feedback. We utilized a pitch shift self-vocalization and playback paradigm to investigate the underlying neural mechanisms of audio-vocal interaction. High-resolution electrocorticography (ECoG) signals were recorded directly from auditory cortex of 10 human subjects while they vocalized and received brief downward (−100 cents) pitch perturbations in their voice auditory feedback (speaking task). ECoG was also recorded when subjects passively listened to playback of their own pitch-shifted vocalizations. Feedback pitch perturbations elicited average evoked potential (AEP) and event-related band power (ERBP) responses, primarily in the high gamma (70–150 Hz) range, in focal areas of non-primary auditory cortex on superior temporal gyrus (STG). The AEPs and high gamma responses were both modulated by speaking compared with playback in a subset of STG contacts. From these contacts, a majority showed significant enhancement of high gamma power and AEP responses during speaking while the remaining contacts showed attenuated response amplitudes. The speaking-induced enhancement effect suggests that engaging the vocal motor system can modulate auditory cortical processing of self-produced sounds in such a way as to increase neural sensitivity for feedback pitch error detection. It is likely that mechanisms such as efference copies may be involved in this process, and modulation of AEP and high gamma responses imply that such modulatory effects may affect different cortical generators within distinctive functional networks that drive voice production and control. PMID:23577157
Subglottal pressure, tracheal airflow, and intrinsic laryngeal muscle activity during rat ultrasound vocalization

PubMed Central

2011-01-01

Vocal production requires complex planning and coordination of respiratory, laryngeal, and vocal tract movements, which are incompletely understood in most mammals. Rats produce a variety of whistles in the ultrasonic range that are of communicative relevance and of importance as a model system, but the sources of acoustic variability were mostly unknown. The goal was to identify sources of fundamental frequency variability. Subglottal pressure, tracheal airflow, and electromyographic (EMG) data from two intrinsic laryngeal muscles were measured during 22-kHz and 50-kHz call production in awake, spontaneously behaving adult male rats. During ultrasound vocalization, subglottal pressure ranged between 0.8 and 1.9 kPa. Pressure differences between call types were not significant. The relation between fundamental frequency and subglottal pressure within call types was inconsistent. Experimental manipulations of subglottal pressure had only small effects on fundamental frequency. Tracheal airflow patterns were also inconsistently associated with frequency. Pressure and flow seem to play a small role in regulation of fundamental frequency. Muscle activity, however, is precisely regulated and very sensitive to alterations, presumably because of effects on resonance properties in the vocal tract. EMG activity of cricothyroid and thyroarytenoid muscle was tonic in calls with slow or no fundamental frequency modulations, like 22-kHz and flat 50-kHz calls. Both muscles showed brief high-amplitude, alternating bursts at rates up to 150 Hz during production of frequency-modulated 50-kHz calls. A differentiated and fine regulation of intrinsic laryngeal muscles is critical for normal ultrasound vocalization. Many features of the laryngeal muscle activation pattern during ultrasound vocalization in rats are shared with other mammals. PMID:21832032
Female harbor seal (Phoca vitulina) behavioral response to playbacks of underwater male acoustic advertisement displays

PubMed Central

Blades, Brittany; Parks, Susan E.

2018-01-01

During the breeding season, male harbor seals (Phoca vitulina) make underwater acoustic displays using vocalizations known as roars. These roars have been shown to function in territory establishment in some breeding areas and have been hypothesized to be important for female choice, but the function of these sounds remains unresolved. This study consisted of a series of playback experiments in which captive female harbor seals were exposed to recordings of male roars to determine if females respond to recordings of male vocalizations and whether or not they respond differently to roars from categories with different acoustic characteristics. The categories included roars with characteristics of dominant males (longest duration, lowest frequency), subordinate males (shortest duration, highest frequency), combinations of call parameters from dominant and subordinate males (long duration, high frequency and short duration, low frequency), and control playbacks of water noise and water noise with tonal signals in the same frequency range as male signals. Results indicate that overall females have a significantly higher level of response to playbacks that imitate male vocalizations when compared to control playbacks of water noise. Specifically, there was a higher level of response to playbacks representing dominant male vocalization when compared to the control playbacks. For most individuals, there was a greater response to playbacks representing dominant male vocalizations compared to playbacks representing subordinate male vocalizations; however, there was no statistical difference between those two playback types. Additionally, there was no difference between the playbacks of call parameter combinations and the controls. Investigating female preference for male harbor seal vocalizations is a critical step in understanding the harbor seal mating system and further studies expanding on this captive study will help shed light on this important issue. PMID:29607261
Female harbor seal (Phoca vitulina) behavioral response to playbacks of underwater male acoustic advertisement displays.

PubMed

Matthews, Leanna P; Blades, Brittany; Parks, Susan E

2018-01-01

During the breeding season, male harbor seals ( Phoca vitulina ) make underwater acoustic displays using vocalizations known as roars. These roars have been shown to function in territory establishment in some breeding areas and have been hypothesized to be important for female choice, but the function of these sounds remains unresolved. This study consisted of a series of playback experiments in which captive female harbor seals were exposed to recordings of male roars to determine if females respond to recordings of male vocalizations and whether or not they respond differently to roars from categories with different acoustic characteristics. The categories included roars with characteristics of dominant males (longest duration, lowest frequency), subordinate males (shortest duration, highest frequency), combinations of call parameters from dominant and subordinate males (long duration, high frequency and short duration, low frequency), and control playbacks of water noise and water noise with tonal signals in the same frequency range as male signals. Results indicate that overall females have a significantly higher level of response to playbacks that imitate male vocalizations when compared to control playbacks of water noise. Specifically, there was a higher level of response to playbacks representing dominant male vocalization when compared to the control playbacks. For most individuals, there was a greater response to playbacks representing dominant male vocalizations compared to playbacks representing subordinate male vocalizations; however, there was no statistical difference between those two playback types. Additionally, there was no difference between the playbacks of call parameter combinations and the controls. Investigating female preference for male harbor seal vocalizations is a critical step in understanding the harbor seal mating system and further studies expanding on this captive study will help shed light on this important issue.
The objective vocal quality, vocal risk factors, vocal complaints, and corporal pain in Dutch female students training to be speech-language pathologists during the 4 years of study.

PubMed

Van Lierde, Kristiane M; D'haeseleer, Evelien; Wuyts, Floris L; De Ley, Sophia; Geldof, Ruben; De Vuyst, Julie; Sofie, Claeys

2010-09-01

The purpose of the present cross-sectional study was to determine the objective vocal quality and the vocal characteristics (vocal risk factors, vocal and corporal complaints) in 197 female students in speech-language pathology during the 4 years of study. The objective vocal quality was measured by means of the Dysphonia Severity Index (DSI). Perceptual voice assessment, the Voice Handicap Index (VHI), questionnaires addressing vocal risks, and vocal and corporal complaints during and/or after voice usage were performed. Speech-language pathology (SLP) students have a borderline vocal quality corresponding to a DSI% of 68. The analysis of variance revealed no significant change of the objective vocal quality between the first bachelor year and the master year. No psychosocial handicapping effect of the voice was observed by means of the VHI total, though there was an effect at the functional VHI level in addition to some vocal complaints. Ninety-three percent of the student SLPs reported the presence of corporal pain during and/or after speaking. In particular, sore throat and headache were mentioned as the prevalent corporal pain symptoms. A longitudinal study of the objective vocal quality of the same subjects during their career as an SLP might provide new insights. 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Can vocal conditioning trigger a semiotic ratchet in marmosets?

PubMed

Turesson, Hjalmar K; Ribeiro, Sidarta

2015-01-01

The complexity of human communication has often been taken as evidence that our language reflects a true evolutionary leap, bearing little resemblance to any other animal communication system. The putative uniqueness of the human language poses serious evolutionary and ethological challenges to a rational explanation of human communication. Here we review ethological, anatomical, molecular, and computational results across several species to set boundaries for these challenges. Results from animal behavior, cognitive psychology, neurobiology, and semiotics indicate that human language shares multiple features with other primate communication systems, such as specialized brain circuits for sensorimotor processing, the capability for indexical (pointing) and symbolic (referential) signaling, the importance of shared intentionality for associative learning, affective conditioning and parental scaffolding of vocal production. The most substantial differences lie in the higher human capacity for symbolic compositionality, fast vertical transmission of new symbols across generations, and irreversible accumulation of novel adaptive behaviors (cultural ratchet). We hypothesize that increasingly-complex vocal conditioning of an appropriate animal model may be sufficient to trigger a semiotic ratchet, evidenced by progressive sign complexification, as spontaneous contact calls become indexes, then symbols and finally arguments (strings of symbols). To test this hypothesis, we outline a series of conditioning experiments in the common marmoset (Callithrix jacchus). The experiments are designed to probe the limits of vocal communication in a prosocial, highly vocal primate 35 million years far from the human lineage, so as to shed light on the mechanisms of semiotic complexification and cultural transmission, and serve as a naturalistic behavioral setting for the investigation of language disorders.
Can vocal conditioning trigger a semiotic ratchet in marmosets?

PubMed Central

Turesson, Hjalmar K.; Ribeiro, Sidarta

2015-01-01

The complexity of human communication has often been taken as evidence that our language reflects a true evolutionary leap, bearing little resemblance to any other animal communication system. The putative uniqueness of the human language poses serious evolutionary and ethological challenges to a rational explanation of human communication. Here we review ethological, anatomical, molecular, and computational results across several species to set boundaries for these challenges. Results from animal behavior, cognitive psychology, neurobiology, and semiotics indicate that human language shares multiple features with other primate communication systems, such as specialized brain circuits for sensorimotor processing, the capability for indexical (pointing) and symbolic (referential) signaling, the importance of shared intentionality for associative learning, affective conditioning and parental scaffolding of vocal production. The most substantial differences lie in the higher human capacity for symbolic compositionality, fast vertical transmission of new symbols across generations, and irreversible accumulation of novel adaptive behaviors (cultural ratchet). We hypothesize that increasingly-complex vocal conditioning of an appropriate animal model may be sufficient to trigger a semiotic ratchet, evidenced by progressive sign complexification, as spontaneous contact calls become indexes, then symbols and finally arguments (strings of symbols). To test this hypothesis, we outline a series of conditioning experiments in the common marmoset (Callithrix jacchus). The experiments are designed to probe the limits of vocal communication in a prosocial, highly vocal primate 35 million years far from the human lineage, so as to shed light on the mechanisms of semiotic complexification and cultural transmission, and serve as a naturalistic behavioral setting for the investigation of language disorders. PMID:26500583
University Vocal Training and Vocal Health of Music Educators and Music Therapists

ERIC Educational Resources Information Center

Baker, Vicki D.; Cohen, Nicki

2017-01-01

The purpose of this study was to describe the university vocal training and vocal health of music educators and music therapists. The participants (N = 426), music educators (n = 351) and music therapists (n = 75), completed a survey addressing demographics, vocal training, voice usage, and vocal health. Both groups reported singing at least 50%…

Monkey vocal tracts are speech-ready.

PubMed

Fitch, W Tecumseh; de Boer, Bart; Mathur, Neil; Ghazanfar, Asif A

2016-12-01

For four decades, the inability of nonhuman primates to produce human speech sounds has been claimed to stem from limitations in their vocal tract anatomy, a conclusion based on plaster casts made from the vocal tract of a monkey cadaver. We used x-ray videos to quantify vocal tract dynamics in living macaques during vocalization, facial displays, and feeding. We demonstrate that the macaque vocal tract could easily produce an adequate range of speech sounds to support spoken language, showing that previous techniques based on postmortem samples drastically underestimated primate vocal capabilities. Our findings imply that the evolution of human speech capabilities required neural changes rather than modifications of vocal anatomy. Macaques have a speech-ready vocal tract but lack a speech-ready brain to control it.
The effect of age of cochlear implantation on vocal characteristics in children

PubMed Central

Knight, Kerry; Ducasse, Simone; Coetzee, Ashley; Louw, Anel

2016-01-01

Background Early cochlear implantation aids auditory feedback and supports better communication and self-monitoring of the voice. The objective of this study was to determine whether the age of cochlear implantation has an impact on vocal development in children implanted before age 4. Method and procedures The study consisted of 19 participants in total. All implant recipients (experimental group) were 3–5 years post-implantation, including four prelingual (0–2 years) and five perilingual (2–4 years) implant recipients. The control group consisted of 10 children whose hearing was within normal limits between the ages 3–6 years and 10 months, which was compared to the experimental group. Established paediatric norms were used for additional comparison. A questionnaire was used to gather information from each of the participant’s caregivers to determine whether other personal and contextual factors had an impact on voice production. An acoustic analysis was conducted for each participant using the Multi-Dimensional Voice Program of the Computerized Speech Lab. Results When the experimental group and the control group were compared, similar results were yielded for fundamental frequency and short-term perturbation (jitter and shimmer). More variability was noted in long-term frequency and amplitude measures, with significantly higher differences, and therefore further outside the norms, in the prelingual group when compared to the perilingual and control groups. Conclusion In this study, age of implantation did not impact vocal characteristics. Further research should include larger sample sizes, with participants that are age and gender matched. PMID:27380914
Emotional expressions in voice and music: same code, same effect?

PubMed

Escoffier, Nicolas; Zhong, Jidan; Schirmer, Annett; Qiu, Anqi

2013-08-01

Scholars have documented similarities in the way voice and music convey emotions. By using functional magnetic resonance imaging (fMRI) we explored whether these similarities imply overlapping processing substrates. We asked participants to trace changes in either the emotion or pitch of vocalizations and music using a joystick. Compared to music, vocalizations more strongly activated superior and middle temporal cortex, cuneus, and precuneus. However, despite these differences, overlapping rather than differing regions emerged when comparing emotion with pitch tracing for music and vocalizations, respectively. Relative to pitch tracing, emotion tracing activated medial superior frontal and anterior cingulate cortex regardless of stimulus type. Additionally, we observed emotion specific effects in primary and secondary auditory cortex as well as in medial frontal cortex that were comparable for voice and music. Together these results indicate that similar mechanisms support emotional inferences from vocalizations and music and that these mechanisms tap on a general system involved in social cognition. Copyright © 2011 Wiley Periodicals, Inc.
Vocal Dose Measures: Quantifying Accumulated Vibration Exposure in Vocal Fold Tissues

PubMed Central

Titze, Ingo R.; Švec, Jan G.; Popolo, Peter S.

2011-01-01

To measure the exposure to self-induced tissue vibration in speech, three vocal doses were defined and described: distance dose, which accumulates the distance that tissue particles of the vocal folds travel in an oscillatory trajectory; energy dissipation dose, which accumulates the total amount of heat dissipated over a unit volume of vocal fold tissues; and time dose, which accumulates the total phonation time. These doses were compared to a previously used vocal dose measure, the vocal loading index, which accumulates the number of vibration cycles of the vocal folds. Empirical rules for viscosity and vocal fold deformation were used to calculate all the doses from the fundamental frequency (F0) and sound pressure level (SPL) values of speech. Six participants were asked to read in normal, monotone, and exaggerated speech and the doses associated with these vocalizations were calculated. The results showed that large F0 and SPL variations in speech affected the dose measures, suggesting that accumulation of phonation time alone is insufficient. The vibration exposure of the vocal folds in normal speech was related to the industrial limits for hand-transmitted vibration, in which the safe distance dose was derived to be about 500 m. This limit was found rather low for vocalization; it was related to a comparable time dose of about 17 min of continuous vocalization, or about 35 min of continuous reading with normal breathing and unvoiced segments. The voicing pauses in normal speech and dialogue effectively prolong the safe time dose. The derived safety limits for vocalization will likely require refinement based on a more detailed knowledge of the differences in hand and vocal fold tissue morphology and their response to vibrational stress, and on the effect of recovery of the vocal fold tissue during voicing pauses. PMID:12959470
Neural FoxP2 and FoxP1 expression in the budgerigar, an avian species with adult vocal learning.

PubMed

Hara, Erina; Perez, Jemima M; Whitney, Osceola; Chen, Qianqian; White, Stephanie A; Wright, Timothy F

2015-04-15

Vocal learning underlies acquisition of both language in humans and vocal signals in some avian taxa. These bird groups and humans exhibit convergent developmental phases and associated brain pathways for vocal communication. The transcription factor FoxP2 plays critical roles in vocal learning in humans and songbirds. Another member of the forkhead box gene family, FoxP1 also shows high expression in brain areas involved in vocal learning and production. Here, we investigate FoxP2 and FoxP1 mRNA and protein in adult male budgerigars (Melopsittacus undulatus), a parrot species that exhibits vocal learning as both juveniles and adults. To examine these molecules in adult vocal learners, we compared their expression patterns in the budgerigar striatal nucleus involved in vocal learning, magnocellular nucleus of the medial striatum (MMSt), across birds with different vocal states, such as vocalizing to a female (directed), vocalizing alone (undirected), and non-vocalizing. We found that both FoxP2 mRNA and protein expressions were consistently lower in MMSt than in the adjacent striatum regardless of the vocal states, whereas previous work has shown that songbirds exhibit down-regulation in the homologous region, Area X, only after singing alone. In contrast, FoxP1 levels were high in MMSt compared to the adjacent striatum in all groups. Taken together these results strengthen the general hypothesis that FoxP2 and FoxP1 have specialized expression in vocal nuclei across a range of taxa, and suggest that the adult vocal plasticity seen in budgerigars may be a product of persistent down-regulation of FoxP2 in MMSt. Copyright © 2015 Elsevier B.V. All rights reserved.
Neural FoxP2 and FoxP1 expression in the budgerigar, an avian species with adult vocal learning

PubMed Central

Hara, Erina; Perez, Jemima M.; Whitney, Osceola; Chen, Qianqian; White, Stephanie A.; Wright, Timothy F.

2015-01-01

Vocal learning underlies acquisition of both language in humans and vocal signals in some avian taxa. These bird groups and humans exhibit convergent developmental phases and associated brain pathways for vocal communication. The transcription factor FoxP2 plays critical roles in vocal learning in humans and songbirds. Another member of the forkhead box gene family, FoxP1 also shows high expression in brain areas involved in vocal learning and production. Here, we investigate FoxP2 and FoxP1 mRNA and protein in adult male budgerigars (Melopsittacus undulatus), a parrot species that exhibits vocal learning as both juveniles and adults. To examine these molecules in adult vocal learners, we compared their expression patterns in the budgerigar striatal nucleus involved in vocal learning, magnocellular nucleus of the medial striatum (MMSt), across birds with different vocal states, such as vocalizing to a female (directed), vocalizing alone (undirected), and non-vocalizing. We found that both FoxP2 mRNA and protein expressions were consistently lower in MMSt than in the adjacent striatum regardless of the vocal states, whereas previous work has shown that songbirds exhibit downregulation in the homologous region, Area X, only after singing alone. In contrast, FoxP1 levels were high in MMSt compared to the adjacent striatum in all groups. Taken together these results strengthen the general hypothesis that FoxP2 and FoxP1 have specialized expression in vocal nuclei across a range of taxa, and suggest that the adult vocal plasticity seen in budgerigars may be a product of persistent down-regulation of FoxP2 in MMSt. PMID:25601574
Vocal dose measures: quantifying accumulated vibration exposure in vocal fold tissues.

PubMed

Titze, Ingo R; Svec, Jan G; Popolo, Peter S

2003-08-01

To measure the exposure to self-induced tissue vibration in speech, three vocal doses were defined and described: distance dose, which accumulates the distance that tissue particles of the vocal folds travel in an oscillatory trajectory; energy dissipation dose, which accumulates the total amount of heat dissipated over a unit volume of vocal fold tissues; and time dose, which accumulates the total phonation time. These doses were compared to a previously used vocal dose measure, the vocal loading index, which accumulates the number of vibration cycles of the vocal folds. Empirical rules for viscosity and vocal fold deformation were used to calculate all the doses from the fundamental frequency (F0) and sound pressure level (SPL) values of speech. Six participants were asked to read in normal, monotone, and exaggerated speech and the doses associated with these vocalizations were calculated. The results showed that large F0 and SPL variations in speech affected the dose measures, suggesting that accumulation of phonation time alone is insufficient. The vibration exposure of the vocal folds in normal speech was related to the industrial limits for hand-transmitted vibration, in which the safe distance dose was derived to be about 500 m. This limit was found rather low for vocalization; it was related to a comparable time dose of about 17 min of continuous vocalization, or about 35 min of continuous reading with normal breathing and unvoiced segments. The voicing pauses in normal speech and dialogue effectively prolong the safe time dose. The derived safety limits for vocalization will likely require refinement based on a more detailed knowledge of the differences in hand and vocal fold tissue morphology and their response to vibrational stress, and on the effect of recovery of the vocal fold tissue during voicing pauses.
Singers' Vocal Function Knowledge Levels, Sensorimotor Self-awareness of Vocal Tract, and Impact of Functional Voice Rehabilitation on the Vocal Function Knowledge and Self-awareness of Vocal Tract.

PubMed

Sielska-Badurek, Ewelina; Osuch-Wójcikiewicz, Ewa; Sobol, Maria; Kazanecka, Ewa; Niemczyk, Kazimierz

2017-01-01

This study investigated vocal function knowledge and vocal tract sensorimotor self-awareness and the impact of functional voice rehabilitation on vocal function knowledge and self-awareness. This is a prospective, randomized study. Twenty singers (study group [SG]) completed a questionnaire before and after functional voice rehabilitation. Twenty additional singers, representing the control group, also completed the questionnaire without functional voice rehabilitation at a 3-month interval. The questionnaire consisted of three parts. The first part evaluated the singers' attitude to the anatomical and physiological knowledge of the vocal tract and their self-esteem of the knowledge level. The second part assessed the theoretical knowledge of the singers' vocal tract physiology. The third part of the questionnaire assessed singers' sensorimotor self-awareness of the vocal tract. The results showed that most singers indicated that knowledge of the vocal tract's anatomy and physiology is useful (59% SG, 67% control group). However, 75% of all participants defined their knowledge of the vocal tract's anatomy and physiology as weak or inadequate. In the SG, vocal function knowledge at the first assessment was 45%. After rehabilitation, the level increased to 67.7%. Vocal tract sensorimotor self-awareness initially was 38.9% in SG but rose to 66.7%. Findings of the study suggest that classical singers lack knowledge about the physiology of the vocal mechanism, especially the breathing patterns. In addition, they have low sensorimotor self-awareness of their vocal tract. The results suggest that singers would benefit from receiving services from phoniatrists and speech-language pathologists during their voice training. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Sensorimotor learning in children and adults: Exposure to frequency-altered auditory feedback during speech production.

PubMed

Scheerer, N E; Jacobson, D S; Jones, J A

2016-02-09

Auditory feedback plays an important role in the acquisition of fluent speech; however, this role may change once speech is acquired and individuals no longer experience persistent developmental changes to the brain and vocal tract. For this reason, we investigated whether the role of auditory feedback in sensorimotor learning differs across children and adult speakers. Participants produced vocalizations while they heard their vocal pitch predictably or unpredictably shifted downward one semitone. The participants' vocal pitches were measured at the beginning of each vocalization, before auditory feedback was available, to assess the extent to which the deviant auditory feedback modified subsequent speech motor commands. Sensorimotor learning was observed in both children and adults, with participants' initial vocal pitch increasing following trials where they were exposed to predictable, but not unpredictable, frequency-altered feedback. Participants' vocal pitch was also measured across each vocalization, to index the extent to which the deviant auditory feedback was used to modify ongoing vocalizations. While both children and adults were found to increase their vocal pitch following predictable and unpredictable changes to their auditory feedback, adults produced larger compensatory responses. The results of the current study demonstrate that both children and adults rapidly integrate information derived from their auditory feedback to modify subsequent speech motor commands. However, these results also demonstrate that children and adults differ in their ability to use auditory feedback to generate compensatory vocal responses during ongoing vocalization. Since vocal variability also differed across the children and adult groups, these results also suggest that compensatory vocal responses to frequency-altered feedback manipulations initiated at vocalization onset may be modulated by vocal variability. Copyright © 2015 IBRO. Published by Elsevier Ltd. All rights reserved.
Modeling vocalization with ECoG cortical activity recorded during vocal production in the macaque monkey.

PubMed

Fukushima, Makoto; Saunders, Richard C; Fujii, Naotaka; Averbeck, Bruno B; Mishkin, Mortimer

2014-01-01

Vocal production is an example of controlled motor behavior with high temporal precision. Previous studies have decoded auditory evoked cortical activity while monkeys listened to vocalization sounds. On the other hand, there have been few attempts at decoding motor cortical activity during vocal production. Here we recorded cortical activity during vocal production in the macaque with a chronically implanted electrocorticographic (ECoG) electrode array. The array detected robust activity in motor cortex during vocal production. We used a nonlinear dynamical model of the vocal organ to reduce the dimensionality of `Coo' calls produced by the monkey. We then used linear regression to evaluate the information in motor cortical activity for this reduced representation of calls. This simple linear model accounted for circa 65% of the variance in the reduced sound representations, supporting the feasibility of using the dynamical model of the vocal organ for decoding motor cortical activity during vocal production.
[Observation of the dysphonia severity index in evaluating curative effect of vocal cord polyp surgery].

PubMed

Zhou, Zhou; Ge, Pingjiang; Liu, Qian; Liu, Ming; Zhang, Wei

2015-08-01

To investigate the applicability of the eysphonia severity index (DSI) in evaluating effects of surgery between before and after groups of vocal polyp patients. Analyses of measurement data pre and pro-surgery of 70 vocal polyp patients and 35 no voice disorders volunteers (control group). The voice quality was measured subjectively with the voice handicap index (VHI), the GRBAS and fiber electronic laryngoscopy. Measures of maximum phonation time (MPT), shimmer and jitter were obtained for each subject by using DiVAS 2.30 (XION, Germany). The DiVAS 2.30 had spotanenously calculate the scores of DSI. Using SPSS 17.0 to find the differences of DSI scores among the three groups by one-way ANOVA variance analysis. And finding out of the correlation with DSI scores and VHI scores, GRBAS, MPT, jitter and shimmer. DSI improved significantly after surgery in the vocal polyps group (mean difference DSI -2.92 and 1.87, respectively) and also in the control group (mean difference DSI -2.92 and 2.30, respectively). However, no significant difference between the control group and the after surgery group. By using Pearson correlation analysis, this study observed a strong correlation between the DSI scores and the VHI scores, the values of GRBAS, shimmer (P < 0.01). DSI is an effective and high accuracy multi-parameter system for evaluation of vocal cord polyp patients as an independent assessment of dysphonia. DSI also can be used in evaluation of the effects of the vocal polyps surgery.
Magnetic resonance imaging of the brain and vocal tract: Applications to the study of speech production and language learning.

PubMed

Carey, Daniel; McGettigan, Carolyn

2017-04-01

The human vocal system is highly plastic, allowing for the flexible expression of language, mood and intentions. However, this plasticity is not stable throughout the life span, and it is well documented that adult learners encounter greater difficulty than children in acquiring the sounds of foreign languages. Researchers have used magnetic resonance imaging (MRI) to interrogate the neural substrates of vocal imitation and learning, and the correlates of individual differences in phonetic "talent". In parallel, a growing body of work using MR technology to directly image the vocal tract in real time during speech has offered primarily descriptive accounts of phonetic variation within and across languages. In this paper, we review the contribution of neural MRI to our understanding of vocal learning, and give an overview of vocal tract imaging and its potential to inform the field. We propose methods by which our understanding of speech production and learning could be advanced through the combined measurement of articulation and brain activity using MRI - specifically, we describe a novel paradigm, developed in our laboratory, that uses both MRI techniques to for the first time map directly between neural, articulatory and acoustic data in the investigation of vocalisation. This non-invasive, multimodal imaging method could be used to track central and peripheral correlates of spoken language learning, and speech recovery in clinical settings, as well as provide insights into potential sites for targeted neural interventions. Copyright © 2016 Elsevier Ltd. All rights reserved.
Vocal communication in African elephants (Loxodonta africana).

PubMed

Soltis, Joseph

2010-01-01

Research on vocal communication in African elephants has increased in recent years, both in the wild and in captivity, providing an opportunity to present a comprehensive review of research related to their vocal behavior. Current data indicate that the vocal repertoire consists of perhaps nine acoustically distinct call types, "rumbles" being the most common and acoustically variable. Large vocal production anatomy is responsible for the low-frequency nature of rumbles, with fundamental frequencies in the infrasonic range. Additionally, resonant frequencies of rumbles implicate the trunk in addition to the oral cavity in shaping the acoustic structure of rumbles. Long-distance communication is thought possible because low-frequency sounds propagate more faithfully than high-frequency sounds, and elephants respond to rumbles at distances of up to 2.5 km. Elephant ear anatomy appears designed for detecting low frequencies, and experiments demonstrate that elephants can detect infrasonic tones and discriminate small frequency differences. Two vocal communication functions in the African elephant now have reasonable empirical support. First, closely bonded but spatially separated females engage in rumble exchanges, or "contact calls," that function to coordinate movement or reunite animals. Second, both males and females produce "mate attraction" rumbles that may advertise reproductive states to the opposite sex. Additionally, there is evidence that the structural variation in rumbles reflects the individual identity, reproductive state, and emotional state of callers. Growth in knowledge about the communication system of the African elephant has occurred from a rich combination of research on wild elephants in national parks and captive elephants in zoological parks.
Construction and Characterization of a Novel Vocal Fold Bioreactor

PubMed Central

Zerdoum, Aidan B.; Tong, Zhixiang; Bachman, Brendan; Jia, Xinqiao

2014-01-01

In vitro engineering of mechanically active tissues requires the presentation of physiologically relevant mechanical conditions to cultured cells. To emulate the dynamic environment of vocal folds, a novel vocal fold bioreactor capable of producing vibratory stimulations at fundamental phonation frequencies is constructed and characterized. The device is composed of a function generator, a power amplifier, a speaker selector and parallel vibration chambers. Individual vibration chambers are created by sandwiching a custom-made silicone membrane between a pair of acrylic blocks. The silicone membrane not only serves as the bottom of the chamber but also provides a mechanism for securing the cell-laden scaffold. Vibration signals, generated by a speaker mounted underneath the bottom acrylic block, are transmitted to the membrane aerodynamically by the oscillating air. Eight identical vibration modules, fixed on two stationary metal bars, are housed in an anti-humidity chamber for long-term operation in a cell culture incubator. The vibration characteristics of the vocal fold bioreactor are analyzed non-destructively using a Laser Doppler Vibrometer (LDV). The utility of the dynamic culture device is demonstrated by culturing cellular constructs in the presence of 200-Hz sinusoidal vibrations with a mid-membrane displacement of 40 µm. Mesenchymal stem cells cultured in the bioreactor respond to the vibratory signals by altering the synthesis and degradation of vocal fold-relevant, extracellular matrix components. The novel bioreactor system presented herein offers an excellent in vitro platform for studying vibration-induced mechanotransduction and for the engineering of functional vocal fold tissues. PMID:25145349
Construction and characterization of a novel vocal fold bioreactor.

PubMed

Zerdoum, Aidan B; Tong, Zhixiang; Bachman, Brendan; Jia, Xinqiao

2014-08-01

In vitro engineering of mechanically active tissues requires the presentation of physiologically relevant mechanical conditions to cultured cells. To emulate the dynamic environment of vocal folds, a novel vocal fold bioreactor capable of producing vibratory stimulations at fundamental phonation frequencies is constructed and characterized. The device is composed of a function generator, a power amplifier, a speaker selector and parallel vibration chambers. Individual vibration chambers are created by sandwiching a custom-made silicone membrane between a pair of acrylic blocks. The silicone membrane not only serves as the bottom of the chamber but also provides a mechanism for securing the cell-laden scaffold. Vibration signals, generated by a speaker mounted underneath the bottom acrylic block, are transmitted to the membrane aerodynamically by the oscillating air. Eight identical vibration modules, fixed on two stationary metal bars, are housed in an anti-humidity chamber for long-term operation in a cell culture incubator. The vibration characteristics of the vocal fold bioreactor are analyzed non-destructively using a Laser Doppler Vibrometer (LDV). The utility of the dynamic culture device is demonstrated by culturing cellular constructs in the presence of 200-Hz sinusoidal vibrations with a mid-membrane displacement of 40 µm. Mesenchymal stem cells cultured in the bioreactor respond to the vibratory signals by altering the synthesis and degradation of vocal fold-relevant, extracellular matrix components. The novel bioreactor system presented herein offers an excellent in vitro platform for studying vibration-induced mechanotransduction and for the engineering of functional vocal fold tissues.
A Mechanism for Frequency Modulation in Songbirds Shared with Humans

PubMed Central

Margoliash, Daniel

2013-01-01

In most animals that vocalize, control of fundamental frequency is a key element for effective communication. In humans, subglottal pressure controls vocal intensity but also influences fundamental frequency during phonation. Given the underlying similarities in the biomechanical mechanisms of vocalization in humans and songbirds, songbirds offer an attractive opportunity to study frequency modulation by pressure. Here, we present a novel technique for dynamic control of subsyringeal pressure in zebra finches. By regulating the opening of a custom-built fast valve connected to the air sac system, we achieved partial or total silencing of specific syllables, and could modify syllabic acoustics through more complex manipulations of air sac pressure. We also observed that more nuanced pressure variations over a limited interval during production of a syllable concomitantly affected the frequency of that syllable segment. These results can be explained in terms of a mathematical model for phonation that incorporates a nonlinear description for the vocal source capable of generating the observed frequency modulations induced by pressure variations. We conclude that the observed interaction between pressure and frequency was a feature of the source, not a result of feedback control. Our results indicate that, beyond regulating phonation or its absence, regulation of pressure is important for control of fundamental frequencies of vocalizations. Thus, although there are separate brainstem pathways for syringeal and respiratory control of song production, both can affect airflow and frequency. We hypothesize that the control of pressure and frequency is combined holistically at higher levels of the vocalization pathways. PMID:23825417
A sensorimotor area in the songbird brain is required for production of vocalizations in the song learning period of development.

PubMed

Piristine, Hande C; Choetso, Tenzin; Gobes, Sharon M H

2016-11-01

Sensory feedback is essential for acquiring and maintaining complex motor behaviors, including birdsong. In zebra finches, auditory feedback reaches the song control circuits primarily through the nucleus interfacialis nidopalii (Nif), which provides excitatory input to HVC (proper name)-a premotor region essential for the production of learned vocalizations. Despite being one of the major inputs to the song control pathway, the role of Nif in generating vocalizations is not well understood. To address this, we transiently inactivated Nif in late juvenile zebra finches. Upon Nif inactivation (in both hemispheres or on one side only), birds went from singing stereotyped zebra finch song to uttering highly variable and unstructured vocalizations resembling sub-song, an early juvenile song form driven by a basal ganglia circuit. Simultaneously inactivating Nif and LMAN (lateral magnocellular nucleus of the anterior nidopallium), the output nucleus of a basal ganglia circuit, inhibited song production altogether. These results suggest that Nif is required for generating the premotor drive for song. Permanent Nif lesions, in contrast, have only transient effects on vocal production, with song recovering within a day. The sensorimotor nucleus Nif thus produces a premotor drive to the motor pathway that is acutely required for generating learned vocalizations, but once permanently removed, the song system can compensate for its absence. © 2016 Wiley Periodicals, Inc. Develop Neurobiol 76: 1213-1225, 2016. © 2016 Wiley Periodicals, Inc.
A mechanism for frequency modulation in songbirds shared with humans.

PubMed

Amador, Ana; Margoliash, Daniel

2013-07-03

In most animals that vocalize, control of fundamental frequency is a key element for effective communication. In humans, subglottal pressure controls vocal intensity but also influences fundamental frequency during phonation. Given the underlying similarities in the biomechanical mechanisms of vocalization in humans and songbirds, songbirds offer an attractive opportunity to study frequency modulation by pressure. Here, we present a novel technique for dynamic control of subsyringeal pressure in zebra finches. By regulating the opening of a custom-built fast valve connected to the air sac system, we achieved partial or total silencing of specific syllables, and could modify syllabic acoustics through more complex manipulations of air sac pressure. We also observed that more nuanced pressure variations over a limited interval during production of a syllable concomitantly affected the frequency of that syllable segment. These results can be explained in terms of a mathematical model for phonation that incorporates a nonlinear description for the vocal source capable of generating the observed frequency modulations induced by pressure variations. We conclude that the observed interaction between pressure and frequency was a feature of the source, not a result of feedback control. Our results indicate that, beyond regulating phonation or its absence, regulation of pressure is important for control of fundamental frequencies of vocalizations. Thus, although there are separate brainstem pathways for syringeal and respiratory control of song production, both can affect airflow and frequency. We hypothesize that the control of pressure and frequency is combined holistically at higher levels of the vocalization pathways.
Familiarity with a vocal category biases the compartmental expression of Arc/Arg3.1 in core auditory cortex.

PubMed

Ivanova, Tamara N; Gross, Christina; Mappus, Rudolph C; Kwon, Yong Jun; Bassell, Gary J; Liu, Robert C

2017-12-01

Learning to recognize a stimulus category requires experience with its many natural variations. However, the mechanisms that allow a category's sensorineural representation to be updated after experiencing new exemplars are not well understood, particularly at the molecular level. Here we investigate how a natural vocal category induces expression in the auditory system of a key synaptic plasticity effector immediate early gene, Arc/Arg3.1 , which is required for memory consolidation. We use the ultrasonic communication system between mouse pups and adult females to study whether prior familiarity with pup vocalizations alters how Arc is engaged in the core auditory cortex after playback of novel exemplars from the pup vocal category. A computerized, 3D surface-assisted cellular compartmental analysis, validated against manual cell counts, demonstrates significant changes in the recruitment of neurons expressing Arc in pup-experienced animals (mothers and virgin females "cocaring" for pups) compared with pup-inexperienced animals (pup-naïve virgins), especially when listening to more familiar, natural calls compared to less familiar but similarly recognized tonal model calls. Our data support the hypothesis that the kinetics of Arc induction to refine cortical representations of sensory categories is sensitive to the familiarity of the sensory experience. © 2017 Ivanova et al.; Published by Cold Spring Harbor Laboratory Press.
Selective impairment of song learning following lesions of a forebrain nucleus in the juvenile zebra finch.

PubMed

Sohrabji, F; Nordeen, E J; Nordeen, K W

1990-01-01

Area X, a large sexually dimorphic nucleus in the avian ventral forebrain, is part of a highly discrete system of interconnected nuclei that have been implicated in either song learning or adult song production. Previously, this nucleus has been included in the song system because of its substantial connections with other vocal control nuclei, and because its volume is positively correlated with the capacity for song. In order to directly assess the role of Area X in song behavior, this nucleus was bilaterally lesioned in both juvenile and adult zebra finches, using ibotenic acid. We report here that lesioning Area X disrupts normal song development in juvenile birds, but does not affect the production of stereotyped song by adult birds. Although juvenile-lesioned birds were consistently judged as being in earlier stages of vocal development than age-matched controls, they continued to produce normal song-like vocalizations. Thus, unlike the lateral magnocellular nucleus of the anterior neostriatum, another avian forebrain nucleus implicated in song learning, Area X does not seem to be necessary for sustaining production of juvenile song. Rather, the behavioral results suggest Area X is important for either the acquisition of a song model or the improvement of song through vocal practice.

Role of immediate recurrent laryngeal nerve reconstruction in surgery for thyroid cancers with fixed vocal cords.

PubMed

Iwaki, Shinobu; Maeda, Tatsuyoshi; Saito, Miki; Otsuki, Naoki; Takahashi, Miki; Wakui, Emi; Shinomiya, Hirotaka; Morimoto, Koichi; Inoue, Hiroyuki; Masuoka, Hiroo; Miyauchi, Akira; Nibu, Ken-Ichi

2017-03-01

Quality of voice after immediate recurrent laryngeal nerve (RLN) reconstruction in thyroid cancers has not been thoroughly studied. Thirteen patients with fixed vocal cords (fixed vocal cord group) and 8 patients with intact or impaired mobile vocal cords (mobile vocal cord group) who had immediate RLN reconstruction simultaneously with total thyroidectomy, and patients who had arytenoid adduction and thyroplasty for vocal cord paralysis caused by previous surgery (arytenoid adduction thyroplasty group) were enrolled in this study. Preoperative phonation efficiency index was significantly lower (p = .008) in the fixed vocal cord group than in the mobile vocal cord group. One year after surgery, all voice parameters of the patients in the fixed vocal cord group had improved, compared with their preoperative data. The fixed vocal cord group had attained satisfactory voice qualities equivalent to those of the mobile vocal cord group in terms of various voice parameters. The present results support the idea that immediate RLN reconstruction at the time of surgery for thyroid cancers may spare the need for subsequent arytenoid adduction thyroplasty even in the patients with preoperatively fixed vocal cords. © 2016 Wiley Periodicals, Inc. Head Neck 39: 427-431, 2017. © 2016 Wiley Periodicals, Inc.
Numerical Simulation of the Self-Oscillations of the Vocal Folds and of the Resulting Acoustic Phenomena in the Vocal Tract

NASA Astrophysics Data System (ADS)

Švancara, P.; Horáček, J.; Švec, J. G.

The study presents a three-dimensional (3D) finite element (FE) model of the flow-induced self-oscillation of the human vocal folds in interaction with acoustics of simplified vocal tract models. The 3D vocal tract models of the acoustic spaces shaped for simulation of phonation of Czech vowels [a:], [i:] and [u:] were created by converting the data from the magnetic resonance images (MRI). For modelling of the fluid-structure interaction, explicit coupling scheme with separated solvers for fluid and structure domain was utilized. The FE model comprises vocal folds pretension before starting phonation, large deformations of the vocal fold tissue, vocal-fold collisions, fluid-structure interaction, morphing the fluid mesh according to the vocal-fold motion (Arbitrary Lagrangian-Eulerian approach), unsteady viscous compressible airflow described by the Navier-Stokes equations and airflow separation. The developed FE model enables to study the relationship between flow-induced vibrations of the vocal folds and acoustic wave propagation in the vocal tract and can also be used to simulate for example pathological changes in the vocal fold tissue and their influence on the voice production.
The effect of voice amplification on occupational vocal dose in elementary school teachers.

PubMed

Gaskill, Christopher S; O'Brien, Shenendoah G; Tinter, Sara R

2012-09-01

Two elementary school teachers, one with and one without a history of vocal complaints, wore a vocal dosimeter all day at school for a 3-week period. In the second week, each teacher wore a portable voice amplifier. Each teacher showed a reduction in vocal intensity during the week of amplification, with a larger effect for the teacher with vocal difficulties. This teacher also showed a decrease in hourly vocal fold distance dose as measured by the dosimeter despite incurring longer phonation times. Fundamental frequency and vocal fold cycle dose did not appear to be affected by the use of amplification during the teaching day. Both teachers showed evidence of a possible moderate effect of adjusting vocal intensity in the week after amplification, possibly as a means to recalibrate their perceived vocal loudness. This study demonstrates the usefulness of both vocal dosimetry and amplification in monitoring and modifying vocal dose in an occupational setting and reinforces previous data suggesting the effectiveness of amplification in reducing the vocal load in schoolteachers. Implications of the data for future research regarding prevention and treatment of occupational voice disorders are discussed. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Responses of primate frontal cortex neurons during natural vocal communication.

PubMed

Miller, Cory T; Thomas, A Wren; Nummela, Samuel U; de la Mothe, Lisa A

2015-08-01

The role of primate frontal cortex in vocal communication and its significance in language evolution have a controversial history. While evidence indicates that vocalization processing occurs in ventrolateral prefrontal cortex neurons, vocal-motor activity has been conjectured to be primarily subcortical and suggestive of a distinctly different neural architecture from humans. Direct evidence of neural activity during natural vocal communication is limited, as previous studies were performed in chair-restrained animals. Here we recorded the activity of single neurons across multiple regions of prefrontal and premotor cortex while freely moving marmosets engaged in a natural vocal behavior known as antiphonal calling. Our aim was to test whether neurons in marmoset frontal cortex exhibited responses during vocal-signal processing and/or vocal-motor production in the context of active, natural communication. We observed motor-related changes in single neuron activity during vocal production, but relatively weak sensory responses for vocalization processing during this natural behavior. Vocal-motor responses occurred both prior to and during call production and were typically coupled to the timing of each vocalization pulse. Despite the relatively weak sensory responses a population classifier was able to distinguish between neural activity that occurred during presentations of vocalization stimuli that elicited an antiphonal response and those that did not. These findings are suggestive of the role that nonhuman primate frontal cortex neurons play in natural communication and provide an important foundation for more explicit tests of the functional contributions of these neocortical areas during vocal behaviors. Copyright © 2015 the American Physiological Society.
Responses of primate frontal cortex neurons during natural vocal communication

PubMed Central

Thomas, A. Wren; Nummela, Samuel U.; de la Mothe, Lisa A.

2015-01-01

The role of primate frontal cortex in vocal communication and its significance in language evolution have a controversial history. While evidence indicates that vocalization processing occurs in ventrolateral prefrontal cortex neurons, vocal-motor activity has been conjectured to be primarily subcortical and suggestive of a distinctly different neural architecture from humans. Direct evidence of neural activity during natural vocal communication is limited, as previous studies were performed in chair-restrained animals. Here we recorded the activity of single neurons across multiple regions of prefrontal and premotor cortex while freely moving marmosets engaged in a natural vocal behavior known as antiphonal calling. Our aim was to test whether neurons in marmoset frontal cortex exhibited responses during vocal-signal processing and/or vocal-motor production in the context of active, natural communication. We observed motor-related changes in single neuron activity during vocal production, but relatively weak sensory responses for vocalization processing during this natural behavior. Vocal-motor responses occurred both prior to and during call production and were typically coupled to the timing of each vocalization pulse. Despite the relatively weak sensory responses a population classifier was able to distinguish between neural activity that occurred during presentations of vocalization stimuli that elicited an antiphonal response and those that did not. These findings are suggestive of the role that nonhuman primate frontal cortex neurons play in natural communication and provide an important foundation for more explicit tests of the functional contributions of these neocortical areas during vocal behaviors. PMID:26084912
How small could a pup sound? The physical bases of signaling body size in harbor seals

PubMed Central

Gross, Stephanie; Garcia, Maxime; Rubio-Garcia, Ana; de Boer, Bart

2017-01-01

Abstract Vocal communication is a crucial aspect of animal behavior. The mechanism which most mammals use to vocalize relies on three anatomical components. First, air overpressure is generated inside the lower vocal tract. Second, as the airstream goes through the glottis, sound is produced via vocal fold vibration. Third, this sound is further filtered by the geometry and length of the upper vocal tract. Evidence from mammalian anatomy and bioacoustics suggests that some of these three components may covary with an animal’s body size. The framework provided by acoustic allometry suggests that, because vocal tract length (VTL) is more strongly constrained by the growth of the body than vocal fold length (VFL), VTL generates more reliable acoustic cues to an animal’s size. This hypothesis is often tested acoustically but rarely anatomically, especially in pinnipeds. Here, we test the anatomical bases of the acoustic allometry hypothesis in harbor seal pups Phoca vitulina. We dissected and measured vocal tract, vocal folds, and other anatomical features of 15 harbor seals post-mortem. We found that, while VTL correlates with body size, VFL does not. This suggests that, while body growth puts anatomical constraints on how vocalizations are filtered by harbor seals’ vocal tract, no such constraints appear to exist on vocal folds, at least during puppyhood. It is particularly interesting to find anatomical constraints on harbor seals’ vocal tracts, the same anatomical region partially enabling pups to produce individually distinctive vocalizations. PMID:29492005
Distinct Neural Activities in Premotor Cortex during Natural Vocal Behaviors in a New World Primate, the Common Marmoset (Callithrix jacchus).

PubMed

Roy, Sabyasachi; Zhao, Lingyun; Wang, Xiaoqin

2016-11-30

Although evidence from human studies has long indicated the crucial role of the frontal cortex in speech production, it has remained uncertain whether the frontal cortex in nonhuman primates plays a similar role in vocal communication. Previous studies of prefrontal and premotor cortices of macaque monkeys have found neural signals associated with cue- and reward-conditioned vocal production, but not with self-initiated or spontaneous vocalizations (Coudé et al., 2011; Hage and Nieder, 2013), which casts doubt on the role of the frontal cortex of the Old World monkeys in vocal communication. A recent study of marmoset frontal cortex observed modulated neural activities associated with self-initiated vocal production (Miller et al., 2015), but it did not delineate whether these neural activities were specifically attributed to vocal production or if they may result from other nonvocal motor activity such as orofacial motor movement. In the present study, we attempted to resolve these issues and examined single neuron activities in premotor cortex during natural vocal exchanges in the common marmoset (Callithrix jacchus), a highly vocal New World primate. Neural activation and suppression were observed both before and during self-initiated vocal production. Furthermore, by comparing neural activities between self-initiated vocal production and nonvocal orofacial motor movement, we identified a subpopulation of neurons in marmoset premotor cortex that was activated or suppressed by vocal production, but not by orofacial movement. These findings provide clear evidence of the premotor cortex's involvement in self-initiated vocal production in natural vocal behaviors of a New World primate. Human frontal cortex plays a crucial role in speech production. However, it has remained unclear whether the frontal cortex of nonhuman primates is involved in the production of self-initiated vocalizations during natural vocal communication. Using a wireless multichannel neural recording technique, we observed in the premotor cortex neural activation and suppression both before and during self-initiated vocalizations when marmosets, a highly vocal New World primate species, engaged in vocal exchanges with conspecifics. A novel finding of the present study is the discovery of a subpopulation of premotor cortex neurons that was activated by vocal production, but not by orofacial movement. These observations provide clear evidence of the premotor cortex's involvement in vocal production in a New World primate species. Copyright © 2016 the authors 0270-6474/16/3612168-12$15.00/0.
Phase-Specific Vocalizations of Male Mice at the Initial Encounter during the Courtship Sequence

PubMed Central

Matsumoto, Yui K.; Okanoya, Kazuo

2016-01-01

Mice produce ultrasonic vocalizations featuring a variety of syllables. Vocalizations are observed during social interactions. In particular, males produce numerous syllables during courtship. Previous studies have shown that vocalizations change according to sexual behavior, suggesting that males vary their vocalizations depending on the phase of the courtship sequence. To examine this process, we recorded large sets of mouse vocalizations during male–female interactions and acoustically categorized these sounds into 12 vocal types. We found that males emitted predominantly short syllables during the first minute of interaction, more long syllables in the later phases, and mainly harmonic sounds during mounting. These context- and time-dependent changes in vocalization indicate that vocal communication during courtship in mice consists of at least three stages and imply that each vocalization type has a specific role in a phase of the courtship sequence. Our findings suggest that recording for a sufficiently long time and taking the phase of courtship into consideration could provide more insights into the role of vocalization in mouse courtship behavior in future study. PMID:26841117
Quantifying vocal fatigue recovery: Dynamic vocal recovery trajectories after a vocal loading exercise

PubMed Central

Hunter, Eric J.; Titze, Ingo R.

2012-01-01

Objectives To quantify the recovery of voice following a 2-hour vocal loading exercise (oral reading). Methods 86 adult participants tracked their voice recovery using short vocal tasks and perceptual ratings after an initial vocal loading exercise and for the following two days. Results Short-term recovery was apparent with 90% recovery within 4-6 hours and full recovery at 12-18 hours. Recovery was shown to be similar to a dermal wound healing trajectory. Conclusions The new recovery trajectory highlighted by the vocal loading exercise in the current study is called a vocal recovery trajectory. By comparing vocal fatigue to dermal wound healing, this trajectory is parallel to a chronic wound healing trajectory (as opposed to an acute wound healing trajectory). This parallel suggests that vocal fatigue from the daily use of the voice could be treated as a chronic wound, with the healing and repair mechanisms in a state of constant repair. In addition, there is likely a vocal fatigue threshold at which point the level of tissue damage would shift the chronic healing trajectory to an acute healing trajectory. PMID:19663377
Viscoelastic properties of rabbit vocal folds after augmentation.

PubMed

Hertegård, Stellan; Dahlqvist, Ake; Laurent, Claude; Borzacchiello, Assunta; Ambrosio, Luigi

2003-03-01

Vocal fold function is closely related to tissue viscoelasticity. Augmentation substances may alter the viscoelastic properties of vocal fold tissues and hence their vibratory capacity. We sought to investigate the viscoelastic properties of rabbit vocal folds in vitro after injections of various augmentation substances. Polytetrafluoroethylene (Teflon), cross-linked collagen (Zyplast), and cross-linked hyaluronan, hylan b gel (Hylaform) were injected into the lamina propria and the thyroarytenoid muscle of rabbit vocal folds. Dynamic viscosity of the injected vocal fold as a function of frequency was measured with a Bohlin parallel-plate rheometer during small-amplitude oscillation. All injected vocal folds showed a decreasing dynamic viscosity with increasing frequency. Vocal fold samples injected with hylan b gel showed the lowest dynamic viscosity, quite close to noninjected control samples. Vocal folds injected with polytetrafluoroethylene showed the highest dynamic viscosity followed by the collagen samples. The data indicated that hylan b gel in short-term renders the most natural viscoelastic properties to the vocal fold among the substances tested. This is of importance to restore/preserve the vibratory capacity of the vocal folds when glottal insufficiency is treated with injections.
On the role of the reticular formation in vocal pattern generation.

PubMed

Jürgens, Uwe; Hage, Steffen R

2007-09-04

This review is an attempt to localize the brain region responsible for pattern generation of species-specific vocalizations. A catalogue is set up, listing the criteria considered to be essential for a vocal pattern generator. According to this catalogue, a vocal pattern generator should show vocalization-correlated activity, starting before vocal onset and reflecting specific acoustic features of the vocalization. Artificial activation by electrical or glutamatergic stimulation should produce artificially sounding vocalization. Lesioning is expected to have an inhibitory or deteriorating effect on vocalization. Anatomically, a vocal pattern generator can be assumed to have direct or, at least, oligosynaptic connections with all the motoneuron pools involved in phonation. A survey of the literature reveals that the only area meeting all these criteria is a region, reaching from the parvocellular pontine reticular formation just above the superior olive through the lateral reticular formation around the facial nucleus and nucleus ambiguus down to the caudalmost medulla, including the dorsal and ventral reticular nuclei and nucleus retroambiguus. It is proposed that vocal pattern generation takes place within this whole region.
Preliminary experiments to quantify liquid movement under mimetic vocal fold vibrational forces.

PubMed

Titze, Ingo R; Klemuk, Sarah; Lu, Xiaoying

2014-07-01

Hydration of vocal fold tissues is essential for self-sustained oscillation. Normal regulatory processes of liquid transport to and from the vocal folds would be expected through the autonomic systems, but the possibility exists that liquid movement may occur locally due to vibrational pressures. Such movement may cause regions of lower or higher concentrations of liquid viscosity and therewith changes in phonation threshold pressure. Hyaluronic acid, a glycosaminoglycan that attracts large quantities of free water, may be a key molecule for transporting or localizing liquids. Some preliminary experiments are reported in which attempts were made to move low-concentration HA liquids with vibration. None of the experiments was conclusive, but collectively they lay some groundwork for future explorations.
[The autoimmune rheumatic disease and laryngeal pathology].

PubMed

Osipenko, E V; Kotel'nikova, N M

Vocal disorders make up one of the autoimmune pathological conditions characterized by multiple organ system dysfunction. Laryngeal pathology in this condition has an autoimmune nature; it is highly diverse and poorly explored. The objective of the present work based on the analysis of the relevant literature publications was to study clinical manifestations of the autoimmune rheumatic disease affecting the larynx. 'Bamboo nodes' on the vocal folds is a rare manifestation of laryngeal autoimmune diseases. We found out references to 49 cases of this condition in the available literature. All the patients were women presenting with autoimmune diseases. The present review highlights the problems pertaining to etiology of 'bamboo nodes' on the vocal folds and the method for the treatment of this condition.
Relation of structural and vibratory kinematics of the vocal folds to two acoustic measures of breathy voice based on computational modeling

PubMed Central

Samlan, Robin A.; Story, Brad H.

2011-01-01

Purpose To relate vocal fold structure and kinematics to two acoustic measures: cepstral peak prominence (CPP) and the amplitude of the first harmonic relative to the second (H1-H2). Method A computational, kinematic model of the medial surfaces of the vocal folds was used to specify features of vocal fold structure and vibration in a manner consistent with breathy voice. Four model parameters were altered: degree of vocal fold adduction, surface bulging, vibratory nodal point, and supraglottal constriction. CPP and H1-H2 were measured from simulated glottal area, glottal flow and acoustic waveforms and related to the underlying vocal fold kinematics. Results CPP decreased with increased separation of the vocal processes, whereas the nodal point location had little effect. H1-H2 increased as a function of separation of the vocal processes in the range of 1–1.5 mm and decreased with separation > 1.5 mm. Conclusions CPP is generally a function of vocal process separation. H1*-H2* will increase or decrease with vocal process separation based on vocal fold shape, pivot point for the rotational mode, and supraglottal vocal tract shape, limiting its utility as an indicator of breathy voice. Future work will relate the perception of breathiness to vocal fold kinematics and acoustic measures. PMID:21498582
An End-User Participatory Approach to Collaboratively Refine HIV Care Data, The New York State Experience.

PubMed

Swain, Carol-Ann; Sawicki, Steven; Addison, Diane; Katz, Benjamin; Piersanti, Kelly; Baim-Lance, Abigail; Gordon, Daniel; Anderson, Bridget J; Nash, Denis; Steinbock, Clemens; Agins, Bruce

2018-04-02

Existing data dissemination structures primarily rely on top-down approaches. Unless designed with the end user in mind, this may impair data-driven clinical improvements to Human Immunodeficiency Virus (HIV) prevention and care. In this study, we implemented a data visualization activity to create region-specific data presentations collaboratively with HIV providers, consumers of HIV care, and New York State (NYS) Department of Health AIDS Institute staff for use in local HIV care decision-making. Data from the NYS HIV Surveillance Registry (2009-2013) and HIV care facilities (2010-2015) participating in a Health Resources and Services Administration (HRSA) Systems Linkages and Access to Care project were used. Each data package incorporated visuals for: linkage to HIV care, retention in care and HIV viral suppression. End-users were vocal about their data needs and their capacity to interpret public health data. This experience suggests that data dissemination strategies should incorporate input from the end user to improve comprehension and optimize HIV care.
A case of bilateral vocal fold mucosal bridges, bilateral trans-vocal fold type III sulci vocales, and an intracordal polyp.

PubMed

Tan, Melin; Pitman, Michael J

2011-07-01

We present a patient with a novel finding of bilateral mucosal bridges, bilateral type III trans-vocal fold sulci vocales, and a vocal fold polyp. Although sulci and mucosal bridges occur in the vocal folds, it is rare to find multiples of these lesions in a single patient, and it is even more uncommon when they occur in conjunction with a vocal fold polyp. To our knowledge, this is the first description of a vocal fold polyp in combination with multiple vocal fold bridges and multiple type III sulci vocales in a single patient. To describe and visually present the diagnosis and treatment of a patient with an intracordal polyp, bilateral mucosal bridges, as well as bilateral type III trans-vocal fold sulci vocales. Presentation of a set of high definition intraoperative photos displaying the extent of the vocal fold lesions and the resection of the intracordal polyp. This patient presented with only 6 months of significant dysphonia. It was felt that the recent change in voice was because of the polyp and not the bridges or sulci vocales. Considering the patient's presentation and the possible morbidity of resection of mucosal bridges and sulci, only the polyp was excised. Postoperatively, the patient's voice returned to his acceptable mild baseline dysphonia, and the benefit has persisted 6 months postoperatively. The combination of bilateral mucosal bridges, bilateral type III sulcus vocalis, and an intracordal polyp in one patient is rare if not novel. Treatment of the polyp alone returned the patient's voice to his lifelong baseline of mild dysphonia. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Temporal Lobe Epilepsy Alters Auditory-motor Integration For Voice Control

PubMed Central

Li, Weifeng; Chen, Ziyi; Yan, Nan; Jones, Jeffery A.; Guo, Zhiqiang; Huang, Xiyan; Chen, Shaozhen; Liu, Peng; Liu, Hanjun

2016-01-01

Temporal lobe epilepsy (TLE) is the most common drug-refractory focal epilepsy in adults. Previous research has shown that patients with TLE exhibit decreased performance in listening to speech sounds and deficits in the cortical processing of auditory information. Whether TLE compromises auditory-motor integration for voice control, however, remains largely unknown. To address this question, event-related potentials (ERPs) and vocal responses to vocal pitch errors (1/2 or 2 semitones upward) heard in auditory feedback were compared across 28 patients with TLE and 28 healthy controls. Patients with TLE produced significantly larger vocal responses but smaller P2 responses than healthy controls. Moreover, patients with TLE exhibited a positive correlation between vocal response magnitude and baseline voice variability and a negative correlation between P2 amplitude and disease duration. Graphical network analyses revealed a disrupted neuronal network for patients with TLE with a significant increase of clustering coefficients and path lengths as compared to healthy controls. These findings provide strong evidence that TLE is associated with an atypical integration of the auditory and motor systems for vocal pitch regulation, and that the functional networks that support the auditory-motor processing of pitch feedback errors differ between patients with TLE and healthy controls. PMID:27356768
Campbell's monkeys concatenate vocalizations into context-specific call sequences

PubMed Central

Ouattara, Karim; Lemasson, Alban; Zuberbühler, Klaus

2009-01-01

Primate vocal behavior is often considered irrelevant in modeling human language evolution, mainly because of the caller's limited vocal control and apparent lack of intentional signaling. Here, we present the results of a long-term study on Campbell's monkeys, which has revealed an unrivaled degree of vocal complexity. Adult males produced six different loud call types, which they combined into various sequences in highly context-specific ways. We found stereotyped sequences that were strongly associated with cohesion and travel, falling trees, neighboring groups, nonpredatory animals, unspecific predatory threat, and specific predator classes. Within the responses to predators, we found that crowned eagles triggered four and leopards three different sequences, depending on how the caller learned about their presence. Callers followed a number of principles when concatenating sequences, such as nonrandom transition probabilities of call types, addition of specific calls into an existing sequence to form a different one, or recombination of two sequences to form a third one. We conclude that these primates have overcome some of the constraints of limited vocal control by combinatorial organization. As the different sequences were so tightly linked to specific external events, the Campbell's monkey call system may be the most complex example of ‘proto-syntax’ in animal communication known to date. PMID:20007377
Factors associated with voice therapy outcomes in the treatment of presbyphonia.

PubMed

Mau, Ted; Jacobson, Barbara H; Garrett, C Gaelyn

2010-06-01

Age, vocal fold atrophy, glottic closure pattern, and the burden of medical problems are associated with voice therapy outcomes for presbyphonia. Retrospective. Records of patients seen over a 3-year period at a voice center were screened. Inclusion criteria consisted of age over 55 years, primary complaint of hoarseness, presence of vocal fold atrophy on examination, and absence of laryngeal or neurological pathology. Videostroboscopic examinations on initial presentation were reviewed. Voice therapy outcomes were assessed with the American Speech-Language-Hearing Association National Outcomes Measurement System scale. Statistical analysis was performed with Spearman rank correlation and chi(2) tests. Sixty-seven patients were included in the study. Of the patients, 85% demonstrated improvement with voice therapy. The most common type of glottic closure consisted of a slit gap. Gender or age had no effect on voice therapy outcomes. Larger glottic gaps on initial stroboscopy examination and more pronounced vocal fold atrophy were weakly correlated with less improvement from voice therapy. A weak correlation was also found between the number of chronic medical conditions and poorer outcomes from voice therapy. The degree of clinician-determined improvement in vocal function from voice therapy is independent of patient age but is influenced by the degree of vocal fold atrophy, glottic closure pattern, and the patient's burden of medical problems.
Vocal Tones Influence Young Children’s Responses to Prohibitions

PubMed Central

Dahl, Audun; Tran, Amy Q.

2016-01-01

Vocal reactions to child transgressions convey information about the nature of those transgressions. The present research investigated children’s ability to make use of such vocal reactions. Study 1 investigated infants’ compliance with a vocal prohibition telling them to stay away from a toy. Compared to younger infants, older infants showed greater compliance with prohibitions elicited by moral (interpersonal harm) transgressions, but not with prohibitions elicited by pragmatic (inconvenience) transgressions. Study 2 investigated preschoolers’ use of firm-stern vocalizations (associated with moral transgressions) and positive vocalizations (associated with pragmatic transgressions). Most children guessed that the firm-stern vocalizations were uttered in response to a moral transgression and the positive vocalization were uttered in response to a pragmatic transgression. These two studies suggest that children use vocal tones, along with other experiences, to guide their compliance with and interpretation of prohibitions. PMID:27518810

High-speed imaging of vocal fold vibrations and larynx movements within vocalizations of different vowels.

PubMed

Maurer, D; Hess, M; Gross, M

1996-12-01

Theoretic investigations of the "source-filter" model have indicated a pronounced acoustic interaction of glottal source and vocal tract. Empirical investigations of formant pattern variations apart from changes in vowel identity have demonstrated a direct relationship between the fundamental frequency and the patterns. As a consequence of both findings, independence of phonation and articulation may be limited in the speech process. Within the present study, possible interdependence of phonation and phoneme was investigated: vocal fold vibrations and larynx position for vocalizations of different vowels in a healthy man and woman were examined by high-speed light-intensified digital imaging. We found 1) different movements of the vocal folds for vocalizations of different vowel identities within one speaker and at similar fundamental frequency, and 2) constant larynx position within vocalization of one vowel identity, but different positions for vocalizations of different vowel identities. A possible relationship between the vocal fold vibrations and the phoneme is discussed.
An annotated dataset of Egyptian fruit bat vocalizations across varying contexts and during vocal ontogeny.

PubMed

Prat, Yosef; Taub, Mor; Pratt, Ester; Yovel, Yossi

2017-10-03

Animal acoustic communication research depends on our ability to record the vocal behaviour of different species. Only rarely do we have the opportunity to continuously follow the vocal behaviour of a group of individuals of the same species for a long period of time. Here, we provide a database of Egyptian fruit bat vocalizations, which were continuously recorded in the lab in several groups simultaneously for more than a year. The dataset includes almost 300,000 files, a few seconds each, containing social vocalizations and representing the complete vocal repertoire used by the bats in the experiment period. Around 90,000 files are annotated with details about the individuals involved in the vocal interactions, their behaviours and the context. Moreover, the data include the complete vocal ontogeny of pups, from birth to adulthood, in different conditions (e.g., isolated or in a group). We hope that this comprehensive database will stimulate studies that will enhance our understanding of bat, and mammal, social vocal communication.
The Risk of Vocal Fold Atrophy after Serial Corticosteroid Injections of the Vocal Fold.

PubMed

Shi, Lucy L; Giraldez-Rodriguez, Laureano A; Johns, Michael M

2016-11-01

The aim of this study was to illustrate the risk of vocal fold atrophy in patients who receive serial subepithelial steroid injections for vocal fold scar. This study is a retrospective case report of two patients who underwent a series of weekly subepithelial infusions of 10 mg/mL dexamethasone for benign vocal fold lesion. Shortly after the procedures, both patients developed a weak and breathy voice. The first patient was a 53-year-old man with radiation-induced vocal fold stiffness. Six injections were performed unilaterally, and 1 week later, he developed unilateral vocal fold atrophy with new glottal insufficiency. The second patient was a 67-year-old woman with severe vocal fold inflammation related to laryngitis and calcinosis, Raynaud's phenomenon, esophagean dysmotility, sclerodactyly, and telangiectasia (CREST) syndrome. Five injections were performed bilaterally, and 1 week later, she developed bilateral vocal fold atrophy with a large midline glottal gap during phonation. In both cases, the steroid-induced vocal atrophy resolved spontaneously after 4 months. Serial subepithelial steroid infusions of the vocal folds, although safe in the majority of patients, carry the risk of causing temporary vocal fold atrophy when given at short intervals. Copyright Â© 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Predicting Achievable Fundamental Frequency Ranges in Vocalization Across Species

PubMed Central

Titze, Ingo; Riede, Tobias; Mau, Ted

2016-01-01

Vocal folds are used as sound sources in various species, but it is unknown how vocal fold morphologies are optimized for different acoustic objectives. Here we identify two main variables affecting range of vocal fold vibration frequency, namely vocal fold elongation and tissue fiber stress. A simple vibrating string model is used to predict fundamental frequency ranges across species of different vocal fold sizes. While average fundamental frequency is predominantly determined by vocal fold length (larynx size), range of fundamental frequency is facilitated by (1) laryngeal muscles that control elongation and by (2) nonlinearity in tissue fiber tension. One adaptation that would increase fundamental frequency range is greater freedom in joint rotation or gliding of two cartilages (thyroid and cricoid), so that vocal fold length change is maximized. Alternatively, tissue layers can develop to bear a disproportionate fiber tension (i.e., a ligament with high density collagen fibers), increasing the fundamental frequency range and thereby vocal versatility. The range of fundamental frequency across species is thus not simply one-dimensional, but can be conceptualized as the dependent variable in a multi-dimensional morphospace. In humans, this could allow for variations that could be clinically important for voice therapy and vocal fold repair. Alternative solutions could also have importance in vocal training for singing and other highly-skilled vocalizations. PMID:27309543
Histopathologic investigations of the unphonated human child vocal fold mucosa.

PubMed

Sato, Kiminori; Umeno, Hirohito; Nakashima, Tadashi; Nonaka, Satoshi; Harabuchi, Yasuaki

2012-01-01

Vocal fold stellate cells (VFSCs) in the maculae flavae (MFe) located at both ends of the vocal fold mucosa are inferred to be involved in the metabolism of extracellular matrices. MFe are also considered to be an important structure in the growth and development of the human vocal fold mucosa. Tension caused by phonation (vocal fold vibration) is hypothesized to stimulate VFSCs to accelerate production of extracellular matrices. Human child vocal fold mucosae unphonated since birth were investigated histologically. Histologic analysis of human child vocal fold mucosa. Vocal fold mucosae, which have remained unphonated since birth, of two children (7 and 12 years old) with cerebral palsy were investigated by light and electron microscopy and compared with normal subjects. Vocal fold mucosae and MFe were hypoplastic and rudimentary and did not have a vocal ligament, Reinke's space, or the layered structure. The lamina propria appeared as a uniform structure. Some VFSCs in the MFe showed degeneration and not many vesicles were present at the periphery of the cytoplasm. The VFSCs synthesized fewer extracellular matrices, such as fibrous protein and glycosaminoglycan. The VFSCs appeared to have decreased activity. Vocal fold vibration (phonation) after birth is an important factor in the growth and development of the human vocal fold mucosa. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Central pattern generator for vocalization: is there a vertebrate morphotype?

PubMed

Bass, Andrew H

2014-10-01

Animals that generate acoustic signals for social communication are faced with two essential tasks: generate a temporally precise signal and inform the auditory system about the occurrence of one's own sonic signal. Recent studies of sound producing fishes delineate a hindbrain network comprised of anatomically distinct compartments coding equally distinct neurophysiological properties that allow an organism to meet these behavioral demands. A set of neural characters comprising a vocal-sonic central pattern generator (CPG) morphotype is proposed for fishes and tetrapods that shares evolutionary developmental origins with pectoral appendage motor systems. Copyright © 2014 Elsevier Ltd. All rights reserved.
Central pattern generator for vocalization: Is there a vertebrate morphotype?

PubMed Central

Bass, Andrew H.

2014-01-01

Animals that generate acoustic signals for social communication are faced with two essential tasks: generate a temporally precise signal and inform the auditory system about the occurrence of one’s own sonic signal. Recent studies of sound producing fishes delineate a hindbrain network comprised of anatomically distinct compartments coding equally distinct neurophysiological properties that allow an organism to meet these behavioral demands. A set of neural characters comprising a vocal-sonic central pattern generator (CPG) morphotype is proposed for fishes and tetrapods that shares evolutionary developmental origins with pectoral appendage motor systems. PMID:25050813
Viscoelasticity of rabbit vocal folds after injection augmentation.

PubMed

Dahlqvist, Ake; Gärskog, Ola; Laurent, Claude; Hertegård, Stellan; Ambrosio, Luigi; Borzacchiello, Assunta

2004-01-01

Vocal fold function is related to the viscoelasticity of the vocal fold tissue. Augmentation substances used for injection treatment of voice insufficiency may alter the viscoelastic properties of vocal folds and their vibratory capacity. The objective was to compare the mechanical properties (viscoelasticity) of various injectable substances and the viscoelasticity of rabbit vocal folds, 6 months after injection with one of these substances. Animal model. Cross-linked collagen (Zyplast), double cross-linked hyaluronan (hylan B gel), dextranomers in hyaluronan (DHIA), and polytetrafluoroethylene (Teflon) were injected into rabbit vocal folds. Six months after the injection, the animals were killed and the right- and left-side vocal folds were removed. Dynamic viscosity of the injected substances and the vocal folds was measured with a Bohlin parallel-plate rheometer during small-amplitude oscillation. All injected vocal folds showed a decreasing dynamic viscosity with increasing frequency. Hylan B gel and DiHA showed the lowest dynamic viscosity values, and vocal folds injected with these substances also showed the lowest dynamic viscosity (similar to noninjected control samples). Teflon (and vocal folds injected with Teflon) showed the highest dynamic viscosity values, followed by the collagen samples. Substances with low viscoelasticity alter the mechanical properties of the vocal fold to a lesser degree than substances with a high viscoelasticity. The data indicated that hylan B gel and DiHA render the most natural viscoelastic properties to the vocal folds. These substances seem to be appropriate for preserving or restoring the vibratory capacity of the vocal folds when glottal insufficiency is treated with augmentative injections.
Neural Representation of a Target Auditory Memory in a Cortico-Basal Ganglia Pathway

PubMed Central

Bottjer, Sarah W.

2013-01-01

Vocal learning in songbirds, like speech acquisition in humans, entails a period of sensorimotor integration during which vocalizations are evaluated via auditory feedback and progressively refined to achieve an imitation of memorized vocal sounds. This process requires the brain to compare feedback of current vocal behavior to a memory of target vocal sounds. We report the discovery of two distinct populations of neurons in a cortico-basal ganglia circuit of juvenile songbirds (zebra finches, Taeniopygia guttata) during vocal learning: (1) one in which neurons are selectively tuned to memorized sounds and (2) another in which neurons are selectively tuned to self-produced vocalizations. These results suggest that neurons tuned to learned vocal sounds encode a memory of those target sounds, whereas neurons tuned to self-produced vocalizations encode a representation of current vocal sounds. The presence of neurons tuned to memorized sounds is limited to early stages of sensorimotor integration: after learning, the incidence of neurons encoding memorized vocal sounds was greatly diminished. In contrast to this circuit, neurons known to drive vocal behavior through a parallel cortico-basal ganglia pathway show little selective tuning until late in learning. One interpretation of these data is that representations of current and target vocal sounds in the shell circuit are used to compare ongoing patterns of vocal feedback to memorized sounds, whereas the parallel core circuit has a motor-related role in learning. Such a functional subdivision is similar to mammalian cortico-basal ganglia pathways in which associative-limbic circuits mediate goal-directed responses, whereas sensorimotor circuits support motor aspects of learning. PMID:24005299
Passive Localization of Multiple Sources Using Widely-Spaced Arrays with Application to Marine Mammals

DTIC Science & Technology

2007-09-30

the behavioral ecology of marine mammals by simultaneously tracking multiple vocalizing individuals in space and time. OBJECTIVES The ...goal is to contribute to the behavioral ecology of marine mammals by simultaneously tracking multiple vocalizing individuals in space and time. 15...OA Graduate Traineeship for E-M Nosal) LONG-TERM GOALS The goal of our research is to develop systems that use a widely spaced hydrophone array
Vocal Tremor Analysis with the Vocal Demodulator.

ERIC Educational Resources Information Center

Winholtz, William S.; Ramig, Lorraine Olson

1992-01-01

This paper describes the Vocal Demodulator as a new device for analysis of vocal tremor. The Vocal Demodulator produces amplitude-demodulated and frequency-demodulated outputs and measures the frequency and level of low-frequency tremor components in sustained phonation. The paper describes quantification of the demodulation process, validation…
Singing voice detection for karaoke application

NASA Astrophysics Data System (ADS)

Shenoy, Arun; Wu, Yuansheng; Wang, Ye

2005-07-01

We present a framework to detect the regions of singing voice in musical audio signals. This work is oriented towards the development of a robust transcriber of lyrics for karaoke applications. The technique leverages on a combination of low-level audio features and higher level musical knowledge of rhythm and tonality. Musical knowledge of the key is used to create a song-specific filterbank to attenuate the presence of the pitched musical instruments. This is followed by subband processing of the audio to detect the musical octaves in which the vocals are present. Text processing is employed to approximate the duration of the sung passages using freely available lyrics. This is used to obtain a dynamic threshold for vocal/ non-vocal segmentation. This pairing of audio and text processing helps create a more accurate system. Experimental evaluation on a small database of popular songs shows the validity of the proposed approach. Holistic and per-component evaluation of the system is conducted and various improvements are discussed.
Pedagogical efficiency of melodic contour mapping technology as it relates to vocal timbre in singers of classical music repertoire.

PubMed

Barnes-Burroughs, Kathryn; Anderson, Edward E; Hughes, Thomas; Lan, William Y; Dent, Karl; Arnold, Sue; Dolter, Gerald; McNeil, Kathy

2007-11-01

The purpose of this investigation was to ascertain the pedagogical viability of computer-generated melodic contour mapping systems in the classical singing studio, as perceived by their resulting effect (if any) on vocal timbre when a singer's head and neck remained in a normal singing posture. The evaluation of data gathered during the course of the study indicates that the development of consistent vocal timbre produced by the classical singing student may be enhanced through visual/kinesthetic response to melodic contour inversion mapping, as it balances the singer's perception of melodic intervals in standard musical notation. Unexpectedly, it was discovered that the system, in its natural melodic contour mode, may also be useful for teaching a student to sing a consistent legato line. The results of the study also suggest that the continued development of this new technology for the general teaching studio, designed to address standard musical notation and a singer's visual/kinesthetic response to it, may indeed be useful.
Dysphonia and vocal fold telangiectasia in hereditary hemorrhagic telangiectasia.

PubMed

Chang, Joseph; Yung, Katherine C

2014-11-01

This case report is the first documentation of dysphonia and vocal fold telangiectasia as a complication of hereditary hemorrhagic telangiectasia (HHT). Case report of a 40-year-old man with HHT presenting with 2 years of worsening hoarseness. Hoarseness corresponded with a period of anticoagulation. Endoscopy revealed vocal fold scarring, vocal fold telangiectasias, and plica ventricular is suggestive of previous submucosal vocal fold hemorrhage and subsequent counterproductive compensation with ventricular phonation. Hereditary hemorrhagic telangiectasia may present as dysphonia with vocal fold telangiectasias and place patients at risk of vocal fold hemorrhage. © The Author(s) 2014.
Vocal fry may undermine the success of young women in the labor market.

PubMed

Anderson, Rindy C; Klofstad, Casey A; Mayew, William J; Venkatachalam, Mohan

2014-01-01

Vocal fry is speech that is low pitched and creaky sounding, and is increasingly common among young American females. Some argue that vocal fry enhances speaker labor market perceptions while others argue that vocal fry is perceived negatively and can damage job prospects. In a large national sample of American adults we find that vocal fry is interpreted negatively. Relative to a normal speaking voice, young adult female voices exhibiting vocal fry are perceived as less competent, less educated, less trustworthy, less attractive, and less hirable. The negative perceptions of vocal fry are stronger for female voices relative to male voices. These results suggest that young American females should avoid using vocal fry speech in order to maximize labor market opportunities.
Correlation between vocal tract symptoms and modern singing handicap index in church gospel singers.

PubMed

Pinheiro, Joel; Silverio, Kelly Cristina Alves; Siqueira, Larissa Thaís Donalonso; Ramos, Janine Santos; Brasolotto, Alcione Ghedini; Zambon, Fabiana; Behlau, Mara

2017-08-24

To verify the correlation between vocal tract discomfort symptoms and perceived voice handicaps in gospel singers, analyzing possible differences according to gender. 100 gospel singers volunteered, 50 male and 50 female. All participants answered two questionnaires: Vocal Tract Discomfort (VTD) scale and the Modern Singing Handicap Index (MSHI) that investigates the vocal handicap perceived by singers, linking the results of both instruments (p<0.05). Women presented more perceived handicaps and also more frequent and higher intensity vocal tract discomfort. Furthermore, the more frequent and intense the vocal tract symptoms, the higher the vocal handicap for singing. Female gospel singers present higher frequency and intensity of vocal tract discomfort symptoms, as well as higher voice handicap for singing than male gospel singers. The higher the frequency and intensity of the laryngeal symptoms, the higher the vocal handicap will be.
FE Modelling of the Fluid-Structure-Acoustic Interaction for the Vocal Folds Self-Oscillation

NASA Astrophysics Data System (ADS)

Švancara, Pavel; Horáček, J.; Hrůza, V.

The flow induced self-oscillation of the human vocal folds in interaction with acoustic processes in the simplified vocal tract model was explored by three-dimensional (3D) finite element (FE) model. Developed FE model includes vocal folds pretension before phonation, large deformations of the vocal fold tissue, vocal folds contact, fluid-structure interaction, morphing the fluid mesh according the vocal folds motion (Arbitrary Lagrangian-Eulerian approach), unsteady viscous compressible airflow described by the Navier-Stokes equations and airflow separation during the glottis closure. Iterative partitioned approach is used for modelling the fluid-structure interaction. Computed results prove that the developed model can be used for simulation of the vocal folds self-oscillation and resulting acoustic waves. The developed model enables to numerically simulate an influence of some pathological changes in the vocal fold tissue on the voice production.
Vocal cord collapse during phrenic nerve-paced respiration in congenital central hypoventilation syndrome.

PubMed

Domanski, Mark C; Preciado, Diego A

2012-01-01

Phrenic nerve pacing can be used to treat congenital central hypoventilation syndrome (CCHS). We report how the lack of normal vocal cord tone during phrenic paced respiration can result in passive vocal cord collapse and produce obstructive symptoms. We describe a case of passive vocal cord collapse during phrenic nerve paced respiration in a patient with CCHS. As far as we know, this is the first report of this etiology of airway obstruction. The patient, a 7-year-old with CCHS and normal waking vocal cord movement, continued to require nightly continuous positive airway pressure (CPAP) despite successful utilization of phrenic nerve pacers. On direct laryngoscopy, the patient's larynx was observed while the diaphragmatic pacers were sequentially engaged. No abnormal vocal cord stimulation was witnessed during engaging of either phrenic nerve stimulator. However, the lack of normal inspiratory vocal cord abduction during phrenic nerve-paced respiration resulted in vocal cord collapse and partial obstruction due to passive adduction of the vocal cords through the Bernoulli effect. Bilateral phrenic nerve stimulation resulted in more vocal cord collapse than unilateral stimulation. The lack of vocal cord abduction on inspiration presents a limit to phrenic nerve pacers.
A study of vocal nonlinearities in humpback whale songs: from production mechanisms to acoustic analysis.

PubMed

Cazau, Dorian; Adam, Olivier; Aubin, Thierry; Laitman, Jeffrey T; Reidenberg, Joy S

2016-10-10

Although mammalian vocalizations are predominantly harmonically structured, they can exhibit an acoustic complexity with nonlinear vocal sounds, including deterministic chaos and frequency jumps. Such sounds are normative events in mammalian vocalizations, and can be directly traceable to the nonlinear nature of vocal-fold dynamics underlying typical mammalian sound production. In this study, we give qualitative descriptions and quantitative analyses of nonlinearities in the song repertoire of humpback whales from the Ste Marie channel (Madagascar) to provide more insight into the potential communication functions and underlying production mechanisms of these features. A low-dimensional biomechanical modeling of the whale's U-fold (vocal folds homolog) is used to relate specific vocal mechanisms to nonlinear vocal features. Recordings of living humpback whales were searched for occurrences of vocal nonlinearities (instabilities). Temporal distributions of nonlinearities were assessed within sound units, and between different songs. The anatomical production sources of vocal nonlinearities and the communication context of their occurrences in recordings are discussed. Our results show that vocal nonlinearities may be a communication strategy that conveys information about the whale's body size and physical fitness, and thus may be an important component of humpback whale songs.
A study of vocal nonlinearities in humpback whale songs: from production mechanisms to acoustic analysis

NASA Astrophysics Data System (ADS)

Cazau, Dorian; Adam, Olivier; Aubin, Thierry; Laitman, Jeffrey T.; Reidenberg, Joy S.

2016-10-01

Although mammalian vocalizations are predominantly harmonically structured, they can exhibit an acoustic complexity with nonlinear vocal sounds, including deterministic chaos and frequency jumps. Such sounds are normative events in mammalian vocalizations, and can be directly traceable to the nonlinear nature of vocal-fold dynamics underlying typical mammalian sound production. In this study, we give qualitative descriptions and quantitative analyses of nonlinearities in the song repertoire of humpback whales from the Ste Marie channel (Madagascar) to provide more insight into the potential communication functions and underlying production mechanisms of these features. A low-dimensional biomechanical modeling of the whale’s U-fold (vocal folds homolog) is used to relate specific vocal mechanisms to nonlinear vocal features. Recordings of living humpback whales were searched for occurrences of vocal nonlinearities (instabilities). Temporal distributions of nonlinearities were assessed within sound units, and between different songs. The anatomical production sources of vocal nonlinearities and the communication context of their occurrences in recordings are discussed. Our results show that vocal nonlinearities may be a communication strategy that conveys information about the whale’s body size and physical fitness, and thus may be an important component of humpback whale songs.

Improvement of Vocal Pathologies Diagnosis Using High-Speed Videolaryngoscopy

PubMed Central

Tsuji, Domingos Hiroshi; Hachiya, Adriana; Dajer, Maria Eugenia; Ishikawa, Camila Cristina; Takahashi, Marystella Tomoe; Montagnoli, Arlindo Neto

2014-01-01

Introduction The study of the dynamic properties of vocal fold vibration is important for understanding the vocal production mechanism and the impact of organic and functional changes. The advent of high-speed videolaryngoscopy (HSV) has provided the possibility of seeing the real cycle of vocal fold vibration in detail through high sampling rate of successive frames and adequate spatial resolution. Objective To describe the technique, advantages, and limitations of using HSV and digital videokymography in the diagnosis of vocal pathologies. Methods We used HSV and digital videokymography to evaluate one normophonic individual and four patients with vocal fold pathologies (nodules, unilateral paralysis of the left vocal fold, intracordal cyst, and adductor spasmodic dysphonia). The vocal fold vibration parameters (glottic closure, vibrational symmetry, periodicity, mucosal wave, amplitude, and glottal cycle phases) were assessed. Results Differences in the vocal vibration parameters were observed and correlated with the pathophysiology. Conclusion HSV is the latest diagnostic tool in visual examination of vocal behavior and has considerable potential to refine our knowledge regarding the vocal fold vibration and voice production, as well as regarding the impact of pathologic conditions have on the mechanism of phonation. PMID:25992109
Resting-Associated Vocalization Emitted by Captive Asian House Shrews (Suncus murinus): Acoustic Structure and Variability in an Unusual Mammalian Vocalization

PubMed Central

Schneiderová, Irena; Zouhar, Jan

2014-01-01

Shrews have rich vocal repertoires that include vocalizations within the human audible frequency range and ultrasonic vocalizations. Here, we recorded and analyzed in detail the acoustic structure of a vocalization with unclear functional significance that was spontaneously produced by 15 adult, captive Asian house shrews (Suncus murinus) while they were lying motionless and resting in their nests. This vocalization was usually emitted repeatedly in a long series with regular intervals. It showed some structural variability; however, the shrews most frequently emitted a tonal, low-frequency vocalization with minimal frequency modulation and a low, non-vocal click that was clearly noticeable at its beginning. There was no effect of sex, but the acoustic structure of the analyzed vocalizations differed significantly between individual shrews. The encoded individuality was low, but it cannot be excluded that this individuality would allow discrimination of family members, i.e., a male and female with their young, collectively resting in a common nest. The question remains whether the Asian house shrews indeed perceive the presence of their mates, parents or young resting in a common nest via the resting-associated vocalization and whether they use it to discriminate among their family members. Additional studies are needed to explain the possible functional significance of resting-associated vocalizations emitted by captive Asian house shrews. Our study highlights that the acoustic communication of shrews is a relatively understudied topic, particularly considering that they are highly vocal mammals. PMID:25390304
The Role of Lexical Stress on the Use of Vocal Fry in Young Adult Female Speakers.

PubMed

Gibson, Todd A

2017-01-01

Vocal fry is a voice register often used by young adult women for sociolinguistic purposes. Some acoustic correlates of lexical stress, however, appear incompatible with the use of vocal fry. The objective of this study was to systematically examine the role of lexical stress in the use of vocal fry by young adult women. This is a semi-randomized controlled laboratory study. Fifty female undergraduate students were recorded repeating one-, two-, three-, and four-syllable nonwords that conformed to English phonotactics. Nonwords were presented in order from shorter to longer lengths, with stimuli randomized within syllable length. Perceptual analyses of recordings were augmented by acoustic analyses to identify each syllable in which vocal fry occurred. Eighty-six percent of participants produced at least one episode of vocal fry. Vocal fry was more likely to occur in unstressed than stressed position, and the likelihood increased as distance from the stressed syllable increased. There was considerable variability in the use of vocal fry. Frequent and infrequent users varied on the degree to which they used vocal fry in single-syllable nonwords. Vocal fry use persists among young adult women even in the absence of syntactic and pragmatic influences. Lexical stress appeared to dramatically reduce the use of vocal fry. Patterns of vocal fry use appeared to be different for frequent and infrequent users of this vocal register. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Relation of structural and vibratory kinematics of the vocal folds to two acoustic measures of breathy voice based on computational modeling.

PubMed

Samlan, Robin A; Story, Brad H

2011-10-01

To relate vocal fold structure and kinematics to 2 acoustic measures: cepstral peak prominence (CPP) and the amplitude of the first harmonic relative to the second (H1-H2). The authors used a computational, kinematic model of the medial surfaces of the vocal folds to specify features of vocal fold structure and vibration in a manner consistent with breathy voice. Four model parameters were altered: degree of vocal fold adduction, surface bulging, vibratory nodal point, and supraglottal constriction. CPP and H1-H2 were measured from simulated glottal area, glottal flow, and acoustic waveforms and were related to the underlying vocal fold kinematics. CPP decreased with increased separation of the vocal processes, whereas the nodal point location had little effect. H1-H2 increased as a function of separation of the vocal processes in the range of 1.0 mm to 1.5 mm and decreased with separation > 1.5 mm. CPP is generally a function of vocal process separation. H1*-H2* (see paragraph 6 of article text for an explanation of the asterisks) will increase or decrease with vocal process separation on the basis of vocal fold shape, pivot point for the rotational mode, and supraglottal vocal tract shape, limiting its utility as an indicator of breathy voice. Future work will relate the perception of breathiness to vocal fold kinematics and acoustic measures.
Night-to-Night Variability of Muscle Tone, Movements, and Vocalizations in Patients with REM Sleep Behavior Disorder

PubMed Central

Cygan, Fanny; Oudiette, Delphine; Leclair-Visonneau, Laurène; Leu-Semenescu, Smaranda; Arnulf, Isabelle

2010-01-01

Objectives: The video-polysomnographic criteria of REM sleep behavior disorder (RBD) have not been well described. We evaluated the between-night reproducibility of phasic and tonic enhanced muscle activity during REM sleep as well as the associated behaviors and vocalizations of the patients. Methods: Fifteen patients with clinical RBD underwent two consecutive video-polysomnographies. The amount of excessive phasic and tonic chin muscle activity during REM sleep was measured in 15 patients in 3-sec mini-epochs. The time spent with motor (minor, major, complex, and scenic) or vocal (sounds, mumblings, and comprehensible speeches) events was measured in 7 patients during REM sleep. Results: There was a good between-night agreement for tonic (Spearman rho = 0.55, p = 0.03; Kendall tau = 0.48, p = 0.01) but not for phasic (rho = 0.47, p = 0.1; tau = 0.31, p = 0.1) excessive chin muscle activity. On the video and audio recordings, the minor RBD behaviors tended to occur more frequently during the second night than the first, whereas the patients spoke longer during the first than the second night. Conclusion: The excessive tonic activity during REM sleep is a reliable marker of RBD. It could represent the extent of dysfunction in the permissive atonia systems. In contrast, the more variable phasic activity and motor/vocal events could be more dependent on dream content (executive systems). Citation: Cygan F; Oudiette D; Leclair-Visonneau L; Leu-Semenescu S; Arnulf I. Night-to-night variability of muscle tone, movements, and vocalizations in patients with REM sleep behavior disorder. J Clin Sleep Med 2010;6(6):551-555. PMID:21206543
MARTI: man-machine animation real-time interface

NASA Astrophysics Data System (ADS)

Jones, Christian M.; Dlay, Satnam S.

1997-05-01

The research introduces MARTI (man-machine animation real-time interface) for the realization of natural human-machine interfacing. The system uses simple vocal sound-tracks of human speakers to provide lip synchronization of computer graphical facial models. We present novel research in a number of engineering disciplines, which include speech recognition, facial modeling, and computer animation. This interdisciplinary research utilizes the latest, hybrid connectionist/hidden Markov model, speech recognition system to provide very accurate phone recognition and timing for speaker independent continuous speech, and expands on knowledge from the animation industry in the development of accurate facial models and automated animation. The research has many real-world applications which include the provision of a highly accurate and 'natural' man-machine interface to assist user interactions with computer systems and communication with one other using human idiosyncrasies; a complete special effects and animation toolbox providing automatic lip synchronization without the normal constraints of head-sets, joysticks, and skilled animators; compression of video data to well below standard telecommunication channel bandwidth for video communications and multi-media systems; assisting speech training and aids for the handicapped; and facilitating player interaction for 'video gaming' and 'virtual worlds.' MARTI has introduced a new level of realism to man-machine interfacing and special effect animation which has been previously unseen.
Development of neural responsivity to vocal sounds in higher level auditory cortex of songbirds

PubMed Central

Miller-Sims, Vanessa C.

2014-01-01

Like humans, songbirds learn vocal sounds from “tutors” during a sensitive period of development. Vocal learning in songbirds therefore provides a powerful model system for investigating neural mechanisms by which memories of learned vocal sounds are stored. This study examined whether NCM (caudo-medial nidopallium), a region of higher level auditory cortex in songbirds, serves as a locus where a neural memory of tutor sounds is acquired during early stages of vocal learning. NCM neurons respond well to complex auditory stimuli, and evoked activity in many NCM neurons habituates such that the response to a stimulus that is heard repeatedly decreases to approximately one-half its original level (stimulus-specific adaptation). The rate of neural habituation serves as an index of familiarity, being low for familiar sounds, but high for novel sounds. We found that response strength across different song stimuli was higher in NCM neurons of adult zebra finches than in juveniles, and that only adult NCM responded selectively to tutor song. The rate of habituation across both tutor song and novel conspecific songs was lower in adult than in juvenile NCM, indicating higher familiarity and a more persistent response to song stimuli in adults. In juvenile birds that have memorized tutor vocal sounds, neural habituation was higher for tutor song than for a familiar conspecific song. This unexpected result suggests that the response to tutor song in NCM at this age may be subject to top-down influences that maintain the tutor song as a salient stimulus, despite its high level of familiarity. PMID:24694936
Vocal performance affects metabolic rate in dolphins: implications for animals communicating in noisy environments.

PubMed

Holt, Marla M; Noren, Dawn P; Dunkin, Robin C; Williams, Terrie M

2015-06-01

Many animals produce louder, longer or more repetitious vocalizations to compensate for increases in environmental noise. Biological costs of increased vocal effort in response to noise, including energetic costs, remain empirically undefined in many taxa, particularly in marine mammals that rely on sound for fundamental biological functions in increasingly noisy habitats. For this investigation, we tested the hypothesis that an increase in vocal effort would result in an energetic cost to the signaler by experimentally measuring oxygen consumption during rest and a 2 min vocal period in dolphins that were trained to vary vocal loudness across trials. Vocal effort was quantified as the total acoustic energy of sounds produced. Metabolic rates during the vocal period were, on average, 1.2 and 1.5 times resting metabolic rate (RMR) in dolphin A and B, respectively. As vocal effort increased, we found that there was a significant increase in metabolic rate over RMR during the 2 min following sound production in both dolphins, and in total oxygen consumption (metabolic cost of sound production plus recovery costs) in the dolphin that showed a wider range of vocal effort across trials. Increases in vocal effort, as a consequence of increases in vocal amplitude, repetition rate and/or duration, are consistent with behavioral responses to noise in free-ranging animals. Here, we empirically demonstrate for the first time in a marine mammal, that these vocal modifications can have an energetic impact at the individual level and, importantly, these data provide a mechanistic foundation for evaluating biological consequences of vocal modification in noise-polluted habitats. © 2015. Published by The Company of Biologists Ltd.
Endoscopic laterofixation in bilateral vocal cords paralysis in children.

PubMed

Lidia, Zawadzka-Glos; Magdalena, Frackiewicz; Mieczyslaw, Chmielik

2010-06-01

Vocal cords paralysis is the second most frequent cause of laryngeal stridor in children. Symptoms of congenital vocal cords paralysis can occur shortly after birth or later. Vocal cords paralysis can be unilateral or bilateral. Symptoms of unilateral paralysis include hoarse weeping or stridor during a deep inhalation. In children unilateral vocal cords paralysis often retreats spontaneously or can be completely compensated. Children with bilateral vocal cords paralysis present mainly breathing disorders while phonation is normal. Symptoms are different, starting from complete occlusion of respiratory tracts and ending on small symptoms connected with the lack of effort tolerance. When symptoms are severe, patients from this group require a tracheotomy. The lack of restoration of normal function of vocal cords or lack of complete compensation and maintenance of symptoms are an indication for surgical treatment. The aim of this study is to present results of the treatment of bilateral vocal cords paralysis in children using the endoscopic method of laterofixation of vocal cords. In the Pediatric ENT Department between 1998 and 2009 sixty four children with dyspnoea and/or phonation disorders caused by vocal cords paralysis were treated. In ten cases laterofixation of vocal cords was performed, in most cases with good result. In this article the authors present the method of endoscopic laterofixation and achieved results. Endoscopic laterofixation of vocal cords in children is a safe and an easy method of surgical treatment of bilateral vocal cords paralysis. This method can be used as a first and often as a one stage treatment of vocal cords paralysis. In some cases this procedure is insufficient and has to be completed with other methods. Copyright (c) 2010 Elsevier Ireland Ltd. All rights reserved.
Female Presence and Estrous State Influence Mouse Ultrasonic Courtship Vocalizations

PubMed Central

Hanson, Jessica L.; Hurley, Laura M.

2012-01-01

The laboratory mouse is an emerging model for context-dependent vocal signaling and reception. Mouse ultrasonic vocalizations are robustly produced in social contexts. In adults, male vocalization during courtship has become a model of interest for signal-receiver interactions. These vocalizations can be grouped into syllable types that are consistently produced by different subspecies and strains of mice. Vocalizations are unique to individuals, vary across development, and depend on social housing conditions. The behavioral significance of different syllable types, including the contexts in which different vocalizations are made and the responses listeners have to different types of vocalizations, is not well understood. We examined the effect of female presence and estrous state on male vocalizations by exploring the use of syllable types and the parameters of syllables during courtship. We also explored correlations between vocalizations and other behaviors. These experimental manipulations produced four main findings: 1) vocalizations varied among males, 2) the production of USVs and an increase in the use of a specific syllable type were temporally related to mounting behavior, 3) the frequency (kHz), bandwidth, and duration of syllables produced by males were influenced by the estrous phase of female partners, and 4) syllable types changed when females were removed. These findings show that mouse ultrasonic courtship vocalizations are sensitive to changes in female phase and presence, further demonstrating the context-sensitivity of these calls. PMID:22815817
[Varices of the vocal cord: report of 21 cases].

PubMed

Li, Jin-rang; Sun, Jian-jun

2006-04-01

To study the diagnosis and treatment of varices of the vocal cord. The clinical data of 21 cases with varix of vocal cord were analyzed. All the patients presented hoarseness. There were 15 female and 6 male cases with their ages ranged from 23 to 68 years (median 44 years old). The varix was found on the right vocal cord in 12 cases, on the left vocal cord in 9 cases. Isolated varix existed on the vocal cord in 10 cases, varix with vocal cord polyps or nodules in 10 cases, varix with vocal cord paralysis in 1 case. All the patients were diagnosed under the laryngovideoscopy. The lesions appeared on the superior surface of the vocal cord. Varices manifested as abnormally dilated capillary running in the anterior to posterior direction in 6 cases, as clusters of capillary in 3 cases, as a dot or small sheet or short line of capillary in 12 cases. The varices were disappeared in 2 of 8 cases with vocal cord varices and polyps after removed the polyps. The varices of others patients had no change after following up for more than 6 months, but one patient happened hemorrhage of the contralateral vocal cord. Varices are most commonly seen in female. Laryngovideoscopy is the key in determining the vocal fold varices. Management of patients with a varix includes medical therapy, speech therapy, and occasionally surgical vaporization.
Viscoelastic measurements after vocal fold scarring in rabbits--short-term results after hyaluronan injection.

PubMed

Hertegård, S; Dahlqvist, A; Goodyer, E

2006-07-01

The scarring model resulted in significant damage and elevated viscoelasticity of the lamina propria. Hyaluronan preparations may alter viscoelasticity in scarred rabbit vocal folds. Vocal fold scarring results in stiffness of the lamina propria and severe voice problems. The aims of this study were to examine the degree of scarring achieved in the experiment and to measure the viscoelastic properties after injection of hyaluronan in rabbit vocal folds. Twenty-two vocal folds from 15 New Zealand rabbits were scarred, 8 vocal folds were controls. After 8 weeks 12 of the scarred vocal folds received injections with 2 types of cross-linked hyaluronan products and 10 scarred folds were injected with saline. After 11 more weeks the animals were sacrificed. After dissection, 15 vocal folds were frozen for viscoelastic measurements, whereas 14 vocal folds were prepared and stained. Measurements were made of the lamina propria thickness. Viscoelasticity was measured on intact vocal folds with a linear skin rheometer (LSR) adapted to laryngeal measurements. Measurements on the digitized slides showed a thickened lamina propria in the scarred samples as compared with the normal vocal folds (p<0.05). The viscoelastic analysis showed a tendency to stiffening of the scarred vocal folds as compared with the normal controls (p=0.05). There was large variation in stiffness between the two injected hyaluronan products.
Altered vocal fold kinematics in synthetic self-oscillating models that employ adipose tissue as a lateral boundary condition.

NASA Astrophysics Data System (ADS)

Saidi, Hiba; Erath, Byron D.

2015-11-01

The vocal folds play a major role in human communication by initiating voiced sound production. During voiced speech, the vocal folds are set into sustained vibrations. Synthetic self-oscillating vocal fold models are regularly employed to gain insight into flow-structure interactions governing the phonation process. Commonly, a fixed boundary condition is applied to the lateral, anterior, and posterior sides of the synthetic vocal fold models. However, physiological observations reveal the presence of adipose tissue on the lateral surface between the thyroid cartilage and the vocal folds. The goal of this study is to investigate the influence of including this substrate layer of adipose tissue on the dynamics of phonation. For a more realistic representation of the human vocal folds, synthetic multi-layer vocal fold models have been fabricated and tested while including a soft lateral layer representative of adipose tissue. Phonation parameters have been collected and are compared to those of the standard vocal fold models. Results show that vocal fold kinematics are affected by adding the adipose tissue layer as a new boundary condition.
Quantitative Analysis of Vocal Fold Vibration in Vocal Fold Paralysis With the Use of High-speed Digital Imaging.

PubMed

Yamauchi, Akihito; Yokonishi, Hisayuki; Imagawa, Hiroshi; Sakakibara, Ken-Ichi; Nito, Takaharu; Tayama, Niro

2016-11-01

The goal of this work was to objectively elucidate the vibratory characteristics of vocal fold paralysis (VFP) using high-speed digital imaging (HSDI). HSDI was performed in 29 vocally healthy subjects (12 women and 17 men) and in 107 patients with VFP (40 women and 67 men). Then, the HSDI data were evaluated by visual-perceptual rating, single-line kymography, multiline kymography, laryngotopography, and glottal area waveform analysis. Patients with VFP compared with vocally healthy subjects revealed more frequent incomplete glottal closure, greater asymmetry in amplitude, mucosal wave, frequency, and phase, as well as larger open quotient, smaller speed index, larger maximal and minimal glottal area, and smaller glottal area difference. Paralyzed vocal folds in VFP revealed reduced mucosal wave than nonparalyzed vocal folds in VFP or in intact vocal folds in vocally healthy subjects. HSDI was effective in documenting the characteristics of vocal fold vibrations in patients with VFP and in exploring the vibratory disturbance for estimating the severity of dysphonia. Copyright Â© 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Singers' interest and knowledge levels of vocal function and dysfunction: survey findings.

PubMed

Braun-Janzen, Colleen; Zeine, Lina

2009-07-01

A questionnaire investigating the levels of interest in and knowledge of vocal function and dysfunction was completed by 129 singers. Those with professional singing experience indicated significantly greater interest and higher perceived knowledge levels than amateurs in areas of vocal anatomy and physiology, vocal hygiene, and functional vocal pathologies. Greater interest levels, but not higher perceived knowledge levels were reported by professional singers (PSs) in the area of the role of the speech-language pathologist (SLP) and the voice. Professionals answered significantly more knowledge-based questions correctly than amateurs in all areas except the role of the SLP and the voice. However, findings indicated wide variability in knowledge levels of both groups. Singing teachers (STs) within the group significantly outperformed the remainder of the group in areas of vocal anatomy and physiology, vocal hygiene, and functional vocal pathologies. Scores of the choir directors (CDs) within the group were not significantly superior to the remainder of the group except in the area of functional vocal pathologies. Implications for a preventative approach to vocal health are discussed.
Interstitial protein alterations in rabbit vocal fold with scar.

PubMed

Thibeault, Susan L; Bless, Diane M; Gray, Steven D

2003-09-01

Fibrous and interstitial proteins compose the extracellular matrix of the vocal fold lamina propria and account for its biomechanic properties. Vocal fold scarring is characterized by altered biomechanical properties, which create dysphonia. Although alterations of the fibrous proteins have been confirmed in the rabbit vocal fold scar, interstitial proteins, which are known to be important in wound repair, have not been investigated to date. Using a rabbit model, interstitial proteins decorin, fibromodulin, and fibronectin were examined immunohistologically, two months postinduction of vocal fold scar by means of forcep biopsy. Significantly decreased decorin and fibromodulin with significantly increased fibronectin characterized scarred vocal fold tissue. The implications of altered interstitial proteins levels and their affect on the fibrous proteins will be discussed in relation to increased vocal fold stiffness and viscosity, which characterizes vocal fold scar.
The forgotten cause of stridor in the emergency department.

PubMed

Ng, Tian-Tee

2017-01-01

Paradoxical Vocal Fold Movement Disorder is where the larynx exhibits paradoxical vocal cords closure during respiration, creating partial airway obstruction. Causes of vocal fold movement disorder are multifactorial, and patients describe tightness of throat, difficulty getting air in, have stridor, and do not respond to inhalers. We propose using transnasal laryngoscopy examination, which will show narrowing of vocal cords on inspiration, and The Pittsburgh Vocal Cord Dysfunction Index with a cutoff score of ≥4 to distinguish vocal fold movement disorder from asthma and other causes of stridor. Management of paradoxical vocal fold movement disorder involves a combination of pharmacological, psychological, psychiatric, and speech training. Paradoxical vocal fold movement disorder is a very treatable cause of stridor, so long as it is identified and other organic causes are excluded.
Evolutionary Origins for Social Vocalization in a Vertebrate Hindbrain–Spinal Compartment

PubMed Central

Bass, Andrew H.; Gilland, Edwin H.; Baker, Robert

2008-01-01

The macroevolutionary events leading to neural innovations for social communication, such as vocalization, are essentially unexplored. Many fish vocalize during female courtship and territorial defense, as do amphibians, birds, and mammals. Here, we map the neural circuitry for vocalization in larval fish and show that the vocal network develops in a segment-like region across the most caudal hindbrain and rostral spinal cord. Taxonomic analysis demonstrates a highly conserved pattern between fish and all major lineages of vocal tetrapods. We propose that the vocal basis for acoustic communication among vertebrates evolved from an ancestrally shared developmental compartment already present in the early fishes. PMID:18635807
Non-song vocalizations of pygmy blue whales in Geographe Bay, Western Australia.

PubMed

Recalde-Salas, A; Salgado Kent, C P; Parsons, M J G; Marley, S A; McCauley, R D

2014-05-01

Non-song vocalizations of migrating pygmy blue whales (Balaenoptera musculus brevicauda) in Western Australia are described. Simultaneous land-based visual observations and underwater acoustic recordings detected 27 groups in Geographe Bay, WA over 2011 to 2012. Six different vocalizations were recorded that were not repeated in a pattern or in association with song, and thus were identified as non-song vocalizations. Five of these were not previously described for this population. Their acoustic characteristics and context are presented. Given that 56% of groups vocalized, 86% of which produced non-song vocalizations and 14% song units, the inclusion of non-song vocalizations in passive-acoustic monitoring is proposed.
The value of ASSR threshold-based bilateral hearing aid fitting in children with difficult or unreliable behavioral audiometry.

PubMed

Vlastarakos, Petros V; Vasileiou, Alexandra; Nikolopoulos, Thomas P

2017-12-01

We conducted an analysis to assess the relative contribution of auditory brainstem response (ABR) testing and auditory steady-state response (ASSR) testing in providing appropriate hearing aid fitting in hearing-impaired children with difficult or unreliable behavioral audiometry. Of 150 infants and children who had been referred to us for hearing assessment as part of a neonatal hearing screening and cochlear implantation program, we identified 5 who exhibited significant discrepancies between click-ABR and ASSR testing results and difficult or unreliable behavioral audiometry. Hearing aid fitting in pediatric cochlear implant candidates for a trial period of 3 to 6 months is a common practice in many implant programs, but monitoring the progress of the amplified infants and providing appropriate hearing aid fitting can be challenging. If we accept the premise that we can assess the linguistic progress of amplified infants with an acceptable degree of certainty, the auditory behavior that we are monitoring presupposes appropriate bilateral hearing aid fitting. This may become very challenging in young children, or even in older children with difficult or unreliable behavioral audiometry results. This challenge can be addressed by using data from both ABR and ASSR testing. Fitting attempts that employ data from only ABR testing provide amplification that involves the range of spoken language but is not frequency-specific. Hearing aid fitting should also incorporate and take into account ASSR data because reliance on ABR testing alone might compromise the validity of the monitoring process. In conclusion, we believe that ASSR threshold-based bilateral hearing aid fitting is necessary to provide frequency-specific amplification of hearing and appropriate propulsion in the prelinguistic vocalizations of monitored infants.

Protective Effect of Astaxanthin on Vocal Fold Injury and Inflammation Due to Vocal Loading: A Clinical Trial.

PubMed

Kaneko, Mami; Kishimoto, Yo; Suzuki, Ryo; Kawai, Yoshitaka; Tateya, Ichiro; Hirano, Shigeru

2017-05-01

Professional voice users, such as singers and teachers, are at greater risk of developing vocal fold injury from excessive use of voice; thus, protection of the vocal fold is essential. One of the most important factors that aggravates injury is the production of reactive oxygen species at the wound site. The purpose of the current study was to assess the effect of astaxanthin, a strong antioxidant, on the protection of the vocal fold from injury and inflammation due to vocal loading. This study is an institutional review board-approved human clinical trial. Ten male subjects underwent a 60-minute vocal loading session and received vocal assessments prior to, immediately after, and 30 minutes postvocal loading (AST(-) status). All subjects were then prescribed 24 mg/day of astaxanthin for 28 days, after which they received the same vocal task and assessments (AST(+) status). Phonatory parameters were compared between both groups. Aerodynamic assessment, acoustic analysis, and GRBAS scale (grade, roughness, breathiness, asthenia, and strain) were significantly worse in the AST(-) status immediately after vocal loading, but improved by 30 minutes after loading. In contrast, none of the phonatory parameters in the AST(+) status were statistically worse, even when measured immediately after vocal loading. No allergic responses or adverse effects were observed after administration of astaxanthin. The current results suggest that astaxanthin can protect the vocal fold from injury and inflammation caused by vocal loading possibly through the regulation of oxidative stress. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Learning to detect vocal hyperfunction from ambulatory neck-surface acceleration features: initial results for vocal fold nodules.

PubMed

Ghassemi, Marzyeh; Van Stan, Jarrad H; Mehta, Daryush D; Zañartu, Matías; Cheyne, Harold A; Hillman, Robert E; Guttag, John V

2014-06-01

Voice disorders are medical conditions that often result from vocal abuse/misuse which is referred to generically as vocal hyperfunction. Standard voice assessment approaches cannot accurately determine the actual nature, prevalence, and pathological impact of hyperfunctional vocal behaviors because such behaviors can vary greatly across the course of an individual's typical day and may not be clearly demonstrated during a brief clinical encounter. Thus, it would be clinically valuable to develop noninvasive ambulatory measures that can reliably differentiate vocal hyperfunction from normal patterns of vocal behavior. As an initial step toward this goal we used an accelerometer taped to the neck surface to provide a continuous, noninvasive acceleration signal designed to capture some aspects of vocal behavior related to vocal cord nodules, a common manifestation of vocal hyperfunction. We gathered data from 12 female adult patients diagnosed with vocal fold nodules and 12 control speakers matched for age and occupation. We derived features from weeklong neck-surface acceleration recordings by using distributions of sound pressure level and fundamental frequency over 5-min windows of the acceleration signal and normalized these features so that intersubject comparisons were meaningful. We then used supervised machine learning to show that the two groups exhibit distinct vocal behaviors that can be detected using the acceleration signal. We were able to correctly classify 22 of the 24 subjects, suggesting that in the future measures of the acceleration signal could be used to detect patients with the types of aberrant vocal behaviors that are associated with hyperfunctional voice disorders.
Reproduction of mouse-pup ultrasonic vocalizations by nanocrystalline silicon thermoacoustic emitter

NASA Astrophysics Data System (ADS)

Kihara, Takashi; Harada, Toshihiro; Kato, Masahiro; Nakano, Kiyoshi; Murakami, Osamu; Kikusui, Takefumi; Koshida, Nobuyoshi

2006-01-01

As one of the functional properties of ultrasound generator based on efficient thermal transfer at the nanocrystalline silicon (nc-Si) layer surface, its potential as an ultrasonic simulator of vocalization signals is demonstrated by using the acoustic data of mouse-pup calls. The device composed of a surface-heating thin-film electrode, an nc-Si layer, and a single-crystalline silicon (c-Si) wafer, exhibits an almost completely flat frequency response over a wide range without any mechanical surface vibration systems. It is shown that the fabricated emitter can reproduce digitally recorded ultrasonic mouse-pups vocalizations very accurately in terms of the call duration, frequency dispersion, and sound pressure level. The thermoacoustic nc-Si device provides a powerful physical means for the understanding of ultrasonic communication mechanisms in various living animals.
Night-to-night variability of muscle tone, movements, and vocalizations in patients with REM sleep behavior disorder.

PubMed

Cygan, Fanny; Oudiette, Delphine; Leclair-Visonneau, Laurène; Leu-Semenescu, Smaranda; Arnulf, Isabelle

2010-12-15

The video-polysomnographic criteria of REM sleep behavior disorder (RBD) have not been well described. We evaluated the between-night reproducibility of phasic and tonic enhanced muscle activity during REM sleep as well as the associated behaviors and vocalizations of the patients. Fifteen patients with clinical RBD underwent two consecutive video-polysomnographies. The amount of excessive phasic and tonic chin muscle activity during REM sleep was measured in 15 patients in 3-sec mini-epochs. The time spent with motor (minor, major, complex, and scenic) or vocal (sounds, mumblings, and comprehensible speeches) events was measured in 7 patients during REM sleep. There was a good between-night agreement for tonic (Spearman rho = 0.55, p = 0.03; Kendall tau = 0.48, p = 0.01) but not for phasic (rho = 0.47, p = 0.1; tau = 0.31, p = 0.1) excessive chin muscle activity. On the video and audio recordings, the minor RBD behaviors tended to occur more frequently during the second night than the first, whereas the patients spoke longer during the first than the second night. The excessive tonic activity during REM sleep is a reliable marker of RBD. It could represent the extent of dysfunction in the permissive atonia systems. In contrast, the more variable phasic activity and motor/vocal events could be more dependent on dream content (executive systems).
Impact of call center work in subjective voice symptoms and complaints--an analytic study.

PubMed

Rechenberg, Leila; Goulart, Bárbara Niegia Garcia de; Roithmann, Renato

2011-12-01

To estimate the prevalence of vocal symptoms, occupational risk factors, associated symptoms and their impact on the professional activity of the telemarketers. Cross-section analytical study with 124 telemarketers and 109 administrative workers (control group) selected from a random sample stratified by gender. The subjects answered an anonymous self-administered questionnaire involving issues related to the presence of vocal symptoms, potential risk factors for dysphonia, and vocal impact of symptoms in professional activity. The presence of one or more voice symptoms that occurred daily or weekly was considered positive for the presence of vocal symptoms. The prevalence of vocal symptoms was found in 33% of telemarketers and in 21% of the control group, indicating an association between vocal symptoms and the activity of the telemarketer. When adjusted for confounders, this association remained in the sense of risk. In telemarketers, the sensation of dry air, ambient noise, and lack of vocal rest were the most frequently reported complaints reported by those presenting vocal symptoms. Almost 70% of telemarketers with vocal symptoms reported that these symptoms interfere with their professional activity. The rate of absenteeism by vocal symptoms in this group was 29%. Vocal symptoms are common in most telemarketers when compared to their peer controls, and significantly affect their job performance.
Mesenchymal Stem Cell Therapy for the Treatment of Vocal Fold Scarring: A Systematic Review of Preclinical Studies

PubMed Central

Wingstrand, Vibe Lindeblad; Jensen, David H.; Bork, Kristian; Sebbesen, Lars; Balle, Jesper; Fischer-Nielsen, Anne; von Buchwald, Christian

2016-01-01

Objectives Therapy with mesenchymal stem cells exhibits potential for the development of novel interventions for many diseases and injuries. The use of mesenchymal stem cells in regenerative therapy for vocal fold scarring exhibited promising results to reduce stiffness and enhance the biomechanical properties of injured vocal folds. This study evaluated the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring. Data Sources PubMed, Embase, the Cochrane Library and Google Scholar were searched. Methods Controlled studies that assessed the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring were included. Primary outcomes were viscoelastic properties and mucosal wave amplitude. Results Seven preclinical animal studies (n = 152 single vocal folds) were eligible for inclusion. Evaluation of viscoelastic parameters revealed a decreased dynamic viscosity (η’) and elastic modulus (G’), i.e., decreased resistance and stiffness, in scarred vocal folds treated with mesenchymal stem cells compared to non-treated scarred vocal folds. Mucosal wave amplitude was increased in scarred vocal folds treated with mesenchymal stem cells vs. non-treated scarred vocal folds. Conclusion The results from these studies suggest an increased regenerative effect of therapy with mesenchymal stem cells for scarred vocal folds and are encouraging for further clinical studies. PMID:27631373
Mesenchymal Stem Cell Therapy for the Treatment of Vocal Fold Scarring: A Systematic Review of Preclinical Studies.

PubMed

Wingstrand, Vibe Lindeblad; Grønhøj Larsen, Christian; Jensen, David H; Bork, Kristian; Sebbesen, Lars; Balle, Jesper; Fischer-Nielsen, Anne; von Buchwald, Christian

2016-01-01

Therapy with mesenchymal stem cells exhibits potential for the development of novel interventions for many diseases and injuries. The use of mesenchymal stem cells in regenerative therapy for vocal fold scarring exhibited promising results to reduce stiffness and enhance the biomechanical properties of injured vocal folds. This study evaluated the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring. PubMed, Embase, the Cochrane Library and Google Scholar were searched. Controlled studies that assessed the biomechanical effects of mesenchymal stem cell therapy for the treatment of vocal fold scarring were included. Primary outcomes were viscoelastic properties and mucosal wave amplitude. Seven preclinical animal studies (n = 152 single vocal folds) were eligible for inclusion. Evaluation of viscoelastic parameters revealed a decreased dynamic viscosity (η') and elastic modulus (G'), i.e., decreased resistance and stiffness, in scarred vocal folds treated with mesenchymal stem cells compared to non-treated scarred vocal folds. Mucosal wave amplitude was increased in scarred vocal folds treated with mesenchymal stem cells vs. non-treated scarred vocal folds. The results from these studies suggest an increased regenerative effect of therapy with mesenchymal stem cells for scarred vocal folds and are encouraging for further clinical studies.
Using nonlinear methods to quantify changes in infant limb movements and vocalizations.

PubMed

Abney, Drew H; Warlaumont, Anne S; Haussman, Anna; Ross, Jessica M; Wallot, Sebastian

2014-01-01

The pairing of dynamical systems theory and complexity science brings novel concepts and methods to the study of infant motor development. Accordingly, this longitudinal case study presents a new approach to characterizing the dynamics of infant limb and vocalization behaviors. A single infant's vocalizations and limb movements were recorded from 51-days to 305-days of age. On each recording day, accelerometers were placed on all four of the infant's limbs and an audio recorder was worn on the child's chest. Using nonlinear time series analysis methods, such as recurrence quantification analysis and Allan factor, we quantified changes in the stability and multiscale properties of the infant's behaviors across age as well as how these dynamics relate across modalities and effectors. We observed that particular changes in these dynamics preceded or coincided with the onset of various developmental milestones. For example, the largest changes in vocalization dynamics preceded the onset of canonical babbling. The results show that nonlinear analyses can help to understand the functional co-development of different aspects of infant behavior.
Effect of a Single Musical Cakra Activation Manoeuvre on Body Temperature: An Exploratory Study

PubMed Central

Sumathy, Sundar; Parmar, Parin N

2016-01-01

Cakra activation/balancing and music therapy are part of the traditional Indian healing system. Little is known about effect of musical (vocal) technique of cakra activation on body temperature. We conducted a single-session exploratory study to evaluate effects of a single musical (vocal) cakra activation manoeuvre on body temperature in controlled settings. Seven healthy adults performed a single musical (vocal) cakra activation manoeuvre for approximately 12 minutes in controlled environmental conditions. Pre- and post-manoeuvre body temperatures were recorded with a clinical mercury thermometer. After a single manoeuvre, increase in body temperature was recorded in all seven subjects. The range of increase in body temperature was from 0.2°F to 1.4°F; with mean temperature rise being 0.5°F and median temperature rise being 0.4°F. We conclude that a single session of musical (vocal) technique of cakra activation elevated body temperatures in all 7 subjects. Further research is required to study effects of various cakra activation techniques on body temperature and other physiological parameters. PMID:28182030
Using nonlinear methods to quantify changes in infant limb movements and vocalizations

PubMed Central

Abney, Drew H.; Warlaumont, Anne S.; Haussman, Anna; Ross, Jessica M.; Wallot, Sebastian

2014-01-01

The pairing of dynamical systems theory and complexity science brings novel concepts and methods to the study of infant motor development. Accordingly, this longitudinal case study presents a new approach to characterizing the dynamics of infant limb and vocalization behaviors. A single infant's vocalizations and limb movements were recorded from 51-days to 305-days of age. On each recording day, accelerometers were placed on all four of the infant's limbs and an audio recorder was worn on the child's chest. Using nonlinear time series analysis methods, such as recurrence quantification analysis and Allan factor, we quantified changes in the stability and multiscale properties of the infant's behaviors across age as well as how these dynamics relate across modalities and effectors. We observed that particular changes in these dynamics preceded or coincided with the onset of various developmental milestones. For example, the largest changes in vocalization dynamics preceded the onset of canonical babbling. The results show that nonlinear analyses can help to understand the functional co-development of different aspects of infant behavior. PMID:25161629
Harmonic template neurons in primate auditory cortex underlying complex sound processing

PubMed Central

Feng, Lei

2017-01-01

Harmonicity is a fundamental element of music, speech, and animal vocalizations. How the auditory system extracts harmonic structures embedded in complex sounds and uses them to form a coherent unitary entity is not fully understood. Despite the prevalence of sounds rich in harmonic structures in our everyday hearing environment, it has remained largely unknown what neural mechanisms are used by the primate auditory cortex to extract these biologically important acoustic structures. In this study, we discovered a unique class of harmonic template neurons in the core region of auditory cortex of a highly vocal New World primate, the common marmoset (Callithrix jacchus), across the entire hearing frequency range. Marmosets have a rich vocal repertoire and a similar hearing range to that of humans. Responses of these neurons show nonlinear facilitation to harmonic complex sounds over inharmonic sounds, selectivity for particular harmonic structures beyond two-tone combinations, and sensitivity to harmonic number and spectral regularity. Our findings suggest that the harmonic template neurons in auditory cortex may play an important role in processing sounds with harmonic structures, such as animal vocalizations, human speech, and music. PMID:28096341
Effect of a Single Musical Cakra Activation Manoeuvre on Body Temperature: An Exploratory Study.

PubMed

Sumathy, Sundar; Parmar, Parin N

2016-01-01

Cakra activation/balancing and music therapy are part of the traditional Indian healing system. Little is known about effect of musical (vocal) technique of cakra activation on body temperature. We conducted a single-session exploratory study to evaluate effects of a single musical (vocal) cakra activation manoeuvre on body temperature in controlled settings. Seven healthy adults performed a single musical (vocal) cakra activation manoeuvre for approximately 12 minutes in controlled environmental conditions. Pre- and post-manoeuvre body temperatures were recorded with a clinical mercury thermometer. After a single manoeuvre, increase in body temperature was recorded in all seven subjects. The range of increase in body temperature was from 0.2°F to 1.4°F; with mean temperature rise being 0.5°F and median temperature rise being 0.4°F. We conclude that a single session of musical (vocal) technique of cakra activation elevated body temperatures in all 7 subjects. Further research is required to study effects of various cakra activation techniques on body temperature and other physiological parameters.
Psychosocial Intervention for Young Children With Chronic Tics

ClinicalTrials.gov

2018-06-18

Tourette's Syndrome; Tourette's Disorder; Tourette's Disease; Tourette Disorder; Tourette Disease; Tic Disorder, Combined Vocal and Multiple Motor; Multiple Motor and Vocal Tic Disorder, Combined; Gilles de La Tourette's Disease; Gilles de la Tourette Syndrome; Gilles De La Tourette's Syndrome; Combined Vocal and Multiple Motor Tic Disorder; Combined Multiple Motor and Vocal Tic Disorder; Chronic Motor and Vocal Tic Disorder
A Rat Excised Larynx Model of Vocal Fold Scar

ERIC Educational Resources Information Center

Welham, Nathan V.; Montequin, Douglas W.; Tateya, Ichiro; Tateya, Tomoko; Choi, Seong Hee; Bless, Diane M.

2009-01-01

Purpose: To develop and evaluate a rat excised larynx model for the measurement of acoustic, aerodynamic, and vocal fold vibratory changes resulting from vocal fold scar. Method: Twenty-four 4-month-old male Sprague-Dawley rats were assigned to 1 of 4 experimental groups: chronic vocal fold scar, chronic vocal fold scar treated with 100-ng basic…
Site specific passive acoustic detection and densities of humpback whale calls off the coast of California

NASA Astrophysics Data System (ADS)

Helble, Tyler Adam

Passive acoustic monitoring of marine mammal calls is an increasingly important method for assessing population numbers, distribution, and behavior. Automated methods are needed to aid in the analyses of the recorded data. When a mammal vocalizes in the marine environment, the received signal is a filtered version of the original waveform emitted by the marine mammal. The waveform is reduced in amplitude and distorted due to propagation effects that are influenced by the bathymetry and environment. It is important to account for these effects to determine a site-specific probability of detection for marine mammal calls in a given study area. A knowledge of that probability function over a range of environmental and ocean noise conditions allows vocalization statistics from recordings of single, fixed, omnidirectional sensors to be compared across sensors and at the same sensor over time with less bias and uncertainty in the results than direct comparison of the raw statistics. This dissertation focuses on both the development of new tools needed to automatically detect humpback whale vocalizations from single-fixed omnidirectional sensors as well as the determination of the site-specific probability of detection for monitoring sites off the coast of California. Using these tools, detected humpback calls are "calibrated" for environmental properties using the site-specific probability of detection values, and presented as call densities (calls per square kilometer per time). A two-year monitoring effort using these calibrated call densities reveals important biological and ecological information on migrating humpback whales off the coast of California. Call density trends are compared between the monitoring sites and at the same monitoring site over time. Call densities also are compared to several natural and human-influenced variables including season, time of day, lunar illumination, and ocean noise. The results reveal substantial differences in call densities between the two sites which were not noticeable using uncorrected (raw) call counts. Additionally, a Lombard effect was observed for humpback whale vocalizations in response to increasing ocean noise. The results presented in this thesis develop techniques to accurately measure marine mammal abundances from passive acoustic sensors.
Objective Measurement of Vocal Fatigue in Classically Trained Singers: A Pilot Study of Vocal Dosimetry Data

PubMed Central

Carroll, Thomas; Nix, John; Hunter, Eric; Titze, Ingo; Abaza, Mona

2016-01-01

Objectives To evaluate vocal fatigue by using objective and subjective measurements of dose recorded by the National Center for Voice and Speech (NCVS) Dosimeter™ (Dosimeter). Study Design and Setting Seven subjects completed a two-week study period. The Dosimeter recorded vocal load, soft phonation tasks and subjective soft voice ratings. Three vocal doses (time, distance, and cycle) were measured in classical singers' larynges during an intensive practice period. Results Spikes in vocal load are reflected as harsher subjective ratings on the same day as well as 24–72 hours later. When at least 48 hours of vocal rest occurred before a vocal load, improved subjective evaluations were seen after the load. Conclusions The NCVS Dosimeter appears to be an effective tool for data collection on prolonged use of the voice. Significance This is the first multi-day study comparing objective and subjective data on vocal fatigue in a group of professional singers. PMID:17011424
The value of vocalizing: Five-month-old infants associate their own noncry vocalizations with responses from caregivers

PubMed Central

Goldstein, Michael H.; Schwade, Jennifer A.; Bornstein, Marc H.

2014-01-01

The early noncry vocalizations of infants are salient social signals. Caregivers spontaneously respond to 30-50% of these sounds, and their responsiveness to infants' prelinguistic noncry vocalizations facilitates the development of phonology and speech. Have infants learned that their vocalizations influence the behavior of social partners? If infants have learned the contingency between their vocalizing and the social responses of others, they should show an extinction burst when the contingency is removed, increasing their rate of noncry vocalizing then decreasing. Thirty-eight 5-month-olds were tested in the still-face paradigm, during which they engaged in a 2-min still-face interaction with an unfamiliar adult. When the adult assumed a still face, infants showed an extinction burst. This pattern of infant vocalizations suggests that 5-month-olds have learned the social efficacy of their vocalizations on caregivers' behavior. Furthermore, the magnitude of 5-month infants' extinction bursts predicted their language comprehension at 13 months. PMID:19489893
Vocal tract length and acoustics of vocalization in the domestic dog (Canis familiaris).

PubMed

Riede, T; Fitch, T

1999-10-01

The physical nature of the vocal tract results in the production of formants during vocalisation. In some animals (including humans), receivers can derive information (such as body size) about sender characteristics on the basis of formant characteristics. Domestication and selective breeding have resulted in a high variability in head size and shape in the dog (Canis familiaris), suggesting that there might be large differences in the vocal tract length, which could cause formant behaviour to affect interbreed communication. Lateral radiographs were made of dogs from several breeds ranging in size from a Yorkshire terrier (2.5 kg) to a German shepherd (50 kg) and were used to measure vocal tract length. In addition, we recorded an acoustic signal (growling) from some dogs. Significant correlations were found between vocal tract length, body mass and formant dispersion, suggesting that formant dispersion can deliver information about the body size of the vocalizer. Because of the low correlation between vocal tract length and the first formant, we predict a non-uniform vocal tract shape.
A fast and flexible MRI system for the study of dynamic vocal tract shaping.

PubMed

Lingala, Sajan Goud; Zhu, Yinghua; Kim, Yoon-Chul; Toutios, Asterios; Narayanan, Shrikanth; Nayak, Krishna S

2017-01-01

The aim of this work was to develop and evaluate an MRI-based system for study of dynamic vocal tract shaping during speech production, which provides high spatial and temporal resolution. The proposed system utilizes (a) custom eight-channel upper airway coils that have high sensitivity to upper airway regions of interest, (b) two-dimensional golden angle spiral gradient echo acquisition, (c) on-the-fly view-sharing reconstruction, and (d) off-line temporal finite difference constrained reconstruction. The system also provides simultaneous noise-cancelled and temporally aligned audio. The system is evaluated in 3 healthy volunteers, and 1 tongue cancer patient, with a broad range of speech tasks. We report spatiotemporal resolutions of 2.4 × 2.4 mm 2 every 12 ms for single-slice imaging, and 2.4 × 2.4 mm 2 every 36 ms for three-slice imaging, which reflects roughly 7-fold acceleration over Nyquist sampling. This system demonstrates improved temporal fidelity in capturing rapid vocal tract shaping for tasks, such as producing consonant clusters in speech, and beat-boxing sounds. Novel acoustic-articulatory analysis was also demonstrated. A synergistic combination of custom coils, spiral acquisitions, and constrained reconstruction enables visualization of rapid speech with high spatiotemporal resolution in multiple planes. Magn Reson Med 77:112-125, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Development of echolocation calls and neural selectivity for echolocation calls in the pallid bat.

PubMed

Razak, Khaleel A; Fuzessery, Zoltan M

2015-10-01

Studies of birdsongs and neural selectivity for songs have provided important insights into principles of concurrent behavioral and auditory system development. Relatively little is known about mammalian auditory system development in terms of vocalizations or other behaviorally relevant sounds. This review suggests echolocating bats are suitable mammalian model systems to understand development of auditory behaviors. The simplicity of echolocation calls with known behavioral relevance and strong neural selectivity provides a platform to address how natural experience shapes cortical receptive field (RF) mechanisms. We summarize recent studies in the pallid bat that followed development of echolocation calls and cortical processing of such calls. We also discuss similar studies in the mustached bat for comparison. These studies suggest: (1) there are different developmental sensitive periods for different acoustic features of the same vocalization. The underlying basis is the capacity for some components of the RF to be modified independent of others. Some RF computations and maps involved in call processing are present even before the cochlea is mature and well before use of echolocation in flight. Others develop over a much longer time course. (2) Normal experience is required not just for refinement, but also for maintenance, of response properties that develop in an experience independent manner. (3) Experience utilizes millisecond range changes in timing of inhibitory and excitatory RF components as substrates to shape vocalization selectivity. We suggest that bat species and call diversity provide a unique opportunity to address developmental constraints in the evolution of neural mechanisms of vocalization processing. © 2014 Wiley Periodicals, Inc.

Development of echolocation calls and neural selectivity for echolocation calls in the pallid bat

PubMed Central

Razak, Khaleel A.; Fuzessery, Zoltan M.

2014-01-01

Studies of birdsongs and neural selectivity for songs have provided important insights into principles of concurrent behavioral and auditory system development. Relatively little is known about mammalian auditory system development in terms of vocalizations, or other behaviorally relevant sounds. This review suggests echolocating bats are suitable mammalian model systems to understand development of auditory behaviors. The simplicity of echolocation calls with known behavioral relevance and strong neural selectivity provides a platform to address how natural experience shapes cortical receptive field (RF) mechanisms. We summarize recent studies in the pallid bat that followed development of echolocation calls and cortical processing of such calls. We also discuss similar studies in the mustached bat for comparison. These studies suggest: (1) there are different developmental sensitive periods for different acoustic features of the same vocalization. The underlying basis is the capacity for some components of the RF to be modified independent of others. Some RF computations and maps involved in call processing are present even before the cochlea is mature and well before use of echolocation in flight. Others develop over a much longer time course. (2) Normal experience is required not just for refinement, but also for maintenance, of response properties that develop in an experience independent manner. (3) Experience utilizes millisecond range changes in timing of inhibitory and excitatory RF components as substrates to shape vocalization selectivity. We suggest that bat species and call diversity provide a unique opportunity to address developmental constraints in the evolution of neural mechanisms of vocalization processing. PMID:25142131
Neuroanatomical Evidence for Catecholamines as Modulators of Audition and Acoustic Behavior in a Vocal Teleost.

PubMed

Forlano, Paul M; Sisneros, Joseph A

2016-01-01

The plainfin midshipman fish (Porichthys notatus) is a well-studied model to understand the neural and endocrine mechanisms underlying vocal-acoustic communication across vertebrates. It is well established that steroid hormones such as estrogen drive seasonal peripheral auditory plasticity in female Porichthys in order to better encode the male's advertisement call. However, little is known of the neural substrates that underlie the motivation and coordinated behavioral response to auditory social signals. Catecholamines, which include dopamine and noradrenaline, are good candidates for this function, as they are thought to modulate the salience of and reinforce appropriate behavior to socially relevant stimuli. This chapter summarizes our recent studies which aimed to characterize catecholamine innervation in the central and peripheral auditory system of Porichthys as well as test the hypotheses that innervation of the auditory system is seasonally plastic and catecholaminergic neurons are activated in response to conspecific vocalizations. Of particular significance is the discovery of direct dopaminergic innervation of the saccule, the main hearing end organ, by neurons in the diencephalon, which also robustly innervate the cholinergic auditory efferent nucleus in the hindbrain. Seasonal changes in dopamine innervation in both these areas appear dependent on reproductive state in females and may ultimately function to modulate the sensitivity of the peripheral auditory system as an adaptation to the seasonally changing soundscape. Diencephalic dopaminergic neurons are indeed active in response to exposure to midshipman vocalizations and are in a perfect position to integrate the detection and appropriate motor response to conspecific acoustic signals for successful reproduction.
Vocal learning beyond imitation: mechanisms of adaptive vocal development in songbirds and human infants

PubMed Central

Tchernichovski, Ofer; Marcus, Gary

2014-01-01

Studies of vocal learning in songbirds typically focus on the acquisition of sensory templates for song imitation and on the consequent process of matching song production to templates. However, functional vocal development also requires the capacity to adaptively diverge from sensory templates, and to flexibly assemble vocal units. Examples of adaptive divergence include the corrective imitation of abnormal songs, and the decreased tendency to copy overabundant syllables. Such frequency-dependent effects might mirror tradeoffs between the assimilation of group identity (culture) while establishing individual and flexibly expressive songs. Intriguingly, although the requirements for vocal plasticity vary across songbirds, and more so between birdsong and language, the capacity to flexibly assemble vocal sounds develops in a similar, stepwise manner across species. Therefore, universal features of vocal learning go well beyond the capacity to imitate. PMID:25005823
Nonlinear dynamic mechanism of vocal tremor from voice analysis and model simulations

NASA Astrophysics Data System (ADS)

Zhang, Yu; Jiang, Jack J.

2008-09-01

Nonlinear dynamic analysis and model simulations are used to study the nonlinear dynamic characteristics of vocal folds with vocal tremor, which can typically be characterized by low-frequency modulation and aperiodicity. Tremor voices from patients with disorders such as paresis, Parkinson's disease, hyperfunction, and adductor spasmodic dysphonia show low-dimensional characteristics, differing from random noise. Correlation dimension analysis statistically distinguishes tremor voices from normal voices. Furthermore, a nonlinear tremor model is proposed to study the vibrations of the vocal folds with vocal tremor. Fractal dimensions and positive Lyapunov exponents demonstrate the evidence of chaos in the tremor model, where amplitude and frequency play important roles in governing vocal fold dynamics. Nonlinear dynamic voice analysis and vocal fold modeling may provide a useful set of tools for understanding the dynamic mechanism of vocal tremor in patients with laryngeal diseases.
The Vocal Repertoire of the Domesticated Zebra Finch: a Data Driven Approach to Decipher the Information-bearing Acoustic Features of Communication Signals

PubMed Central

Elie, Julie E.; Theunissen, Frédéric E.

2018-01-01

Although a universal code for the acoustic features of animal vocal communication calls may not exist, the thorough analysis of the distinctive acoustical features of vocalization categories is important not only to decipher the acoustical code for a specific species but also to understand the evolution of communication signals and the mechanisms used to produce and understand them. Here, we recorded more than 8,000 examples of almost all the vocalizations of the domesticated zebra finch, Taeniopygia guttata: vocalizations produced to establish contact, to form and maintain pair bonds, to sound an alarm, to communicate distress or to advertise hunger or aggressive intents. We characterized each vocalization type using complete representations that avoided any a priori assumptions on the acoustic code, as well as classical bioacoustics measures that could provide more intuitive interpretations. We then used these acoustical features to rigorously determine the potential information-bearing acoustical features for each vocalization type using both a novel regularized classifier and an unsupervised clustering algorithm. Vocalization categories are discriminated by the shape of their frequency spectrum and by their pitch saliency (noisy to tonal vocalizations) but not particularly by their fundamental frequency. Notably, the spectral shape of zebra finch vocalizations contains peaks or formants that vary systematically across categories and that would be generated by active control of both the vocal organ (source) and the upper vocal tract (filter). PMID:26581377
The vocal sac of Hylodidae (Amphibia, Anura): Phylogenetic and functional implications of a unique morphology.

PubMed

Elias-Costa, Agustin J; Montesinos, Rachel; Grant, Taran; Faivovich, Julián

2017-11-01

Anuran vocal sacs are elastic chambers that recycle exhaled air during vocalizations and are present in males of most species of frogs. Most knowledge of the diversity of vocal sacs relates to external morphology; detailed information on internal anatomy is available for few groups of frogs. Frogs of the family Hylodidae, which is endemic to the Atlantic Forest of Brazil and adjacent Argentina and Paraguay, have three patterns of vocal sac morphology-that is, single, subgular; paired, lateral; and absent. The submandibular musculature and structure of the vocal sac mucosa (the internal wall of the vocal sac) of exemplar species of this family and relatives were studied. In contrast to previous accounts, we found that all species of Crossodactylus and Hylodes possess paired, lateral vocal sacs, with the internal mucosa of each sac being separate from the contralateral one. Unlike all other frogs for which data are available, the mucosa of the vocal sacs in these genera is not supported externally by the mm. intermandibularis and interhyoideus. Rather, the vocal sac mucosa projects through the musculature and is free in the submandibular lymphatic sac. The presence of paired, lateral vocal sacs, the internal separation of the sac mucosae, and their projection through the m. interhyoideus are synapomorphies of the family. Furthermore, the specific configuration of the m. interhyoideus allows asymmetric inflation of paired vocal sacs, a feature only reported in species of these diurnal, stream-dwelling frogs. © 2017 Wiley Periodicals, Inc.
Oral breathing challenge in participants with vocal attrition.

PubMed

Sivasankar, Mahalakshmi; Fisher, Kimberly V

2003-12-01

Vocal folds undergo osmotic challenge by mouth breathing during singing, exercising, and loud speaking. Just 15 min of obligatory oral breathing, to dry the vocal folds, increases phonation threshold pressure (Pth) and expiratory vocal effort in healthy speakers (M. Sivasankar & K. Fisher, 2002). We questioned whether oral breathing is more detrimental to phonation in healthy participants with a history of temporary vocal attrition. The effects of a 15-min oral or nasal breathing challenge on Pth and perceived expiratory vocal effort were compared for participants reporting symptoms of vocal attrition (N = 18, ages 19-38 years) and normal controls (N = 20, ages 19-33 years). Post-challenge-prechallenge differences in Pth (deltaPth) and effort (deltaEffort) revealed that oral breathing, but not nasal breathing, increased Pth (p < .001 ) and effort (p < .001) at low, comfortable, and high pitch. deltaPth was significantly greater in participants with vocal attrition than in normal controls (p < .001). Nasal breathing reduced Pth for all controls but not for all participants reporting vocal attrition. deltaPth was significantly and linearly correlated with deltaEffort (rvocal attrition = .81, p < .001; rcontrol = .84, p < .001). We speculate that the greater increases in Pth in participants reporting vocal attrition may result from delayed or inadequate compensatory response to superficial laryngeal dehydration. Obligatory oral breathing may place voice users at risk for exacerbating vocal attrition. That sol layer depletion by obligatory oral breathing increased Pth and vocal effort provides support for the role of superficial hydration in maintaining ease of phonation.
Intra- and interobserver agreement for fetal cerebral measurements in 3D-ultrasonography.

PubMed

Albers, Maria E W A; Buisman, Erato T I A; Kahn, René S; Franx, Arie; Onland-Moret, N Charlotte; de Heus, Roel

2018-04-10

The aim of this study is to evaluate intra- and interobserver agreement for measurement of intracranial, cerebellar, and thalamic volume with the Virtual Organ Computer-aided AnaLysis (VOCAL) technique in three-dimensional ultrasound images, in comparison to two-dimensional measurements of these brain structures. Three-dimensional ultrasound images of the brains of 80 fetuses at 20-24 weeks' gestational age were obtained from YOUth, a Dutch prospective cohort study. Two observers performed offline measurement of the occipitofrontal diameter, intracranial volume, transcerebellar diameter, cerebellar volume, and thalamic width, area, and volume, independently. VOCAL was used for calculation of the volumes. The two-way random, single measures intraclass correlation coefficient (ICC) was used for analysis of agreement and Bland-Altman plots were configured. Intra- and interobserver agreement was almost perfect for occipitofrontal diameter (intra ICC 0.88, 95% CI 0.82-0.92; inter ICC 0.91, 95% CI 0.85-0.94), intracranial volume (intra ICC 0.96, 95% CI 0.91-0.98; inter ICC 0.97, 95% CI 0.96-0.98) and transcerebellar diameter (intra ICC 0.91, 95% CI 0.86-0.94; inter ICC 0.86, 95% CI 0.78-0.910). For cerebellar volume, the intraobserver agreement was almost perfect (0.85, 95% CI 0.76-0.90), whereas the interobserver agreement was substantial (0.75, 95% CI 0.44-0.88). Agreement was only moderate for thalamic measurements. Bland-Altman plots for the volume measurements are normally distributed with acceptable mean differences and 95% limits of agreement. The intra- and interobserver agreement of the measurement of intracranial and cerebellar volume with VOCAL was almost perfect. These measurements are therefore reliable, and can be used to investigate fetal brain development. Thalamic measurements are not reliable enough. © 2018 Wiley Periodicals, Inc.
Towards endoscopic ultrafast laser microsurgery of vocal folds

NASA Astrophysics Data System (ADS)

Hoy, Christopher L.; Everett, W. Neil; Yildirim, Murat; Kobler, James; Zeitels, Steven M.; Ben-Yakar, Adela

2012-03-01

Vocal fold scarring is a predominant cause of voice disorders yet lacks a reliable treatment method. The injection of soft biomaterials to improve mechanical compliance of the vocal folds has emerged as a promising treatment. Here, we study the use of precise femtosecond laser microsurgery to ablate subsurface voids, with a goal of eventually creating a plane in dense subepithelial scar tissue into which biomaterials can be injected for their improved localization. Specifically, we demonstrate the ablation of small subepithelial voids in porcine vocal fold tissue up to 120 µm below the surface such that larger voids in the active area of vocal fold mucosa (~3×10 mm2) can eventually be ablated in about 3 min. We use sub-µJ, 776-nm pulses from a compact femtosecond fiber laser system operating at a 500-kHz repetition rate. The use of relatively high repetition rates, with a small number of overlapping pulses, is critical to achieving ablation in a very short time while still avoiding significant heat deposition. Additionally, we use the same laser for nonlinear optical imaging to provide visual feedback of tissue structure and to confirm successful ablation. The ablation parameters, including pulse duration, pulse energy, spot size, and scanning speed, are comparable to the specifications in our recently developed miniaturized femtosecond laser surgery probes, illustrating the feasibility of developing an ultrafast laser surgical instrument.
Vocal cord paralysis after surgery to the descending thoracic aorta via left posterolateral thoracotomy.

PubMed

Ohta, Noriyuki; Mori, Takahiko

2007-11-01

Vocal cord paralysis is one of the frequently encountered complications after aortic surgery. However, reports of vocal cord paralysis after aortic surgery have been limited. In a retrospective cohort study of vocal cord paralysis after aortic surgery at a general hospital, we sought factors related to its development after aortic surgery to the descending thoracic aorta via left posterolateral thoracotomy. We reviewed data for a total of 69 patients who, between 1989 and 1995, underwent aortic surgery to the descending thoracic aorta. We assessed factors associated with the development of vocal cord paralysis and postoperative complications. Postoperative vocal cord paralysis appeared in 19 patients. Multiple logistic regression analysis revealed two risk factors for vocal cord paralysis: chronic dilatation of the aorta at the left subclavian artery (odds ratio = 8.67) and anastomosis proximal to the left subclavian artery (odds ratio = 17.7). The duration of mechanical ventilation was significantly prolonged for patients with vocal cord paralysis. Certain surgical factors associated with left subclavian artery increase the risk of vocal cord paralysis after surgery on the descending thoracic aorta. Vocal cord paralysis after aortic surgery did not increase aspiration pneumonia but was associated with pulmonary complications.
Convergence of calls as animals form social bonds, active compensation for noisy communication channels, and the evolution of vocal learning in mammals.

PubMed

Tyack, Peter L

2008-08-01

The classic evidence for vocal production learning involves imitation of novel, often anthropogenic sounds. Among mammals, this has been reported for dolphins, elephants, harbor seals, and humans. A broader taxonomic distribution has been reported for vocal convergence, where the acoustic properties of calls from different individuals converge when they are housed together in captivity or form social bonds in the wild. Vocal convergence has been demonstrated for animals as diverse as songbirds, parakeets, hummingbirds, bats, elephants, cetaceans, and primates. For most species, call convergence is thought to reflect a group-distinctive identifier, with shared calls reflecting and strengthening social bonds. A ubiquitous function for vocal production learning that is starting to receive attention involves modifying signals to improve communication in a noisy channel. Pooling data on vocal imitation, vocal convergence, and compensation for noise suggests a wider taxonomic distribution of vocal production learning among mammals than has been generally appreciated. The wide taxonomic distribution of this evidence for vocal production learning suggests that perhaps more of the neural underpinnings for vocal production learning are in place in mammals than is usually recognized. (c) 2008 APA, all rights reserved
Assessing and treating vocal stereotypy in children with autism.

PubMed

Ahearn, William H; Clark, Kathy M; MacDonald, Rebecca P F; Chung, Bo In

2007-01-01

Previous research implies that stereotypic behavior tends to be maintained by the sensory consequences produced by engaging in the response. Few investigations, however, have focused on vocal stereotypy. The current study examined the noncommunicative vocalizations of 4 children with an autism spectrum disorder. First, functional analyses were conducted in an attempt to identify the function of each child's behavior. For each of the participants, it was found that vocal stereotypy was likely not maintained by the social consequences. Following assessment, response interruption and redirection (RIRD) was implemented in an ABAB design to determine whether vocal stereotypy could be successfully redirected. RIRD involved a teacher issuing a series of vocal demands the child readily complied with during regular academic programming. Vocal demands were presented contingent on the occurrence of vocal stereotypy and were continuously presented until the child complied with three consecutively issued demands without emitting vocal stereotypy. For each child, RIRD produced levels of vocal stereotypy substantially lower than those observed in baseline. For 3 of the children, an increase in appropriate communication was also observed. The children's teachers were trained to implement RIRD. Brief follow-up probes and anecdotal information implied that the treatment had a positive impact in the natural environment.
Assessing and Treating Vocal Stereotypy in children with Autism

PubMed Central

Ahearn, William H; Clark, Kathy M; MacDonald, Rebecca P.F; In Chung, Bo

2007-01-01

Previous research implies that stereotypic behavior tends to be maintained by the sensory consequences produced by engaging in the response. Few investigations, however, have focused on vocal stereotypy. The current study examined the noncommunicative vocalizations of 4 children with an autism spectrum disorder. First, functional analyses were conducted in an attempt to identify the function of each child's behavior. For each of the participants, it was found that vocal stereotypy was likely not maintained by the social consequences. Following assessment, response interruption and redirection (RIRD) was implemented in an ABAB design to determine whether vocal stereotypy could be successfully redirected. RIRD involved a teacher issuing a series of vocal demands the child readily complied with during regular academic programming. Vocal demands were presented contingent on the occurrence of vocal stereotypy and were continuously presented until the child complied with three consecutively issued demands without emitting vocal stereotypy. For each child, RIRD produced levels of vocal stereotypy substantially lower than those observed in baseline. For 3 of the children, an increase in appropriate communication was also observed. The children's teachers were trained to implement RIRD. Brief follow-up probes and anecdotal information implied that the treatment had a positive impact in the natural environment. PMID:17624067
The relationship between prelinguistic vocalization and later expressive vocabulary in young children with developmental delay.

PubMed

McCathren, R B; Yoder, P J; Warren, S F

1999-08-01

This study tested the relationship between prelinguistic vocalization and expressive vocabulary 1 year later in young children with mild to moderate developmental delays. Three vocalization variables were tested: rate of all vocalization, rate of vocalizations with consonants, and rate of vocalizations used interactively. The 58 toddlers in the study were 17-34 months old, not sensory impaired, and had Bayley Mental Development Indices (Bayley, 1969; Bayley, 1993) from 35-85. In addition, the children had fewer than 3 words in their expressive vocabularies and during classroom observation each showed at least one instance of intentional prelinguistic communication before testing. Selected sections of the Communication and Symbolic Behavior Scales procedures (CSBS; Wetherby & Prizant, 1993) were administered at the beginning and at the end of the study. The vocal measures were obtained in the initial CSBS session. One measure of expressive vocabulary was obtained in the CSBS session at the end of the study. In addition, expressive vocabulary was measured in a nonstructured play session at the end of the study. We predicted that rate of vocalization, rate of vocalizations with consonants, and rate of vocalizations used interactively would all be positively related to later expressive vocabulary. The results confirmed the predictions.
Comprehensive Outcome Researches of Intralesional Steroid Injection on Benign Vocal Fold Lesions.

PubMed

Wang, Chi-Te; Lai, Mei-Shu; Hsiao, Tzu-Yu

2015-09-01

This study investigated multidimensional treatment outcomes, including prognostic factors and side effects of vocal fold steroid injection (VFSI). We recruited 126 consecutive patients, including patients with 49 nodules, 47 polyps, and 30 mucus retention cysts. All the patients received VFSI under local anesthesia in the office settings. Treatment outcomes were evaluated 1 and 2 months after the procedure, including endoscopic evaluation, perceptual voice quality (GRB scores), acoustic analysis, and 10-item Voice Handicap Index (VHI-10). More than 80% of the patients reported subjective improvements after VFSI. Objective measurements revealed significant improvements from baseline in most of the outcome parameters (P<0.05). Higher occupational vocal demands and fibrotic vocal nodules were significantly associated with poorer clinical responses as measured by the VHI-10 and GRB scores, respectively. For vocal polyps, dysphonia for more than 12 months were significantly associated with higher postoperative VHI-10 scores, whereas patients with laryngopharyngeal reflux (LPR) showed significantly poor postoperative voice quality as measured by GRB scores. Side effects after VFSI included hematoma (27%), triamcinolone deposits (4%), and vocal atrophy (1%), which resolved spontaneously within 1-2 months. Presentation with vocal fold ectasias/varicosities and higher vocal demands were significantly correlated with postoperative vocal hematoma. This study demonstrated significant improvements after VFSI in vocal nodules, polyps, and cysts. Occupational vocal demand and subtypes of vocal nodules are closely related to the treatment outcomes after VFSI, whereas symptom duration and LPR were significant prognostic factors for VFSI treatment outcomes in vocal polyps. Side effects after receiving VFSI were mostly self-limited without sequel, whereas the incidence rates might be varied by the injection approach and the timing for postoperative follow-up. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Functional morphology of the Alligator mississippiensis larynx with implications for vocal production.

PubMed

Riede, Tobias; Li, Zhiheng; Tokuda, Isao T; Farmer, Colleen G

2015-04-01

Sauropsid vocalization is mediated by the syrinx in birds and the larynx in extant reptiles; but whereas avian vocal production has received much attention, the vocal mechanism of basal reptilians is poorly understood. The American alligator (Alligator mississippiensis) displays a large vocal repertoire during mating and in parent-offspring interactions. Although vocal outputs of these behaviors have received some attention, the underlying mechanism of sound production remains speculative. Here, we investigate the laryngeal anatomy of juvenile and adult animals by macroscopic and histological methods. Observations of the cartilaginous framework and associated muscles largely corroborate earlier findings, but one muscle, the cricoarytenoideus, exhibits a heretofore unknown extrinsic insertion that has important implications for effective regulation of vocal fold length and tension. Histological investigation of the larynx revealed a layered vocal fold morphology. The thick lamina propria consists of non-homogenous extracellular matrix containing collagen fibers that are tightly packed below the epithelium but loosely organized deep inside the vocal fold. We found few elastic fibers but comparatively high proportions of hyaluronan. Similar organizational complexity is also seen in mammalian vocal folds and the labia of the avian syrinx: convergent morphologies that suggest analogous mechanisms for sound production. In tensile tests, alligator vocal folds demonstrated a linear stress-strain behavior in the low strain region and nonlinear stress responses at strains larger than 15%, which is similar to mammalian vocal fold tissue. We have integrated morphological and physiological data in a two-mass vocal fold model, providing a systematic description of the possible acoustic space that could be available to an alligator larynx. Mapping actual call production onto possible acoustic space validates the model's predictions. © 2015. Published by The Company of Biologists Ltd.
Precise Motor Control Enables Rapid Flexibility in Vocal Behavior of Marmoset Monkeys.

PubMed

Pomberger, Thomas; Risueno-Segovia, Cristina; Löschner, Julia; Hage, Steffen R

2018-03-05

Investigating the evolution of human speech is difficult and controversial because human speech surpasses nonhuman primate vocal communication in scope and flexibility [1-3]. Monkey vocalizations have been assumed to be largely innate, highly affective, and stereotyped for over 50 years [4, 5]. Recently, this perception has dramatically changed. Current studies have revealed distinct learning mechanisms during vocal development [6-8] and vocal flexibility, allowing monkeys to cognitively control when [9, 10], where [11], and what to vocalize [10, 12, 13]. However, specific call features (e.g., duration, frequency) remain surprisingly robust and stable in adult monkeys, resulting in rather stereotyped and discrete call patterns [14]. Additionally, monkeys seem to be unable to modulate their acoustic call structure under reinforced conditions beyond natural constraints [15, 16]. Behavioral experiments have shown that monkeys can stop sequences of calls immediately after acoustic perturbation but cannot interrupt ongoing vocalizations, suggesting that calls consist of single impartible pulses [17, 18]. Using acoustic perturbation triggered by the vocal behavior itself and quantitative measures of resulting vocal adjustments, we show that marmoset monkeys are capable of producing calls with durations beyond the natural boundaries of their repertoire by interrupting ongoing vocalizations rapidly after perturbation onset. Our results indicate that marmosets are capable of interrupting vocalizations only at periodic time points throughout calls, further supported by the occurrence of periodically segmented phees. These ideas overturn decades-old concepts on primate vocal pattern generation, indicating that vocalizations do not consist of one discrete call pattern but are built of many sequentially uttered units, like human speech. Copyright © 2018 The Author(s). Published by Elsevier Ltd.. All rights reserved.
The Distribution and Severity of Tremor in Speech Structures of Persons with Vocal Tremor.

PubMed

Hemmerich, Abby L; Finnegan, Eileen M; Hoffman, Henry T

2017-05-01

Vocal tremor may be associated with cyclic oscillations in the pulmonary, laryngeal, velopharyngeal, or oral regions. This study aimed to correlate the overall severity of vocal tremor with the distribution and severity of tremor in structures involved. Endoscopic and clinical examinations were completed on 20 adults with vocal tremor and two age-matched controls during sustained phonation. Two judges rated the severity of vocal tremor and the severity of tremor affecting each of 13 structures. Participants with mild vocal tremor typically presented with tremor in three laryngeal structures, moderate vocal tremor in five structures (laryngeal and another region), and severe vocal tremor in eight structures affecting all regions. The severity of tremor was lowest (mean = 1.2 out of 3) in persons with mild vocal tremor and greater in persons with moderate (mean = 1.5) and severe vocal tremor (mean = 1.4). Laryngeal structures were most frequently (95%) and severely (1.7 out of 3) affected, followed by velopharynx (40% occurrence, 1.3 severity), pulmonary (40% occurrence, 1.1 severity), and oral (40% occurrence, 1.0 severity) regions. Regression analyses indicated tremor severity of the supraglottic structures, and vertical laryngeal movement contributed most to vocal tremor severity during sustained phonation (r = 0.77, F = 16.17, P < 0.0001). A strong positive correlation (r = 0.72) was found between the Tremor Index and the severity of the vocal tremor during sustained phonation. It is useful to obtain a wide endoscopic view of the larynx to visualize tremor, which is rarely isolated to the true vocal folds alone. Published by Elsevier Inc.
Current Understanding and Future Directions for Vocal Fold Mechanobiology

PubMed Central

Li, Nicole Y.K.; Heris, Hossein K.; Mongeau, Luc

2013-01-01

The vocal folds, which are located in the larynx, are the main organ of voice production for human communication. The vocal folds are under continuous biomechanical stress similar to other mechanically active organs, such as the heart, lungs, tendons and muscles. During speech and singing, the vocal folds oscillate at frequencies ranging from 20 Hz to 3 kHz with amplitudes of a few millimeters. The biomechanical stress associated with accumulated phonation is believed to alter vocal fold cell activity and tissue structure in many ways. Excessive phonatory stress can damage tissue structure and induce a cell-mediated inflammatory response, resulting in a pathological vocal fold lesion. On the other hand, phonatory stress is one major factor in the maturation of the vocal folds into a specialized tri-layer structure. One specific form of vocal fold oscillation, which involves low impact and large amplitude excursion, is prescribed therapeutically for patients with mild vocal fold injuries. Although biomechanical forces affect vocal fold physiology and pathology, there is little understanding of how mechanical forces regulate these processes at the cellular and molecular level. Research into vocal fold mechanobiology has burgeoned over the past several years. Vocal fold bioreactors are being developed in several laboratories to provide a biomimic environment that allows the systematic manipulation of physical and biological factors on the cells of interest in vitro. Computer models have been used to simulate the integrated response of cells and proteins as a function of phonation stress. The purpose of this paper is to review current research on the mechanobiology of the vocal folds as it relates to growth, pathogenesis and treatment as well as to propose specific research directions that will advance our understanding of this subject. PMID:24812638
Vocal cysts: clinical, endoscopic, and surgical aspects.

PubMed

Martins, Regina Helena Garcia; Santana, Marcela Ferreira; Tavares, Elaine Lara Mendes

2011-01-01

Vocal cysts are benign laryngeal lesions, which affect children and adults. They can be classified as epidermic or mucous-retention cyst. The objective was to study the clinical, endoscopic, and surgical aspects of vocal cysts. We reviewed the medical charts of 72 patients with vocal cysts, considering age, gender, occupation, time of vocal symptoms, nasosinusal and gastroesophageal symptoms, vocal abuse, tabagism, alcoholism, associated lesions, treatment, and histological details. Of the 72 cases, 46 were adults (36 females and 10 male) and 26 were children (eight girls and 18 boys). As far as occupation is concerned, there was a higher incidence of students and teachers. All the patients had symptoms of chronic hoarseness. Nasosinusal (27.77%) and gastroesophageal (32%) symptoms were not relevant. Vocal abuse was reported by 45.83%, smoking by 18%, and alcoholism by 8.4% of the patients. Unilateral cysts were seen in 93% of the cases, 22 patients had associated lesions, such as bridge, sulcus vocalis, and microweb. Surgical treatment was performed in 46 cases. Histological analysis of the epidermic cysts revealed a cavity with caseous content, covered by stratified squamous epithelium, often keratinized. Mucous cysts presented mucous content, and the walls were coated by a cylindrical ciliated epithelium. Vocal cysts are benign vocal fold lesions that affect children and adults, being often associated with vocal overuse, which frequently affects people who use their voices professionally. Vocal symptoms are chronic in course, often times since childhood, and the treatment of choice is surgical removal. A careful examination of the vocal folds is necessary during surgery, because other laryngeal lesions may be associated with vocal cysts. Copyright Â© 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.

Microscopie non-lineaire pour l'imagerie des cordes vocales

NASA Astrophysics Data System (ADS)

Deterre, Romain

The vocal cords are two folds of epithelial tissues located in the larynx and are involved in production of the human voice. Despite their apparent simplicity, their internal structure is complex. Each fold can be divided into several layers with different mechanical properties. The gold standard for studying their structure - histology - has the inconvenience of being very invasive. Non-linear microscopy is an optical imaging technique which allows images to be taken in depth within samples in a non invasive manner. It also offers intrinsic contrasts, allowing the identification of certain fibrous proteins - elastin and collagen - which are responsible for the mechanical properties of epithelious tissues. The main goal of this research project was to assess nonlinear microscopy's performances for vocal fold imaging. The study has been broken down in two separate tasks. The first one was to evaluate the nonlinear modalities contrast against histology. For that purpose, we chose to first take images of thin samples and compare them to the corresponding histological slides. The second task was to make tests to transcribe the results obtained to in vivo imaging. A custom-built nonlinear imaging system was used for these experiments. It was developed to allow acquisition of wide-field images. A C++ based software was developped to control the microscope and allow treatment and visualization of the images. After being built, the system was further tested to check its performances in comparison with the theoretical limit as described in the literature. Thin slices of vocal folds were obtained from the team of Pr Christopher J. Hartnick from Massachusetts Eye and Ear Infirmary, Harvard Medical School. Specialists from his team analysed the histological samples to extract structural data from the vocal folds. A good correlation was measured between histological and nonlinear data. A first step in evaluating the possibility for translating these results towards in vivo imaging was performed during this project. A swine's larynx was obtained, and vocal folds were extracted for imaging purposes. This experiment showed that it is indeed possible to localize various macrostructures of the tissues with nonlinear microscopy.
Interactive voice technology: Variations in the vocal utterances of speakers performing a stress-inducing task

NASA Astrophysics Data System (ADS)

Mosko, J. D.; Stevens, K. N.; Griffin, G. R.

1983-08-01

Acoustical analyses were conducted of words produced by four speakers in a motion stress-inducing situation. The aim of the analyses was to document the kinds of changes that occur in the vocal utterances of speakers who are exposed to motion stress and to comment on the implications of these results for the design and development of voice interactive systems. The speakers differed markedly in the types and magnitudes of the changes that occurred in their speech. For some speakers, the stress-inducing experimental condition caused an increase in fundamental frequency, changes in the pattern of vocal fold vibration, shifts in vowel production and changes in the relative amplitudes of sounds containing turbulence noise. All speakers showed greater variability in the experimental condition than in more relaxed control situation. The variability was manifested in the acoustical characteristics of individual phonetic elements, particularly in speech sound variability observed serve to unstressed syllables. The kinds of changes and variability observed serve to emphasize the limitations of speech recognition systems based on template matching of patterns that are stored in the system during a training phase. There is need for a better understanding of these phonetic modifications and for developing ways of incorporating knowledge about these changes within a speech recognition system.
Dopaminergic Contributions to Vocal Learning

PubMed Central

Hoffmann, Lukas A.; Saravanan, Varun; Wood, Alynda N.; He, Li

2016-01-01

Although the brain relies on auditory information to calibrate vocal behavior, the neural substrates of vocal learning remain unclear. Here we demonstrate that lesions of the dopaminergic inputs to a basal ganglia nucleus in a songbird species (Bengalese finches, Lonchura striata var. domestica) greatly reduced the magnitude of vocal learning driven by disruptive auditory feedback in a negative reinforcement task. These lesions produced no measureable effects on the quality of vocal performance or the amount of song produced. Our results suggest that dopaminergic inputs to the basal ganglia selectively mediate reinforcement-driven vocal plasticity. In contrast, dopaminergic lesions produced no measurable effects on the birds' ability to restore song acoustics to baseline following the cessation of reinforcement training, suggesting that different forms of vocal plasticity may use different neural mechanisms. SIGNIFICANCE STATEMENT During skill learning, the brain relies on sensory feedback to improve motor performance. However, the neural basis of sensorimotor learning is poorly understood. Here, we investigate the role of the neurotransmitter dopamine in regulating vocal learning in the Bengalese finch, a songbird with an extremely precise singing behavior that can nevertheless be reshaped dramatically by auditory feedback. Our findings show that reduction of dopamine inputs to a region of the songbird basal ganglia greatly impairs vocal learning but has no detectable effect on vocal performance. These results suggest a specific role for dopamine in regulating vocal plasticity. PMID:26888928
Conversational Entrainment of Vocal Fry in Young Adult Female American English Speakers.

PubMed

Borrie, Stephanie A; Delfino, Christine R

2017-07-01

Conversational entrainment, the natural tendency for people to modify their behaviors to more closely match their communication partner, is examined as one possible mechanism modulating the prevalence of vocal fry in the speech of young American women engaged in spoken dialogue. Twenty young adult female American English speakers engaged in two spoken dialogue tasks-one with a young adult female American English conversational partner who exhibited substantial vocal fry and one with a young adult female American English conversational partner who exhibited quantifiably less vocal fry. Dialogues were analyzed for proportion of vocal fry, by speaker, and two measures of communicative success (efficiency and enjoyment). Participants employed significantly more vocal fry when conversing with the partner who exhibited substantial vocal fry than when conversing with the partner who exhibited quantifiably less vocal fry. Further, greater similarity between communication partners in their use of vocal fry tracked with higher scores of communicative efficiency and communicative enjoyment. Conversational entrainment offers a mechanistic framework that may be used to explain, to some degree, the frequency with which vocal fry is employed by young American women engaged in spoken dialogue. Further, young American women who modulated their vocal patterns during dialogue to match those of their conversational partner gained more efficiency and enjoyment from their interactions, demonstrating the cognitive and social benefits of entrainment. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Assessing Vocal Development in Infants and Toddlers

PubMed Central

Nathani, Suneeti; Ertmer, David J.

2012-01-01

The purpose of this study was to examine changes in prelinguistic vocal productions during the first 20 months of life. Vocalizations were classified into 23 mutually exclusive and exhaustive types, and grouped into five ascending levels using the Stark Assessment of Early Vocal Development-Revised (SAEVD-R). Data from 30 typically developing infants, aged 0–20 months, show that older infants attained higher developmental levels on the SAEVD-R than younger infants. Infants 0–2, 3–5, and 6–8 months of age primarily produced vocalizations from Levels 1 (Reflexive), 2 (Control of Phonation), and 3 (Expansion). Infants 9–20 months of age also produced vocalizations from Level 4 (Basic Canonical Syllables). Only infants from 16–20 months of age produced Level 5 (Advanced Forms) vocalizations in significant quantities. The outcomes indicate that the SAEVD-R is a valuable instrument for evaluating prelinguistic vocal development. PMID:16728333
Phonetogram changes for trained singers over a nine-month period of vocal training.

PubMed

LeBorgne, Wendy DeLeo; Weinrich, Barbara D

2002-03-01

Professional vocalists encounter demands requiring voluntary control of phonation, while utilizing a considerable range of frequency and intensity. These quantifiable acoustic events can be measured and represented in a phonetogram. Previous research has compared the phonetograms of trained and untrained voices and found significant differences between these groups. This study was designed to assess the effects of vocal training for singers over a period of nine months. Phonetogram contour changes were examined, with the primary focus on expansion of frequency range and/or intensity control. Twenty-one first-year, master's level, vocal music students, who were engaged in an intensive vocal performance curriculum, participated in this study. Following nine months of vocal training, significant differences were revealed in the subjects' mean frequency range and minimum vocal intensity across frequency levels. There was no significant difference for the mean maximum vocal intensity across frequency levels following vocal training.
The Effect of Vocal Hygiene and Behavior Modification Instruction on the Self-Reported Vocal Health Habits of Public School Music Teachers

ERIC Educational Resources Information Center

Hackworth, Rhonda S.

2007-01-01

This study examined the effects of vocal hygiene and behavior modification instruction on self-reported behaviors of music teachers. Subjects (N = 76) reported daily behaviors for eight weeks: water consumption, warm-up, talking over music/noise, vocal rest, nonverbal commands, and vocal problems. Subjects were in experimental group 1 or 2, or the…
Cross-system effects of dysphagia treatment on dysphonia: a case report

PubMed Central

LaGorio, Lisa A; Carnaby-Mann, Giselle D; Crary, Michael A

2008-01-01

Traditionally, treatment of dysphagia and dysphonia has followed a specificity approach whereby treatment plans have focused on each dysfunction individually. Recently however, a therapeutic cross-system effect has been proposed between these two dysfunctions. At least one study has demonstrated swallowing improvement in subjects who completed a dysphonia treatment program. However, we are unaware of any evidence demonstrating the converse effect. In this paper, we present a case-report of a 74 year old male who demonstrated improvement in selected vocal parameters after completion of a dysphagia therapy program. Dysphagia therapy resulted in improved laryngeal function in this subject. Results implicate improved vocal fold tension with increased glottal closure. Further investigation into the potential for this cross-system effect is warranted. PMID:18667069
Preservation of viscoelastic properties of rabbit vocal folds after implantation of hyaluronic Acid-based biomaterials.

PubMed

Choi, Jeong-Seok; Kim, Nahn Ju; Klemuk, Sarah; Jang, Yun Ho; Park, In Suh; Ahn, Kyung Hyun; Lim, Jae-Yol; Kim, Young-Mo

2012-09-01

To compare the rheological characteristics of structurally different hyaluronic acid (HA)-based biomaterials that are presently used for phonosurgery and to investigate their influence on the viscoelastic properties of vocal folds after implantation in an in vivo rabbit model. In vitro and in vivo rheometric investigation. Experimental laboratory, Inha and Seoul National Universities. Viscoelastic shear properties of 3 HA-based biomaterials (Rofilan, Restylane, and Reviderm) were measured with a strain-controlled rheometer. These biomaterials were injected into the deep layers of rabbit vocal folds, and viscoelastic moduli of the injected vocal folds were determined 2 months after the injection. The vocal fold specimens were observed using a light microscope and a transmission electron microscope. All HA-based biomaterials showed similar levels of shear viscosity, which were slightly higher than that of human vocal folds reported in previous studies. Compared with noninjected control vocal folds, there were no significant differences in the magnitudes of both elastic shear modulus (G') and viscous modulus (G") of injected vocal folds among all of the materials. Light microscopic images showed that all materials were observed in the deep layers of vocal folds and electron scanning images revealed that injected HA particles were homogeneously distributed in regions of collagenous fibers. HA-based biomaterials could preserve the viscoelastic properties of the vocal folds, when they were injected into vocal folds in an in vivo rabbit model. However, further studies on the influence of the biomaterials on the viscoelasticity of human vocal folds in ECM surroundings are still needed.
Voice Use Among Music Theory Teachers: A Voice Dosimetry and Self-Assessment Study.

PubMed

Schiller, Isabel S; Morsomme, Dominique; Remacle, Angélique

2017-07-25

This study aimed (1) to investigate music theory teachers' professional and extra-professional vocal loading and background noise exposure, (2) to determine the correlation between vocal loading and background noise, and (3) to determine the correlation between vocal loading and self-evaluation data. Using voice dosimetry, 13 music theory teachers were monitored for one workweek. The parameters analyzed were voice sound pressure level (SPL), fundamental frequency (F0), phonation time, vocal loading index (VLI), and noise SPL. Spearman correlation was used to correlate vocal loading parameters (voice SPL, F0, and phonation time) and noise SPL. Each day, the subjects self-assessed their voice using visual analog scales. VLI and self-evaluation data were correlated using Spearman correlation. Vocal loading parameters and noise SPL were significantly higher in the professional than in the extra-professional environment. Voice SPL, phonation time, and female subjects' F0 correlated positively with noise SPL. VLI correlated with self-assessed voice quality, vocal fatigue, and amount of singing and speaking voice produced. Teaching music theory is a profession with high vocal demands. More background noise is associated with increased vocal loading and may indirectly increase the risk for voice disorders. Correlations between VLI and self-assessments suggest that these teachers are well aware of their vocal demands and feel their effect on voice quality and vocal fatigue. Visual analog scales seem to represent a useful tool for subjective vocal loading assessment and associated symptoms in these professional voice users. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
The Interaction of Surface Hydration and Vocal Loading on Voice Measures.

PubMed

Fujiki, Robert Brinton; Chapleau, Abigail; Sundarrajan, Anusha; McKenna, Victoria; Sivasankar, M Preeti

2017-03-01

Vocal loading tasks provide insight regarding the mechanisms underlying healthy laryngeal function. Determining the manner in which the larynx can most efficiently be loaded is a complex task. The goal of this study was to determine if vocal loading could be achieved in 30 minutes by altering phonatory mode. Owing to the fact that surface hydration facilitates efficient vocal fold oscillation, the effects of environmental humidity on vocal loading were also examined. This study also investigated whether the detrimental effects of vocal loading could be attenuated by increasing environmental humidity. Sixteen vocally healthy adults (8 men, 8 women) completed a 30-minute vocal loading task in low and moderate humidity. The order of humidities was counterbalanced across subjects. The vocal loading task consisted of reading with elevated pitch and pressed vocal quality and low pitch and pressed and/or raspy vocal quality in the presence of 65 dB ambient, multi-talker babble noise. Significant effects were observed for (1) cepstral peak prominence on soft sustained phonation at 10th and 80th pitches, (2) perceived phonatory effort, and (3) perceived tiredness ratings. No loading effects were observed for cepstral peak prominence on the rainbow passage, although fundamental frequency on the rainbow passage increased post loading. No main effect was observed for humidity. Following a 30-minute vocal loading task involving altering laryngeal vibratory mode in combination with increased volume. Also, moderate environmental humidity did not significantly attenuate the negative effects of loading. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Convergent Differential Regulation of Parvalbumin in the Brains of Vocal Learners

PubMed Central

Hara, Erina; Rivas, Miriam V.; Ward, James M.; Okanoya, Kazuo; Jarvis, Erich D.

2012-01-01

Spoken language and learned song are complex communication behaviors found in only a few species, including humans and three groups of distantly related birds – songbirds, parrots, and hummingbirds. Despite their large phylogenetic distances, these vocal learners show convergent behaviors and associated brain pathways for vocal communication. However, it is not clear whether this behavioral and anatomical convergence is associated with molecular convergence. Here we used oligo microarrays to screen for genes differentially regulated in brain nuclei necessary for producing learned vocalizations relative to adjacent brain areas that control other behaviors in avian vocal learners versus vocal non-learners. A top candidate gene in our screen was a calcium-binding protein, parvalbumin (PV). In situ hybridization verification revealed that PV was expressed significantly higher throughout the song motor pathway, including brainstem vocal motor neurons relative to the surrounding brain regions of all distantly related avian vocal learners. This differential expression was specific to PV and vocal learners, as it was not found in avian vocal non-learners nor for control genes in learners and non-learners. Similar to the vocal learning birds, higher PV up-regulation was found in the brainstem tongue motor neurons used for speech production in humans relative to a non-human primate, macaques. These results suggest repeated convergent evolution of differential PV up-regulation in the brains of vocal learners separated by more than 65–300 million years from a common ancestor and that the specialized behaviors of learned song and speech may require extra calcium buffering and signaling. PMID:22238614
Evolution of Courtship Songs in Xenopus : Vocal Pattern Generation and Sound Production.

PubMed

Leininger, Elizabeth C; Kelley, Darcy B

2015-01-01

The extant species of African clawed frogs (Xenopus and Silurana) provide an opportunity to link the evolution of vocal characters to changes in the responsible cellular and molecular mechanisms. In this review, we integrate several robust lines of research: evolutionary trajectories of Xenopus vocalizations, cellular and circuit-level mechanisms of vocalization in selected Xenopus model species, and Xenopus evolutionary history and speciation mechanisms. Integrating recent findings allows us to generate and test specific hypotheses about the evolution of Xenopus vocal circuits. We propose that reduced vocal sex differences in some Xenopus species result from species-specific losses of sexually differentiated neural and neuromuscular features. Modification of sex-hormone-regulated developmental mechanisms is a strong candidate mechanism for reduced vocal sex differences.
Social coordination in animal vocal interactions. Is there any evidence of turn-taking? The starling as an animal model

PubMed Central

Henry, Laurence; Craig, Adrian J. F. K.; Lemasson, Alban; Hausberger, Martine

2015-01-01

Turn-taking in conversation appears to be a common feature in various human cultures and this universality raises questions about its biological basis and evolutionary trajectory. Functional convergence is a widespread phenomenon in evolution, revealing sometimes striking functional similarities between very distant species even though the mechanisms involved may be different. Studies on mammals (including non-human primates) and bird species with different levels of social coordination reveal that temporal and structural regularities in vocal interactions may depend on the species' social structure. Here we test the hypothesis that turn-taking and associated rules of conversations may be an adaptive response to the requirements of social life, by testing the applicability of turn-taking rules to an animal model, the European starling. Birdsong has for many decades been considered as one of the best models of human language and starling songs have been well described in terms of vocal production and perception. Starlings do have vocal interactions where alternating patterns predominate. Observational and experimental data on vocal interactions reveal that (1) there are indeed clear temporal and structural regularities, (2) the temporal and structural patterning is influenced by the immediate social context, the general social situation, the individual history, and the internal state of the emitter. Comparison of phylogenetically close species of Sturnids reveals that the alternating pattern of vocal interactions varies greatly according to the species' social structure, suggesting that interactional regularities may have evolved together with social systems. These findings lead to solid bases of discussion on the evolution of communication rules in relation to social evolution. They will be discussed also in terms of processes, at the light of recent neurobiological findings. PMID:26441787
Relationship between patient-perceived vocal handicap and clinician-rated level of vocal dysfunction.

PubMed

Childs, Lesley F; Bielinski, Clifford; Toles, Laura; Hamilton, Amy; Deane, Janis; Mau, Ted

2015-01-01

The relationship between patient-reported vocal handicap and clinician-rated measures of vocal dysfunction is not understood. This study aimed to determine if a correlation exists between the Voice Handicap Index-10 (VHI-10) and the Voice Functional Communication Measure rating in the National Outcomes Measurement System (NOMS). Retrospective case series. Four hundred and nine voice evaluations over 12 months at a tertiary voice center were reviewed. The VHI-10 and NOMS scores, diagnoses, and potential comorbid factors were collected and analyzed. For the study population as a whole, there was a moderate negative correlation between the NOMS rating and the VHI-10 (Pearson r = -0.57). However, for a given NOMS level, there could be considerable spread in the VHI-10. In addition, as the NOMS decreased stepwise below level 4, there was a corresponding increase in the VHI-10. However, a similar trend in VHI-10 was not observed for NOMS above level 4, indicating the NOMS versus VHI-10 correlation was not linear. Among diagnostic groups, the strongest correlation was found for subjects with functional dysphonia. The NOMS versus VHI-10 correlation was not affected by gender or the coexistence of a psychiatric diagnosis. A simple relationship between VHI-10 and NOMS rating does not exist. Patients with mild vocal dysfunction have a less direct relationship between their NOMS ratings and the VHI-10. These findings provide insight into the interpretation of patient-perceived and clinician-rated measures of vocal function and may allow for better management of expectations and patient counseling in the treatment of voice disorders. © 2014 The American Laryngological, Rhinological and Otological Society, Inc.
Experiments on Analysing Voice Production: Excised (Human, Animal) and In Vivo (Animal) Approaches

PubMed Central

Döllinger, Michael; Kobler, James; Berry, David A.; Mehta, Daryush D.; Luegmair, Georg; Bohr, Christopher

2015-01-01

Experiments on human and on animal excised specimens as well as in vivo animal preparations are so far the most realistic approaches to simulate the in vivo process of human phonation. These experiments do not have the disadvantage of limited space within the neck and enable studies of the actual organ necessary for phonation, i.e., the larynx. The studies additionally allow the analysis of flow, vocal fold dynamics, and resulting acoustics in relation to well-defined laryngeal alterations. Purpose of Review This paper provides an overview of the applications and usefulness of excised (human/animal) specimen and in vivo animal experiments in voice research. These experiments have enabled visualization and analysis of dehydration effects, vocal fold scarring, bifurcation and chaotic vibrations, three-dimensional vibrations, aerodynamic effects, and mucosal wave propagation along the medial surface. Quantitative data will be shown to give an overview of measured laryngeal parameter values. As yet, a full understanding of all existing interactions in voice production has not been achieved, and thus, where possible, we try to indicate areas needing further study. Recent Findings A further motivation behind this review is to highlight recent findings and technologies related to the study of vocal fold dynamics and its applications. For example, studies of interactions between vocal tract airflow and generation of acoustics have recently shown that airflow superior to the glottis is governed by not only vocal fold dynamics but also by subglottal and supraglottal structures. In addition, promising new methods to investigate kinematics and dynamics have been reported recently, including dynamic optical coherence tomography, X-ray stroboscopy and three-dimensional reconstruction with laser projection systems. Finally, we touch on the relevance of vocal fold dynamics to clinical laryngology and to clinically-oriented research. PMID:26581597
Control of Vocal and Respiratory Patterns in Birdsong: Dissection of Forebrain and Brainstem Mechanisms Using Temperature

PubMed Central

Fee, Michale S.

2011-01-01

Learned motor behaviors require descending forebrain control to be coordinated with midbrain and brainstem motor systems. In songbirds, such as the zebra finch, regular breathing is controlled by brainstem centers, but when the adult songbird begins to sing, its breathing becomes tightly coordinated with forebrain-controlled vocalizations. The periods of silence (gaps) between song syllables are typically filled with brief breaths, allowing the bird to sing uninterrupted for many seconds. While substantial progress has been made in identifying the brain areas and pathways involved in vocal and respiratory control, it is not understood how respiratory and vocal control is coordinated by forebrain motor circuits. Here we combine a recently developed technique for localized brain cooling, together with recordings of thoracic air sac pressure, to examine the role of cortical premotor nucleus HVC (proper name) in respiratory-vocal coordination. We found that HVC cooling, in addition to slowing all song timescales as previously reported, also increased the duration of expiratory pulses (EPs) and inspiratory pulses (IPs). Expiratory pulses, like song syllables, were stretched uniformly by HVC cooling, but most inspiratory pulses exhibited non-uniform stretch of pressure waveform such that the majority of stretch occurred late in the IP. Indeed, some IPs appeared to change duration by the earlier or later truncation of an underlying inspiratory event. These findings are consistent with the idea that during singing the temporal structure of EPs is under the direct control of forebrain circuits, whereas that of IPs can be strongly influenced by circuits downstream of HVC, likely in the brainstem. An analysis of the temporal jitter of respiratory and vocal structure suggests that IPs may be initiated by HVC at the end of each syllable and terminated by HVC immediately before the onset of the next syllable. PMID:21980466
Audio-vocal responses of vocal fundamental frequency and formant during sustained vowel vocalizations in different noises.

PubMed

Lee, Shao-Hsuan; Hsiao, Tzu-Yu; Lee, Guo-She

2015-06-01

Sustained vocalizations of vowels [a], [i], and syllable [mə] were collected in twenty normal-hearing individuals. On vocalizations, five conditions of different audio-vocal feedback were introduced separately to the speakers including no masking, wearing supra-aural headphones only, speech-noise masking, high-pass noise masking, and broad-band-noise masking. Power spectral analysis of vocal fundamental frequency (F0) was used to evaluate the modulations of F0 and linear-predictive-coding was used to acquire first two formants. The results showed that while the formant frequencies were not significantly shifted, low-frequency modulations (<3 Hz) of F0 significantly increased with reduced audio-vocal feedback across speech sounds and were significantly correlated with auditory awareness of speakers' own voices. For sustained speech production, the motor speech controls on F0 may depend on a feedback mechanism while articulation should rely more on a feedforward mechanism. Power spectral analysis of F0 might be applied to evaluate audio-vocal control for various hearing and neurological disorders in the future. Copyright © 2015 Elsevier B.V. All rights reserved.
Bilateral Vocal Cord Palsy with Arnold Chiari Malformation: A Rare Case Series

PubMed Central

Arora, Nikhil; Meher, Ravi; Bhargava, Eishaan K.

2016-01-01

Stridor in paediatric age group is not an uncommon presentation to the ENT emergency. The range of differential diagnosis is vast. The presentation may vary from noisy breathing to severe respiratory distress and apnea. Early and meticulous diagnosis is crucial for the management as the condition may be life threatening. We report a rare case series of 3 infants with Arnold Chiari Malformation who presented to the hospital with stridor and were diagnosed with bilateral vocal cord palsy. These 3 infants had similar underlying neurological condition with hydrocephalus and raised intracranial pressure. Chiari malformation is the one of the most common congenital central nervous system anomaly associated with bilateral vocal cord paralysis. However, the presentation is rare. This article, thus, emphasizes the significance of early diagnosis and immediate management of this condition. PMID:27790480
An Investigation of Extinction-Induced Vocalizations

ERIC Educational Resources Information Center

Valentino, Amber L.; Shillingsburg, M. Alice; Call, Nathan A.; Burton, Britney; Bowen, Crystal N.

2011-01-01

Children with autism have significant communication delays. Although some children develop vocalizations through shaping and differential reinforcement, others rarely exhibit vocalizations, and alternative methods are targeted in intervention. However, vocal language often remains a goal for caregivers and clinicians. Thus, strategies to increase…

Vocal fold paralysis secondary to phonotrauma.

PubMed

Klein, Travis A L; Gaziano, Joy E; Ridley, Marion B

2014-01-01

A unique case of acute onset vocal fold paralysis secondary to phonotrauma is presented. The cause was forceful vocalization by a drill instructor on a firearm range. Imaging studies revealed extensive intralaryngeal and retropharyngeal hemorrhage. Laryngoscopy showed a complete left vocal fold paralysis. Relative voice rest was recommended, and the patient regained normal vocal fold mobility and function after approximately 12 weeks. Copyright © 2014 The Voice Foundation. All rights reserved.
Short-Term Effect of Two Semi-Occluded Vocal Tract Training Programs on the Vocal Quality of Future Occupational Voice Users: "Resonant Voice Training Using Nasal Consonants" versus "Straw Phonation"

ERIC Educational Resources Information Center

Meerschman, Iris; Van Lierde, Kristiane; Peeters, Karen; Meersman, Eline; Claeys, Sofie; D'haeseleer, Evelien

2017-01-01

Purpose: The purpose of this study was to determine the short-term effect of 2 semi-occluded vocal tract training programs, "resonant voice training using nasal consonants" versus "straw phonation," on the vocal quality of vocally healthy future occupational voice users. Method: A multigroup pretest--posttest randomized control…
Clinical practice: vocal nodules in dysphonic children.

PubMed

Martins, Regina Helena Garcia; Branco, Anete; Tavares, Elaine Lara Mendes; Gramuglia, Andrea Cristina Jóia

2013-09-01

Common among children, vocal symptoms are a cause of concern for parents who seek elucidation of their diagnosis and treatment. Vocal nodules are the major cause of dysphonias in children and are related to vocal abuse. We conducted a literature review considering clinical, physiopathological, epidemiological, and histological aspects of vocal nodules, as well as diagnostic methods, highlighting the main studies addressing this issue. The controversial points of treatments were also discussed.
Inadequate vocal hygiene habits associated with the presence of self-reported voice symptoms in telemarketers.

PubMed

Fuentes-López, Eduardo; Fuente, Adrian; Contreras, Karem V

2017-12-18

The aim of this study is to determine possible associations between vocal hygiene habits and self-reported vocal symptoms in telemarketers. A cross-sectional study that included 79 operators from call centres in Chile was carried out. Their vocal hygiene habits and self-reported symptoms were investigated using a validated and reliable questionnaire created for the purposes of this study. Forty-five percent of telemarketers reported having one or more vocal symptoms. Among them, 16.46% reported that their voices tense up when talking and 10.13% needed to clear their throat to make their voices clearer. Five percent mentioned that they always talk without taking a break and 40.51% reported using their voices in noisy environments. The number of working hours per day and inadequate vocal hygiene habits were associated with the presence of self-reported symptoms. Additionally, an interaction between the use of the voice in noisy environments and not taking breaks during the day was observed. Finally, the frequency of inadequate vocal hygiene habits was associated with the number of symptoms reported. Using the voice in noisy environments and talking without taking breaks were both associated with the presence of specific vocal symptoms. This study provides some evidence about the interaction between these two inadequate vocal hygiene habits that potentiates vocal symptoms.
Visualizing Collagen Network Within Human and Rhesus Monkey Vocal Folds Using Polarized Light Microscopy

PubMed Central

Julias, Margaret; Riede, Tobias; Cook, Douglas

2014-01-01

Objectives Collagen fiber content and orientation affect the viscoelastic properties of the vocal folds, determining oscillation characteristics during speech and other vocalization. The investigation and reconstruction of the collagen network in vocal folds remains a challenge, because the collagen network requires at least micron-scale resolution. In this study, we used polarized light microscopy to investigate the distribution and alignment of collagen fibers within the vocal folds. Methods Data were collected in sections of human and rhesus monkey (Macaca mulatta) vocal folds cut at 3 different angles and stained with picrosirius red. Results Statistically significant differences were found between different section angles, implying that more than one section angle is required to capture the network’s complexity. In the human vocal folds, the collagen fiber distribution continuously varied across the lamina propria (medial to lateral). Distinct differences in birefringence distribution were observed between the species. For the human vocal folds, high birefringence was observed near the thyroarytenoid muscle and near the epithelium. However, in the rhesus monkey vocal folds, high birefringence was observed near the epithelium, and lower birefringence was seen near the thyroarytenoid muscle. Conclusions The differences between the collagen networks in human and rhesus monkey vocal folds provide a morphological basis for differences in viscoelastic properties between species. PMID:23534129
Retention of Human-Induced Pluripotent Stem Cells (hiPS) With Injectable HA Hydrogels for Vocal Fold Engineering.

PubMed

Imaizumi, Mitsuyoshi; Li-Jessen, Nicole Y K; Sato, Yuka; Yang, David T; Thibeault, Susan L

2017-04-01

One prospective treatment option for vocal fold scarring is regeneration with an engineered scaffold containing induced pluripotent stem cells (iPS). In the present study, we investigated the feasibility of utilizing an injectable hyaluronic acid (HA) scaffold encapsulated with human-iPS cell (hiPS) for regeneration of vocal folds. Thirty athymic nude rats underwent unilateral vocal fold injury. Contralateral vocal folds served as uninjured controls. Hyaluronic acid hydrogel scaffold, HA hydrogel scaffold containing hiPS, and HA hydrogel scaffold containing hiPS with epidermal growth factor (EGF) were injected in both vocal folds immediately after surgery. One and 2 weeks after injection, larynges were excised for histology, immunohistochemistry, and fluorescence in situ hybridization (FISH). Presence of HA hydrogel was confirmed in vocal folds 1 and 2 weeks post injection. The FISH analysis confirmed the presence and viability of hiPS in the injected vocal folds. Histological results demonstrated that vocal folds injected with HA hydrogel scaffold containing EGF demonstrated less fibrosis than those with HA hydrogel only. Human-iPS survived in injured rat vocal folds. The HA hydrogel with hiPS and EGF ameliorated the fibrotic response. Additional work is necessary to optimize hiPS differentiation and further confirm the safety of hiPS for clinical applications.
The role of adjustment of expiratory effort in the control of vocal intensity: clinical assessment of phonatory function.

PubMed

Makiyama, Kiyoshi; Yoshihashi, Hidetaka; Mogitate, Manabu; Kida, Akinori

2005-04-01

To determine the role of the adjustment of expiratory effort in the control of vocal intensity. An intensity-loading test was performed by using the airway interruption method. Three groups of subjects were used: a control group thought to resemble normal vocal fold closure, a group of patients with Reinke's edema thought to represent increased mass at the level of the vocal folds, and a group with vocal fold paralysis that was thought to represent a group with lack of adequate vocal fold closure. In the control group, expiratory lung pressure and airway resistance slightly increased. In the patients with Reinke's edema, expiratory lung pressure, and airway resistance significantly increased. In this group, the voice intensity was controlled by laryngeal adjustment, but a greater expiratory effort was needed because of a greater increase in glottal resistance. In the patients with vocal cord paralysis, airway resistance did not increase even with a high-intensity voice. Vocal intensity was controlled by expiratory effort. If there is sufficient ability for laryngeal adjustment, vocal intensity is controlled primarily by laryngeal adjustment and by expiratory adjustment in response to increased glottal resistance. However, vocal intensity is controlled by expiratory effort when laryngeal adjustment ability is poor.
Mothers' tone of voice depends on the nature of infants' transgressions.

PubMed

Dahl, Audun; Sherlock, Briana R; Campos, Joseph J; Theunissen, Frédéric E

2014-08-01

Emotional vocal signals are important ways of communicating norms to young infants. The second year is a period of increase in various forms of child transgressions, but also a period when infants have limited linguistic abilities. Two studies investigated the hypothesis that mothers respond with different vocal emotional tones to 3 types of child transgressions: moral (harming others), prudential (harming oneself), and pragmatic (creating inconvenience, e.g., by spilling) transgressions. We used a combination of naturalistic observation (Study 1) and experimental manipulation (Study 2) to record, code, and analyze maternal vocal responses to child transgressions. Both studies showed that mothers were more likely to use intense, angry vocalizations in response to moral transgressions, fearful vocalizations in response to prudential transgressions, comforting vocalizations in response to pragmatic and prudential transgressions, and (in Study 2) playful vocalizations in response to pragmatic transgressions. Study 1 showed that this differential use of vocal tone is used systematically in everyday life. Study 2 allowed us to standardize the context of the maternal intervention and perform additional acoustical analyses. A combination of principal component analysis and linear discriminant analysis applied to pitch and intensity data provided quantitative measures of the differences in vocal responses. These differentiated vocal responses are likely contributors to children's acquisition of norms from early in life.
Discriminating Simulated Vocal Tremor Source Using Amplitude Modulation Spectra

PubMed Central

Carbonell, Kathy M.; Lester, Rosemary A.; Story, Brad H.; Lotto, Andrew J.

2014-01-01

Objectives/Hypothesis Sources of vocal tremor are difficult to categorize perceptually and acoustically. This paper describes a preliminary attempt to discriminate vocal tremor sources through the use of spectral measures of the amplitude envelope. The hypothesis is that different vocal tremor sources are associated with distinct patterns of acoustic amplitude modulations. Study Design Statistical categorization methods (discriminant function analysis) were used to discriminate signals from simulated vocal tremor with different sources using only acoustic measures derived from the amplitude envelopes. Methods Simulations of vocal tremor were created by modulating parameters of a vocal fold model corresponding to oscillations of respiratory driving pressure (respiratory tremor), degree of vocal fold adduction (adductory tremor) and fundamental frequency of vocal fold vibration (F0 tremor). The acoustic measures were based on spectral analyses of the amplitude envelope computed across the entire signal and within select frequency bands. Results The signals could be categorized (with accuracy well above chance) in terms of the simulated tremor source using only measures of the amplitude envelope spectrum even when multiple sources of tremor were included. Conclusions These results supply initial support for an amplitude-envelope based approach to identify the source of vocal tremor and provide further evidence for the rich information about talker characteristics present in the temporal structure of the amplitude envelope. PMID:25532813
The Effect of Classroom Capacity on Vocal Fatigue as Quantified by the Vocal Fatigue Index.

PubMed

Banks, Russell E; Bottalico, Pasquale; Hunter, Eric J

2017-01-01

Previous research has concluded that teachers are at a higher-than-normal risk for voice issues that can cause occupational limitations. While some risk factors have been identified, there are still many unknowns. A survey was distributed electronically with 506 female teacher respondents. The survey included questions to quantify three aspects of vocal fatigue as captured by the Vocal Fatigue Index (VFI): (1) general tiredness of voice (performance), (2) physical discomfort associated with voicing (pain), and (3) improvement of symptoms with rest (recovery). The effect of classroom capacity on US teachers' self-reported experience of vocal fatigue was analyzed. The results indicated that a classroom's capacity significantly affected teachers' reported amounts of vocal fatigue, while a teacher's age also appeared to significantly affect the reported amount of vocal fatigue. A quadratic rather than linear effect was seen, with the largest age effect occurring at around 40-45 years in all three factors of the VFI. Further factors which may affect vocal fatigue must be explored in future research. By understanding what increases the risk for vocal fatigue, educators and school administrators can take precautions to mitigate the occupational risk of short- and long-term vocal health issues in school teachers. © 2017 S. Karger AG, Basel.
Harbor Seal (Phoca vitulina) Reproductive Advertisement Behavior and the Effects of Vessel Noise

NASA Astrophysics Data System (ADS)

Matthews, Leanna P.

Harbor seals (Phoca vitulina) are a widely distributed pinniped species that mate underwater. Similar to other aquatically mating pinnipeds, male harbor seals produce vocalizations during the breeding season that function in male-male interactions and possibly as an attractant for females. I investigated multiple aspects of these reproductive advertisement displays in a population of harbor seals in Glacier Bay National Park and Preserve, Alaska. First, I looked at vocal production as a function of environmental variables, including season, daylight, and tidal state. Vocalizations were highly seasonal and detection of these vocalizations peaked in June and July, which correspond with the estimated time of breeding. Vocalizations also varied with light, with the lowest probability of detection during the day and the highest probability of detection at night. The high probability of detection corresponded to when females are known to forage. These results are similar to the vocal behavior of previously studied populations. However, unlike previously studied populations, the detection of harbor seal breeding vocalizations did not vary with tidal state. This is likely due to the location of the hydrophone, as it was not near the haul out and depth was therefore not significantly influenced by changes in tidal height. I also investigated the source levels and call parameters of vocalizations, as well as call rate and territoriality. The average source level of harbor seal breeding vocalizations was 144 dB re 1 ?Pa at 1 m and measurements ranged from 129 to 149 dB re 1 ?Pa. Analysis of call parameters indicated that vocalizations of harbor seals in Glacier Bay were similar in duration to other populations, but were much lower in frequency. During the breeding season, there were two discrete calling areas that likely represent two individual males; the average call rate in these display areas was approximately 1 call per minute. The harbor seal breeding season also overlaps with peak tourism in Glacier Bay, and the majority of tourists visit the park on a motorized vessel. Because of this overlap, I investigated the impacts of vessel noise on the vocal behavior of individual males. In the presence of vessel noise, male harbor seals increase the amplitude of their vocalizations, decrease the duration, and increase the minimum frequency. These vocal shifts are similar to studies of noise impacts on other species across taxa, but it is unknown how this could impact the reproductive success of male harbor seals. Finally, I looked at the role of female preference for male vocalizations. Using playbacks of male vocalizations to captive female harbor seals, I found that females have a higher response to vocalizations that correspond to dominant males. Females were less responsive to subordinate male vocalizations, which had a shorter duration and a higher frequency. Given that male harbor seals decrease the duration and increase the frequency of vocalizations in the presence of noise, it is possible that these vocalizations become less attractive in noise.
Collagen Content Limits Optical Coherence Tomography Image Depth in Porcine Vocal Fold Tissue.

PubMed

Garcia, Jordan A; Benboujja, Fouzi; Beaudette, Kathy; Rogers, Derek; Maurer, Rie; Boudoux, Caroline; Hartnick, Christopher J

2016-11-01

Vocal fold scarring, a condition defined by increased collagen content, is challenging to treat without a method of noninvasively assessing vocal fold structure in vivo. The goal of this study was to observe the effects of vocal fold collagen content on optical coherence tomography imaging to develop a quantifiable marker of disease. Excised specimen study. Massachusetts Eye and Ear Infirmary. Porcine vocal folds were injected with collagenase to remove collagen from the lamina propria. Optical coherence tomography imaging was performed preinjection and at 0, 45, 90, and 180 minutes postinjection. Mean pixel intensity (or image brightness) was extracted from images of collagenase- and control-treated hemilarynges. Texture analysis of the lamina propria at each injection site was performed to extract image contrast. Two-factor repeated measure analysis of variance and t tests were used to determine statistical significance. Picrosirius red staining was performed to confirm collagenase activity. Mean pixel intensity was higher at injection sites of collagenase-treated vocal folds than control vocal folds (P < .0001). Fold change in image contrast was significantly increased in collagenase-treated vocal folds than control vocal folds (P = .002). Picrosirius red staining in control specimens revealed collagen fibrils most prominent in the subepithelium and above the thyroarytenoid muscle. Specimens treated with collagenase exhibited a loss of these structures. Collagen removal from vocal fold tissue increases image brightness of underlying structures. This inverse relationship may be useful in treating vocal fold scarring in patients. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.
Vocal Hyperfunction in Parents of Children With Attention Deficit Hyperactivity Disorder.

PubMed

Teresa, Garcia-Real; Díaz-Román, Tomás M

2016-05-01

The objective of this study was to evaluate the presence of habits and symptoms of vocal hyperfunction in the parents of children with attention deficit hyperactivity disorder (ADHD). Parents of 24 children with ADHD and 30 children of a control group completed a specific questionnaire to detect the hyperfunctional use of the voice (excessive talking, excessive loudness, talking too fast, and shouting), hoarseness, vocal fatigue, mental and physical fatigue, and the degree of parental concern for the vocal health of their child. Parents of children with ADHD spoke more often, faster, and stronger than the parents of the control group; in addition, they also used a louder volume than they usually used when they spoke to their children. The parents manifested more vocal, mental, and physical fatigue than the parents of the control group. There was a significant correlation between the "concern" for the vocal health of their children with respect to vocal symptoms of the children, the habits of vocal hyperfunctioning, and the symptoms suffered by the parents. These results suggest that the parents of children with ADHD change their vocal attitude when communicating with their children. Most likely, the increased concern of parents with ADHD children and their respective level of stress lead to hyperfunctional vocal usage. This subsequently leads to symptoms of vocal, physical, and mental fatigue at the end of the day. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Is laughter a better vocal change detector than a growl?

PubMed

Pinheiro, Ana P; Barros, Carla; Vasconcelos, Margarida; Obermeier, Christian; Kotz, Sonja A

2017-07-01

The capacity to predict what should happen next and to minimize any discrepancy between an expected and an actual sensory input (prediction error) is a central aspect of perception. Particularly in vocal communication, the effective prediction of an auditory input that informs the listener about the emotionality of a speaker is critical. What is currently unknown is how the perceived valence of an emotional vocalization affects the capacity to predict and detect a change in the auditory input. This question was probed in a combined event-related potential (ERP) and time-frequency analysis approach. Specifically, we examined the brain response to standards (Repetition Positivity) and to deviants (Mismatch Negativity - MMN), as well as the anticipatory response to the vocal sounds (pre-stimulus beta oscillatory power). Short neutral, happy (laughter), and angry (growls) vocalizations were presented both as standard and deviant stimuli in a passive oddball listening task while participants watched a silent movie and were instructed to ignore the vocalizations. MMN amplitude was increased for happy compared to neutral and angry vocalizations. The Repetition Positivity was enhanced for happy standard vocalizations. Induced pre-stimulus upper beta power was increased for happy vocalizations, and predicted the modulation of the standard Repetition Positivity. These findings indicate enhanced sensory prediction for positive vocalizations such as laughter. Together, the results suggest that positive vocalizations are more effective predictors in social communication than angry and neutral ones, possibly due to their high social significance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Pulsed dye laser-induced inflammatory response and extracellular matrix turnover in rat vocal folds and vocal fold fibroblasts.

PubMed

Lin, Ya; Yamashita, Masaru; Zhang, Jingxian; Ling, Changying; Welham, Nathan V

2009-10-01

Disruption of the vocal fold extracellular matrix (ECM) can induce a profound and refractory dysphonia. Pulsed dye laser (PDL) irradiation has shown early promise as a treatment modality for disordered ECM in patients with chronic vocal fold scar; however, there are limited data addressing the mechanism by which this laser energy might induce cellular and extracellular changes in vocal fold tissues. In this study, we examined the inflammatory and ECM modulating effects of PDL irradiation on normal vocal fold tissues and cultured vocal fold fibroblasts (VFFs). We evaluated the effects of 585 nm PDL irradiation on inflammatory cytokine and collagen/collagenase gene transcription in normal rat vocal folds in vivo (3-168 hours following delivery of approximately 39.46 J/cm(2) fluence) and VFFs in vitro (3-72 hours following delivery of 4.82 or 9.64 J/cm(2) fluence). We also examined morphological vocal fold tissue changes 3 hours, 1 week, and 1 month post-irradiation. PDL irradiation altered inflammatory cytokine and procollagen/collagenase expression at the transcript level, both in vitro and in vivo. Additionally, PDL irradiation induced an inflammatory repair process in vivo that was completed by 1 month with preservation of normal tissue morphology. PDL irradiation can modulate ECM turnover in phenotypically normal vocal folds. Additional work is required to determine if these findings extend to disordered ECM, such as is seen in vocal fold scar. Lasers Surg. Med. 41:585-594, 2009. (c) 2009 Wiley-Liss, Inc.
Vocal activity of lesser galagos (Galago spp.) at zoos.

PubMed

Schneiderová, Irena; Zouhar, Jan; Štefanská, Lucie; Bolfíková, Barbora Černá; Lhota, Stanislav; Brandl, Pavel

2016-01-01

Almost nothing is known about the natural vocal behavior of lesser galagos living in zoos. This is perhaps because they are usually kept in nocturnal exhibits separated from the visitors by a transparent and acoustically insulating glass barrier. The aim of the present study was therefore to fill this gap in knowledge of the vocal behavior of lesser galagos from zoos. This knowledge might be beneficial because the vocalizations of these small primates can be used for species determination. We performed a 10-day-long acoustic monitoring of vocal activity in each of seven various groups of Galago senegalensis and G. moholi living at four zoos. We quantitatively evaluated the occurrence of four loud vocalization types present in both species, including the most species-specific advertisement call. We found that qualitative as well as quantitative differences exist in the vocal behavior of the studied groups. We confirmed that the observed vocalization types can be collected from lesser galagos living at zoos, and the success can be increased by selecting larger and more diverse groups. We found two distinct patterns of diel vocal activity in the most vocally active groups. G. senegalensis groups were most vocally active at the beginning and at the end of their activity period, whereas one G. moholi group showed an opposite pattern. The latter is surprising, as it is generally accepted that lesser galagos emit advertisement calls especially at dawn and dusk, i.e., at the beginning and at the end of their diel activity. © 2016 Wiley Periodicals, Inc.
[Causes of vocal cord dyscinesia and its original factors after endotracheal intubation].

PubMed

Sun, Anke; Zhang, Tiezheng; Liu, Wenyuan; Tang, Weiwei; Guo, Xiaohong

2012-03-01

To research the causes of postintubation vocal cord dyskinesia and its contributing factors. The causes of vocal cord dyskinesia were confirmed by laryngoscope, three-dimensional spiral CT, stroboscope, and the analysis of therapy. The factors relevant to the causes of vocal cord dyskinesia were analysed based on the following elements: (1) the anatomic or pathological condition of patients or the technical skills of anesthetists. (2) emaciated or obese body and neck. (3) the age of patients. (4) the duration of endotracheal tube retention. (5) the types of operations. (6) anesthesia procedure. Among 135 patients, 128 cases (94.81%) manifested arytenoid dislocation, 7 cases (5.19%) vocal cord paralysis. The study showed that the vocal cord dyskinesia associated with anatomic or pathological condition of patients and technical skills of anesthetists (with intubation difficulty) accounted for 76.30%. The patients with relative emaciated body or neck accounted for 90.62% in cases without intubation difficulty. Age had no significant analytical relationship with vocal cord dyskinesia. Prolonged intubation (endotracheal tube retention over 12 hours) was accounted for only 17.64%. The incidence of vocal cord dyskinesia was nearly 0.5% in patients underwent cardio-thoracic surgery, accounting for 59.26% of all the patients. There are two major causes of vocal cord dyskinesia: arytenoid dislocation and vocal cord paralysis, and the rate of vocal cord dyskinesia could be reduced by the improvement of technical skill of anesthetists and/or sufficient attention to the intubation condition of patients.
Experimental analysis of the characteristics of artificial vocal folds.

PubMed

Misun, Vojtech; Svancara, Pavel; Vasek, Martin

2011-05-01

Specialized literature presents a number of models describing the function of the vocal folds. In most of those models, an emphasis is placed on the air flowing through the glottis and, further, on the effect of the parameters of the air alone (its mass, speed, and so forth). The article focuses on the constructional definition of artificial vocal folds and their experimental analysis. The analysis is conducted for voiced source voice phonation and for the changing mean value of the subglottal pressure. The article further deals with the analysis of the pressure of the airflow through the vocal folds, which is cut (separated) into individual pulses by the vibrating vocal folds. The analysis results show that air pulse characteristics are relevant to voice generation, as they are produced by the flowing air and vibrating vocal folds. A number of artificial vocal folds have been constructed to date, and the aforementioned view of their phonation is confirmed by their analysis. The experiments have confirmed that man is able to consciously affect only two parameters of the source voice, that is, its fundamental frequency and voice intensity. The main forces acting on the vocal folds during phonation are as follows: subglottal air pressure and elastic and inertia forces of the vocal folds' structure. The correctness of the function of the artificial vocal folds is documented by the experimental verification of the spectra of several types of artificial vocal folds. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
A method for assessing the regional vibratory pattern of vocal folds by analysing the video recording of stroboscopy.

PubMed

Lee, J S; Kim, E; Sung, M W; Kim, K H; Sung, M Y; Park, K S

2001-05-01

Stroboscopy and kymography have been used to examine the motional abnormality of vocal folds and to visualise their regional vibratory pattern. In a previous study (Laryngoscope, 1999), we introduced the conceptual idea of videostrobokymography, in which we applied the concept of kymography on the pre-recorded video images using stroboscopy, and showed its possible clinical application to various disorders in vocal folds. However, a more detailed description about the software and the mathematical formulation used in this system is needed for the reproduction of similar systems. The composition of hardwares, user-interface and detail procedures including mathematical equations in videostrobokymography software is presented in this study. As an initial clinical trial, videostrobokymography was applied to the preoperative and postoperative videostroboscopic images of 15 patients with Reinke's edema. On preoperative examination, videostrobokymograms showed irregular pattern of mucosal wave and, in some patients, a relatively constant glottic gap during phonation. After the operation, the voice quality of all patients was improved in acoustic and aerodynamic assessments, and videostrobokymography showed clearly improved mucosal waves (change in open quotient: mean +/- SD= 0.11 +/- 0.05).
Cross-activation and Detraining Effects of Tongue Exercise in Aged Rats

PubMed Central

Schaser, Allison J.; Ciucci, Michelle R.; Connor, Nadine P.

2015-01-01

Voice and swallowing deficits can occur with aging. Tongue exercise paired with a swallow may be used to treat swallowing disorders, but may also benefit vocal function due to cross-system activation effects. It is unknown how exercise-based neuroplasticity contributes to behavior and maintenance following treatment. Eighty rats were used to examine behavioral parameters and changes in neurotrophins after tongue exercise paired with a swallow. Tongue forces and ultrasonic vocalizations were recorded before and after training/detraining in young and old rats. Tissue was analyzed for neurotrophin content. Results showed tongue exercise paired with a swallow was associated with increased tongue forces at all ages. Gains diminished after detraining in old rats. Age-related changes in vocalizations, neurotrophin 4 (NT4), and brain derived neurotrophic factor (BDNF) were found. Minimal cross-system activation effects were observed. Neuroplastic benefits were demonstrated with exercise in old rats through behavioral improvements and up-regulation of BDNF in the hypoglossal nucleus. Tongue exercise paired with a swallow should be developed, studied, and optimized in human clinical research to treat swallowing and voice disorders in elderly people. PMID:26477376

Further evaluation of response interruption and redirection as treatment for stereotypy.

PubMed

Ahrens, Erin N; Lerman, Dorothea C; Kodak, Tiffany; Worsdell, April S; Keegan, Courtney

2011-01-01

The effects of 2 forms of response interruption and redirection (RIRD)-motor RIRD and vocal RIRD-were examined with 4 boys with autism to evaluate further the effects of this intervention and its potential underlying mechanisms. In Experiment 1, the effects of motor RIRD and vocal RIRD on vocal stereotypy and appropriate vocalizations were compared for 2 participants. In Experiment 2, the effects of both RIRD procedures on both vocal and motor stereotypy and appropriate vocalizations were compared with 2 additional participants. Results suggested that RIRD was effective regardless of the procedural variation or topography of stereotypy and that vocal RIRD functioned as a punisher. This mechanism was further explored with 1 participant by manipulating the schedule of RIRD in Experiment 3. Results were consistent with the punishment interpretation.
FURTHER EVALUATION OF RESPONSE INTERRUPTION AND REDIRECTION AS TREATMENT FOR STEREOTYPY

PubMed Central

Ahrens, Erin N; Lerman, Dorothea C; Kodak, Tiffany; Worsdell, April S; Keegan, Courtney

2011-01-01

The effects of 2 forms of response interruption and redirection (RIRD)—motor RIRD and vocal RIRD—were examined with 4 boys with autism to evaluate further the effects of this intervention and its potential underlying mechanisms. In Experiment 1, the effects of motor RIRD and vocal RIRD on vocal stereotypy and appropriate vocalizations were compared for 2 participants. In Experiment 2, the effects of both RIRD procedures on both vocal and motor stereotypy and appropriate vocalizations were compared with 2 additional participants. Results suggested that RIRD was effective regardless of the procedural variation or topography of stereotypy and that vocal RIRD functioned as a punisher. This mechanism was further explored with 1 participant by manipulating the schedule of RIRD in Experiment 3. Results were consistent with the punishment interpretation. PMID:21541130
Vocal cord paralysis after aortic arch surgery: predictors and clinical outcome.

PubMed

Ohta, Noriyuki; Kuratani, Toru; Hagihira, Satoshi; Kazumi, Ken-Ichiro; Kaneko, Mitsunori; Mori, Takahiko

2006-04-01

This study is retrospective cohort study of data on vocal cord paralysis after aortic arch surgery collected during 14 years at a general hospital. We investigated factors in the development of vocal cord paralysis after aortic arch surgery and the effect of vocal cord paralysis on clinical course and outcome. We reviewed data for 182 patients who underwent aortic arch surgery for aortic arch aneurysm and aortic dissection between 1989 and 2003, of whom 58 patients had proximal aortic repair, 62 had distal arch repair, and 62 had total arch repair. We assessed factors associated with the development of vocal cord paralysis and examined in detail the clinical outcome of patients with vocal cord paralysis. Postoperative vocal cord paralysis occurred in 40 patients. Multiple logistic regression analysis revealed the following risk factors with odds ratios (OR) for vocal cord paralysis: extension of procedures into distal arch (OR, 17.0), chronic dilatation of the aorta at the left subclavian artery (OR, 9.14), and total arch repair (OR, 4.24). Adoption of open-style stent-grafts reduced the incidence of vocal cord paralysis (OR, 0.031). The postoperative occurrence of vocal cord paralysis itself emerges as an independent predictor of pulmonary complications (OR, 4.12) and leads to a longer duration of hospital stay. The risk of vocal cord paralysis after aortic arch surgery depends on surgical factors, such as aneurysmal involvement of the distal arch, or the application of newer, less invasive surgical procedures. Vocal cord paralysis after aortic arch surgery itself, under aggressive postoperative respiratory management, did not increase aspiration pneumonia but was associated with postoperative complications leading to higher hospital mortality and prolonged hospitalization.
Vocal education for the professional voice user and singer.

PubMed

Murry, T; Rosen, C A

2000-10-01

Providing education on voice-related anatomy, physiology, and vocal hygiene information is the responsibility of every voice care professional. This article discusses the importance of a vocal education program for singers and professional voice users. An outline of a vocal education lecture is provided.
Contextual influences on children's use of vocal affect cues during referential interpretation.

PubMed

Berman, Jared M J; Graham, Susan A; Chambers, Craig G

2013-01-01

In three experiments, we investigated 5-year-olds' sensitivity to speaker vocal affect during referential interpretation in cases where the indeterminacy is or is not resolved by speech information. In Experiment 1, analyses of eye gaze patterns and pointing behaviours indicated that 5-year-olds used vocal affect cues at the point where an ambiguous description was encountered. In Experiments 2 and 3, we used unambiguous situations to investigate how the referential context influences the ability to use affect cues earlier in the utterance. Here, we found a differential use of speaker vocal affect whereby 5-year-olds' referential hypotheses were influenced by negative vocal affect cues in advance of the noun, but not by positive affect cues. Together, our findings reveal how 5-year-olds use a speaker's vocal affect to identify potential referents in different contextual situations and also suggest that children may be more attuned to negative vocal affect than positive vocal affect, particularly early in an utterance.
Regulation of glottal closure and airflow in a three-dimensional phonation model: Implications for vocal intensity control

PubMed Central

Zhang, Zhaoyan

2015-01-01

Maintaining a small glottal opening across a large range of voice conditions is critical to normal voice production. This study investigated the effectiveness of vocal fold approximation and stiffening in regulating glottal opening and airflow during phonation, using a three-dimensional numerical model of phonation. The results showed that with increasing subglottal pressure the vocal folds were gradually pushed open, leading to increased mean glottal opening and flow rate. A small glottal opening and a mean glottal flow rate typical of human phonation can be maintained against increasing subglottal pressure by proportionally increasing the degree of vocal fold approximation for low to medium subglottal pressures and vocal fold stiffening at high subglottal pressures. Although sound intensity was primarily determined by the subglottal pressure, the results suggest that, to maintain small glottal opening as the sound intensity increases, one has to simultaneously tighten vocal fold approximation and/or stiffen the vocal folds, resulting in increased glottal resistance, vocal efficiency, and fundamental frequency. PMID:25698022
Contingent imitation increases verbal interaction in children with autism spectrum disorders.

PubMed

Ishizuka, Yuka; Yamamoto, Jun-Ichi

2016-11-01

Several studies have suggested that contingent adult imitation increase nonverbal communication, such as attention and proximity to adults, in children with autism spectrum disorders. However, few studies have shown the effect of contingent imitation on verbal communication. This study examined whether children with autism were able to promote verbal interaction such as vocal imitation, vocalization, and vocal turn-taking via contingent imitation. We used an alternating treatment design composed of the conditions of contingent imitation and control for six children with autism (aged 33-63 months). For contingent imitation condition, adults imitated children's vocalization immediately. For control condition, adults did not imitate but gave a vocal response immediately. Results showed that in contingent imitation condition, all children increased the number of vocal imitations and vocal turn-takings compared with control condition. The number of vocalizations increased in both condition for all children. Overall, it is suggested that all children promote verbal interaction via contingent imitation. © The Author(s) 2016.
Influence of the ventricular folds on a voice source with specified vocal fold motion1

PubMed Central

McGowan, Richard S.; Howe, Michael S.

2010-01-01

The unsteady drag on the vocal folds is the major source of sound during voiced speech. The drag force is caused by vortex shedding from the vocal folds. The influence of the ventricular folds (i.e., the “false” vocal folds that protrude into the vocal tract a short distance downstream of the glottis) on the drag and the voice source are examined in this paper by means of a theoretical model involving vortex sheets in a two-dimensional geometry. The effect of the ventricular folds on the output acoustic pressure is found to be small when the movement of the vocal folds is prescribed. It is argued that the effect remains small when fluid-structure interactions account for vocal fold movement. These conclusions can be justified mathematically when the characteristic time scale for change in the velocity of the glottal jet is large compared to the time it takes for a vortex disturbance to be convected through the vocal fold and ventricular fold region. PMID:20329852
Cross-fostering alters advertisement vocalizations of grasshopper mice (Onychomys): Evidence for the developmental stress hypothesis.

PubMed

Pasch, Bret; Abbasi, Mustafa Z; Wilson, Macey; Zhao, Daniel; Searle, Jeremy B; Webster, Michael S; Rice, Aaron N

2016-04-01

Nutritional stress can have lasting impacts on the development of traits involved in vocal production. Cross-fostering experiments are often used to examine the propensity for vocal learning in a variety of taxa, but few studies assess the influence of malnourishment that can occur as a byproduct of this technique. In this study, we reciprocally cross-fostered sister taxa of voluble grasshopper mice (genus Onychomys) to explore their propensity for vocal learning. Vocalizations of Onychomys leucogaster did not differ between control and cross-fostered animals, but cross-fostered Onychomys arenicola produced vocalizations that were higher in frequency in a direction away from tutors. These same animals exhibited a transient reduction in body mass early in development, indicative of malnutrition. Our findings simultaneously refute vocal learning and support the developmental stress hypothesis to highlight the importance of early ontogeny on the production of vocalizations later in life. Copyright © 2016 Elsevier Inc. All rights reserved.
Further evaluation of methods to identify matched stimulation.

PubMed

Rapp, John T

2007-01-01

The effects of preferred stimulation on the vocal stereotypy of 2 individuals were evaluated in two experiments. The results of Experiment 1 showed that (a) the vocal stereotypy of both participants persisted in the absence of social consequences, (b) 1 participant manipulated toys that did and did not produce auditory stimulation, but only sound-producing toys decreased his vocal stereotypy, and (c) only noncontingent music decreased vocal stereotypy for the other participant, but sterotypy paradoxically increased when toys were presented with music. Using a three-component multiple schedule, the results of Experiment 2 showed that the vocal stereotypy of both participants remained below preintervention levels following the removal of auditory stimulation and that 1 participant's vocal stereotypy increased following the removal of contingent reprimands. These patterns suggest that auditory stimulation functioned as an abolishing operation for vocal stereotypy and reprimands functioned as an establishing operation for vocal stereotypy. Together, the two experiments provide a method for identifying alternative stimulation that may substitute for automatically reinforced behavior.
Shear properties of vocal fold mucosal tissues and their effect on vocal fold oscillation

NASA Astrophysics Data System (ADS)

Chan, Roger Wai Kai

Viscoelastic shear properties of vocal fold mucosal tissues and phonosurgical biomaterials were measured with a parallel-plate rotational rheometer. Elastic, viscous and damping properties were quantified as a function of frequency (0.01 Hz to 15 Hz) for human vocal fold mucosal tissues (N = 15), implantable biomaterials commonly used in the treatment of vocal fold paralysis (Teflon, gelatin, and collagen) (the non-mucosal group), and biomaterials currently or potentially useful in the treatment of vocal fold mucosal defects (adipose tissue or fat, hyaluronic acid, and fibronectin) (the mucosal group). It was found that intersubject differences as large as an order of magnitude were often observed for the shear properties of vocal fold mucosal tissues, part of which may be age- and gender-related. Shear properties of the non-mucosal group biomaterials were often much higher than those of the mucosal group biomaterials, which were relatively close to the shear properties of mucosal tissues. Viscoelastic and rheological modeling showed that shear properties of human vocal fold mucosa may be described by a quasi-linear viscoelastic theory and a statistical network theory, based upon which extrapolations to audio frequencies were possible. A theory of small-amplitude vocal fold oscillation was revisited to describe the effects of tissue shear properties on vocal fold oscillation and phonation threshold pressure, a measure of the 'ease' of phonation and an objective indication of vocal function. It was found that phonation threshold pressure is directly related to the viscous shear modulus or the 'effective damping modulus', a concept proposed to quantify the effective amount of damping in vocal fold oscillation. The mucosal group biomaterials were incorporated into the artificial vocal fold mucosa of a physical model in order to empirically assess their effects on phonation threshold pressure. Results showed that higher threshold pressures were consistently observed for higher concentrations of hyaluronic acid and for hyaluronic acid mixed with fibronectin, in correlation with their differences in viscous shear modulus and effective damping modulus. Implications for phonosurgery were discussed in terms of the choice of optimal biomaterials for the surgical management of vocal fold mucosal defects and lamina propria deficiencies.
Neural Correlates of Vocal Production and Motor Control in Human Heschl's Gyrus

PubMed Central

Oya, Hiroyuki; Nourski, Kirill V.; Kawasaki, Hiroto; Larson, Charles R.; Brugge, John F.; Howard, Matthew A.; Greenlee, Jeremy D.W.

2016-01-01

The present study investigated how pitch frequency, a perceptually relevant aspect of periodicity in natural human vocalizations, is encoded in Heschl's gyrus (HG), and how this information may be used to influence vocal pitch motor control. We recorded local field potentials from multicontact depth electrodes implanted in HG of 14 neurosurgical epilepsy patients as they vocalized vowel sounds and received brief (200 ms) pitch perturbations at 100 Cents in their auditory feedback. Event-related band power responses to vocalizations showed sustained frequency following responses that tracked voice fundamental frequency (F0) and were significantly enhanced in posteromedial HG during speaking compared with when subjects listened to the playback of their own voice. In addition to frequency following responses, a transient response component within the high gamma frequency band (75–150 Hz) was identified. When this response followed the onset of vocalization, the magnitude of the response was the same for the speaking and playback conditions. In contrast, when this response followed a pitch shift, its magnitude was significantly enhanced during speaking compared with playback. We also observed that, in anterolateral HG, the power of high gamma responses to pitch shifts correlated with the magnitude of compensatory vocal responses. These findings demonstrate a functional parcellation of HG with neural activity that encodes pitch in natural human voice, distinguishes between self-generated and passively heard vocalizations, detects discrepancies between the intended and heard vocalization, and contains information about the resulting behavioral vocal compensations in response to auditory feedback pitch perturbations. SIGNIFICANCE STATEMENT The present study is a significant contribution to our understanding of sensor-motor mechanisms of vocal production and motor control. The findings demonstrate distinct functional parcellation of core and noncore areas within human auditory cortex on Heschl's gyrus that process natural human vocalizations and pitch perturbations in the auditory feedback. In addition, our data provide evidence for distinct roles of high gamma neural oscillations and frequency following responses for processing periodicity in human vocalizations during vocal production and motor control. PMID:26888939
Differential short-term memorisation for vocal and instrumental rhythms.

PubMed

Klyn, Niall A M; Will, Udo; Cheong, Yong-Jeon; Allen, Erin T

2016-07-01

This study explores differential processing of vocal and instrumental rhythms in short-term memory with three decision (same/different judgments) and one reproduction experiment. In the first experiment, memory performance declined for delayed versus immediate recall, with accuracy for the two rhythms being affected differently: Musicians performed better than non-musicians on clapstick but not on vocal rhythms, and musicians were better on vocal rhythms in the same than in the different condition. Results for the second experiment showed that concurrent sub-vocal articulation and finger-tapping differentially affected the two rhythms and same/different decisions, but produced no evidence for articulatory loop involvement in delayed decision tasks. In a third experiment, which tested rhythm reproduction, concurrent sub-vocal articulation decreased memory performance, with a stronger deleterious effect on the reproduction of vocal than of clapstick rhythms. This suggests that the articulatory loop may only be involved in delayed reproduction not in decision tasks. The fourth experiment tested whether differences between filled and empty rhythms (continuous vs. discontinuous sounds) can explain the different memorisation of vocal and clapstick rhythms. Though significant differences were found for empty and filled instrumental rhythms, the differences between vocal and clapstick can only be explained by considering additional voice specific features.
Vocal Features of Song and Speech: Insights from Schoenberg's Pierrot Lunaire.

PubMed

Merrill, Julia; Larrouy-Maestri, Pauline

2017-01-01

Similarities and differences between speech and song are often examined. However, the perceptual definition of these two types of vocalization is challenging. Indeed, the prototypical characteristics of speech or song support top-down processes, which influence listeners' perception of acoustic information. In order to examine vocal features associated with speaking and singing, we propose an innovative approach designed to facilitate bottom-up mechanisms in perceiving vocalizations by using material situated between speech and song: Speechsong. 25 participants were asked to evaluate 20 performances of a speechsong composition by Arnold Schoenberg, "Pierrot lunaire" op. 21 from 1912, evaluating 20 features of vocal-articulatory expression. Raters provided reliable judgments concerning the vocal features used by the performers and did not show strong appeal or specific expectations in reference to Schoenberg's piece. By examining the relationship between the vocal features and the impression of song or speech, the results confirm the importance of pitch (height, contour, range), but also point to the relevance of register, timbre, tension and faucal distance. Besides highlighting vocal features associated with speech and song, this study supports the relevance of the present approach of focusing on a theoretical middle category in order to better understand vocal expression in song and speech.
Effects of speech style, room acoustics, and vocal fatigue on vocal effort

PubMed Central

Bottalico, Pasquale; Graetzer, Simone; Hunter, Eric J.

2016-01-01

Vocal effort is a physiological measure that accounts for changes in voice production as vocal loading increases. It has been quantified in terms of sound pressure level (SPL). This study investigates how vocal effort is affected by speaking style, room acoustics, and short-term vocal fatigue. Twenty subjects were recorded while reading a text at normal and loud volumes in anechoic, semi-reverberant, and reverberant rooms in the presence of classroom babble noise. The acoustics in each environment were modified by creating a strong first reflection in the talker position. After each task, the subjects answered questions addressing their perception of the vocal effort, comfort, control, and clarity of their own voice. Variation in SPL for each subject was measured per task. It was found that SPL and self-reported effort increased in the loud style and decreased when the reflective panels were present and when reverberation time increased. Self-reported comfort and control decreased in the loud style, while self-reported clarity increased when panels were present. The lowest magnitude of vocal fatigue was experienced in the semi-reverberant room. The results indicate that early reflections may be used to reduce vocal effort without modifying reverberation time. PMID:27250179
Integrating perspectives on vocal performance and consistency

PubMed Central

Sakata, Jon T.; Vehrencamp, Sandra L.

2012-01-01

SUMMARY Recent experiments in divergent fields of birdsong have revealed that vocal performance is important for reproductive success and under active control by distinct neural circuits. Vocal consistency, the degree to which the spectral properties (e.g. dominant or fundamental frequency) of song elements are produced consistently from rendition to rendition, has been highlighted as a biologically important aspect of vocal performance. Here, we synthesize functional, developmental and mechanistic (neurophysiological) perspectives to generate an integrated understanding of this facet of vocal performance. Behavioral studies in the field and laboratory have found that vocal consistency is affected by social context, season and development, and, moreover, positively correlated with reproductive success. Mechanistic investigations have revealed a contribution of forebrain and basal ganglia circuits and sex steroid hormones to the control of vocal consistency. Across behavioral, developmental and mechanistic studies, a convergent theme regarding the importance of vocal practice in juvenile and adult songbirds emerges, providing a basis for linking these levels of analysis. By understanding vocal consistency at these levels, we gain an appreciation for the various dimensions of song control and plasticity and argue that genes regulating the function of basal ganglia circuits and sex steroid hormones could be sculpted by sexual selection. PMID:22189763
The vocal load of Reform Jewish cantors in the USA.

PubMed

Hapner, Edie; Gilman, Marina

2012-03-01

Jewish cantors comprise a subset of vocal professionals that is not well understood by vocal health professionals. This study aimed to document the vocal demands, vocal training, reported incidence of voice problems, and treatment-seeking behavior of Reform Jewish cantors. The study used a prospective observational design to anonymously query Reform Jewish cantors using a 35-item multiple-choice survey distributed online. Demographic information, medical history, vocal music training, cantorial duties, history of voice problems, and treatment-seeking behavior were addressed. Results indicated that many of the commonly associated risk factors for developing voice disorders were present in this population, including high vocal demands, reduced vocal downtime, allergies, and acid reflux. Greater than 65% of the respondents reported having had a voice problem that interfered with their ability to perform their duties at some time during their careers. Reform Jewish cantors are a population of occupational voice users who may be currently unidentified and underserved by vocal health professionals. The results of the survey suggest that Reform Jewish cantors are occupational voice users and are at high risk for developing voice disorders. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Data analysis of response interruption and redirection as a treatment for vocal stereotypy.

PubMed

Wunderlich, Kara L; Vollmer, Timothy R

2015-12-01

Vocal stereotypy, or repetitive, noncontextual vocalizations, is a problematic form of behavior exhibited by many individuals with autism spectrum disorder (ASD). Recent research has evaluated the efficacy of response interruption and redirection (RIRD) in the reduction of vocal stereotypy. Research has indicated that RIRD often results in reductions in the level of vocal stereotypy; however, many previous studies have only presented data on vocal stereotypy that occurred outside RIRD implementation. The current study replicated the procedures of previous studies that have evaluated the efficacy of RIRD and compared 2 data-presentation methods: inclusion of only data collected outside RIRD implementation and inclusion of all vocal stereotypy data from the entirety of each session. Subjects were 7 children who had been diagnosed with ASD. Results indicated that RIRD appeared to be effective when we evaluated the level of vocal stereotypy outside RIRD implementation, but either no reductions or more modest reductions in the level of vocal stereotypy during the entirety of sessions were obtained for all subjects. Results suggest that data-analysis methods used in previous research may overestimate the efficacy of RIRD. © Society for the Experimental Analysis of Behavior.
Characterizing the graded structure of false killer whale (Pseudorca crassidens) vocalizations.

PubMed

Murray, S O; Mercado, E; Roitblat, H L

1998-09-01

The vocalizations from two, captive false killer whales (Pseudorca crassidens) were analyzed. The structure of the vocalizations was best modeled as lying along a continuum with trains of discrete, exponentially damped sinusoidal pulses at one end and continuous sinusoidal signals at the other end. Pulse trains were graded as a function of the interval between pulses where the minimum interval between pulses could be zero milliseconds. The transition from a pulse train with no inter-pulse interval to a whistle could be modeled by gradations in the degree of damping. There were many examples of vocalizations that were gradually modulated from pulse trains to whistles. There were also vocalizations that showed rapid shifts in signal type--for example, switching immediately from a whistle to a pulse train. These data have implications when considering both the possible function(s) of the vocalizations and the potential sound production mechanism(s). A short-time duty cycle measure was developed to characterize the graded structure of the vocalizations. A random sample of 500 vocalizations was characterized by combining the duty cycle measure with peak frequency measurements. The analysis method proved to be an effective metric for describing the graded structure of false killer whale vocalizations.
Low-frequency vocalizations in the Florida manatee (Trichechus manatus latirostris)

NASA Astrophysics Data System (ADS)

Frisch, Katherine; Frisch, Stefan

2003-10-01

Vocalizations produced by Florida manatees (Trichechus manatus latirostris) have been characterized as being of relatively high frequency, with fundamental tones ranging from 2500-5000 Hz. These sounds have been variously described as squeaks, squeals, and chirps. Vocalizations below 500 Hz have not been previously reported. Two captive-born Florida manatees were recorded at Mote Marine Laboratory in Sarasota, Florida. The analysis of these vocalizations provides evidence of a new category of low-frequency sounds produced by manatees. These sounds are often heard in conjunction with higher-frequency vocalizations. The low-frequency vocalizations are relatively brief and of low amplitude. These vocalizations are perceived as a series of impulses rather than a low-frequency periodic tone. Knowledge of these low-frequency vocalizations could be useful to those developing future management strategies. Interest has recently increased in the development of acoustic detection and deterrence devices to reduce the number of manatee watercraft interactions. The design of appropriate devices must take into account the apparent ability of manatees to perceive and produce sounds of both high and low frequency. It is also important to consider the possibility that acoustic deterrence devices may disrupt the potentially communicative frequencies of manatee vocalizations.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.