Lexical frequency and voice assimilation in complex words in Dutch
NASA Astrophysics Data System (ADS)
Ernestus, Mirjam; Lahey, Mybeth; Verhees, Femke; Baayen, Harald
2004-05-01
Words with higher token frequencies tend to have more reduced acoustic realizations than lower frequency words (e.g., Hay, 2000; Bybee, 2001; Jurafsky et al., 2001). This study documents frequency effects for regressive voice assimilation (obstruents are voiced before voiced plosives) in Dutch morphologically complex words in the subcorpus of read-aloud novels in the corpus of spoken Dutch (Oostdijk et al., 2002). As expected, the initial obstruent of the cluster tends to be absent more often as lexical frequency increases. More importantly, as frequency increases, the duration of vocal-fold vibration in the cluster decreases, and the duration of the bursts in the cluster increases, after partialing out cluster duration. This suggests that there is less voicing for higher-frequency words. In fact, phonetic transcriptions show regressive voice assimilation for only half of the words and progressive voice assimilation for one third. Interestingly, the progressive voice assimilation observed for higher-frequency complex words renders these complex words more similar to monomorphemic words: Dutch monomorphemic words typically contain voiceless obstruent clusters (Zonneveld, 1983). Such high-frequency complex words may therefore be less easily parsed into their constituent morphemes (cf. Hay, 2000), favoring whole word lexical access (Bertram et al., 2000).
Human voice quality measurement in noisy environments.
Ueng, Shyh-Kuang; Luo, Cheng-Ming; Tsai, Tsung-Yu; Yeh, Hsuan-Chen
2015-01-01
Computerized acoustic voice measurement is essential for the diagnosis of vocal pathologies. Previous studies showed that ambient noises have significant influences on the accuracy of voice quality assessment. This paper presents a voice quality assessment system that can accurately measure qualities of voice signals, even though the input voice data are contaminated by low-frequency noises. The ambient noises in our living rooms and laboratories are collected and the frequencies of these noises are analyzed. Based on the analysis, a filter is designed to reduce noise level of the input voice signal. Then, improved numerical algorithms are employed to extract voice parameters from the voice signal to reveal the health of the voice signal. Compared with MDVP and Praat, the proposed method outperforms these two widely used programs in measuring fundamental frequency and harmonic-to-noise ratio, and its performance is comparable to these two famous programs in computing jitter and shimmer. The proposed voice quality assessment method is resistant to low-frequency noises and it can measure human voice quality in environments filled with noises from air-conditioners, ceiling fans and cooling fans of computers.
Perception of the fundamental frequencies of children's voices by trained and untrained listeners.
Wilson, F B; Wellen, C J; Kimbarow, M L
1983-10-01
This study was designed to determine if trained voice clinicians were better than untrained listeners in judging differences in the fundamental frequencies of children's voices. We also attempted to determine the degree of difference in fundamental frequency necessary for accurate judgments. Finally, ability to perceive pitch differences in speaking voices was correlated with ability to judge puretone stimuli. Results indicated that trained clinicians were no better at judging average fundamental frequency than were untrained listeners. Both groups performed at chance level until differences in vocal fundamental frequency exceeded 20 Hz. Finally, there was no correlation between subjects' success on standardized puretone pitch tests and ability to judge average pitch in the speaking voice.
Voice characteristics in the progression of Parkinson's disease.
Holmes, R J; Oates, J M; Phyland, D J; Hughes, A J
2000-01-01
This study examined the acoustic and perceptual voice characteristics of patients with Parkinson's disease according to disease severity. The perceptual and acoustic voice characteristics of 30 patients with early stage PD and 30 patients with later stage PD were compared with data from 30 normal control subjects. Voice recordings consisted of prolongation of the vowel /a/, scale singing, and a 1-min monologue. In comparison with controls and previously published normative data, both early and later stage PD patients' voices were characterized perceptually by limited pitch and loudness variability, breathiness, harshness and reduced loudness. High modal pitch levels also characterized the voices of males in both early and later stages of PD. Acoustically, the voices of both groups of PD patients demonstrated lower mean intensity levels and reduced maximum phonational frequency ranges in comparison with normative data. Although less clear, the present data also suggested that the PD patients' voices were characterized by excess jitter, a high-speaking fundamental frequency for males and a reduced fundamental frequency variability for females. While several of these voice features did not appear to deteriorate with disease progression (i.e. harshness, high modal pitch and speaking fundamental frequency in males, fundamental frequency variability in females, low intensity and jitter), breathiness, monopitch and monoloudness, low loudness and reduced maximum phonational frequency range were all worse in the later stages of PD. Tremor was the sole voice feature which was associated only with later stage PD.
Quantitative analysis of professionally trained versus untrained voices.
Siupsinskiene, Nora
2003-01-01
The aim of this study was to compare healthy trained and untrained voices as well as healthy and dysphonic trained voices in adults using combined voice range profile and aerodynamic tests, to define the normal range limiting values of quantitative voice parameters and to select the most informative quantitative voice parameters for separation between healthy and dysphonic trained voices. Three groups of persons were evaluated. One hundred eighty six healthy volunteers were divided into two groups according to voice training: non-professional speakers group consisted of 106 untrained voices persons (36 males and 70 females) and professional speakers group--of 80 trained voices persons (21 males and 59 females). Clinical group consisted of 103 dysphonic professional speakers (23 males and 80 females) with various voice disorders. Eighteen quantitative voice parameters from combined voice range profile (VRP) test were analyzed: 8 of voice range profile, 8 of speaking voice, overall vocal dysfunction degree and coefficient of sound, and aerodynamic maximum phonation time. Analysis showed that healthy professional speakers demonstrated expanded vocal abilities in comparison to healthy non-professional speakers. Quantitative voice range profile parameters- pitch range, high frequency limit, area of high frequencies and coefficient of sound differed significantly between healthy professional and non-professional voices, and were more informative than speaking voice or aerodynamic parameters in showing the voice training. Logistic stepwise regression revealed that VRP area in high frequencies was sufficient to discriminate between healthy and dysphonic professional speakers for male subjects (overall discrimination accuracy--81.8%) and combination of three quantitative parameters (VRP high frequency limit, maximum voice intensity and slope of speaking curve) for female subjects (overall model discrimination accuracy--75.4%). We concluded that quantitative voice assessment with selected parameters might be useful for evaluation of voice education for healthy professional speakers as well as for detection of vocal dysfunction and evaluation of rehabilitation effect in dysphonic professionals.
F0 Characteristics of Newsreaders on Varied Emotional Texts in Tamil Language.
Gunasekaran, Nishanthi; Boominathan, Prakash; Seethapathy, Jayashree
2017-12-26
The objective of this study was to profile speaking F 0 and its variations in newsreaders on varied emotional texts. This study has a prospective, case-control study design. Fifteen professional newsreaders and 15 non-newsreaders were the participants. The participants read the news bulletin that conveyed different emotions (shock, neutral, happy, and sad) in a habitual and "newsreading" voice. Speaking fundamental frequency (SFF) and F 0 variations were extracted from 1620 tokens using Praat software (version 5.2.32) on the opening lines, headlines, news stories, and closing lines of each news item. Paired t test, independent t test, and Friedman test were used for statistical analysis. Both male and female newsreaders had significantly (P ≤ 0.05) higher SFFs and standard deviations (SDs) of SFF in newsreading voice than speaking voice. Female non-newsreaders demonstrated significantly higher SFF and SD of SFF in newsreading voice, whereas no significant differences were noticed in the frequency parameters for male non-newsreaders. No significant difference was noted in the frequency parameters of speaking and newsreading voice between male newsreaders and male non-newsreaders. A significant difference in the SD of SFF was noticed between female newsreaders and female non-newsreaders in newsreading voice. Female newsreaders had a higher frequency range in both speaking voice and newsreading voice when compared with non-newsreaders. F 0 characteristics and frequency range determine the amount of frequency changes exercised by newsreaders while reading bulletins. This information is highly pedagogic for training voices in this profession. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Schultz-Coulon, H J
1975-07-01
The applicability of a newly developed fundamental frequency analyzer to diagnosis in phoniatrics is reviewed. During routine voice examination, the analyzer allows a quick and accurate measurement of fundamental frequency and sound level of the speaking voice, and of vocal range and maximum phonation time. By computing fundamental frequency histograms, the median fundamental frequency and the total pitch range can be better determined and compared. Objective studies of certain technical faculties of the singing voice, which usually are estimated subjectively by the speech therapist, may now be done by means of this analyzer. Several examples demonstrate the differences between correct and incorrect phonation. These studies compare the pitch perturbations during the crescendo and decrescendo of a swell-tone, and show typical traces of staccato, thrill and yodel. Conclusions of the study indicate that fundamental frequency analysis is a valuable supplemental method for objective voice examination.
Shao, Xu; Milner, Ben
2005-08-01
This work proposes a method to reconstruct an acoustic speech signal solely from a stream of mel-frequency cepstral coefficients (MFCCs) as may be encountered in a distributed speech recognition (DSR) system. Previous methods for speech reconstruction have required, in addition to the MFCC vectors, fundamental frequency and voicing components. In this work the voicing classification and fundamental frequency are predicted from the MFCC vectors themselves using two maximum a posteriori (MAP) methods. The first method enables fundamental frequency prediction by modeling the joint density of MFCCs and fundamental frequency using a single Gaussian mixture model (GMM). The second scheme uses a set of hidden Markov models (HMMs) to link together a set of state-dependent GMMs, which enables a more localized modeling of the joint density of MFCCs and fundamental frequency. Experimental results on speaker-independent male and female speech show that accurate voicing classification and fundamental frequency prediction is attained when compared to hand-corrected reference fundamental frequency measurements. The use of the predicted fundamental frequency and voicing for speech reconstruction is shown to give very similar speech quality to that obtained using the reference fundamental frequency and voicing.
Correlational Analysis of Speech Intelligibility Tests and Metrics for Speech Transmission
2017-12-04
frequency scale (male voice; normal voice effort) ............................... 4 Fig. 2 Diagram of a speech communication system (Letowski...languages. Consonants contain mostly high frequency (above 1500 Hz) speech energy, but this energy is relatively small in comparison to that of the whole...voices (Letowski et al. 1993). Since the mid- frequency spectral region contains mostly vowel energy while consonants are high frequency sounds, an
Acoustic analysis of speech variables during depression and after improvement.
Nilsonne, A
1987-09-01
Speech recordings were made of 16 depressed patients during depression and after clinical improvement. The recordings were analyzed using a computer program which extracts acoustic parameters from the fundamental frequency contour of the voice. The percent pause time, the standard deviation of the voice fundamental frequency distribution, the standard deviation of the rate of change of the voice fundamental frequency and the average speed of voice change were found to correlate to the clinical state of the patient. The mean fundamental frequency, the total reading time and the average rate of change of the voice fundamental frequency did not differ between the depressed and the improved group. The acoustic measures were more strongly correlated to the clinical state of the patient as measured by global depression scores than to single depressive symptoms such as retardation or agitation.
Siupsinskiene, Nora; Lycke, Hugo
2011-07-01
This prospective cross-sectional study examines the effects of voice training on vocal capabilities in vocally healthy age and gender differentiated groups measured by voice range profile (VRP) and speech range profile (SRP). Frequency and intensity measurements of the VRP and SRP using standard singing and speaking voice protocols were derived from 161 trained choir singers (21 males, 59 females, and 81 prepubescent children) and from 188 nonsingers (38 males, 89 females, and 61 children). When compared with nonsingers, both genders of trained adult and child singers exhibited increased mean pitch range, highest frequency, and VRP area in high frequencies (P<0.05). Female singers and child singers also showed significantly increased mean maximum voice intensity, intensity range, and total VRP area. The logistic regression analysis showed that VRP pitch range, highest frequency, maximum voice intensity, and maximum-minimum intensity range, and SRP slope of speaking curve were the key predictors of voice training. Age, gender, and voice training differentiated norms of VRP and SRP parameters are presented. Significant positive effect of voice training on vocal capabilities, mostly singing voice, was confirmed. The presented norms for trained singers, with key parameters differentiated by gender and age, are suggested for clinical practice of otolaryngologists and speech-language pathologists. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Sex hormones and the elderly male voice.
Gugatschka, Markus; Kiesler, Karl; Obermayer-Pietsch, Barbara; Schoekler, Bernadette; Schmid, Christoph; Groselj-Strele, Andrea; Friedrich, Gerhard
2010-05-01
The objective was to describe influences of sex hormones on the male voice in an elderly cohort. Sixty-three elderly males were recruited to undergo assessment of voice parameters, stroboscopy, voice-related questionnaires, a blood draw, and an ultrasound examination of the laryngeal skeleton. The group was divided into men with normal hormonal status and men with lowered levels of sex hormones, called hypogonades. Depending on the level of androgens, voice parameters did not differ. In subjects with decreased levels of estrogens, a significant increase in mean fundamental frequency, as well as changes of highest and lowest frequency plus a shift of the frequency range could be detected. We could detect significant changes of voice parameters depending on status of estrogens in elderly males. Androgens appear to have no impact on the elderly male voice. To our knowledge, this is the first prospective study that correlates sex hormones with voice parameters in elderly men. (c) 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
ERIC Educational Resources Information Center
Stepp, Cara E.; Merchant, Gabrielle R.; Heaton, James T.; Hillman, Robert E.
2011-01-01
Purpose: The purpose of this study was to determine whether the relative fundamental frequency (RFF) surrounding a voiceless consonant in patients with hyperfunctionally related voice disorders would normalize after a successful course of voice therapy. Method: Pre- and posttherapy measurements of RFF were compared in 16 subjects undergoing voice…
NASA Astrophysics Data System (ADS)
Freeman, Allison
This research examined the fundamental frequency and perturbation (jitter % and shimmer %) measures in young adult (20-30 year-old) and middle-aged adult (40-55 year-old) smokers and non-smokers; there were 36 smokers and 36 non-smokers. Acoustic analysis was carried out utilizing one task: production of sustained /a/. These voice samples were analyzed utilizing Multi-Dimensional Voice Program (MDVP) software, which provided values for fundamental frequency, jitter %, and shimmer %.These values were analyzed for trends regarding smoking status, age, and gender. Statistical significance was found regarding the fundamental frequency, jitter %, and shimmer % for smokers as compared to non-smokers; smokers were found to have significantly lower fundamental frequency values, and significantly higher jitter % and shimmer % values. Statistical significance was not found regarding fundamental frequency, jitter %, and shimmer % for age group comparisons. With regard to gender, statistical significance was found regarding fundamental frequency; females were found to have statistically higher fundamental frequencies as compared to males. However, the relationships between gender and jitter % and shimmer % lacked statistical significance. These results indicate that smoking negatively affects voice quality. This study also examined the ability of untrained listeners to identify smokers and non-smokers based on their voices. Results of this voice perception task suggest that listeners are not accurately able to identify smokers and non-smokers, as statistical significance was not reached. However, despite a lack of significance, trends in data suggest that listeners are able to utilize voice quality to identify smokers and non-smokers.
ERIC Educational Resources Information Center
Van Stan, Jarrad H.; Mehta, Daryush D.; Sternad, Dagmar; Petit, Robert; Hillman, Robert E.
2017-01-01
Purpose: Ambulatory voice biofeedback has the potential to significantly improve voice therapy effectiveness by targeting carryover of desired behaviors outside the therapy session (i.e., retention). This study applies motor learning concepts (reduced frequency and delayed, summary feedback) that demonstrate increased retention to ambulatory voice…
Electroglottogram waveform types.
Painter, C
1988-01-01
Electroglottography is a useful, non-invasive technique that can assist in the assessment of vocal fold dysfunction. However, if it is to become a useful clinical tool, there is a need for normative studies of the electroglottogram waveform types that characterize trained professional voice users, untrained non-professional speakers and patients with voice disorders and for a way of quantifying and objectively comparing similarities and differences. This report describes our methodology and an investigation into the waveform types characterizing one trained professional voice user phonating in 15 experimental sessions under various fundamental frequency, intensity and voice quality conditions. A number of strong tendencies were noted. In normal voice the lower frequencies and intensities represent one pole of a scale of a mode of phonation, while the higher frequencies and intensities depict the other pole. In these studies breathy voice data overlapped the lower end of the scale and tense voice data overlapped the upper end.
High-frequency energy in singing and speech
NASA Astrophysics Data System (ADS)
Monson, Brian Bruce
While human speech and the human voice generate acoustical energy up to (and beyond) 20 kHz, the energy above approximately 5 kHz has been largely neglected. Evidence is accruing that this high-frequency energy contains perceptual information relevant to speech and voice, including percepts of quality, localization, and intelligibility. The present research was an initial step in the long-range goal of characterizing high-frequency energy in singing voice and speech, with particular regard for its perceptual role and its potential for modification during voice and speech production. In this study, a database of high-fidelity recordings of talkers was created and used for a broad acoustical analysis and general characterization of high-frequency energy, as well as specific characterization of phoneme category, voice and speech intensity level, and mode of production (speech versus singing) by high-frequency energy content. Directionality of radiation of high-frequency energy from the mouth was also examined. The recordings were used for perceptual experiments wherein listeners were asked to discriminate between speech and voice samples that differed only in high-frequency energy content. Listeners were also subjected to gender discrimination tasks, mode-of-production discrimination tasks, and transcription tasks with samples of speech and singing that contained only high-frequency content. The combination of these experiments has revealed that (1) human listeners are able to detect very subtle level changes in high-frequency energy, and (2) human listeners are able to extract significant perceptual information from high-frequency energy.
Guidelines for Selecting Microphones for Human Voice Production Research
ERIC Educational Resources Information Center
Svec, Jan G.; Granqvist, Svante
2010-01-01
Purpose: This tutorial addresses fundamental characteristics of microphones (frequency response, frequency range, dynamic range, and directionality), which are important for accurate measurements of voice and speech. Method: Technical and voice literature was reviewed and analyzed. The following recommendations on desirable microphone…
Peters, E R; Williams, S L; Cooke, M A; Kuipers, E
2012-07-01
Previous studies have suggested that beliefs about voices mediate the relationship between actual voice experience and behavioural and affective response. We investigated beliefs about voice power (omnipotence), voice intent (malevolence/benevolence) and emotional and behavioural response (resistance/engagement) using the Beliefs About Voices Questionnaire - Revised (BAVQ-R) in 46 voice hearers. Distress was assessed using a wide range of measures: voice-related distress, depression, anxiety, self-esteem and suicidal ideation. Voice topography was assessed using measures of voice severity, frequency and intensity. We predicted that beliefs about voices would show a stronger association with distress than voice topography. Omnipotence had the strongest associations with all measures of distress included in the study whereas malevolence was related to resistance, and benevolence to engagement. As predicted, voice severity, frequency and intensity were not related to distress once beliefs were accounted for. These results concur with previous findings that beliefs about voice power are key determinants of distress in voice hearers, and should be targeted specifically in psychological interventions.
Gender differences in children's voice use in a day care environment.
Nygren, Mariana; Tyboni, Mikaela; Lindström, Fredric; McAllister, Anita; van Doorn, Jan
2012-11-01
The prevalence of dysphonia is higher in boys than in girls before puberty. This could be because of the differences in boys' and girls' voice use. Previous research on gender differences in prepubescent children's voice parameters has been contradictory. Most studies have focused on examining fundamental frequency. The purpose of this study was to investigate voice use in boys and girls in a day care environment based on the voice parameters fundamental frequency (Hz), vocal intensity (dB SPL), and phonation time (%) and to ascertain whether there were any significant gender differences. Prospective comparative design. The study was conducted in a day care environment where 30 children (17 boys and 13 girls aged 4-5 years) participated. The participants' voices were measured continuously for 4 hours with a voice accumulator that registered fundamental frequency, vocal intensity level, phonation time, and background noise. Mean (standard deviation) fundamental frequency was 310 (22) and 321 (16) Hz, vocal intensity was 93 (4) and 91 (3) dB SPL, and phonation time was 7.7 (2.0)% and 7.6 (2.5)% for boys and girls, respectively. No differences between genders were statistically significant. The finding of no statistically significant gender differences for measurements of voice parameters in a group of children aged 4-5 years in a day care environment is an important finding that contributes to increased knowledge about young boys' and girls' voice use. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Pitch Elevation in Male-to-female Transgender Persons-the Würzburg Approach.
Meister, Jonas; Hagen, Rudolf; Shehata-Dieler, Wafaa; Kühn, Heike; Kraus, Fabian; Kleinsasser, Norbert
2017-03-01
The present study reports objective and subjective voice results of Wendler's glottoplasty modified by Hagen. This is an outcomes research study. A total of 21 patients underwent Wendler's glottoplasty modified by Hagen. Parameters in the follow-up session were laryngoscopy, voice range profile, Voice Handicap Index, Life Satisfaction Questionnaire, and a visual analog scale for individual satisfaction with the voice. The fundamental frequency was elevated into the typical female fundamental frequency range. Furthermore, an elevation of the lower frequency limit was shown without a reduction of the frequency range. About one third of the population feels affected by the restricted dynamic range. This change of the vocal pitch is seen as part of the voice feminization by some of the patients. The Dysphonia Severity Index as a marker for voice quality was unchanged. Subjective satisfaction with the voice showed a strong correlation with the individual elevation of the pitch. Wendler's glottoplasty modified by Hagen is an effective and low-risk method of raising the vocal pitch of male-to-female transgender persons. However, elevated Scores of the Voice Handicap Index indicated that in everyday life, transgender persons continue to feel handicapped because of their voice. Another indicator for the lack of social acceptance and integration is the reduced general life satisfaction in the Life Satisfaction Questionnaire especially in the domain "friends, acquaintances, relatives." Therefore, a better multidisciplinary therapy concept for voice feminization is necessary. Copyright © 2017. Published by Elsevier Inc.
Voice-stress measure of mental workload
NASA Technical Reports Server (NTRS)
Alpert, Murray; Schneider, Sid J.
1988-01-01
In a planned experiment, male subjects between the age of 18 and 50 will be required to produce speech while performing various tasks. Analysis of the speech produced should reveal which aspects of voice prosody are associated with increased workloads. Preliminary results with two female subjects suggest a possible trend for voice frequency and amplitude to be higher and the variance of the voice frequency to be lower in the high workload condition.
Real time analysis of voiced sounds
NASA Technical Reports Server (NTRS)
Hong, J. P. (Inventor)
1976-01-01
A power spectrum analysis of the harmonic content of a voiced sound signal is conducted in real time by phase-lock-loop tracking of the fundamental frequency, (f sub 0) of the signal and successive harmonics (h sub 1 through h sub n) of the fundamental frequency. The analysis also includes measuring the quadrature power and phase of each frequency tracked, differentiating the power measurements of the harmonics in adjacent pairs, and analyzing successive differentials to determine peak power points in the power spectrum for display or use in analysis of voiced sound, such as for voice recognition.
Voice measures of workload in the advanced flight deck
NASA Technical Reports Server (NTRS)
Schneider, Sid J.; Alpert, Murray; Odonnell, Richard
1989-01-01
Voice samples were obtained from 14 male subjects under high and low workload conditions. Acoustical analysis of the voice suggested that high workload conditions can be revealed by their effects on the voice over time. Aircrews in the advanced flight deck will be voicing short, imperative sentences repeatedly. A drop in the energy of the voice, as reflected by reductions in amplitude and frequency over time, and the failure to achieve old amplitude and frequency levels after rest periods, can signal that the workload demands of the situation are straining the speaker. This kind of measurement would be relatively unaffected by individual differences in acoustical measures.
Voice Relative Fundamental Frequency via Neck-Skin Acceleration in Individuals with Voice Disorders
ERIC Educational Resources Information Center
Lien, Yu-An S.; Calabrese, Carolyn R.; Michener, Carolyn M.; Murray, Elizabeth Heller; Van Stan, Jarrad H.; Mehta, Daryush D.; Hillman, Robert E.; Noordzij, J. Pieter; Stepp, Cara E.
2015-01-01
Purpose: This study investigated the use of neck-skin acceleration for relative fundamental frequency (RFF) analysis. Method: Forty individuals with voice disorders associated with vocal hyperfunction and 20 age- and sex-matched control participants were recorded with a subglottal neck-surface accelerometer and a microphone while producing speech…
Volitional exaggeration of body size through fundamental and formant frequency modulation in humans
Pisanski, Katarzyna; Mora, Emanuel C.; Pisanski, Annette; Reby, David; Sorokowski, Piotr; Frackowiak, Tomasz; Feinberg, David R.
2016-01-01
Several mammalian species scale their voice fundamental frequency (F0) and formant frequencies in competitive and mating contexts, reducing vocal tract and laryngeal allometry thereby exaggerating apparent body size. Although humans’ rare capacity to volitionally modulate these same frequencies is thought to subserve articulated speech, the potential function of voice frequency modulation in human nonverbal communication remains largely unexplored. Here, the voices of 167 men and women from Canada, Cuba, and Poland were recorded in a baseline condition and while volitionally imitating a physically small and large body size. Modulation of F0, formant spacing (∆F), and apparent vocal tract length (VTL) were measured using Praat. Our results indicate that men and women spontaneously and systemically increased VTL and decreased F0 to imitate a large body size, and reduced VTL and increased F0 to imitate small size. These voice modulations did not differ substantially across cultures, indicating potentially universal sound-size correspondences or anatomical and biomechanical constraints on voice modulation. In each culture, men generally modulated their voices (particularly formants) more than did women. This latter finding could help to explain sexual dimorphism in F0 and formants that is currently unaccounted for by sexual dimorphism in human vocal anatomy and body size. PMID:27687571
Electromyographic activity of strap and cricothyroid muscles in pitch change.
Roubeau, B; Chevrie-Muller, C; Lacau Saint Guily, J
1997-05-01
The EMG activity of the cricothyroid muscle (CT) and the three extrinsic laryngeal muscles (thyohyoid, TH; sternothyroid, ST, and sternohyoid, SH) were recorded throughout the voice range of one female and one male subject, both untrained singers. The voice range was examined using rising and falling glissandos (production of a sustained sound with progressive and continuous variation of fundamental frequency). Muscle activity was observed at various pitches during the glissandos. The strap muscle activity during the production of glissandos appears to be synergistic. At the lowest frequency, the CT is inactive but strap muscles (TH, ST, SH) are active. As frequency increases, strap muscle activity decreases while the CT controls frequency in the middle of the range. At higher frequencies the strap muscles once again become active. This activity might depend on the vocal vibratory mechanism involved. The role of the strap muscles at high pitches is a widely debated point but it seems that in some way they control the phenomena relevant to the rising pitch. The phasic-type strap muscle activity contrasts with the tonic-type activity of the CT. The CT closely controls the frequency, while the straps are not directly linked to the pitch but rather to the evolution of the frequency of voice production (speaking voice, singing voice, held notes, glissandos, trillo, vibrato, etc.).
ERIC Educational Resources Information Center
Skuk, Verena G.; Schweinberger, Stefan R.
2014-01-01
Purpose: To determine the relative importance of acoustic parameters (fundamental frequency [F0], formant frequencies [FFs], aperiodicity, and spectrum level [SL]) on voice gender perception, the authors used a novel parameter-morphing approach that, unlike spectral envelope shifting, allows the application of nonuniform scale factors to transform…
Vocal impact of a prolonged reading task in dysphonic versus normophonic female teachers.
Remacle, Angélique; Morsomme, Dominique; Berrué, Elise; Finck, Camille
2012-11-01
This study evaluates the effect of a 2-hour reading task between 70 and 75 dB(A) in 16 normophonic and 16 dysphonic female teachers with vocal nodules. Objective measurements (acoustic analysis, voice range measurements, and aerodynamic measurements) and subjective self-ratings were collected before and every 30 minutes during the reading to determine the voice evolution in both groups. Fundamental frequency, lowest frequency, highest frequency (F-High), highest intensity, and intensity range increase through the reading, whereas shimmer decreases. Maximum phonation time decreases after 30 minutes. Estimated subglottal pressure (ESP) and sound pressure level increase during the first hour. Afterward, ESP decreases. Self-ratings worsen through time. When comparing the normophonic and the dysphonic teachers, self-ratings reveal more complaints in the dysphonic group. Few differences in objective measurements are found between both groups: normophonic teachers show lower ESP, higher F-High, and greater frequency range. Frequency modifications from acoustic analysis and voice range measurements suggest an increased laryngeal tension during vocal load, while subjects perceive a worsening of voice. Aerodynamic parameters depict first a deterioration of voice efficiency and then an adaptation to the prolonged reading. The comparison between both groups shows a discrepancy between objective measurements and self-ratings, suggesting that both approaches are necessary to have a complete view of vocal load effects. Surprisingly, both groups behave similarly through vocal load, without more or quicker deterioration of voice in the dysphonic group. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
[The speaking fundamental frequency and the singing voice type].
Chernobel'skiĭ, S I
2010-01-01
The objective of this study was to determine the speaking fundamental frequency (SFO) in professional opera singers and its dependence on their voice type, if any. A total of 75 persons were available for observation using a special computer clinical program. Male voices were categorized into three groups (viz, tenor, baritone, and bass), female ones into 2 groups (soprano and mezzo-soprano). It was shown that borderlines between SFO types varied within a wide range in all study groups. Significant differences in SFO were documented between tenors, baritones, and basses and between sopranos and mezzo-sopranos; the differences were insignificant between baritones and basses. It is concluded that the speaking fundamental frequency depends on the type of the singing voice; however this characteristic may serve only as an auxiliary tool but can not be used for the classification of singing voices.
Voice similarity in identical twins.
Van Gysel, W D; Vercammen, J; Debruyne, F
2001-01-01
If people are asked to discriminate visually the two individuals of a monozygotic twin (MT), they mostly get into trouble. Does this problem also exist when listening to twin voices? Twenty female and 10 male MT voices were randomly assembled with one "strange" voice to get voice trios. The listeners (10 female students in Speech and Language Pathology) were asked to label the twins (voices 1-2, 1-3 or 2-3) in two conditions: two standard sentences read aloud and a 2.5-second midsection of a sustained /a/. The proportion correctly labelled twins was for female voices 82% and 63% and for male voices 74% and 52% for the sentences and the sustained /a/ respectively, both being significantly greater than chance (33%). The acoustic analysis revealed a high intra-twin correlation for the speaking fundamental frequency (SFF) of the sentences and the fundamental frequency (F0) of the sustained /a/. So the voice pitch could have been a useful characteristic in the perceptual identification of the twins. We conclude that there is a greater perceptual resemblance between the voices of identical twins than between voices without genetic relationship. The identification however is not perfect. The voice pitch possibly contributes to the correct twin identifications.
Mehta, Daryush D.; Sternad, Dagmar; Petit, Robert; Hillman, Robert E.
2017-01-01
Purpose Ambulatory voice biofeedback has the potential to significantly improve voice therapy effectiveness by targeting carryover of desired behaviors outside the therapy session (i.e., retention). This study applies motor learning concepts (reduced frequency and delayed, summary feedback) that demonstrate increased retention to ambulatory voice monitoring for training nurses to talk softer during work hours. Method Forty-eight nurses with normal voices wore the Voice Health Monitor (Mehta, Zañartu, Feng, Cheyne, & Hillman, 2012) for 6 days: 3 baseline days, 1 biofeedback day, 1 short-term retention day, and 1 long-term retention day. Participants were block-randomized into 3 different biofeedback groups: 100%, 25%, and Summary. Performance was measured in terms of compliance time below a participant-specific vocal intensity threshold. Results All participants exhibited a significant increase in compliance time (Cohen's d = 4.5) during biofeedback days compared with baseline days. The Summary feedback group exhibited statistically smaller performance reduction during both short-term (d = 1.14) and long-term (d = 1.04) retention days compared with the 100% feedback group. Conclusions These findings suggest that modifications in feedback frequency and timing affect retention of a modified vocal behavior in daily life. Future work calls for studying the potential beneficial impact of ambulatory voice biofeedback in participants with behaviorally based voice disorders. PMID:28329366
Relationship between Activity Noise, Voice Parameters, and Voice Symptoms among Female Teachers.
Pirilä, Sirpa; Pirilä, Paula; Ansamaa, Terhi; Yliherva, Anneli; Sonning, Samuel; Rantala, Leena
2017-01-01
Our interest was in how teachers' voices behave during the delivery of lessons in core subjects (e.g., mathematics, science, etc.). We sought to evaluate the relationship between voice sound pressure level (SPL), vocal fundamental frequency (F0), voice symptoms, activity noise, and differences therein during the first and the last lessons in core subjects of the day. The participants were 24 female elementary school teachers. Voice symptoms were evaluated by questionnaire. The data were recorded on 2 portable voice accumulators (VoxLog) from the first and last lessons of the day. The versions of accumulators differed by frequency weighting; therefore, the analysis and the results of noise and voice SPL were treated separately: unweighted (group 1) and A-weighted (group 2). Difference in voice SPL followed difference in activity noise. F0 increased between the first and last lessons. Correlations were found between differences in the noise and the voice symptoms of tiredness and dryness. Irritating mucus was associated with high F0 during the first lesson. An apparent increase in voice loading due to the activity noise was observed during lessons in core subjects. Collaboration between specialists in voice and acoustics and teachers and pupils is needed to reduce this voice loading. © 2017 S. Karger AG, Basel.
Double Fourier analysis for Emotion Identification in Voiced Speech
NASA Astrophysics Data System (ADS)
Sierra-Sosa, D.; Bastidas, M.; Ortiz P., D.; Quintero, O. L.
2016-04-01
We propose a novel analysis alternative, based on two Fourier Transforms for emotion recognition from speech. Fourier analysis allows for display and synthesizes different signals, in terms of power spectral density distributions. A spectrogram of the voice signal is obtained performing a short time Fourier Transform with Gaussian windows, this spectrogram portraits frequency related features, such as vocal tract resonances and quasi-periodic excitations during voiced sounds. Emotions induce such characteristics in speech, which become apparent in spectrogram time-frequency distributions. Later, the signal time-frequency representation from spectrogram is considered an image, and processed through a 2-dimensional Fourier Transform in order to perform the spatial Fourier analysis from it. Finally features related with emotions in voiced speech are extracted and presented.
Brockmann, Meike; Drinnan, Michael J; Storck, Claudio; Carding, Paul N
2011-01-01
The aims of this study were to examine vowel and gender effects on jitter and shimmer in a typical clinical voice task while correcting for the confounding effects of voice sound pressure level (SPL) and fundamental frequency (F(0)). Furthermore the relative effect sizes of vowel, gender, voice SPL, and F(0) were assessed, and recommendations for clinical measurements were derived. With this cross-sectional single cohort study, 57 healthy adults (28 women, 29 men) aged 20-40 years were investigated. Three phonations of /a/, /o/, and /i/ at "normal" voice loudness were analyzed using Praat (software). The effects of vowel, gender, voice SPL, and F(0) on jitter and shimmer were assessed using descriptive and inferential (analysis of covariance) statistics. The effect sizes were determined with the eta-squared statistic. Vowels, gender, voice SPL, and F(0), each had significant effects either on jitter or on shimmer, or both. Voice SPL was the most important factor, whereas vowel, gender, and F(0) effects were comparatively small. Because men had systematically higher voice SPL, the gender effects on jitter and shimmer were smaller when correcting for SPL and F(0). Surprisingly, in clinical assessments, voice SPL has the single biggest impact on jitter and shimmer. Vowel and gender effects were clinically important, whereas fundamental frequency had a relatively small influence. Phonations at a predefined voice SPL (80 dB minimum) and vowel (/a/) would enhance measurement reliability. Furthermore, gender-specific thresholds applying these guidelines should be established. However, the efficiency of these measures should be verified and tested with patients. Copyright © 2011 The Voice Foundation. All rights reserved.
Szabo Portela, Annika; Granqvist, Svante; Ternström, Sten; Södersten, Maria
2018-01-01
This study aimed to assess vocal behavior in women with voice-intensive occupations to investigate differences between patients and controls and between work and leisure conditions with environmental noise level as an experimental factor. Patients with work-related voice disorders, 10 with phonasthenia and 10 with vocal nodules, were matched regarding age, profession, and workplace with 20 vocally healthy colleagues. The sound pressure level of environmental noise and the speakers' voice, fundamental frequency, and phonation ratio were registered from morning to night during 1 week with a voice accumulator. Voice data were assessed in low (≤55 dBA), moderate, and high (>70 dBA) environmental noise levels. The average environmental noise level was significantly higher during the work condition for patients with vocal nodules (73.9 dBA) and their controls (73.0 dBA) compared with patients with phonasthenia (68.3 dBA) and their controls (67.1 dBA). The average voice level and the fundamental frequency were also significantly higher during work for the patients with vocal nodules and their controls. During the leisure condition, there were no significant differences in average noise and voice level nor fundamental frequency between the groups. The patients with vocal nodules and their controls spent significantly more time and used their voices significantly more in high-environmental noise levels. High noise levels during work and demands from the occupation impact vocal behavior. Thus, assessment of voice ergonomics should be part of the work environmental management. To reduce environmental noise levels is important to improve voice ergonomic conditions in communication-intensive and vocally demanding workplaces. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Perceptual adaptation of voice gender discrimination with spectrally shifted vowels.
Li, Tianhao; Fu, Qian-Jie
2011-08-01
To determine whether perceptual adaptation improves voice gender discrimination of spectrally shifted vowels and, if so, which acoustic cues contribute to the improvement. Voice gender discrimination was measured for 10 normal-hearing subjects, during 5 days of adaptation to spectrally shifted vowels, produced by processing the speech of 5 male and 5 female talkers with 16-channel sine-wave vocoders. The subjects were randomly divided into 2 groups; one subjected to 50-Hz, and the other to 200-Hz, temporal envelope cutoff frequencies. No preview or feedback was provided. There was significant adaptation in voice gender discrimination with the 200-Hz cutoff frequency, but significant improvement was observed only for 3 female talkers with F(0) > 180 Hz and 3 male talkers with F(0) < 170 Hz. There was no significant adaptation with the 50-Hz cutoff frequency. Temporal envelope cues are important for voice gender discrimination under spectral shift conditions with perceptual adaptation, but spectral shift may limit the exclusive use of spectral information and/or the use of formant structure on voice gender discrimination. The results have implications for cochlear implant users and for understanding voice gender discrimination.
Perceptual Adaptation of Voice Gender Discrimination with Spectrally Shifted Vowels
Li, Tianhao; Fu, Qian-Jie
2013-01-01
Purpose To determine whether perceptual adaptation improves voice gender discrimination of spectrally shifted vowels and, if so, which acoustic cues contribute to the improvement. Method Voice gender discrimination was measured for 10 normal-hearing subjects, during 5 days of adaptation to spectrally shifted vowels, produced by processing the speech of 5 male and 5 female talkers with 16-channel sine-wave vocoders. The subjects were randomly divided into 2 groups; one subjected to 50-Hz, and the other to 200-Hz, temporal envelope cutoff frequencies. No preview or feedback was provided. Results: There was significant adaptation in voice gender discrimination with the 200-Hz cutoff frequency, but significant improvement was observed only for 3 female talkers with F0 > 180 Hz and 3 male talkers with F0 < 170 Hz. There was no significant adaptation with the 50-Hz cutoff frequency. Conclusions Temporal envelope cues are important for voice gender discrimination under spectral shift conditions with perceptual adaptation, but spectral shift may limit the exclusive use of spectral information and/or the use of formant structure on voice gender discrimination. The results have implications for cochlear implant users and for understanding voice gender discrimination. PMID:21173392
Nilsonne, A; Sundberg, J; Ternström, S; Askenfelt, A
1988-02-01
A method of measuring the rate of change of fundamental frequency has been developed in an effort to find acoustic voice parameters that could be useful in psychiatric research. A minicomputer program was used to extract seven parameters from the fundamental frequency contour of tape-recorded speech samples: (1) the average rate of change of the fundamental frequency and (2) its standard deviation, (3) the absolute rate of fundamental frequency change, (4) the total reading time, (5) the percent pause time of the total reading time, (6) the mean, and (7) the standard deviation of the fundamental frequency distribution. The method is demonstrated on (a) a material consisting of synthetic speech and (b) voice recordings of depressed patients who were examined during depression and after improvement.
McCormick, Michael; Seta, John J
2012-01-01
An attribute framing effect occurs when positive or negative associations produced by positive or negative frames are mapped onto evaluations resulting in a more favourable evaluation for the positively framed attribute. We used a new voice frequency manipulation to differentially enhance right versus left hemisphere processing. In doing so we found a strong attribute framing effect when a speaker with a low-frequency voice enhanced the contextual processing style of the right hemisphere. However, a framing effect was not obtained when a speaker with a high-frequency voice enhanced the inferential/analytical processing style of the left hemisphere. At the theoretical level our results provide evidence that the contextual processing style of the right hemisphere is especially susceptible to associative implications, such as those found in attribute framing manipulations. At the applied level we provide a simple method for altering the effectiveness of persuasion messages.
Voice Tremor in Parkinson's Disease: An Acoustic Study.
Gillivan-Murphy, Patricia; Miller, Nick; Carding, Paul
2018-01-30
Voice tremor associated with Parkinson disease (PD) has not been characterized. Its relationship with voice disability and disease variables is unknown. This study aimed to evaluate voice tremor in people with PD (pwPD) and a matched control group using acoustic analysis, and to examine correlations with voice disability and disease variables. Acoustic voice tremor analysis was completed on 30 pwPD and 28 age-gender matched controls. Voice disability (Voice Handicap Index), and disease variables of disease duration, Activities of Daily Living (Unified Parkinson's Disease Rating Scale [UPDRS II]), and motor symptoms related to PD (UPDRS III) were examined for relationship with voice tremor measures. Voice tremor was detected acoustically in pwPD and controls with similar frequency. PwPD had a statistically significantly higher rate of amplitude tremor (Hz) than controls (P = 0.001). Rate of amplitude tremor was negatively and significantly correlated with UPDRS III total score (rho -0.509). For pwPD, the magnitude and periodicity of acoustic tremor was higher than for controls without statistical significance. The magnitude of frequency tremor (Mftr%) was positively and significantly correlated with disease duration (rho 0.463). PwPD had higher Voice Handicap Index total, functional, emotional, and physical subscale scores than matched controls (P < 0.001). Voice disability did not correlate significantly with acoustic voice tremor measures. Acoustic analysis enhances understanding of PD voice tremor characteristics, its pathophysiology, and its relationship with voice disability and disease symptomatology. Copyright © 2018 The Voice Foundation. All rights reserved.
Freddie Mercury-acoustic analysis of speaking fundamental frequency, vibrato, and subharmonics.
Herbst, Christian T; Hertegard, Stellan; Zangger-Borch, Daniel; Lindestad, Per-Åke
2017-04-01
Freddie Mercury was one of the twentieth century's best-known singers of commercial contemporary music. This study presents an acoustical analysis of his voice production and singing style, based on perceptual and quantitative analysis of publicly available sound recordings. Analysis of six interviews revealed a median speaking fundamental frequency of 117.3 Hz, which is typically found for a baritone voice. Analysis of voice tracks isolated from full band recordings suggested that the singing voice range was 37 semitones within the pitch range of F#2 (about 92.2 Hz) to G5 (about 784 Hz). Evidence for higher phonations up to a fundamental frequency of 1,347 Hz was not deemed reliable. Analysis of 240 sustained notes from 21 a-cappella recordings revealed a surprisingly high mean fundamental frequency modulation rate (vibrato) of 7.0 Hz, reaching the range of vocal tremor. Quantitative analysis utilizing a newly introduced parameter to assess the regularity of vocal vibrato corroborated its perceptually irregular nature, suggesting that vibrato (ir)regularity is a distinctive feature of the singing voice. Imitation of subharmonic phonation samples by a professional rock singer, documented by endoscopic high-speed video at 4,132 frames per second, revealed a 3:1 frequency locked vibratory pattern of vocal folds and ventricular folds.
[Mechanism of neoglottic adjustment for voice variation in tracheoesophageal speech].
Fujimoto, T; Kinishi, M; Mohri, M; Amatsu, M
1994-06-01
Over the past 17 years, we have been performing tracheoesophageal (TE) fistulization for voice restoration following total laryngectomy. The purpose of this technique is to divert the exhaled air through the TE fistula into the hypopharynx where the inferior constrictor muscle forms the retropharyngeal prominence on which the neoglottis is located. It is generally accepted that both pulmonary power and laryngeal adjustment control voice frequency and intensity change in laryngeal phonation. Regularity at various pitches and voice intensities was seen in TE phonation, despite laryngeal adjustment being lost. Regular voice production with various pitches and intensities requires a regulatory mechanism for both pulmonary power and the neoglottis. This study was designed to clarify the mechanism of neoglottic adjustment in TE phonation. Ten speakers with TE fistula were subjected to aerodynamic and electrophysiological investigations. Tracheal pressure, fundamental frequency, intensity, and airflow rate were measured for easy phonation, a high-pitched voice, and a loud voice. Resistance and efficiency of the neoglottis were calculated from the data obtained. Electromyograms of the inferior constrictor muscle and tracheal pressure were simultaneously recorded when the pitch or intensity of the voice increased. Six of the ten subjects examined were able to produce a high-pitched voice. Tracheal pressure increased in all six, the airflow rate in four, and neoglottal resistance in five, as compared with the data obtained during easy phonation. Nine of the ten subjects examined were able to produce a loud voice. In all nine, both tracheal pressure and the airflow rate increased as compared with the values measured during easy phonation. Neoglottal resistance had no definite pattern in relation to voice intensity changes. Electrophysiological study demonstrated that the activity of the inferior constrictor muscle increased as tracheal pressure increased so as to raise the pitch or increase the intensity of the voice. These results indicate that the adjustment of neoglottic closure and stiffness produced by the inferior constrictor muscle has the role of varying the frequency or intensity of the voice.
Formant frequencies in country singers' speech and singing.
Stone, R E; Cleveland, T F; Sundberg, J
1999-06-01
In previous investigations breathing kinematics, subglottal pressures, and voice source characteristics of a group of premier country singers have been analyzed. The present study complements the description of these singers' voice properties by examining the formant frequencies in five of these country singers' spoken and sung versions of the national anthem and of a song of their own choosing. The formant frequencies were measured for identical phonemes under both conditions. Comparisons revealed that the singers used the same or slightly higher formant frequencies when they were singing than when they were speaking. The differences may be related to the higher fundamental frequency in singing. These findings are in good agreement with previous observations regarding breathing, subglottal pressures, and voice source, but are in marked contrast to what has been found for classically trained singers.
Voice Quality and Gender Stereotypes: A Study of Lebanese Women With Reinke's Edema.
Matar, Nayla; Portes, Cristel; Lancia, Leonardo; Legou, Thierry; Baider, Fabienne
2016-12-01
Women with Reinke's edema (RW) report being mistaken for men during telephone conversations. For this reason, their masculine-sounding voices are interesting for the study of gender stereotypes. The study's objective is to verify their complaint and to understand the cues used in gender identification. Using a self-evaluation study, we verified RW's perception of their own voices. We compared the acoustic parameters of vowels produced by 10 RW to those produced by 10 men and 10 women with healthy voices (hereafter referred to as NW) in Lebanese Arabic. We conducted a perception study for the evaluation of RW, healthy men's, and NW voices by naïve listeners. RW self-evaluated their voices as masculine and their gender identities as feminine. The acoustic parameters that distinguish RW from NW voices concern fundamental frequency, spectral slope, harmonicity of the voicing signal, and complexity of the spectral envelope. Naïve listeners very often rate RW as surely masculine. Listeners may rate RW's gender incorrectly. These incorrect gender ratings are correlated with acoustic measures of fundamental frequency and voice quality. Further investigations will reveal the contribution of each of these parameters to gender perception and guide the treatment plan of patients complaining of a gender ambiguous voice.
Maxillary arch dimensions associated with acoustic parameters in prepubertal children.
Hamdan, Abdul-Latif; Khandakji, Mohannad; Macari, Anthony Tannous
2018-04-18
To evaluate the association between maxillary arch dimensions and fundamental frequency and formants of voice in prepubertal subjects. Thirty-five consecutive prepubertal patients seeking orthodontic treatment were recruited (mean age = 11.41 ± 1.46 years; range, 8 to 13.7 years). Participants with a history of respiratory infection, laryngeal manipulation, dysphonia, congenital facial malformations, or history of orthodontic treatment were excluded. Dental measurements included maxillary arch length, perimeter, depth, and width. Voice parameters comprising fundamental frequency (f0_sustained), Habitual pitch (f0_count), Jitter, Shimmer, and different formant frequencies (F1, F2, F3, and F4) were measured using acoustic analysis prior to initiation of any orthodontic treatment. Pearson's correlation coefficients were used to measure the strength of associations between different dental and voice parameters. Multiple linear regressions were computed for the predictions of different dental measurements. Arch width and arch depth had moderate significant negative correlations with f0 ( r = -0.52; P = .001 and r = -0.39; P = .022, respectively) and with habitual frequency ( r = -0.51; P = .0014 and r = -0.34; P = .04, respectively). Arch depth and arch length were significantly correlated with formant F3 and formant F4, respectively. Predictors of arch depth included frequencies of F3 vowels, with a significant regression equation ( P-value < .001; R 2 = 0.49). Similarly, fundamental frequency f0 and frequencies of formant F3 vowels were predictors of arch width, with a significant regression equation ( P-value < .001; R 2 = 0.37). There is a significant association between arch dimensions, particularly arch length and depth, and voice parameters. The formant most predictive of arch depth and width is the third formant, along with fundamental frequency of voice.
Voice Range Profiles of Middle School and High School Choral Directors
ERIC Educational Resources Information Center
Schwartz, Sandra M.
2009-01-01
Vocal demands of teaching are significant, and this challenge is compounded for choral directors who depend on the voice for communicating information or demonstrating music concepts. The purpose of this study is to examine the frequency and intensity of middle and high school choral directors' voices and to compare choral directors' voices with…
Measurements of the Acoustic Speaking Voice After Vocal Warm-up and Cooldown in Choir Singers.
Onofre, Fernanda; Prado, Yuka de Almeida; Rojas, Gleidy Vannesa E; Garcia, Denny Marco; Aguiar-Ricz, Lílian
2017-01-01
The aim of this study was to evaluate the acoustic measurements of the vowel /a/ in modal recording before and after a singing voice resistance test and after 30 minutes of absolute rest in female choir singers. This is a prospective cohort study. A total of 13 soprano choir singers with experience in choir singing were evaluated through analysis of acoustic voice parameters at three points in time: before continuous use of the voice, after vocal warm-up and a singing test 60 minutes in duration respecting the pauses for breathing, and after vocal cooldown and an absolute voice rest for 30 minutes. The fundamental frequency increased after the voice resistance test (P = 0.012) and remained elevated after the 30 minutes of voice rest (P = 0.01). The jitter decreased after the voice resistance test (P = 0.02) and after the 30 minutes of voice rest. A significant difference was detected for the acoustic voice parameters relative average perturbation (RAP), (P = 0.05), and pitch perturbation quotient (PPQ), (P = 0.04), compared with the initial time point. The fundamental frequency increased after 60 minutes of singing and remained elevated after vocal cooldown and absolute rest for 30 minutes, proving an efficient parameter for identifying the changes inherent to voice demand during singing. Copyright © 2017. Published by Elsevier Inc.
The singer's voice range profile: female professional opera soloists.
Lamarche, Anick; Ternström, Sten; Pabon, Peter
2010-07-01
This work concerns the collection of 30 voice range profiles (VRPs) of female operatic voice. We address the questions: Is there a need for a singer's protocol in VRP acquisition? Are physiological measurements sufficient or should the measurement of performance capabilities also be included? Can we address the female singing voice in general or is there a case for categorizing voices when studying phonetographic data? Subjects performed a series of structured tasks involving both standard speech voice protocols and additional singing tasks. Singers also completed an extensive questionnaire. Physiological VRPs differ from performance VRPs. Two new VRP metrics, the voice area above a defined level threshold and the dynamic range independent from the fundamental frequency (F(0)), were found to be useful in the analysis of singer VRPs. Task design had no effect on performance VRP outcomes. Voice category differences were mainly attributable to phonation frequency-based information. Results support the clinical importance of addressing the vocal instrument as it is used in performance. Equally important is the elaboration of a protocol suitable for the singing voice. The given context and instructions can be more important than task design for performance VRPs. Yet, for physiological VRP recordings, task design remains critical. Both types of VRPs are suggested for a singer's voice evaluation. Copyright (c) 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Physiological characteristics of the supported singing voice. A preliminary study.
Griffin, B; Woo, P; Colton, R; Casper, J; Brewer, D
1995-03-01
The purpose of this study was to develop a definition of the supported singing voice based on physiological characteristics by comparing the subjects' concepts of a supported voice with objective measurements of their supported and unsupported voice. This preliminary report presents findings based on data from eight classically trained singers. Subjects answered questions about their concepts of the characteristics of the supported singing voice and how it is produced. Samples of the supported and unsupported singing voice produced at low, medium, and high pitches at a comfortable loudness level were collected for acoustic, spectral, airflow, electroglottographic, air volume, and stroboscopic analyses. Significant differences between the supported and unsupported voice were found for sound pressure level (SPL), peak airflow, subglottal pressure (Ps), glottal open time, and frequency of the fourth formant (F4). Mean flow and F2 frequency differences were sex and pitch related. Males adjusted laryngeal configuration to produce supported voice, whereas glottal configuration differences were greater in females. Breathing patterns were variable and not significantly different between supported and unsupported voice. Subjects in this study believe that the supported singing voice is resonant, clear, and easy to manage and is produced by correct breath management. Results of data analysis show that the supported singing voice has different spectral characteristics from and higher SPL, peak airflow, and Ps than the unsupported voice. Singers adjust laryngeal and/or glottal configuration to account for these changes, but no significant differences in breathing activity were found.
Fundamental frequency, phonation maximum time and vocal complaints in morbidly obese women
de SOUZA, Lourdes Bernadete Rocha; PEREIRA, Rayane Medeiros; dos SANTOS, Marquiony Marques; GODOY, Cynthia Meida de Almeida
2014-01-01
Background Obese people have abnormal deposition of fat in the vocal tract that can interfere with the acoustic voice. Aim To relate the fundamental frequency, the maximum phonation time and voice complaints from a group of morbidly obese women. Methods Observational, cross-sectional and descriptive study that included 44 morbidly obese women, mean age of 42.45 (±10.31) years old, observational group and 30 women without obesity, control group, with 33.79 (±4.51)years old. The voice recording was done in a quiet environment, on a laptop using the program ANAGRAF acoustic analysis of speech sounds. To extract the values of fundamental frequency the subjects were asked to produce vowel [a] at usual intensity for a period in average of three seconds. After the voice recording, participants were prompted to produce sustained vowel [ a] , [ i] and [ u] at usual intensity and height, using a stopwatch to measure the time that each participant could hold each vowel. Results The majority, 31(70.5%), had vocal complaints, with a higher percentage for complaints of vocal fatigue 20(64.51%) and voice failures 19(61.29%) followed by dryness of the throat in 15 (48.38%) and effort to speak 13(41.93%). There was no statistically significant difference regarding the mean fundamental frequency of the voice in both groups, but there was significance between the two groups regarding maximum phonation. Conclusion Increased adipose tissue in the vocal tract interfered in the vocal parameters. PMID:24676298
Nonlinear dynamic mechanism of vocal tremor from voice analysis and model simulations
NASA Astrophysics Data System (ADS)
Zhang, Yu; Jiang, Jack J.
2008-09-01
Nonlinear dynamic analysis and model simulations are used to study the nonlinear dynamic characteristics of vocal folds with vocal tremor, which can typically be characterized by low-frequency modulation and aperiodicity. Tremor voices from patients with disorders such as paresis, Parkinson's disease, hyperfunction, and adductor spasmodic dysphonia show low-dimensional characteristics, differing from random noise. Correlation dimension analysis statistically distinguishes tremor voices from normal voices. Furthermore, a nonlinear tremor model is proposed to study the vibrations of the vocal folds with vocal tremor. Fractal dimensions and positive Lyapunov exponents demonstrate the evidence of chaos in the tremor model, where amplitude and frequency play important roles in governing vocal fold dynamics. Nonlinear dynamic voice analysis and vocal fold modeling may provide a useful set of tools for understanding the dynamic mechanism of vocal tremor in patients with laryngeal diseases.
NASA Astrophysics Data System (ADS)
Rendall, Drew; Kollias, Sophie; Ney, Christina; Lloyd, Peter
2005-02-01
Key voice features-fundamental frequency (F0) and formant frequencies-can vary extensively between individuals. Much of the variation can be traced to differences in the size of the larynx and vocal-tract cavities, but whether these differences in turn simply reflect differences in speaker body size (i.e., neutral vocal allometry) remains unclear. Quantitative analyses were therefore undertaken to test the relationship between speaker body size and voice F0 and formant frequencies for human vowels. To test the taxonomic generality of the relationships, the same analyses were conducted on the vowel-like grunts of baboons, whose phylogenetic proximity to humans and similar vocal production biology and voice acoustic patterns recommend them for such comparative research. For adults of both species, males were larger than females and had lower mean voice F0 and formant frequencies. However, beyond this, F0 variation did not track body-size variation between the sexes in either species, nor within sexes in humans. In humans, formant variation correlated significantly with speaker height but only in males and not in females. Implications for general vocal allometry are discussed as are implications for speech origins theories, and challenges to them, related to laryngeal position and vocal tract length. .
Fundamental voice frequence during normal and abnormal growth, and after androgen treatment.
Vuorenkoski, V; Lenko, H L; Tjernlund, P; Vuorenkoski, L; Perheentupa, J
1978-01-01
A simple treatment was shown to be suitable for clinical measurement of fundamental voice frequency. Basal frequency (SFF) and lowest frequency (LF) were determined in 374 normal subjects aged 6 years to adulthood. SFF fell between ages 8 and 10 years in boys (from 259 to 247 Hz), but not in girls (253 Hz). LF fell between ages 6 and 10 years in boys (from 234 to 203 Hz) and girls (from 230 to 218 Hz), and a sex difference appeared. In puberty, parallel to pubic hair (PH) development, a gradual fall of SFF and LF occurred in both boys (to 100 and 90 Hz, respectively) and girls (to 213 and 180 Hz). As a group, young hypopituitary children and girls with Turner's syndrome had a high SFF, and prepubertal boys with delayed maturation a low SFF. In some children with prenatal growth failure, SFF was abnormally high. The girls with Turner's syndrome exhibited a high, though individually variable, sensitivity of voice to androgen; their voices became lower before the appearance of any other masculinising effects. The instrument is useful for characterisation of growth failure syndromes and stages of puberty. It is particularly recommended for monitoring an undesirable effect on the voice during androgen treatment. Images Fig. 1 p202-b PMID:646429
Deguchi, Shinji; Kawashima, Kazutaka; Washio, Seiichi
2008-12-01
The effect of artificially altered transglottal pressures on the voice fundamental frequency (F0) is known to be associated with vocal fold stiffness. Its measurement, though useful as a potential diagnostic tool for noncontact assessment of vocal fold stiffness, often requires manual and painstaking determination of an unstable F0 of voice. Here, we provide a computer-aided technique that enables one to carry out the determination easily and accurately. Human subjects vocalized in accordance with a series of reference sounds from a speaker controlled by a computer. Transglottal pressures were altered by means of a valve embedded in a mouthpiece. Time-varying vocal F0 was extracted, without manual procedures, from a specific range of the voice spectrum determined on the basis of the controlled reference sounds. The validity of the proposed technique was assessed for 11 healthy subjects. Fluctuating voice F0 was tracked automatically during experiments, providing the relationship between transglottal pressure change and F0 on the computer. The proposed technique overcomes the difficulty in automatic determination of the voice F0, which tends to be transient both in normal voice and in some types of pathological voice.
System Design Plan for a DCS (Defense Communications System) Data Transmission Network.
1981-07-01
modems , FDO group modems , and Voice Frequency Carrier Telegraph (VFCT) networks. The DTN will be a synchronous network and its implementation must coincide...Frequency (VF) modems and Voice Frequency Carrier Telegraph (VFCT) networks. Further, data circuits can be extended over present analog FDM facilities using...VF or group data modems . In addition to the availability of terrestrial and satellite digital transmission facilities, the implementation of the DTN
Podsakoff, Nathan P; Maynes, Timothy D; Whiting, Steven W; Podsakoff, Philip M
2015-07-01
This article reports an investigation into how individuals form perceptions of overall voice behavior in group contexts. More specifically, the authors examine the effect of the proportion of group members exhibiting voice behavior in the group, the frequency of voice events in the group, and the measurement item referent (group vs. individual) on an individual's ratings of group voice behavior. In addition, the authors examine the effect that measurement item referent has on the magnitude of the relationship observed between an individual's ratings of group voice behavior and perceptions of group performance. Consistent with hypotheses, the results from 1 field study (N = 220) and 1 laboratory experiment (N = 366) indicate that: (a) When group referents were used, raters relied on the frequency of voice events (and not the proportion of group members exhibiting voice) to inform their ratings of voice behavior, whereas the opposite was true when individual-referent items were used, and (b) the magnitude of the relationship between observers' ratings of group voice behavior and their perceptions of group performance was higher when raters used group-referent, as opposed to an individual-referent, items. The authors discuss the implications of their findings for scholars interested in studying behavioral phenomena occurring in teams, groups, and work units in organizational behavior research. (c) 2015 APA, all rights reserved).
The Impact of Vocal Hyperfunction on Relative Fundamental Frequency during Voicing Offset and Onset
ERIC Educational Resources Information Center
Stepp, Cara E.; Hillman, Robert E.; Heaton, James T.
2010-01-01
Purpose: This study tested the hypothesis that individuals with vocal hyperfunction would show decreases in relative fundamental frequency (RFF) surrounding a voiceless consonant. Method: This retrospective study of 2 clinical databases used speech samples from 15 control participants and women with hyperfunction-related voice disorders: 82 prior…
Uncertainty quantification of voice signal production mechanical model and experimental updating
NASA Astrophysics Data System (ADS)
Cataldo, E.; Soize, C.; Sampaio, R.
2013-11-01
The aim of this paper is to analyze the uncertainty quantification in a voice production mechanical model and update the probability density function corresponding to the tension parameter using the Bayes method and experimental data. Three parameters are considered uncertain in the voice production mechanical model used: the tension parameter, the neutral glottal area and the subglottal pressure. The tension parameter of the vocal folds is mainly responsible for the changing of the fundamental frequency of a voice signal, generated by a mechanical/mathematical model for producing voiced sounds. The three uncertain parameters are modeled by random variables. The probability density function related to the tension parameter is considered uniform and the probability density functions related to the neutral glottal area and the subglottal pressure are constructed using the Maximum Entropy Principle. The output of the stochastic computational model is the random voice signal and the Monte Carlo method is used to solve the stochastic equations allowing realizations of the random voice signals to be generated. For each realization of the random voice signal, the corresponding realization of the random fundamental frequency is calculated and the prior pdf of this random fundamental frequency is then estimated. Experimental data are available for the fundamental frequency and the posterior probability density function of the random tension parameter is then estimated using the Bayes method. In addition, an application is performed considering a case with a pathology in the vocal folds. The strategy developed here is important mainly due to two things. The first one is related to the possibility of updating the probability density function of a parameter, the tension parameter of the vocal folds, which cannot be measured direct and the second one is related to the construction of the likelihood function. In general, it is predefined using the known pdf. Here, it is constructed in a new and different manner, using the own system considered.
Control of voice fundamental frequency in speaking versus singing
NASA Astrophysics Data System (ADS)
Natke, Ulrich; Donath, Thomas M.; Kalveram, Karl Th.
2003-03-01
In order to investigate control of voice fundamental frequency (F0) in speaking and singing, 24 adults had to utter the nonsense word ['ta:tatas] repeatedly, while in selected trials their auditory feedback was frequency-shifted by 100 cents downwards. In the speaking condition the target speech rate and prosodic pattern were indicated by a rhythmic sequence made of white noise. In the singing condition the sequence consisted of piano notes, and subjects were instructed to match the pitch of the notes. In both conditions a response in voice F0 begins with a latency of about 150 ms. As predicted, response magnitude is greater in the singing condition (66 cents) than in the speaking condition (47 cents). Furthermore the singing condition seems to prolong the after-effect which is a continuation of the response in trials after the frequency shift. In the singing condition, response magnitude and the ability to match the target F0 correlate significantly. Results support the view that in speaking voice F0 is monitored mainly supra-segmentally and controlled less tightly than in singing.
Control of voice fundamental frequency in speaking versus singing.
Natke, Ulrich; Donath, Thomas M; Kalveram, Karl Th
2003-03-01
In order to investigate control of voice fundamental frequency (F0) in speaking and singing, 24 adults had to utter the nonsense word ['ta:tatas] repeatedly, while in selected trials their auditory feedback was frequency-shifted by 100 cents downwards. In the speaking condition the target speech rate and prosodic pattern were indicated by a rhythmic sequence made of white noise. In the singing condition the sequence consisted of piano notes, and subjects were instructed to match the pitch of the notes. In both conditions a response in voice F0 begins with a latency of about 150 ms. As predicted, response magnitude is greater in the singing condition (66 cents) than in the speaking condition (47 cents). Furthermore the singing condition seems to prolong the after-effect which is a continuation of the response in trials after the frequency shift. In the singing condition, response magnitude and the ability to match the target F0 correlate significantly. Results support the view that in speaking voice F0 is monitored mainly supra-segmentally and controlled less tightly than in singing.
Speech waveform perturbation analysis: a perceptual-acoustical comparison of seven measures.
Askenfelt, A G; Hammarberg, B
1986-03-01
The performance of seven acoustic measures of cycle-to-cycle variations (perturbations) in the speech waveform was compared. All measures were calculated automatically and applied on running speech. Three of the measures refer to the frequency of occurrence and severity of waveform perturbations in special selected parts of the speech, identified by means of the rate of change in the fundamental frequency. Three other measures refer to statistical properties of the distribution of the relative frequency differences between adjacent pitch periods. One perturbation measure refers to the percentage of consecutive pitch period differences with alternating signs. The acoustic measures were tested on tape recorded speech samples from 41 voice patients, before and after successful therapy. Scattergrams of acoustic waveform perturbation data versus an average of perceived deviant voice qualities, as rated by voice clinicians, are presented. The perturbation measures were compared with regard to the acoustic-perceptual correlation and their ability to discriminate between normal and pathological voice status. The standard deviation of the distribution of the relative frequency differences was suggested as the most useful acoustic measure of waveform perturbations for clinical applications.
Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach.
Fang, Shih-Hau; Tsao, Yu; Hsiao, Min-Jing; Chen, Ji-Ying; Lai, Ying-Hui; Lin, Feng-Chuan; Wang, Chi-Te
2018-03-19
Computerized detection of voice disorders has attracted considerable academic and clinical interest in the hope of providing an effective screening method for voice diseases before endoscopic confirmation. This study proposes a deep-learning-based approach to detect pathological voice and examines its performance and utility compared with other automatic classification algorithms. This study retrospectively collected 60 normal voice samples and 402 pathological voice samples of 8 common clinical voice disorders in a voice clinic of a tertiary teaching hospital. We extracted Mel frequency cepstral coefficients from 3-second samples of a sustained vowel. The performances of three machine learning algorithms, namely, deep neural network (DNN), support vector machine, and Gaussian mixture model, were evaluated based on a fivefold cross-validation. Collective cases from the voice disorder database of MEEI (Massachusetts Eye and Ear Infirmary) were used to verify the performance of the classification mechanisms. The experimental results demonstrated that DNN outperforms Gaussian mixture model and support vector machine. Its accuracy in detecting voice pathologies reached 94.26% and 90.52% in male and female subjects, based on three representative Mel frequency cepstral coefficient features. When applied to the MEEI database for validation, the DNN also achieved a higher accuracy (99.32%) than the other two classification algorithms. By stacking several layers of neurons with optimized weights, the proposed DNN algorithm can fully utilize the acoustic features and efficiently differentiate between normal and pathological voice samples. Based on this pilot study, future research may proceed to explore more application of DNN from laboratory and clinical perspectives. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Changes After Voice Therapy in Acoustic Voice Analysis of Chinese Patients With Voice Disorders.
Lu, Dan; Chen, Fei; Yang, Hui; Yu, Rong; Zhou, Qi; Zhang, Xinyuan; Ren, Jia; Zheng, Yitao; Zhang, Xiaoyan; Zou, Jian; Wang, Haiyang; Liu, Jun
2018-05-01
This study aimed to evaluate the effects of voice therapy on patients with voice disorders by comparing the acoustic parameter changes before and after treatment. This is a retrospective study. Forty-five female patients with early-stage vocal nodules or polyps, postoperative patients, and patients with chronic laryngitis were divided into three subgroups. Videostroboscopic, acoustic analysis (fundamental frequency, jitter, shimmer, mean harmonics-to-noise ratio), and maximum phonation time (MPT) were measured before and after treatment. Fifty healthy female volunteers were the control group. After treatment, 24.4% of nodules or polyps had decreased in size, 11.1% of patients with chronic laryngitis and postoperative patients had reduced edema, and the mucosal wave of vocal folds had different degrees of recovery in postoperative patients. All acoustic analysis values and MPT in the patient group were statistically worse than in the control group, except for fundamental frequency before treatment (P > 0.05). After treatment, the acoustic analysis and MPT values were improved. However, the jitter, mean harmonics-to-noise ratio, and MPT values in the patient group were still worse after voice therapy than in the control group (P < 0.05). Most of acoustic analysis values can be useful as a complementary tool in diagnosis and assessment of voice disorders; however, it is not recommended to use a single parameter to assess voice quality. Voice therapy can improve voice quality in patients with voice disorders, but a period longer than 8 weeks is recommended for these patients. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Type and severity of pain during phonation in professional voice users and nonvocal professionals.
Van Lierde, Kristiane M; Dijckmans, Joke; Scheffel, Lara; Behlau, Mara
2012-09-01
The purpose of this study was to determine the presence, frequency, and intensity of pain during speaking in professional voice users and nonvocal professionals and to determine if the presence of pain is significantly related with the profile of the professional voice user. Based on the available literature, significantly more pain symptoms in professional voice users can be hypothesized. Sample survey. To characterize the presence, type, and degree of pain symptoms during speaking, a questionnaire was used. Pain severity was measured by means of a numerical rating scale. Fifty-five (176/320) percent of the nonvocal professionals and 84% (698/832) of the professional voice users mentioned the presence of one or more pain symptoms during speaking. Throat pain was mentioned as the most common pain in both the professional and nonvocal professional voice users. The professional voice users showed significantly more throat, neck, shoulder, headache, ear, and back pain. Moreover, the intensity of throat pain was significantly increased in the professional voice users. This study showed evidence that several types of pain are present with significantly greater frequency in professional voice users. Vocal screening strategies, diagnostic, and treatment protocols should include the assessment of the type and severity of pain. Currently, the voice clinic is working on improving the diagnostic protocol with the objective of defining the combination of tests, which best diagnose voice problems and related complaints and which evaluate progress in vocal characteristics and pain after rehabilitation. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Detection of high-frequency energy changes in sustained vowels produced by singers
Monson, Brian B.; Lotto, Andrew J.; Ternström, Sten
2011-01-01
The human voice spectrum above 5 kHz receives little attention. However, there are reasons to believe that this high-frequency energy (HFE) may play a role in perceived quality of voice in singing and speech. To fulfill this role, differences in HFE must first be detectable. To determine human ability to detect differences in HFE, the levels of the 8- and 16-kHz center-frequency octave bands were individually attenuated in sustained vowel sounds produced by singers and presented to listeners. Relatively small changes in HFE were in fact detectable, suggesting that this frequency range potentially contributes to the perception of especially the singing voice. Detection ability was greater in the 8-kHz octave than in the 16-kHz octave and varied with band energy level. PMID:21476681
Mechanics of human voice production and control
Zhang, Zhaoyan
2016-01-01
As the primary means of communication, voice plays an important role in daily life. Voice also conveys personal information such as social status, personal traits, and the emotional state of the speaker. Mechanically, voice production involves complex fluid-structure interaction within the glottis and its control by laryngeal muscle activation. An important goal of voice research is to establish a causal theory linking voice physiology and biomechanics to how speakers use and control voice to communicate meaning and personal information. Establishing such a causal theory has important implications for clinical voice management, voice training, and many speech technology applications. This paper provides a review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality. Current efforts to develop mechanical and computational models of voice production are also critically reviewed. Finally, issues and future challenges in developing a causal theory of voice production and perception are discussed. PMID:27794319
Mechanics of human voice production and control.
Zhang, Zhaoyan
2016-10-01
As the primary means of communication, voice plays an important role in daily life. Voice also conveys personal information such as social status, personal traits, and the emotional state of the speaker. Mechanically, voice production involves complex fluid-structure interaction within the glottis and its control by laryngeal muscle activation. An important goal of voice research is to establish a causal theory linking voice physiology and biomechanics to how speakers use and control voice to communicate meaning and personal information. Establishing such a causal theory has important implications for clinical voice management, voice training, and many speech technology applications. This paper provides a review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality. Current efforts to develop mechanical and computational models of voice production are also critically reviewed. Finally, issues and future challenges in developing a causal theory of voice production and perception are discussed.
Functional outcome of vocal fold medialization thyroplasty with a hydroxyapatite implant.
Storck, Claudio; Brockmann, Meike; Schnellmann, Elvira; Stoeckli, Sandro J; Schmid, Stephan
2007-06-01
Unilateral vocal fold paralysis can cause a persistent incomplete glottal closure during phonation, resulting in impaired voice function. The aim of this study was to evaluate functional results of medialization thyroplasty using a hydroxyapatite implant (VoCoM). Prospective observational cohort study. Between 1999 and 2003, a total of 26 patients (19 men, 7 women) undergoing medialization thyroplasty using a hydroxyapatite implant because of unilateral vocal fold paralysis were enrolled in the study. To evaluate voice function, the following parameters were measured preoperatively and postoperatively: mean fundamental frequency, mean sound pressure level, frequency and amplitude range (voice range profile), and maximum phonation time. A perceptual assessment of hoarseness was conducted using the Roughness, Breathiness, Hoarseness scale. Furthermore, the magnitude of voice related impairment of the patient's communication skills was rated on a 7-point scale. A combined parameter called the Voice Dysfunction Index (VDI) was used to rate vocal performance. All patients showed a statistically significant improvement in the VDI, in perceptual voice analysis, in maximum phonation time, and in the dynamic range of voice. One patient experienced a postoperative wound hemorrhage as a minor complication. No further complications or implant extrusions were observed. Medialization thyroplasty using a hydroxyapatite implant is a secure and efficient phonosurgical procedure. Voice quality and patient satisfaction improve significantly after treatment.
Yu, Chengzhu; Hansen, John H L
2017-03-01
Human physiology has evolved to accommodate environmental conditions, including temperature, pressure, and air chemistry unique to Earth. However, the environment in space varies significantly compared to that on Earth and, therefore, variability is expected in astronauts' speech production mechanism. In this study, the variations of astronaut voice characteristics during the NASA Apollo 11 mission are analyzed. Specifically, acoustical features such as fundamental frequency and phoneme formant structure that are closely related to the speech production system are studied. For a further understanding of astronauts' vocal tract spectrum variation in space, a maximum likelihood frequency warping based analysis is proposed to detect the vocal tract spectrum displacement during space conditions. The results from fundamental frequency, formant structure, as well as vocal spectrum displacement indicate that astronauts change their speech production mechanism when in space. Moreover, the experimental results for astronaut voice identification tasks indicate that current speaker recognition solutions are highly vulnerable to astronaut voice production variations in space conditions. Future recommendations from this study suggest that successful applications of speaker recognition during extended space missions require robust speaker modeling techniques that could effectively adapt to voice production variation caused by diverse space conditions.
Adductor spasmodic dysphonia: Relationships between acoustic indices and perceptual judgments
NASA Astrophysics Data System (ADS)
Cannito, Michael P.; Sapienza, Christine M.; Woodson, Gayle; Murry, Thomas
2003-04-01
This study investigated relationships between acoustical indices of spasmodic dysphonia and perceptual scaling judgments of voice attributes made by expert listeners. Audio-recordings of The Rainbow Passage were obtained from thirty one speakers with spasmodic dysphonia before and after a BOTOX injection of the vocal folds. Six temporal acoustic measures were obtained across 15 words excerpted from each reading sample, including both frequency of occurrence and percent time for (1) aperiodic phonation, (2) phonation breaks, and (3) fundamental frequency shifts. Visual analog scaling judgments were also obtained from six voice experts using an interactive computer interface to quantify four voice attributes (i.e., overall quality, roughness, brokenness, breathiness) in a carefully psychoacoustically controlled environment, using the same reading passages as stimuli. Number and percent aperiodicity and phonation breaks correlated significanly with perceived overall voice quality, roughness, and brokenness before and after the BOTOX injection. Breathiness was correlated with aperidocity only prior to injection, while roughness also correlated with frequency shifts following injection. Factor analysis reduced perceived attributes to two principal components: glottal squeezing and breathiness. The acoustic measures demonstrated a strong regression relationship with perceived glottal squeezing, but no regression relationship with breathiness was observed. Implications for an analysis of pathologic voices will be discussed.
Vocal Responses to Perturbations in Voice Auditory Feedback in Individuals with Parkinson's Disease
Liu, Hanjun; Wang, Emily Q.; Metman, Leo Verhagen; Larson, Charles R.
2012-01-01
Background One of the most common symptoms of speech deficits in individuals with Parkinson's disease (PD) is significantly reduced vocal loudness and pitch range. The present study investigated whether abnormal vocalizations in individuals with PD are related to sensory processing of voice auditory feedback. Perturbations in loudness or pitch of voice auditory feedback are known to elicit short latency, compensatory responses in voice amplitude or fundamental frequency. Methodology/Principal Findings Twelve individuals with Parkinson's disease and 13 age- and sex- matched healthy control subjects sustained a vowel sound (/α/) and received unexpected, brief (200 ms) perturbations in voice loudness (±3 or 6 dB) or pitch (±100 cents) auditory feedback. Results showed that, while all subjects produced compensatory responses in their voice amplitude or fundamental frequency, individuals with PD exhibited larger response magnitudes than the control subjects. Furthermore, for loudness-shifted feedback, upward stimuli resulted in shorter response latencies than downward stimuli in the control subjects but not in individuals with PD. Conclusions/Significance The larger response magnitudes in individuals with PD compared with the control subjects suggest that processing of voice auditory feedback is abnormal in PD. Although the precise mechanisms of the voice feedback processing are unknown, results of this study suggest that abnormal voice control in individuals with PD may be related to dysfunctional mechanisms of error detection or correction in sensory feedback processing. PMID:22448258
Acoustic and phonatory characterization of the Fado voice.
Mendes, Ana P; Rodrigues, Aira F; Guerreiro, David Michael
2013-09-01
Fado is a Portuguese musical genre, instrumentally accompanied by a Portuguese and an acoustic guitar. Fado singers' voice is perceptually characterized by a low pitch, hoarse, and strained voice. The present research study sketches the acoustic and phonatory profile of the Fado singers' voice. Fifteen Fado singers produced spoken and sung phonatory tasks. For the spoken voice measures, the maximum phonation time and s/z ratio of Fado singers were near the inefficient physiological threshold. Fundamental frequency was higher than that found in nonsingers and lower than that found in Western Classical singers. Jitter and shimmer mean values were higher compared with nonsingers. Harmonic-to-noise ratio (HNR) was similar to the mean values for nonsingers. For the sung voice, jitter was higher compared with Country, Musical Theater, Soul, Jazz, and Western Classical singers and lower than Pop singers. Shimmer mean values were lower than Country, Musical Theater, Pop, Soul, and Jazz singers and higher than Western Classical singers. HNR was similar for Western Classical singers. Maximum phonational frequency range of Fado singers indicated that male and female subjects had a lower range compared with Western Classical singers. Additionally, Fado singers produced vibrato, but singer's formant was rarely produced. These sung voice characteristics could be related with life habits, less/lack of singing training, or could be just a Fado voice characteristic. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Lopes, Leonardo Wanderley; de Oliveira Florencio, Vanessa; Silva, Priscila Oliveira Costa; da Nóbrega E Ugulino, Ana Celiane; Almeida, Anna Alice
2018-01-04
We aimed to correlate the Vocal Tract Discomfort Scale (VTDS) with the Voice Symptom Scale (VoiSS) for evaluation of patients with dysphonia. In addition, we aimed to compare vocal tract discomfort symptoms in patients with and without self-reported voice problem. This is a descriptive, cross-sectional, and retrospective study. We analyzed 143 women and 62 men with voice disorders, as confirmed by endoscopic larynx examination. All patients completed the VTDS and VoiSS at vocal evaluation. Descriptive statistics and the Spearman correlation test were applied to all variables. The degree of covariance of variables was noted. The Mann-Whitney U test was used to compare the average number of discomfort symptoms among patients with and without self-reported voice problems. A weak to moderate positive correlation was observed between the average number, frequency, and intensity of comfort symptom and the total score, physical domain score, and limitation domain score of the VoiSS. The vocal tract discomfort symptoms and the emotional domain score of the VoiSS were weakly correlated. Patients with self-reported voice problems had a higher number, frequency, and intensity of vocal tract discomfort symptoms. There is correlation between the VTDS and VoiSS scales, with greater references to vocal tract discomfort symptom in patients with self-reported voice problems. Therefore, the discomfort symptoms seem to influence the perception of the impact of a voice problem. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Van Stan, Jarrad H; Mehta, Daryush D; Petit, Robert J; Sternad, Dagmar; Muise, Jason; Burns, James A; Hillman, Robert E
2017-02-01
Ambulatory voice biofeedback (AVB) has the potential to significantly improve voice therapy effectiveness by targeting one of the most challenging aspects of rehabilitation: carryover of desired behaviors outside of the therapy session. Although initial evidence indicates that AVB can alter vocal behavior in daily life, retention of the new behavior after biofeedback has not been demonstrated. Motor learning studies repeatedly have shown retention-related benefits when reducing feedback frequency or providing summary statistics. Therefore, novel AVB settings that are based on these concepts are developed and implemented. The underlying theoretical framework and resultant implementation of innovative AVB settings on a smartphone-based voice monitor are described. A clinical case study demonstrates the functionality of the new relative frequency feedback capabilities. With new technical capabilities, 2 aspects of feedback are directly modifiable for AVB: relative frequency and summary feedback. Although reduced-frequency AVB was associated with improved carryover of a therapeutic vocal behavior (i.e., reduced vocal intensity) in a patient post-excision of vocal fold nodules, causation cannot be assumed. Timing and frequency of AVB schedules can be manipulated to empirically assess generalization of motor learning principles to vocal behavior modification and test the clinical effectiveness of AVB with various feedback schedules.
Mehta, Daryush D.; Petit, Robert J.; Sternad, Dagmar; Muise, Jason; Burns, James A.; Hillman, Robert E.
2017-01-01
Purpose Ambulatory voice biofeedback (AVB) has the potential to significantly improve voice therapy effectiveness by targeting one of the most challenging aspects of rehabilitation: carryover of desired behaviors outside of the therapy session. Although initial evidence indicates that AVB can alter vocal behavior in daily life, retention of the new behavior after biofeedback has not been demonstrated. Motor learning studies repeatedly have shown retention-related benefits when reducing feedback frequency or providing summary statistics. Therefore, novel AVB settings that are based on these concepts are developed and implemented. Method The underlying theoretical framework and resultant implementation of innovative AVB settings on a smartphone-based voice monitor are described. A clinical case study demonstrates the functionality of the new relative frequency feedback capabilities. Results With new technical capabilities, 2 aspects of feedback are directly modifiable for AVB: relative frequency and summary feedback. Although reduced-frequency AVB was associated with improved carryover of a therapeutic vocal behavior (i.e., reduced vocal intensity) in a patient post-excision of vocal fold nodules, causation cannot be assumed. Conclusions Timing and frequency of AVB schedules can be manipulated to empirically assess generalization of motor learning principles to vocal behavior modification and test the clinical effectiveness of AVB with various feedback schedules. PMID:28124070
Artificially intelligent recognition of Arabic speaker using voice print-based local features
NASA Astrophysics Data System (ADS)
Mahmood, Awais; Alsulaiman, Mansour; Muhammad, Ghulam; Akram, Sheeraz
2016-11-01
Local features for any pattern recognition system are based on the information extracted locally. In this paper, a local feature extraction technique was developed. This feature was extracted in the time-frequency plain by taking the moving average on the diagonal directions of the time-frequency plane. This feature captured the time-frequency events producing a unique pattern for each speaker that can be viewed as a voice print of the speaker. Hence, we referred to this technique as voice print-based local feature. The proposed feature was compared to other features including mel-frequency cepstral coefficient (MFCC) for speaker recognition using two different databases. One of the databases used in the comparison is a subset of an LDC database that consisted of two short sentences uttered by 182 speakers. The proposed feature attained 98.35% recognition rate compared to 96.7% for MFCC using the LDC subset.
Clinical voice analysis of Carnatic singers.
Arunachalam, Ravikumar; Boominathan, Prakash; Mahalingam, Shenbagavalli
2014-01-01
Carnatic singing is a classical South Indian style of music that involves rigorous training to produce an "open throated" loud, predominantly low-pitched singing, embedded with vocal nuances in higher pitches. Voice problems in singers are not uncommon. The objective was to report the nature of voice problems and apply a routine protocol to assess the voice. Forty-five trained performing singers (females: 36 and males: 9) who reported to a tertiary care hospital with voice problems underwent voice assessment. The study analyzed their problems and the clinical findings. Voice change, difficulty in singing higher pitches, and voice fatigue were major complaints. Most of the singers suffered laryngopharyngeal reflux that coexisted with muscle tension dysphonia and chronic laryngitis. Speaking voices were rated predominantly as "moderate deviation" on GRBAS (Grade, Rough, Breathy, Asthenia, and Strain). Maximum phonation time ranged from 4 to 29 seconds (females: 10.2, standard deviation [SD]: 5.28 and males: 15.7, SD: 5.79). Singing frequency range was reduced (females: 21.3 Semitones and males: 23.99 Semitones). Dysphonia severity index (DSI) scores ranged from -3.5 to 4.91 (females: 0.075 and males: 0.64). Singing frequency range and DSI did not show significant difference between sex and across clinical diagnosis. Self-perception using voice disorder outcome profile revealed overall severity score of 5.1 (SD: 2.7). Findings are discussed from a clinical intervention perspective. Study highlighted the nature of voice problems (hyperfunctional) and required modifications in assessment protocol for Carnatic singers. Need for regular assessments and vocal hygiene education to maintain good vocal health are emphasized as outcomes. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Two-voice fundamental frequency estimation
NASA Astrophysics Data System (ADS)
de Cheveigné, Alain
2002-05-01
An algorithm is presented that estimates the fundamental frequencies of two concurrent voices or instruments. The algorithm models each voice as a periodic function of time, and jointly estimates both periods by cancellation according to a previously proposed method [de Cheveigné and Kawahara, Speech Commun. 27, 175-185 (1999)]. The new algorithm improves on the old in several respects; it allows an unrestricted search range, effectively avoids harmonic and subharmonic errors, is more accurate (it uses two-dimensional parabolic interpolation), and is computationally less costly. It remains subject to unavoidable errors when periods are in certain simple ratios and the task is inherently ambiguous. The algorithm is evaluated on a small database including speech, singing voice, and instrumental sounds. It can be extended in several ways; to decide the number of voices, to handle amplitude variations, and to estimate more than two voices (at the expense of increased processing cost and decreased reliability). It makes no use of instrument models, learned or otherwise, although it could usefully be combined with such models. [Work supported by the Cognitique programme of the French Ministry of Research and Technology.
Quantitative Analysis of Voice in Parkinson Disease Compared to Motor Performance: A Pilot Study.
Silbergleit, Alice K; LeWitt, Peter A; Peterson, Edward L; Gardner, Glendon M
2015-01-01
Characteristic features of hypokinetic dysarthria develop in Parkinson disease (PD). We hypothesized that quantified acoustic changes of voice might provide a correlate of disease severity. To determine if there are significant differences in acoustic measures of voice between mild and moderate PD; 2) To evaluate correlations between acoustic parameters of voice and subtests of the UPDRS in mild and moderate PD. Twenty six participants with PD underwent vocal acoustic testing while off PD medication, for comparison to 22 healthy controls. Participants with PD were divided into two groups based upon UPDRS activities of daily living (ADL) ratings: summed scores were used to define mild and moderate PD. Participants voiced /i/ ("ee") at comfort, high, and low pitch (3 trials/pitch). The CSpeech Waveform Analysis Program was used to analyze cycle-to-cycle frequency ("jitter") and amplitude ("shimmer") irregularities of the vocal signal, signal-to-noise ratio, and maximum phonation frequency range converted to semitones. Sections of UPDRS scores were correlated to acoustic variables of voice. Key findings included a significant difference between the semitone range of the control subjects and the moderate PD group (p = 0.036). Further analyses revealed significant differences in semitone range for males between the controls vs. mild PD (p = 0.014), and controls vs. moderate PD (p = 0.005). Significant correlations were also found between acoustic findings and both the ADL and motor portions of the UPDRS. Acoustic analysis of voice, particularly frequency range, may provide a quantifiable correlate of disease progression in PD.
2015-01-01
The goal of this study was to analyse perceptually and acoustically the voices of patients with Unilateral Vocal Fold Paralysis (UVFP) and compare them to the voices of normal subjects. These voices were analysed perceptually with the GRBAS scale and acoustically using the following parameters: mean fundamental frequency (F0), standard-deviation of F0, jitter (ppq5), shimmer (apq11), mean harmonics-to-noise ratio (HNR), mean first (F1) and second (F2) formants frequency, and standard-deviation of F1 and F2 frequencies. Statistically significant differences were found in all of the perceptual parameters. Also the jitter, shimmer, HNR, standard-deviation of F0, and standard-deviation of the frequency of F2 were statistically different between groups, for both genders. In the male data differences were also found in F1 and F2 frequencies values and in the standard-deviation of the frequency of F1. This study allowed the documentation of the alterations resulting from UVFP and addressed the exploration of parameters with limited information for this pathology. PMID:26557690
Soul and Musical Theater: A Comparison of Two Vocal Styles.
Hallqvist, Hanna; Lã, Filipa M B; Sundberg, Johan
2017-03-01
The phonatory and resonatory characteristics of nonclassical styles of singing have been rarely analyzed in voice research. Six professional singers volunteered to sing excerpts from two songs pertaining to the musical theater and to the soul styles of singing. Voice source parameters and formant frequencies were analyzed by inverse filtering tones, sung at the same fundamental frequencies in both excerpts. As compared with musical theater, the soul style was characterized by significantly higher subglottal pressure and maximum flow declination rate. Yet sound pressure level was lower, suggesting higher glottal resistance. The differences would be the effects of firmer glottal adduction and a greater frequency separation between the first formant and its closest spectrum partial in soul than in musical theater. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Quantitative evaluation of the voice range profile in patients with voice disorder.
Ikeda, Y; Masuda, T; Manako, H; Yamashita, H; Yamamoto, T; Komiyama, S
1999-01-01
In 1953, Calvet first displayed the fundamental frequency (pitch) and sound pressure level (intensity) of a voice on a two-dimensional plane and created a voice range profile. This profile has been used to evaluate clinically various vocal disorders, although such evaluations to date have been subjective without quantitative assessment. In the present study, a quantitative system was developed to evaluate the voice range profile utilizing a personal computer. The area of the voice range profile was defined as the voice volume. This volume was analyzed in 137 males and 175 females who were treated for various dysphonias at Kyushu University between 1984 and 1990. Ten normal subjects served as controls. The voice volume in cases with voice disorders significantly decreased irrespective of the disease and sex. Furthermore, cases having better improvement after treatment showed a tendency for the voice volume to increase. These findings illustrated the voice volume as a useful clinical test for evaluating voice control in cases with vocal disorders.
Acoustic analysis of voice in children with cleft palate and velopharyngeal insufficiency.
Villafuerte-Gonzalez, Rocio; Valadez-Jimenez, Victor M; Hernandez-Lopez, Xochiquetzal; Ysunza, Pablo Antonio
2015-07-01
Acoustic analysis of voice can provide instrumental data concerning vocal abnormalities. These findings can be used for monitoring clinical course in cases of voice disorders. Cleft palate severely affects the structure of the vocal tract. Hence, voice quality can also be also affected. To study whether the main acoustic parameters of voice, including fundamental frequency, shimmer and jitter are significantly different in patients with a repaired cleft palate, as compared with normal children without speech, language and voice disorders. Fourteen patients with repaired unilateral cleft lip and palate and persistent or residual velopharyngeal insufficiency (VPI) were studied. A control group was assembled with healthy volunteer subjects matched by age and gender. Hypernasality and nasal emission were perceptually assessed in patients with VPI. Size of the gap as assessed by videonasopharyngoscopy was classified in patients with VPI. Acoustic analysis of voice including Fundamental frequency (F0), shimmer and jitter were compared between patients with VPI and control subjects. F0 was significantly higher in male patients as compared with male controls. Shimmer was significantly higher in patients with VPI regardless of gender. Moreover, patients with moderate VPI showed a significantly higher shimmer perturbation, regardless of gender. Although future research regarding voice disorders in patients with VPI is needed, at the present time it seems reasonable to include strategies for voice therapy in the speech and language pathology intervention plan for patients with VPI. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Digital signal processing algorithms for automatic voice recognition
NASA Technical Reports Server (NTRS)
Botros, Nazeih M.
1987-01-01
The current digital signal analysis algorithms are investigated that are implemented in automatic voice recognition algorithms. Automatic voice recognition means, the capability of a computer to recognize and interact with verbal commands. The digital signal is focused on, rather than the linguistic, analysis of speech signal. Several digital signal processing algorithms are available for voice recognition. Some of these algorithms are: Linear Predictive Coding (LPC), Short-time Fourier Analysis, and Cepstrum Analysis. Among these algorithms, the LPC is the most widely used. This algorithm has short execution time and do not require large memory storage. However, it has several limitations due to the assumptions used to develop it. The other 2 algorithms are frequency domain algorithms with not many assumptions, but they are not widely implemented or investigated. However, with the recent advances in the digital technology, namely signal processors, these 2 frequency domain algorithms may be investigated in order to implement them in voice recognition. This research is concerned with real time, microprocessor based recognition algorithms.
Effects of Masking Noise on Laryngeal Resistance for Breathy, Normal, and Pressed Voice
ERIC Educational Resources Information Center
Grillo, Elizabeth U.; Abbott, Katherine Verdolini; Lee, Timothy D.
2010-01-01
Purpose: The purpose of the present study was to explore the effects of masking noise on laryngeal resistance for breathy, normal, and pressed voice in vocally trained women. Method: Eighteen vocally trained women produced breathy, normal, and pressed voice across 7 fundamental frequencies during a repeated CV utterance of /pi/ under normal and…
The Effect of Hydration on the Voice Quality of Future Professional Vocal Performers.
van Wyk, Liezl; Cloete, Mariaan; Hattingh, Danel; van der Linde, Jeannie; Geertsema, Salome
2017-01-01
The application of systemic hydration as an instrument for optimal voice quality has been a common practice by several professional voice users over the years. Although the physiological action has been determined, the benefits on acoustic and perceptual characteristics are relatively unknown. The present study aimed to determine whether systemic hydration has beneficial outcomes on the voice quality of future professional voice users. A within-subject, pretest posttest design is applied to determine quantitative research results of female singing students between 18 and 32 years of age without a history of voice pathology. Acoustic and perceptual data were collected before and after a 2-hour singing rehearsal. The difference between the hypohydrated condition (controlled) and the hydrated condition (experimental) and the relationship between adequate hydration and acoustic and perceptual parameters of voice was then investigated. A statistical significant (P = 0.041) increase in jitter values were obtained for the hypohydrated condition. Increased maximum phonation time (MPT/z/) and higher maximum frequency for hydration indicated further statistical significant changes in voice quality (P = 0.028 and P = 0.015, respectively). Systemic hydration has positive outcomes on perceptual and acoustic parameters of voice quality for future professional singers. The singer's ability to sustain notes for longer and reach higher frequencies may reflect well in performances. Any positive change in voice quality may benefit the singer's occupational success and subsequently their social, emotional, and vocational well-being. More research evidence is needed to determine the parameters for implementing adequate hydration in vocal hygiene programs. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Vocal Tract Discomfort and Voice-Related Quality of Life in Wind Instrumentalists.
Cappellaro, Juliane; Beber, Bárbara Costa
2018-05-01
This study aimed to investigate vocal tract discomfort and quality of life in the voice of wind instrumentalists. It is a cross-sectional study. The sample was composed of 37 musicians of the orchestra of Caxias do Sul city, RS, Brazil. The participants answered a nonstandard questionnaire about demographic and professional information, the Voice-Related Quality of Life (V-RQOL), the Vocal Tract Discomfort (VTD) scale, and additional items about fatigue after playing the instrument and pain in the cervical muscles. Correlation analyses were performed using Spearman correlation test. The most frequent symptoms mentioned by musicians in the VTD, for both frequency and intensity of occurrence, were dryness, ache, irritability, and cervical muscle pain, in addition to the frequency of occurrence of fatigue after playing. The musicians showed high scores in the V-RQOL survey. Several symptoms evaluated by the VTD had a negative correlation with the musicians' years of orchestra membership and with V-RQOL scores. Symptoms of vocal tract discomfort are present in wind instrumentalists in low frequency and intensity of occurrence. However, these symptoms affect the musicians' voice-related quality of life, and they occur more in musicians with fewer years of orchestra membership. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Hacki, T
1996-01-01
The Voice Range Profile (VRP) measurement offers a method for the investigation of voice modalities i.e. speaking voice, shouting voice and singing voice in their mutual pitch and intensity relations. The parameters FO and SPL are evaluated by means of automatic pitch and SPL measurements from (1) sustained phonation /a:/ in the speaker's natural pitch and intensity range, (2) the continuous speaking voice beginning with Pianissimo up to Fortissimo, (3) the shouting voice. Vocal intensity is plotted vertically, vocal pitch horizontally. The displays of the vocal intensity versus fundamental frequency are defined as singing voice range profile (VRP), speaking VRP and shouting VRP. The VRPs are superimposed on the same plot. Their form, their shape and their position to each other are analysed. The physiological relationships between the VRPs of the different voice modalities to each other are defined. The pathological relationships between the VRPs (i.e. reduction, shifting) give information about etiology and pathomechanism of voice disorders.
D'haeseleer, Evelien; Claeys, Sofie; Bettens, Kim; Leemans, Laura; Van Calster, Ann-Sophie; Van Damme, Nina; Thijs, Zoë; Daelman, Julie; Leyns, Clara; Van Lierde, Kristiane
2017-07-01
The purpose of this study was to measure the objective and subjective vocal quality in women aged between 60 and 75 years. Secondly, the impact of a teaching or singing career on the vocal quality was investigated by comparing the vocal quality of retired women with different careers. This is a case-control study. Seventy-three retired women between 60 and 75 years (mean age: 67 years, standard deviation: 4.49) participated in the study and were divided into three groups: women with a teaching career (n = 21), choir singers with a singing career (n = 12), and women with a non-vocal career (n = 40). All subjects underwent the same assessment protocol consisting of objective (aerodynamic, maximum performance, vocal range, acoustic measurements, and the Dysphonia Severity Index) and subjective (the Voice Handicap Index, auditory-perceptual evaluations by three listeners) voice measurements. In all three groups, objective and perceptual voice analysis showed a mild dysphonia. No differences in the Dysphonia Severity Index were found between the three groups. The voices of choir singers with a singing career were perceived significantly less rough than voices of the women with a non-vocal career. Additionally, the lowest frequency of the frequency range was significantly lower in the retired teachers and choir singers than in the controls. The results of this study prudently suggest that a singing or a teaching career compared with a non-vocal career has a positive impact on the vocal frequency range, and that singing has a positive impact on the perceptual vocal quality of the older female voice. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Herzel, Hanspeter; Reuter, Robert
1996-06-01
Irregularities in voiced speech are often observed as a consequence of vocal fold lesions, paralyses, and other pathological conditions. Many of these instabilities are related to the intrinsic nonlinearities in the vibrations of the vocal folds. In this paper, a specific nonlinear phenomenon is discussed: The appearance of two independent fundamental frequencies termed biphonation. Several narrow-band spectrograms are presented showing biphonation in signals from voice patients, a newborn cry, a singer, and excised larynx experiments. Finally, possible physiological mechanisms of instabilities of the voice source are discussed.
Park, Kyihwan; Choi, Dongyoub; Ozer, Abdullah; Kim, Sangyoo; Lee, Yongkwan; Joo, Dongik
2008-06-01
We develop a four-mount active vibration isolation system (AVIS) using voice coil actuators. The flexible body modes in the upper plate of the AVIS can cause an instability problem due to control signal whose frequency is close to the resonant frequency of the flexible modes. The loop shaping technique is applied to reduce the amplitude of the control signal. We investigate the performances of the active vibration isolation system proposed in the word in the time domain and frequency domain by comparing to the passive isolation system.
A Novel Fast and Secure Approach for Voice Encryption Based on DNA Computing
NASA Astrophysics Data System (ADS)
Kakaei Kate, Hamidreza; Razmara, Jafar; Isazadeh, Ayaz
2018-06-01
Today, in the world of information communication, voice information has a particular importance. One way to preserve voice data from attacks is voice encryption. The encryption algorithms use various techniques such as hashing, chaotic, mixing, and many others. In this paper, an algorithm is proposed for voice encryption based on three different schemes to increase flexibility and strength of the algorithm. The proposed algorithm uses an innovative encoding scheme, the DNA encryption technique and a permutation function to provide a secure and fast solution for voice encryption. The algorithm is evaluated based on various measures including signal to noise ratio, peak signal to noise ratio, correlation coefficient, signal similarity and signal frequency content. The results demonstrate applicability of the proposed method in secure and fast encryption of voice files
Casado, Juan C; O'Connor, Carlos; Angulo, María S; Adrián, José A
2016-01-01
With the development of new ENT techniques, many male transsexuals who wish to become women usually request a surgical procedure to raise the fundamental frequency of the voice (feminization). The ENT specialist and the voice-therapist have to use an interdisciplinary approach to this growing social demand. The aim of this study was to show the results in a group of transsexual patients after Wendler's anterior synechiae, with additional voice-therapy treatment. Ten male transexulas who wish to become women patients who had Wendler glottoplasty and voice-therapy were assessed. The surgical procedure consisted of a de-epithelialization of the anterior third of both vocal folds; this area was sutured and the surface of both vocal folds was vaporised with laser diode. Pre- and postsurgery voice assessment consisted of measuring fundamental frequency (Fo) and maximum phonation time, administering the transgender self-assessment questionnaire (TSEQ) and obtaining perceptual voice assessment by inter-rater agreement. All the male transsexuals who wish to become women patients significantly increased their Fo (106 Hz on average) after the treatment. Furthermore, significant improvements were shown in self-reported satisfaction and in the degree of voice feminization. No improvements in the maximum phonation time were observed. Wendler glottoplasty is a surgical procedure to contribute to feminising the voice, with good medium-term results and without noteworthy medical complications. The increase in vocal tone was observed using several pre- and post-surgery control measures and voice therapy. Copyright © 2014 Elsevier España, S.L.U. and Sociedad Española de Otorrinolaringología y Patología Cérvico-Facial. All rights reserved.
A Pitch Extraction Method with High Frequency Resolution for Singing Evaluation
NASA Astrophysics Data System (ADS)
Takeuchi, Hideyo; Hoguro, Masahiro; Umezaki, Taizo
This paper proposes a pitch estimation method suitable for singing evaluation incorporable in KARAOKE machines. Professional singers and musicians have sharp hearing for music and singing voice. They recognize that singer's voice pitch is “a little off key” or “be in tune”. In the same way, the pitch estimation method that has high frequency resolution is necessary in order to evaluate singing. This paper proposes a pitch estimation method with high frequency resolution utilizing harmonic characteristic of autocorrelation function. The proposed method can estimate a fundamental frequency in the range 50 ∼ 1700[Hz] with resolution less than 3.6 cents in light processing.
Mechanism of and Threshold Biomechanical Conditions for Falsetto Voice Onset
Deguchi, Shinji
2011-01-01
The sound source of a voice is produced by the self-excited oscillation of the vocal folds. In modal voice production, a drastic increase in transglottal pressure after vocal fold closure works as a driving force that develops self-excitation. Another type of vocal fold oscillation with less pronounced glottal closure observed in falsetto voice production has been accounted for by the mucosal wave theory. The classical theory assumes a quasi-steady flow, and the expected driving force onto the vocal folds under wavelike motion is derived from the Bernoulli effect. However, wavelike motion is not always observed during falsetto voice production. More importantly, the application of the quasi-steady assumption to a falsetto voice with a fundamental frequency of several hundred hertz is unsupported by experiments. These considerations suggested that the mechanism of falsetto voice onset may be essentially different from that explained by the mucosal wave theory. In this paper, an alternative mechanism is submitted that explains how self-excitation reminiscent of the falsetto voice could be produced independent of the glottal closure and wavelike motion. This new explanation is derived through analytical procedures by employing only general unsteady equations of motion for flow and solids. The analysis demonstrated that a convective acceleration of a flow induced by rapid wall movement functions as a negative damping force, leading to the self-excitation of the vocal folds. The critical subglottal pressure and volume flow are expressed as functions of vocal fold biomechanical properties, geometry, and voice fundamental frequency. The analytically derived conditions are qualitatively and quantitatively reasonable in view of reported measurement data of the thresholds required for falsetto voice onset. Understanding of the voice onset mechanism and the explicit mathematical descriptions of thresholds would be beneficial for the diagnosis and treatment of voice diseases and the development of artificial vocal folds. PMID:21408178
Hunter, Eric J.; Titze, Ingo R.
2012-01-01
Purpose This study creates a more concise picture of the vocal demands placed on teachers by comparing occupational voice use with non-occupational voice use. Methods The National Center for Voice and Speech voice dosimetry databank was used to calculate voicing percentage per hour, as well as average dB SPL and F0. Occupational voice use (9am-3 PM, weekdays) and non-occupational voice use (4 PM-10 PM, weekends) were compared (57 teachers, two weeks each). Results Five key findings were uncovered: [1] similar to previous studies, occupational voicing percentage per hour is more than twice that of non-occupational; [2] teachers experienced a wide range of occupational voicing percentages per hour (30±11%/hr); [3] average occupational voice was about 1 dB SPL louder than the non-occupational voice and remained constant throughout the day; [4] occupational voice exhibited an increased pitch and trended upward throughout the day; [5] some apparent gender differences were shown. Conclusions Data regarding voicing percentages, F0 and dB SPL provide critical insight into teachers’ vocal health. Further, because non-occupational voice use is added to an already overloaded voice, it may add key insights into recovery patterns, and should be the focus of future studies. PMID:20689046
Ilomaki, Irma; Laukkanen, Anne-Maria; Leppanen, Kirsti; Vilkman, Erkki
2008-01-01
Voice education programs may help in optimizing teachers' voice use. This study compared effects of voice training (VT) and voice hygiene lecture (VHL) in 60 randomly assigned female teachers. All 60 attended the lecture, and 30 completed a short training course in addition. Text reading was recorded in working environments and analyzed for fundamental frequency (F0), equivalent sound level (Leq), alpha ratio, jitter, shimmer, and perceptual quality. Self-reports of vocal well-being were registered. In the VHL group, increased F0 and difficulty of phonation and in the VT group decreased perturbation, increased alpha ratio, easier phonation, and improved perceptual and self-reported voice quality were found. Both groups equally self-reported increase of voice care knowledge. Results seem to indicate improved vocal well-being after training.
Mazzetto de Menezes, Keyla S; Master, Suely; Guzman, Marco; Bortnem, Cori; Ramos, Luiz Roberto
2014-01-01
The present study aimed to compare elderly and young female voices in habitual and high intensity. The effect of increased intensity on the acoustic and perceptual parameters was assessed. Sound pressure level, fundamental frequency, jitter, shimmer, and harmonic to noise ratio were obtained at habitual and high intensity voice in a group of 30 elderly women and 30 young women. Perceptual assessment was also performed. Both groups demonstrated an increase in sound pressure level and fundamental frequency from habitual voice to high intensity voice. No differences were found between groups in any acoustic variables on samples recorded with habitual intensity level. No significant differences between groups were found in habitual intensity level for pitch, hoarseness, roughness, and breathiness. Asthenia and instability obtained significant higher values in elderly than young participants, whereas, the elderly demonstrated lower values for perceived tension and loudness than young subjects. Acoustic and perceptual measures do not demonstrate evident differences between elderly and young speakers in habitual intensity level. The parameters analyzed may lack the sensitivity necessary to detect differences in subjects with normal voices. Phonation with high intensity highlights differences between groups, especially in perceptual parameters. Therefore, high intensity should be included to compare elderly and young voice. Copyright © 2013 Elsevier España, S.L. All rights reserved.
Van Stan, Jarrad H.; Mehta, Daryush D.; Zeitels, Steven M.; Burns, James A.; Barbu, Anca M.; Hillman, Robert E.
2015-01-01
Objectives Clinical management of phonotraumatic vocal fold lesions (nodules, polyps) is based largely on assumptions that abnormalities in habitual levels of sound pressure level (SPL), fundamental frequency (f0), and/or amount of voice use play a major role in lesion development and chronic persistence. This study used ambulatory voice monitoring to evaluate if significant differences in voice use exist between patients with phonotraumatic lesions and normal matched controls. Methods Subjects were 70 adult females: 35 with vocal fold nodules or polyps and 35 age-, sex-, and occupation-matched normal individuals. Weeklong summary statistics of voice use were computed from anterior neck surface acceleration recorded using a smartphone-based ambulatory voice monitor. Results Paired t-tests and Kolmogorov-Smirnov tests resulted in no statistically significant differences between patients and matched controls regarding average measures of SPL, f0, vocal dose measures, and voicing/voice rest periods. Paired t-tests comparing f0 variability between the groups resulted in statistically significant differences with moderate effect sizes. Conclusions Individuals with phonotraumatic lesions did not exhibit differences in average ambulatory measures of vocal behavior when compared with matched controls. More refined characterizations of underlying phonatory mechanisms and other potentially contributing causes are warranted to better understand risk factors associated with phonotraumatic lesions. PMID:26024911
Fu, Qian-Jie; Chinchilla, Sherol; Galvin, John J
2004-09-01
The present study investigated the relative importance of temporal and spectral cues in voice gender discrimination and vowel recognition by normal-hearing subjects listening to an acoustic simulation of cochlear implant speech processing and by cochlear implant users. In the simulation, the number of speech processing channels ranged from 4 to 32, thereby varying the spectral resolution; the cutoff frequencies of the channels' envelope filters ranged from 20 to 320 Hz, thereby manipulating the available temporal cues. For normal-hearing subjects, results showed that both voice gender discrimination and vowel recognition scores improved as the number of spectral channels was increased. When only 4 spectral channels were available, voice gender discrimination significantly improved as the envelope filter cutoff frequency was increased from 20 to 320 Hz. For all spectral conditions, increasing the amount of temporal information had no significant effect on vowel recognition. Both voice gender discrimination and vowel recognition scores were highly variable among implant users. The performance of cochlear implant listeners was similar to that of normal-hearing subjects listening to comparable speech processing (4-8 spectral channels). The results suggest that both spectral and temporal cues contribute to voice gender discrimination and that temporal cues are especially important for cochlear implant users to identify the voice gender when there is reduced spectral resolution.
NASA Astrophysics Data System (ADS)
Ghoraani, Behnaz; Krishnan, Sridhar
2009-12-01
The number of people affected by speech problems is increasing as the modern world places increasing demands on the human voice via mobile telephones, voice recognition software, and interpersonal verbal communications. In this paper, we propose a novel methodology for automatic pattern classification of pathological voices. The main contribution of this paper is extraction of meaningful and unique features using Adaptive time-frequency distribution (TFD) and nonnegative matrix factorization (NMF). We construct Adaptive TFD as an effective signal analysis domain to dynamically track the nonstationarity in the speech and utilize NMF as a matrix decomposition (MD) technique to quantify the constructed TFD. The proposed method extracts meaningful and unique features from the joint TFD of the speech, and automatically identifies and measures the abnormality of the signal. Depending on the abnormality measure of each signal, we classify the signal into normal or pathological. The proposed method is applied on the Massachusetts Eye and Ear Infirmary (MEEI) voice disorders database which consists of 161 pathological and 51 normal speakers, and an overall classification accuracy of 98.6% was achieved.
2009-03-23
Multitalker speech perception with ideal time-frequency segregation: Effects of voice characteristics and number of talkers Douglas S. Brungarta Air...INTRODUCTION Speech perception in multitalker listening environments is limited by two very different types of masking. The first is energetic...06 MAR 2009 2. REPORT TYPE 3. DATES COVERED 00-00-2009 to 00-00-2009 4. TITLE AND SUBTITLE Multitalker speech perception with ideal time
Drew, R; Sapir, S
1995-06-01
Nineteen trained soprano singers aged 18-30 years vocalized tasks designed to assess average speaking fundamental frequency (SFF) during spontaneous speaking and reading. Vocal range and perceptual characteristics while singing with low intensity and high frequency were also assessed, and subjects completed a survey of vocal habits/symptoms. Recorded signals were digitized prior to being analyzed for SFF using the Kay Computerized Speech Lab program. Subjects were assigned to a normal voice or impaired voice group based on ratings of perceptual tasks and survey results. Data analysis showed group differences in mean SFF, no differences in vocal range, higher mean SFF values for reading than speaking, and 58% ability to perceive speaking in low pitch. The role of speaking in too low pitch as causal for vocal symptoms and need for voice classification differentiation in vocal performance studies are discussed.
[Voice assessment and demographic data of applicants for a school of speech therapists].
Reiter, R; Brosch, S
2008-05-01
Demographic data, subjective und objective voice analysis as well as self-assessment of voice quality from applicants for a school of speech therapists were investigated. Demographic data from 116 applicants were collected and their voice quality assessed by three independent judges. An objective evaluation was done by maximum phonation time, average fundamental frequency, dynamic range and percent of jitter and shimmer by means of Goettinger Hoarseness diagram. Self-assessment of voice quality was done by "voice handicap index questionnaire". The twenty successful applicants had a physiological voice in 95 %, they were all musical and had university entrance qualifications. Subjective voice assessment showed in 16 % of the applicants a hoarse voice. In this subgroup an unphysiological vocal use was observed in 72 % and a reduced articulation in 45 %. The objective voice parameters did not show a significant difference between the 3 groups. Self-assessment of the voice was inconspicuous in all applicants. Applicants with general qualification for university entrance, musicality and a physiological voice were more likely to be successful. There were main differences between self assessment of voice and quantitative analysis or subjective assessment by three independent judges.
Connections between voice ergonomic risk factors in classrooms and teachers' voice production.
Rantala, Leena M; Hakala, Suvi; Holmqvist, Sofia; Sala, Eeva
2012-01-01
The aim of the study was to investigate if voice ergonomic risk factors in classrooms correlated with acoustic parameters of teachers' voice production. The voice ergonomic risk factors in the fields of working culture, working postures and indoor air quality were assessed in 40 classrooms using the Voice Ergonomic Assessment in Work Environment - Handbook and Checklist. Teachers (32 females, 8 males) from the above-mentioned classrooms recorded text readings before and after a working day. Fundamental frequency, sound pressure level (SPL) and the slope of the spectrum (alpha ratio) were analyzed. The higher the number of the risk factors in the classrooms, the higher SPL the teachers used and the more strained the males' voices (increased alpha ratio) were. The SPL was already higher before the working day in the teachers with higher risk than in those with lower risk. In the working environment with many voice ergonomic risk factors, speakers increase voice loudness and use more strained voice quality (males). A practical implication of the results is that voice ergonomic assessments are needed in schools. Copyright © 2013 S. Karger AG, Basel.
Effect on LTAS of vocal loudness variation.
Nordenberg, Maria; Sundberg, Johan
2004-01-01
Long-term-average spectrum (LTAS) is an efficient method for voice analysis, revealing both voice source and formant characteristics. However, the LTAS contour is non-uniformly affected by vocal loudness. This variation was analyzed in 15 male and 16 female untrained voices reading a text 7 times at different degrees of vocal loudness, mean change in overall equivalent sound level (Leq) amounting to 27.9 dB and 28.4 dB for the female and male subjects. For all frequency values up to 4 kHz, spectrum level was strongly and linearly correlated with Leq for each subject. The gain factor, that is to say, the rate of level increase, varied with frequency, from about 0.5 at low frequencies to about 1.5 in the frequency range 1.5-3 kHz. Using the gain factors for a subject, LTAS contours could be predicted at any Leq within the measured range, with an average accuracy of 2-3 dB below 4 kHz. Mean LTAS calculated for an Leq of 70 dB for each subject showed considerable individual variation for both males and females, SD of the level varying between 7 dB and 4 dB depending on frequency. On the other hand, the results also suggest that meaningful comparisons of LTAS, recorded for example before and after voice therapy, can be made, provided that the documentation includes a set of recordings at different loudness levels from one recording session.
NASA Astrophysics Data System (ADS)
Goad, Pamela Joy
The fusion of musical voices is an important aspect of musical blend, or the mixing of individual sounds. Yet, little research has been done to explicitly determine the factors involved in fusion. In this study, the similarity of timbre and modulation were examined for their contribution to the fusion of sounds. It is hypothesized that similar timbres will fuse better than dissimilar timbres, and, voices with the same kind of modulation will fuse better than voices of different modulations. A perceptually-based measure, known as sharpness was investigated as a measure of timbre. The advantages of using sharpness are that it is based on hearing sensitivities and masking phenomena of inner ear processing. Five musical instrument families were digitally recorded in performances across a typical playing range at two extreme dynamic levels. Analyses reveal that sharpness is capable of uncovering subtle changes in timbre including those found in musical dynamics, instrument design, and performer-specific variations. While these analyses alone are insufficient to address fusion, preliminary calculations of timbral combinations indicate that sharpness has the potential to predict the fusion of sounds used in musical composition. Three experiments investigated the effects of modulation on the fusion of a harmonic major sixth interval. In the first experiment using frequency modulation, stimuli varied in deviation about a mean fundamental frequency and relative modulation phase between the two tones. Results showed smaller frequency deviations promoted fusion and relative phase differences had a minimal effect. In a second experiment using amplitude modulation, stimuli varied in deviation about a mean amplitude level and relative phase of modulation. Results showed smaller amplitude deviations promoted better fusion, but unlike frequency modulation, relative phase differences were also important. In a third experiment, frequency modulation, amplitude modulation and mixed modulation were arranged in all possible voicings. Results showed frequency modulation in the lower voice and less variance in amplitude envelopes contributed to an increase in fusion. The theory that similar modulations would promote better fusion was only marginally supported. For these experiments, results revealed differences depending on modulation type and that a lesser amount of modulation fosters greater fusion.
Personal and Professional Characteristics of Music Educators: One Size Does Not Fit All.
Doherty, Mary Lynn; van Mersbergen, Miriam
2017-01-01
The prevalence of voice disorders among various educator groups is well known, and voice disorders among music educators are higher than the general classroom educators. Music educators vary with respect to behavioral and personality factors, personal characteristics, type of music taught, job-specific environment, and governmental professional expectations. This study aims to identify risk factors for voice disorders in a heterogeneous population of music educators. An online survey was conducted with 213 respondents. Survey questions addressed demographics, level of education, years of music teaching experience, specialty training, primary teaching assignments and instrument, vocal health behaviors, and diagnoses of voice disorders. Summary statistics and group comparisons are reported. Those whose primary instrument was voice reported a greater frequency of voice disorders. Female and older music educators also had a higher prevalence of voice disorders. Music educators are a heterogeneous group of individuals who require more careful consideration in the prevention and treatment of occupational voice problems. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Tutorial and Guidelines on Measurement of Sound Pressure Level in Voice and Speech.
Švec, Jan G; Granqvist, Svante
2018-03-15
Sound pressure level (SPL) measurement of voice and speech is often considered a trivial matter, but the measured levels are often reported incorrectly or incompletely, making them difficult to compare among various studies. This article aims at explaining the fundamental principles behind these measurements and providing guidelines to improve their accuracy and reproducibility. Basic information is put together from standards, technical, voice and speech literature, and practical experience of the authors and is explained for nontechnical readers. Variation of SPL with distance, sound level meters and their accuracy, frequency and time weightings, and background noise topics are reviewed. Several calibration procedures for SPL measurements are described for stand-mounted and head-mounted microphones. SPL of voice and speech should be reported together with the mouth-to-microphone distance so that the levels can be related to vocal power. Sound level measurement settings (i.e., frequency weighting and time weighting/averaging) should always be specified. Classified sound level meters should be used to assure measurement accuracy. Head-mounted microphones placed at the proximity of the mouth improve signal-to-noise ratio and can be taken advantage of for voice SPL measurements when calibrated. Background noise levels should be reported besides the sound levels of voice and speech.
Niebudek-Bogusz, Ewa; Sliwińska-Kowalska, Mariola
2006-01-01
An assessment of the vocal system, as a part of the medical certification of occupational diseases, should be objective and reliable. Therefore, interest in the method of acoustic voice analysis enabling objective assessment of voice parameters is still growing. The aim of the present study was to evaluate the applicability of acoustic analysis with vocal loading test to the diagnostics of occupational voice disorders. The results of acoustic voice analysis were compared using IRIS software for phoniatrics, before and after a 30-min vocal loading test in 35 female teachers with diagnosed occupational voice disorders (group I) and in 31 female teachers with functional dysphonia (group II). In group I, vocal effort produced significant abnormalities in voice acoustic parameters, compared to group II. These included significantly increased mean fundamental frequency (Fo) value (by 11 Hz) and worsened jitter, shimmer and NHR parameters. Also, the percentage of subjects showing abnormalities in voice acoustic analysis was higher in this group. Conducting voice acoustic analysis before and after the vocal loading test makes it possible to objectively confirm irreversible voice impairments in persons with work-related pathologies of the larynx, which is essential for medical certification of occupational voice diseases.
An adaptive narrow band frequency modulation voice communication system
NASA Technical Reports Server (NTRS)
Wishna, S.
1972-01-01
A narrow band frequency modulation communication system is described which provides for the reception of good quality voice at low carrier-to-noise ratios. The high level of performance is obtained by designing a limiter and phase lock loop combination as a demodulator, so that the bandwidth of the phase lock loop decreases as the carrier level decreases. The system was built for the position location and aircraft communication equipment experiment of the ATS 6 program.
Acoustic and Auditory Perception Effects of the Voice Therapy Technique Finger Kazoo in Adult Women.
Christmann, Mara Keli; Cielo, Carla Aparecida
2017-05-01
This study aimed to verify and to correlate acoustic and auditory-perceptual measures of glottic source after the performance of finger kazoo (FK) technique. This is an experimental, cross-sectional, and qualitative study. We made an analysis of the vowel [a:] in 46 adult women with neither vocal complaints nor laryngeal alterations, through the Multi-Dimensional Voice Program Advanced and RASATI scale, before and immediately after performing three series of FK and 5 minutes after a period of silence. Kappa, Friedman, Wilcoxon, and Spearman tests were used. We found significant increase in fundamental frequency, reduction of amplitude variation, and degree of sub-harmonics immediately after performing FK. Positive correlations were measures of frequency and its perturbation, measures of amplitude, of soft phonation index, of degree and number of unvoiced segments with aspects of RASATI. Negative correlations were voice turbulence index, measures of frequency and its perturbation, and measures of soft phonation index with aspects of RASATI. There was fundamental frequency increase, within normal limits, and reduction of acoustic measures related to presence of noise and instability. In general, acoustic measures, suggestive of noise and instability, were reduced according to the decrease of perceptive-auditory aspects of vocal alteration. It shows that both instruments are complementary and that the acoustic vocal effect was positive. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Turner, Nick; Tucker, Sean; Kelloway, E Kevin
2015-06-01
The present study examines the self-reported frequency of non-lost work time workplace injuries ("microaccidents") and the frequency of three types of work-related safety behaviors (i.e., safety voice, safety compliance, and safety neglect) recalled over a four-week period. We analyzed data on microaccidents and safety behaviors from 19,547 young workers (aged 15-25years, Mdn=18years; 55% male) from multiple Canadian provinces. Approximately one-third of all young workers recalled experiencing at least one microaccident at work in the last four weeks. Comparisons across three age groups revealed that younger workers, particularly between the ages of 15-18, reported more frequent microaccidents, less safety voice, less safety compliance, and more safety neglect than workers aged 19-22. This pattern of results also held for comparisons between workers in 19-22 and 23-25 age groups, except for safety voice which did not differ between these two older age groups. In terms of gender, males and females reported the same frequency of microaccidents, but males reported more safety voice, more safety compliance, and more safety neglect than females did. The results and limitations of the present study are discussed. Frequency of microaccidents and safety behavior vary among young worker age sub-groups. Copyright © 2015 Elsevier Ltd. and National Safety Council. Published by Elsevier Ltd. All rights reserved.
The influence of pitch and loudness changes on the acoustics of vocal tremor.
Dromey, Christopher; Warrick, Paul; Irish, Jonathan
2002-10-01
The effect of tremor on phonation is to modulate an otherwise steady sound source in its amplitude, fundamental frequency, or both. The severity of untreated vocal tremor has been reported to change under certain conditions that may be related to muscle tension. In order to better understand the phenomenon of vocal tremor, its acoustic properties were examined as individuals volitionally altered their pitch and loudness. These voice conditions were anticipated to alter the tension of the intrinsic laryngeal muscles. The voices of 10 individuals with a diagnosis of vocal tremor were recorded before participating in a longitudinal treatment study. They produced vowels at low and high pitch and loudness levels as well as in a comfortable voice condition. Acoustic analyses quantified the amplitude and frequency modulations of the speakers' voices across the various conditions. Individual speakers varied in the way the pitch and loudness changes affected their tremor, but the following statistically significant effects for the speakers as a group were observed: Higher pitch phonation was associated with a more rapid rate for both amplitude and frequency modulations. Amplitude modulation become faster for louder phonation. Low-pitched phonotion led to decreases in the extent of amplitude tremor. Varying pitch led to dramatic changes in the phase relationship between amplitude and frequency modulation in some of the speakers, whereas this effect was not apparent in other speakers.
Smartphones Offer New Opportunities in Clinical Voice Research.
Manfredi, C; Lebacq, J; Cantarella, G; Schoentgen, J; Orlandi, S; Bandini, A; DeJonckere, P H
2017-01-01
Smartphone technology provides new opportunities for recording standardized voice samples of patients and sending the files by e-mail to the voice laboratory. This drastically improves the collection of baseline data, as used in research on efficiency of voice treatments. However, the basic requirement is the suitability of smartphones for recording and digitizing pathologic voices (mainly characterized by period perturbations and noise) without significant distortion. In this experiment, two smartphones (a very inexpensive one and a high-level one) were tested and compared with direct microphone recordings in a soundproof room. The voice stimuli consisted in synthesized deviant voice samples (median of fundamental frequency: 120 and 200 Hz) with three levels of jitter and three levels of added noise. All voice samples were analyzed using PRAAT software. The results show high correlations between jitter, shimmer, and noise-to-harmonics ratio measured on the recordings via both smartphones, the microphone, and measured directly on the sound files from the synthesizer. Smartphones thus appear adequate for reliable recording and digitizing of pathologic voices. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
[Acoustic voice analysis using the Praat program: comparative study with the Dr. Speech program].
Núñez Batalla, Faustino; González Márquez, Rocío; Peláez González, M Belén; González Laborda, Irene; Fernández Fernández, María; Morato Galán, Marta
2014-01-01
The European Laryngological Society (ELS) basic protocol for functional assessment of voice pathology includes 5 different approaches: perception, videostroboscopy, acoustics, aerodynamics and subjective rating by the patient. In this study we focused on acoustic voice analysis. The purpose of the present study was to correlate the results obtained by the commercial software Dr. Speech and the free software Praat in 2 fields: 1. Narrow-band spectrogram (the presence of noise according to Yanagihara, and the presence of subharmonics) (semi-quantitative). 2. Voice acoustic parameters (jitter, shimmer, harmonics-to-noise ratio, fundamental frequency) (quantitative). We studied a total of 99 voice samples from individuals with Reinke's oedema diagnosed using videostroboscopy. One independent observer used Dr. Speech 3.0 and a second one used the Praat program (Phonetic Sciences, University of Amsterdam). The spectrographic analysis consisted of obtaining a narrow-band spectrogram from the previous digitalised voice samples by the 2 independent observers. They then determined the presence of noise in the spectrogram, using the Yanagihara grades, as well as the presence of subharmonics. As a final result, the acoustic parameters of jitter, shimmer, harmonics-to-noise ratio and fundamental frequency were obtained from the 2 acoustic analysis programs. The results indicated that the sound spectrogram and the numerical values obtained for shimmer and jitter were similar for both computer programs, even though types 1, 2 and 3 voice samples were analysed. The Praat and Dr. Speech programs provide similar results in the acoustic analysis of pathological voices. Copyright © 2013 Elsevier España, S.L. All rights reserved.
Sex hormones and the female voice.
Abitbol, J; Abitbol, P; Abitbol, B
1999-09-01
In the following, the authors examine the relationship between hormonal climate and the female voice through discussion of hormonal biochemistry and physiology and informal reporting on a study of 197 women with either premenstrual or menopausal voice syndrome. These facts are placed in a larger historical and cultural context, which is inextricably bound to the understanding of the female voice. The female voice evolves from childhood to menopause, under the varied influences of estrogens, progesterone, and testosterone. These hormones are the dominant factor in determining voice changes throughout life. For example, a woman's voice always develops masculine characteristics after an injection of testosterone. Such a change is irreversible. Conversely, male castrati had feminine voices because they lacked the physiologic changes associated with testosterone. The vocal instrument is comprised of the vibratory body, the respiratory power source and the oropharyngeal resonating chambers. Voice is characterized by its intensity, frequency, and harmonics. The harmonics are hormonally dependent. This is illustrated by the changes that occur during male and female puberty: In the female, the impact of estrogens at puberty, in concert with progesterone, produces the characteristics of the female voice, with a fundamental frequency one third lower than that of a child. In the male, androgens released at puberty are responsible for the male vocal frequency, an octave lower than that of a child. Premenstrual vocal syndrome is characterized by vocal fatigue, decreased range, a loss of power and loss of certain harmonics. The syndrome usually starts some 4-5 days before menstruation in some 33% of women. Vocal professionals are particularly affected. Dynamic vocal exploration by televideoendoscopy shows congestion, microvarices, edema of the posterior third of the vocal folds and a loss of its vibratory amplitude. The authors studied 97 premenstrual women who were prescribed a treatment of multivitamins, venous tone stimulants (phlebotonics), and anti-edematous drugs. We obtained symptomatic improvement in 84 patients. The menopausal vocal syndrome is characterized by lowered vocal intensity, vocal fatigue, a decreased range with loss of the high tones and a loss of vocal quality. In a study of 100 menopausal women, 17 presented with a menopausal vocal syndrome. To rehabilitate their voices, and thus their professional lives, patients were prescribed hormone replacement therapy and multi-vitamins. All 97 women showed signs of vocal muscle atrophy, reduction in the thickness of the mucosa and reduced mobility in the cricoarytenoid joint. Multi-factorial therapy (hormone replacement therapy and multi-vitamins) has to be individually adjusted to each case depending on body type, vocal needs, and other factors.
Satellite voice broadcase system study. Volume 1: Executive summary
NASA Technical Reports Server (NTRS)
Horstein, M.
1985-01-01
The feasibility of providing Voice of America (VOA) broadcasts by satellite relay was investigated. Satellite voice broadcast systems are described for three different frequency bands: HF, FHV, and L-band. Geostationary satellite configurations are considered for both frequency bands. A system of subsynchronous, circular satellites with an orbit period of 8 hours was developed for the HF band. The VHF broadcasts are provided by a system of Molniya satellites. The satellite designs are limited in size and weight to the capability of the STS/Centaur launch vehicle combination. At L-band, only four geostationary satellites are needed to meet the requirements of the complete broadcast schedule. These satellites are comparable in size and weight to current satellites designed for the direct broadcast of video program material.
Predicting mutational change in the speaking voice of boys.
Fuchs, Michael; Fröehlich, Matthias; Hentschel, Bettina; Stuermer, Ingo W; Kruse, Eberhard; Knauft, Daniel
2007-03-01
The authors investigated whether acoustic speaking voice analyses can be used to predict the beginning of mutation in 21 male members of a professional boys' choir. Over a period of 3 years before mutation, children were examined every 3 months by ear, nose, and throat (ENT) and phoniatric specialists. At the same time, the voice was evaluated acoustically using analysis features of the Goettingen Hoarseness Diagram (GHD). Irregularity component and noise component, jitter, shimmer, mean waveform correlation coefficient, and fundamental frequency were determined from recordings of the speaking voice. Significant changes of acoustic features appeared 7 and 5 months before mutation onset, which indicates that vocal function is already restricted 6 months before mutation onset. This acoustic voice analysis is therefore suitable to support the care of the professional singing voice.
Mares Prefer the Voices of Highly Fertile Stallions
Lemasson, Alban; Remeuf, Kévin; Trabalon, Marie; Cuir, Frédérique; Hausberger, Martine
2015-01-01
We investigated the possibility that stallion whinnies, known to encode caller size, also encoded information about caller arousal and fertility, and the reactions of mares in relation to type of voice. Voice acoustic features are correlated with arousal and reproduction success, the lower-pitched the stallion’s voice, the slower his heart beat and the higher his fertility. Females from three study groups preferred playbacks of low-pitched voices. Hence, females are attracted by frequencies encoding for large male size, calmness and high fertility. More work is needed to explore the relative importance of morpho-physiological features. Assortative mating may be involved as large females preferred voices of larger stallions. Our study contributes to basic and applied ongoing research on mammal reproduction, and questions the mechanisms used by females to detect males’ fertility. PMID:25714814
Kunduk, Melda; Vansant, Mathew B; Ikuma, Takeshi; McWhorter, Andrew
2017-03-01
This study investigated the effect of menstrual cycle on vocal fold vibratory characteristics in young women using high-speed digital imaging. This study examined the menstrual phase effect on five objective high-speed imaging parameters and two self-rated perceptual parameters. The effects of oral birth control use were also investigated. Thirteen subjects with no prior voice complaints were included in this study. All data were collected at three different time periods (premenses, postmenses, ovulation) over the course of one menstrual cycle. For five of the 13 subjects, data were collected for two consecutive cycles. Six of 13 subjects were oral birth control users. From high-speed imaging data, five objective parameters were computed: fundamental frequency, fundamental frequency deviation, harmonics-to-noise ratio, harmonic richness factor, and ratio of first and second harmonics. They were supplemented by two self-rated parameters: Reflux Severity Index and perceptual voice quality rating. Analysis included mixed model linear analysis with repeated measures. Results indicated no significant main effects for menstrual phase, between-cycle, or birth control use in the analysis for mean fundamental frequency, fundamental frequency deviation, harmonics-to-noise ratio, harmonic richness factor, first and second harmonics, Reflux Severity Index, and perceptual voice quality rating. Additionally, there were no interaction effects. Hormone fluctuations observed across the menstrual cycle do not appear to have direct effect on vocal fold vibratory characteristics in young women with no voice concerns. Birth control use, on the other hand, may have influence on spectral richness of vocal fold vibration. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Darawsheh, Wesam B; Natour, Yaser S; Sada, Eve G
2018-07-01
This pilot study aimed to evaluate the internal consistency, convergent construct validity and criterion validity of Arabic version of the Vocal Tract Discomfort Scale (VTDS), and to investigate the correlation between the scores of the VTDS, the VHI and the acoustic measures of fundamental frequency (F0), shimmer, jitter and signal-to-noise ratio (SNR). A cross-sectional study where 97 participants participated (47 males and 50 females) (mean age 20.5 ± 2.1 years) (31 student singers and 66 other non-professional voice user students). Participants were without self-perceived voice disorders who completed the VTDS-Arab scale and the Voice Handicap Index (VHI-Arab), and recorded a vocal sample of/a:/at a comfortable level. A positive internal consistency that signifies reliability was confirmed by Cronbach's α = .884 and 0.874 for the VTDS-Arab frequency and severity subscales, respectively. A moderate positive correlation was found between the VTDS-Arab (frequency, severity, total) and the VHI-Arab total where values of Pearson's correlation coefficient were r= 0.459, 0.430 and 0.451, respectively. Weak correlations were found between all of the acoustic measures and the scores of the VTDS-Arab and VHI-Arab (total and subscales). The area under curve for the VTDS was AUC= 0.824, 0.804 and 0.817 for the VTDS frequency, VTDS severity and VTDS total, respectively. The VTDS-Arab is a valid and reliable tool in measuring vocal tract sensations and predicting the perception of vocal handicap in student singers and can be used to predict the vocal load among professional voice users.
Lundeborg, Inger; Hultcrantz, Elisabeth; Ericsson, Elisabeth; McAllister, Anita
2012-07-01
To evaluate outcome of two types of tonsil surgery (tonsillectomy [TE]+adenoidectomy or tonsillotomy [TT]+adenoidectomy) on vocal function perceptually and acoustically. Sixty-seven children, aged 50-65 months, on waiting list for tonsil surgery were randomized to TE (n=33) or TT (n=34). Fifty-seven age- and gender-matched healthy preschool children were controls. Twenty-eight of them, aged 48-59 months, served as control group before surgery, and 29, aged 60-71 months, served as control group after surgery. Before surgery and 6 months postoperatively, the children were recorded producing three sustained vowels (/ɑ/, /u/, and /i/) and 14 words. The control groups were recorded only once. Three trained speech and language pathologists performed the perceptual analysis using visual analog scale for eight voice quality parameters. Acoustic analysis from sustained vowels included average fundamental frequency, jitter percent, shimmer percent, noise-to-harmonic ratio, and the center frequencies of formants 1-3. Before surgery, the children were rated to have more hyponasality and compressed/throaty voice (P<0.05) and lower mean pitch (P<0.01) in comparison to the control group. They also had higher perturbation measures and lower frequencies of the second and third formants. After surgery, there were no differences perceptually. Perturbation measures decreased but were still higher compared with those of control group (P<0.05). Differences in formant frequencies for /i/ and /u/ remained. No differences were found between the two surgical methods. Voice quality is affected perceptually and acoustically by adenotonsillar hypertrophy. After surgery, the voice is perceptually normalized but acoustic differences remain. Outcome was equal for both surgical methods. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
The effect of voice communications latency in high density, communications-intensive airspace.
DOT National Transportation Integrated Search
2003-01-01
The Federal Aviation Administration (FAA) Next Generation Air-Ground Communications program plans to replace aging analog radio equipment with the Very High Frequency Digital Link Mode 3 (VDL3) system. VDL3 will implement both digital voice and data ...
Bauer, Jay J; Mittal, Jay; Larson, Charles R; Hain, Timothy C
2006-04-01
The present study tested whether subjects respond to unanticipated short perturbations in voice loudness feedback with compensatory responses in voice amplitude. The role of stimulus magnitude (+/- 1,3 vs 6 dB SPL), stimulus direction (up vs down), and the ongoing voice amplitude level (normal vs soft) were compared across compensations. Subjects responded to perturbations in voice loudness feedback with a compensatory change in voice amplitude 76% of the time. Mean latency of amplitude compensation was 157 ms. Mean response magnitudes were smallest for 1-dB stimulus perturbations (0.75 dB) and greatest for 6-dB conditions (0.98 dB). However, expressed as gain, responses for 1-dB perturbations were largest and almost approached 1.0. Response magnitudes were larger for the soft voice amplitude condition compared to the normal voice amplitude condition. A mathematical model of the audio-vocal system captured the main features of the compensations. Previous research has demonstrated that subjects can respond to an unanticipated perturbation in voice pitch feedback with an automatic compensatory response in voice fundamental frequency. Data from the present study suggest that voice loudness feedback can be used in a similar manner to monitor and stabilize voice amplitude around a desired loudness level.
Mobile Communication Devices, Ambient Noise, and Acoustic Voice Measures.
Maryn, Youri; Ysenbaert, Femke; Zarowski, Andrzej; Vanspauwen, Robby
2017-03-01
The ability to move with mobile communication devices (MCDs; ie, smartphones and tablet computers) may induce differences in microphone-to-mouth positioning and use in noise-packed environments, and thus influence reliability of acoustic voice measurements. This study investigated differences in various acoustic voice measures between six recording equipments in backgrounds with low and increasing noise levels. One chain of continuous speech and sustained vowel from 50 subjects with voice disorders (all separated by silence intervals) was radiated and re-recorded in an anechoic chamber with five MCDs and one high-quality recording system. These recordings were acquired in one condition without ambient noise and in four conditions with increased ambient noise. A total of 10 acoustic voice markers were obtained in the program Praat. Differences between MCDs and noise condition were assessed with Friedman repeated-measures test and posthoc Wilcoxon signed-rank tests, both for related samples, after Bonferroni correction. (1) Except median fundamental frequency and seven nonsignificant differences, MCD samples have significantly higher acoustic markers than clinical reference samples in minimal environmental noise. (2) Except median fundamental frequency, jitter local, and jitter rap, all acoustic measures on samples recorded with the reference system experienced significant influence from room noise levels. Fundamental frequency is resistant to recording system, environmental noise, and their combination. All other measures, however, were impacted by both recording system and noise condition, and especially by their combination, often already in the reference/baseline condition without added ambient noise. Caution is therefore warranted regarding implementation of MCDs as clinical recording tools, particularly when applied for treatment outcomes assessments. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Voice quality change in future professional voice users after 9 months of voice training.
Timmermans, Bernadette; De Bodt, Marc; Wuyts, Floris; Van de Heyning, Paul
2004-01-01
Sixty-eight students of a school for audiovisual communication participated in this study. A part of them, 49 students, received voice training for 9 months (the trained group); 19 subjects received no specific voice training (the untrained group). A multidimensional test battery containing the GRBAS scale, videolaryngostroboscopy, Maximum Phonation Time (MPT), jitter, lowest intensity (IL), highest frequency (FoH), Dysphonia Severity Index (DSI) and Voice Handicap Index (VHI) was applied before and after training to evaluate training outcome. The voice training is made up of technical workshops in small groups (five to eight subjects) and vocal coaching in the ateliers. In the technical workshops, basic skills are trained (posture, breathing technique, articulation and diction), and in the ateliers, the speech and language pathologist assists the subjects in the practice of their voice work. This study revealed a significant amelioration over time for the objective measurements [Dysphonia Severity Index: from 2.3 to 4.5 ( P<0.001)] and the self-evaluation [Voice Handicap Index, from 23 to 18.4 ( P=0.016)] for the trained group only. This outcome favors the systematic introduction of voice training during the schooling of professional voice users.
Acoustic markers to differentiate gender in prepubescent children's speaking and singing voice.
Guzman, Marco; Muñoz, Daniel; Vivero, Martin; Marín, Natalia; Ramírez, Mirta; Rivera, María Trinidad; Vidal, Carla; Gerhard, Julia; González, Catalina
2014-10-01
Investigation sought to determine whether there is any acoustic variable to objectively differentiate gender in children with normal voices. A total of 30 children, 15 boys and 15 girls, with perceptually normal voices were examined. They were between 7 and 10 years old (mean: 8.1, SD: 0.7 years). Subjects were required to perform the following phonatory tasks: (1) to phonate sustained vowels [a:], [i:], [u:], (2) to read a phonetically balanced text, and (3) to sing a song. Acoustic analysis included long-term average spectrum (LTAS), fundamental frequency (F0), speaking fundamental frequency (SFF), equivalent continuous sound level (Leq), linear predictive code (LPC) to obtain formant frequencies, perturbation measures, harmonic to noise ratio (HNR), and Cepstral peak prominence (CPP). Auditory perceptual analysis was performed by four blinded judges to determine gender. No significant gender-related differences were found for most acoustic variables. Perceptual assessment showed good intra and inter rater reliability for gender. Cepstrum for [a:], alpha ratio in text, shimmer for [i:], F3 in [a:], and F3 in [i:], were the parameters that composed the multivariate logistic regression model to best differentiate male and female children's voices. Since perceptual assessment reliably detected gender, it is likely that other acoustic markers (not evaluated in the present study) are able to make clearer gender differences. For example, gender-specific patterns of intonation may be a more accurate feature for differentiating gender in children's voices. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Lee, Shao-Hsuan; Fang, Tuan-Jen; Yu, Jen-Fang; Lee, Guo-She
2017-09-01
Auditory feedback can make reflexive responses on sustained vocalizations. Among them, the middle-frequency power of F0 (MFP) may provide a sensitive index to access the subtle changes in different auditory feedback conditions. Phonatory airflow temperature was obtained from 20 healthy adults at two vocal intensity ranges under four auditory feedback conditions: (1) natural auditory feedback (NO); (2) binaural speech noise masking (SN); (3) bone-conducted feedback of self-generated voice (BAF); and (4) SN and BAF simultaneously. The modulations of F0 in low-frequency (0.2 Hz-3 Hz), middle-frequency (3 Hz-8 Hz), and high-frequency (8 Hz-25 Hz) bands were acquired using power spectral analysis of F0. Acoustic and aerodynamic analyses were used to acquire vocal intensity, maximum phonation time (MPT), phonatory airflow, and MFP-based vocal efficiency (MBVE). SN and high vocal intensity decreased MFP and raised MBVE and MPT significantly. BAF showed no effect on MFP but significantly lowered MBVE. Moreover, BAF significantly increased the perception of voice feedback and the sensation of vocal effort. Altered auditory feedback significantly changed the middle-frequency modulations of F0. MFP and MBVE could well detect these subtle responses of audio-vocal feedback. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Vocal warm-up and breathing training for teachers: randomized clinical trial
Pereira, Lílian Paternostro de Pina; Masson, Maria Lúcia Vaz; Carvalho, Fernando Martins
2015-01-01
OBJECTIVE To compare the effectiveness of two speech therapy interventions, vocal warm-up and breathing training, focusing on teachers’ voice quality. METHODS A single-blind, randomized, parallel clinical trial was conducted. The research included 31 20 to 60-year old teachers from a public school in Salvador, BA, Northeasatern Brazil, with minimum workloads of 20 hours a week, who have or have not reported having vocal alterations. The exclusion criteria were the following: being a smoker, excessive alcohol consumption, receiving additional speech therapy assistance while taking part in the study, being affected by upper respiratory tract infections, professional use of the voice in another activity, neurological disorders, and history of cardiopulmonary pathologies. The subjects were distributed through simple randomization in groups vocal warm-up (n = 14) and breathing training (n = 17). The teachers’ voice quality was subjectively evaluated through the Voice Handicap Index (Índice de Desvantagem Vocal, in the Brazilian version) and computerized voice analysis (average fundamental frequency, jitter, shimmer, noise, and glottal-to-noise excitation ratio) by speech therapists. RESULTS Before the interventions, the groups were similar regarding sociodemographic characteristics, teaching activities, and vocal quality. The variations before and after the intervention in self-assessment and acoustic voice indicators have not significantly differed between the groups. In the comparison between groups before and after the six-week interventions, significant reductions in the Voice Handicap Index of subjects in both groups were observed, as wells as reduced average fundamental frequencies in the vocal warm-up group and increased shimmer in the breathing training group. Subjects from the vocal warm-up group reported speaking more easily and having their voices more improved in a general way as compared to the breathing training group. CONCLUSIONS Both interventions were similar regarding their effects on the teachers’ voice quality. However, each contribution has individually contributed to improve the teachers’ voice quality, especially the vocal warm-up. PMID:26465664
Schneider, Berit; Zumtobel, Michaela; Prettenhofer, Walter; Aichstill, Birgitta; Jocher, Werner
2010-03-01
Only limited data on normal vocal constitution and vocal capabilities in school-aged children are available. To take better care of children's voices, it might be helpful to know voice ranges and limits of not only vocally trained but also vocally untrained children. Goal of this study was the evaluation of singing voice capabilities of vocally healthy children with different social and vocal/musical backgrounds using voice range profile measurements (VRP). VRP percentiles that reflect constitutional aspects were suggested. In this cross-sectional study, 186 children (aged between seven and 10 years), attending five schools, were included. VRP measurements were performed under field conditions. Interviews and questionnaires regarding vocal strain and vocal training were applied; the answers were used for classification of singing activity and vocal training (KLASAK). All children reached a mean singing voice range of at least two octaves. By using the answers of interviews and questionnaires, the children could be classified according to vocal strain and vocal training. The groups showed no significant differences regarding VRP measurements. In the following step, percentiles were calculated. Twenty-five percent of all children (P25) reached a minimum voice range of almost two octaves, namely, 22 semitones (ST) from 220 to 784 Hz with soft and loud singing. Half of the children (P50) had a voice range of 24 ST (2 octaves), while soft singing and a larger voice range of 26 ST while loud singing. The measurements of third quartile (P75) revealed that 25% of children have even a larger voice range than 29 dB (from 196 Hz/g to 1047 Hz/c3) and can sing at most frequencies louder than 90 dB. P90 demonstrated that 10% of the children can sing even lower or higher than the frequency range between 196 Hz/g and 1319 Hz/e3 analyzed. The voice range seems not to be constrained by social but by voice/musical background: children of vocally/musically encouraged schools had wider voice ranges. This underlines the necessity of regular singing lessons already in primary schools. The percentile VRP introduced might help to evaluate the vocal constitution and vocal capabilities of a child. Copyright (c) 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Leino, Timo
2009-11-01
Voice quality has mainly been studied in trained speakers, singers, and dysphonic patients. Few studies have concerned ordinary untrained university students' voices. In light of earlier studies of professional voice users, it was hypothesized that good, poor, and intermediate voices would be distinguishable on the basis of long-term average spectrum characteristics. In the present study, voice quality of 50 Finnish vocally untrained male university students was studied perceptually and using long-term average spectrum analysis of text reading samples of one minute duration. Equivalent sound level (Leq) of text reading was also measured. According to the results, the good and ordinary voices differed from the poor ones in their relatively higher sound level in the frequency range of 1-3 kHz and a prominent peak at 3-4 kHz. Good voices, however, did not differ from the ordinary voices in terms of the characteristics of the long-term average spectrum (LTAS). The strength of the peak at 3-4 kHz and the voice-quality scores correlated weakly but significantly. Voice quality and alpha ratio (level difference above and below 1 kHz) correlated likewise. Leq was significantly higher in the students with good and ordinary voices than in those with poor voices. The connections between Leq, voice quality, and the formation of the peak at 3-4 kHz warrant further studies.
Changes of the speaking and singing voice after thyroid or parathyroid surgery.
Musholt, Thomas J; Musholt, Petra B; Garm, Jens; Napiontek, Ulrike; Keilmann, Annerose
2006-12-01
While permanent dysphonia is a rare complication of thyroid or parathyroid surgery, postoperative changes of the speaking and/or singing voice often remain unrecognized. In a prospective 4-arm study, vocal fold videolaryngostroboscopy and functional assessment of pre- and postoperative vocal performance was used to evaluate voice disturbances in 120 patients undergoing extended cervical surgery and in 19 patients with limited interventions for thyroid and/or parathyroid pathology. Impairments, especially of the singing voice, were predominantly observed after extended endocrine neck surgery. In women, the highest pitch of the singing voice (HPS) dropped from 651 Hz to 563 Hz (E5 to Csharp5, P < .001). In men, the HPS decreased to a lesser extent (423 Hz to 374 Hz, (Gsharp4 to Fsharp4, P = .009). Covariant analysis of influencing factors revealed the preoperative maximum frequency range and the HPS as predictors of the postoperative voice outcome. While alterations of the speaking voice after thyroid and parathyroid surgery usually remain subclinical, transient changes of the singing voice will matter to voice professionals.
Roy, Nelson; Merrill, Ray M; Thibeault, Susan; Gray, Steven D; Smith, Elaine M
2004-06-01
To examine the frequency and adverse effects of voice disorders on job performance and attendance in teachers and the general population, 2,401 participants from Iowa and Utah (n1 = 1,243 teachers and n2 = 1,279 nonteachers) were randomly selected and were interviewed by telephone using a voice disorder questionnaire. Teachers were significantly more likely than nonteachers to have experienced multiple voice symptoms and signs including hoarseness, discomfort, and increased effort while using their voice, tiring or experiencing a change in voice quality after short use, difficulty projecting their voice, trouble speaking or singing softly, and a loss of their singing range (all odds ratios [ORs] p <.05). Furthermore, teachers consistently attributed these voice symptoms to their occupation and were significantly more likely to indicate that their voice limited their ability to perform certain tasks at work, and had reduced activities or interactions as a result. Teachers, as compared with nonteachers, had missed more workdays over the preceding year because of voice problems and were more likely to consider changing occupations because of their voice (all comparisons p <.05). These findings strongly suggest that occupationally related voice dysfunction in teachers can have significant adverse effects on job performance, attendance, and future career choices.
Voice responses to changes in pitch of voice or tone auditory feedback
NASA Astrophysics Data System (ADS)
Sivasankar, Mahalakshmi; Bauer, Jay J.; Babu, Tara; Larson, Charles R.
2005-02-01
The present study was undertaken to examine if a subject's voice F0 responded not only to perturbations in pitch of voice feedback but also to changes in pitch of a side tone presented congruent with voice feedback. Small magnitude brief duration perturbations in pitch of voice or tone auditory feedback were randomly introduced during sustained vowel phonations. Results demonstrated a higher rate and larger magnitude of voice F0 responses to changes in pitch of the voice compared with a triangular-shaped tone (experiment 1) or a pure tone (experiment 2). However, response latencies did not differ across voice or tone conditions. Data suggest that subjects responded to the change in F0 rather than harmonic frequencies of auditory feedback because voice F0 response prevalence, magnitude, or latency did not statistically differ across triangular-shaped tone or pure-tone feedback. Results indicate the audio-vocal system is sensitive to the change in pitch of a variety of sounds, which may represent a flexible system capable of adapting to changes in the subject's voice. However, lower prevalence and smaller responses to tone pitch-shifted signals suggest that the audio-vocal system may resist changes to the pitch of other environmental sounds when voice feedback is present. .
Baker, F; Wigram, T; Gold, C
2005-07-01
To examine changes in the relationship between intonation, voice range and mood following music therapy programmes in people with traumatic brain injury. Data from four case studies were pooled and effect size, ANOVA and correlation calculations were performed to evaluate the effectiveness of treatment. Subjects sang three self-selected songs for 15 sessions. Speaking fundamental frequency, fundamental frequency variability, slope, voice range and mood were analysed pre- and post-session. Immediate treatment effects were not found. Long-term improvements in affective intonation were found in three subjects, especially in fundamental frequency. Voice range improved over time and was positively correlated with the three intonation components. Mood scale data showed that immediate effects were in the negative direction whereas there weres increases in positive mood state in the longer-term. Findings suggest that, in the long-term, song singing can improve vocal range and mood and enhance the affective intonation styles of people with TBI.
A hybrid voice/data modulation for the VHF aeronautical channels
NASA Technical Reports Server (NTRS)
Akos, Dennis M.
1993-01-01
A method of improving the spectral efficiency of the existing Very High Frequency (VHF) Amplitude Modulation (AM) voice communication channels is proposed. The technique is to phase modulate the existing voice amplitude modulated carrier with digital data. This allows the transmission of digital information over an existing AM voice channel with no change to the existing AM signal format. There is no modification to the existing AM receiver to demodulate the voice signal and an additional receiver module can be added for processing of the digital data. The existing VHF AM transmitter requires only a slight modification for the addition of the digital data signal. The past work in the area is summarized and presented together with an improved system design and the proposed implementation.
[Approach to the Development of Mind and Persona].
Sawaguchi, Toshiko
2018-01-01
To access medical specialists by health specialists working in the regional health field, the possibility of utilizing the voice approach for dissociative identity disorder (DID) patients as a health assessment for medical access (HAMA) was investigated. The first step is to investigate whether the plural personae in a single DID patient can be discriminated by voice analysis. Voices of DID patients including these with different personae were extracted from YouTube and were analysed using the software PRAAT with basic frequency, oral factors, chin factors and tongue factors. In addition, RAKUGO story teller voices made artificially and dramatically were analysed in the same manner. Quantitive and qualitative analysis method were carried out and nested logistic regression and a nested generalized linear model was developed. The voice from different personae in one DID patient could be visually and easily distinquished using basic frequency curve, cluster analysis and factor analysis. In the canonical analysis, only Roy's maximum root was <0.01. In the nested generalized linear model, the model using a standard deviation (SD) indicator fit best and some other possibilities are shown here. In DID patients, the short transition time among plural personae could guide to the risky situation such as suicide. So if the voice approach can show the time threshold of changes between the different personae, it would be useful as an Access Assessment in the form of a simple HAMA.
Gelfer, Marylou Pausewang; Tice, Ruthanne M
2013-05-01
The present study examined how effectively listeners' perceptions of gender could be changed from male to female for male-to-female (MTF) transgender (TG) clients based on the voice signal alone, immediately after voice therapy and at long-term follow-up. Short- and long-term changes in masculinity and femininity ratings and acoustic measures of speaking fundamental frequency (SFF) and vowel formant frequencies were also investigated. Prospective treatment study. Five MTF TG clients, five control female speakers, and five control male speakers provided a variety of speech samples for later analysis. The TG clients then underwent 8 weeks of voice therapy. Voice samples were collected immediately at the termination of therapy and again 15 months later. Two groups of listeners were recruited to evaluate gender and provide masculinity and femininity ratings. Perceptual results revealed that TG subjects were perceived as female 1.9% of the time in the pretest, 50.8% of the time in the immediate posttest, and 33.1% of the time in the long-term posttest. The TG speakers were also perceived as significantly less masculine and more feminine in the immediate posttest and the long-term posttest compared with the pre-test. Some acoustic measures showed significant differences between the pretest and the immediate posttest and long-term posttest. It appeared that 8 weeks of voice therapy could result in vocal changes in MTF TG individuals that persist at least partially for up to 15 months. However, some TG subjects were more successful with voice feminization than others. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Conde, Mariana de Cásisa Macedo; Siqueira, Larissa Thaís Donalonso; Vendramini, José Eduardo; Brasolotto, Alcione Ghedini; Guirro, Rinaldo Roberto de Jesus; Silverio, Kelly Cristina Alves
2018-05-01
This study aimed to verify the immediate effect of low-frequency transcutaneous electrical nerve stimulation (TENS) and laryngeal manual therapy (LMT) in musculoskeletal pain, voice quality, and self-reported signs in women with dysphonia. Thirty women with behavioral dysphonia were randomly divided into the TENS group and the LMT group. All participants fulfilled the pain survey and had their voices recorded to posterior perceptual and acoustic analysis before and after intervention. The TENS group received a unique low-frequency TENS session (20 minutes). The LMT group received LMT (20 minutes) with soft and superficial massage in the sternocleidomastoid muscle, suprahyoid muscles, and larynx. Afterward, the volunteers reported their voice, larynx, breathing, and articulatory signs. Pre and post data were compared by parametric and nonparametric tests. After TENS, a decrease in pain intensity in the posterior or anterior region of the neck, shoulders, upper or lower back, and masseter was observed. After LMT, a decrease in pain intensity in the neck anterior region, shoulders, lower back, and temporal region was observed. Also, after TENS, there was an improvement in vowel /a/ instability; after LMT, there was a general improvement in voice quality, decrease in tension, and decrease in breathiness in speech. Positive voice and laryngeal signs were reported after TENS, and positive laryngeal signs and articulation were reported after LMT. TENS and LMT may be used in voice treatment of women with behavioral dysphonia, and both may be considered important therapy resources that reduce musculoskeletal pain and cause positive laryngeal signs. Both TENS and LMT are able to partially improve voice quality, but TENS presented better results. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Szabo Portela, Annika; Hammarberg, Britta; Södersten, Maria
2013-01-01
More knowledge is needed about preschool teachers' voice use to identify voice behaviours related to work demands that increase the risk for vocal dysfunction. The purpose of this study was to: (1) determine if speaking fundamental frequency (F0) and phonation time differ between work and leisure time and (2) describe variation in F0 and phonation time across the workday in preschool teachers with healthy voices. A portable voice accumulator was used to collect data on F0 and phonation time. Twelve vocally healthy female preschool teachers participated in recordings during both work and leisure time for 2 successive days. Their mean age was 35 years (range 21-53 years). Mean F0 was high during the working day (266 Hz) and decreased significantly after work (p < 0.0001). F0 was high also during leisure time (246 Hz) as compared to reference F0 values for Swedish females based on laboratory recordings. Phonation time at work varied widely among the participants, with an average of 12.0%, and decreased significantly to 5.5% during leisure time (p < 0.0001). Most participants had few opportunities for voice rest during work. Swedish preschool teachers use high levels of F0 and phonation time during work compared to leisure time indicating high vocal load caused by work. To clarify the role of daily voice use in the causation of vocal dysfunction in this profession, recordings over several days are needed. In addition to F0 and phonation time, recordings of voice sound pressure level and background noise level seem important. © 2013 S. Karger AG, Basel.
Perturbation of voice signals in register transitions on sustained frequency in professional tenors.
Echternach, Matthias; Traser, Louisa; Richter, Bernhard
2012-09-01
Vocal register transitions in the passaggio region remain an unclarified field in classically trained male singers. We examined the acoustic and electroglottographic signals of seven tenors' transitions from voix mixte to falsetto on a sustained pitch F4 (349Hz) on the vowels /a, e, i, o, u, and æ/. It was found that in many of the tested subjects, register transitions between voix mixte and falsetto were performed very continuously without clear register transition events. However, an increase of frequency and amplitude perturbation (jitter, relative average perturbation, and shimmer) was observed during register transitions. These data suggest that professional tenors are able to avoid sudden registration events frequently observed in untrained voices. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Formant characteristics of human laughter.
Szameitat, Diana P; Darwin, Chris J; Szameitat, André J; Wildgruber, Dirk; Alter, Kai
2011-01-01
Although laughter is an important aspect of nonverbal vocalization, its acoustic properties are still not fully understood. Extreme articulation during laughter production, such as wide jaw opening, suggests that laughter can have very high first formant (F(1)) frequencies. We measured fundamental frequency and formant frequencies of the vowels produced in the vocalic segments of laughter. Vocalic segments showed higher average F(1) frequencies than those previously reported and individual values could be as high as 1100 Hz for male speakers and 1500 Hz for female speakers. To our knowledge, these are the highest F(1) frequencies reported to date for human vocalizations, exceeding even the F(1) frequencies reported for trained soprano singers. These exceptionally high F(1) values are likely to be based on the extreme positions adopted by the vocal tract during laughter in combination with physiological constraints accompanying the production of a "pressed" voice. Copyright © 2011 The Voice Foundation. All rights reserved.
47 CFR 90.235 - Secondary fixed signaling operations.
Code of Federal Regulations, 2013 CFR
2013-10-01
... SERVICES PRIVATE LAND MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.235 Secondary... for the primary operations on the frequency concerned. (b) The output power shall not exceed 30 watts... those systems covered under paragraph (e) of this section, the maximum duration of any non-voice...
47 CFR 90.235 - Secondary fixed signaling operations.
Code of Federal Regulations, 2011 CFR
2011-10-01
... SERVICES PRIVATE LAND MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.235 Secondary... for the primary operations on the frequency concerned. (b) The output power shall not exceed 30 watts... those systems covered under paragraph (e) of this section, the maximum duration of any non-voice...
47 CFR 90.235 - Secondary fixed signaling operations.
Code of Federal Regulations, 2012 CFR
2012-10-01
... SERVICES PRIVATE LAND MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.235 Secondary... for the primary operations on the frequency concerned. (b) The output power shall not exceed 30 watts... those systems covered under paragraph (e) of this section, the maximum duration of any non-voice...
47 CFR 90.235 - Secondary fixed signaling operations.
Code of Federal Regulations, 2014 CFR
2014-10-01
... SERVICES PRIVATE LAND MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.235 Secondary... for the primary operations on the frequency concerned. (b) The output power shall not exceed 30 watts... those systems covered under paragraph (e) of this section, the maximum duration of any non-voice...
47 CFR 90.235 - Secondary fixed signaling operations.
Code of Federal Regulations, 2010 CFR
2010-10-01
... SERVICES PRIVATE LAND MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.235 Secondary... for the primary operations on the frequency concerned. (b) The output power shall not exceed 30 watts... those systems covered under paragraph (e) of this section, the maximum duration of any non-voice...
Voice Range Profiles of Singing Students: The Effects of Training Duration and Institution.
Lycke, Hugo; Siupsinskiene, Nora
2016-01-01
The aim of the study was to assess differences in voice parameters measured by the physiological voice range profile (VRP) in groups of vocally healthy subjects differentiated by the duration of vocal training and the training institution. Six basic frequency- and intensity-related VRP parameters and the frequency dip of the register transition zone were determined from VRP recordings of 162 females studying in individual singing lessons (1st-5th level) in Dutch, Belgian, English, and French public or private training facilities. Sixty-seven nonsinging female students served as controls. Singing students in more advanced singing classes demonstrated a significantly greater frequency range, particularly at high frequencies, than did first-year students. Students with private training showed a significantly increased mean intensity range in comparison to those in group classes, while students with musical theater training exhibited significantly increased frequency- and intensity-related VRP parameters in comparison to the students with classical training. When compared to nonsingers, all singing student subgroups showed significant increases in all basic VRP parameters. However, the register transition parameter was not influenced by training duration or institution. Our study suggests that the extension of physiological vocal limits might depend on training duration and institution. © 2016 S. Karger AG, Basel.
The Effect of Hydration on Voice Quality in Adults: A Systematic Review.
Alves, Maxine; Krüger, Esedra; Pillay, Bhavani; van Lierde, Kristiane; van der Linde, Jeannie
2017-11-06
We aimed to critically appraise scientific, peer-reviewed articles, published in the past 10 years on the effects of hydration on voice quality in adults. This is a systematic review. Five databases were searched using the key words "vocal fold hydration", "voice quality", "vocal fold dehydration", and "hygienic voice therapy". The Preferred Reporting Items for Systematic Review and Meta-Analyses (PRISMA) guidelines were followed. The included studies were scored based on American Speech-Language-Hearing Association's levels of evidence and quality indicators, as well as the Cochrane Collaboration's risk of bias tool. Systemic dehydration as a result of fasting and not ingesting fluids significantly negatively affected the parameters of noise-to-harmonics ratio (NHR), shimmer, jitter, frequency, and the s/z ratio. Water ingestion led to significant improvements in shimmer, jitter, frequency, and maximum phonation time values. Caffeine intake does not appear to negatively affect voice production. Laryngeal desiccation challenges by oral breathing led to surface dehydration which negatively affected jitter, shimmer, NHR, phonation threshold pressure, and perceived phonatory effort. Steam inhalation significantly improved NHR, shimmer, and jitter. Only nebulization of isotonic solution decreased phonation threshold pressure and showed some indication of a potential positive effect of nebulization substances. Treatments in high humidity environments prove to be effective and adaptations of low humidity environments should be encouraged. Recent literature regarding vocal hydration is high quality evidence. Systemic hydration is the easiest and most cost-effective solution to improve voice quality. Recent evidence therefore supports the inclusion of hydration in a vocal hygiene program. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Effects of singing training on the speaking voice of voice majors.
Mendes, Ana P; Brown, W S; Rothman, Howard B; Sapienza, Christine
2004-03-01
This longitudinal study gathered data with regard to the question: Does singing training have an effect on the speaking voice? Fourteen voice majors (12 females and two males; age range 17 to 20 years) were recorded once a semester for four consecutive semesters, while sustaining vowels and reading the "Rainbow Passage." Acoustic measures included speaking fundamental frequency (SFF) and sound pressure level (SLP). Perturbation measures included jitter, shimmer, and harmonic-to-noise ratio. Temporal measures included sentence, consonant, and diphthong durations. Results revealed that, as the number of semesters increased, the SFF increased while jitter and shimmer slightly decreased. Repeated measure analysis, however, indicated that none of the acoustic, temporal, or perturbation differences were statistically significant. These results confirm earlier cross-sectional studies that compared singers with nonsingers, in that singing training mostly affects the singing voice and rarely the speaking voice.
Godino-Llorente, J I; Gómez-Vilda, P
2004-02-01
It is well known that vocal and voice diseases do not necessarily cause perceptible changes in the acoustic voice signal. Acoustic analysis is a useful tool to diagnose voice diseases being a complementary technique to other methods based on direct observation of the vocal folds by laryngoscopy. Through the present paper two neural-network based classification approaches applied to the automatic detection of voice disorders will be studied. Structures studied are multilayer perceptron and learning vector quantization fed using short-term vectors calculated accordingly to the well-known Mel Frequency Coefficient cepstral parameterization. The paper shows that these architectures allow the detection of voice disorders--including glottic cancer--under highly reliable conditions. Within this context, the Learning Vector quantization methodology demonstrated to be more reliable than the multilayer perceptron architecture yielding 96% frame accuracy under similar working conditions.
NASA Astrophysics Data System (ADS)
DeRosa, Angela
The present study analyzed the acoustic and perceptual differences in non-singer's singing voice before and after a vocal warm-up. Experiments were conducted with 12 females who had no singing experience and considered themselves to be non-singers. Participants were recorded performing 3 tasks: a musical scale stretching to their most comfortable high and low pitches, sustained productions of the vowels /a/ and /i/, and singing performance of the "Star Spangled Banner." Participants were recorded performing these three tasks before a vocal warm-up, after a vocal warm-up, and then again 2-3 weeks later after 2-3 weeks of practice. Acoustical analysis consisted of formant frequency analysis, singer's formant/singing power ratio analysis, maximum phonation frequency range analysis, and an analysis of jitter, noise to harmonic ratio (NHR), relative average perturbation (RAP), and voice turbulence index (VTI). A perceptual analysis was also conducted with 12 listeners rating comparison performances of before vs. after the vocal warm-up, before vs. after the second vocal warm-up, and after both vocal warm-ups. There were no significant findings for the formant frequency analysis of the vowel /a/, but there was significance for the 1st formant frequency analysis of the vowel /i/. Singer's formant analyzed via Singing Power Ratio analysis showed significance only for the vowel /i/. Maximum phonation frequency range analysis showed a significant increase after the vocal warm-ups. There were no significant findings for the acoustic measures of jitter, NHR, RAP, and VTI. Perceptual analysis showed a significant difference after a vocal warm-up. The results indicate that a singing vocal warm-up can have a significant positive influence on the singing voice of non-singers.
Comparison of FDMA and CDMA for second generation land-mobile satellite communications
NASA Technical Reports Server (NTRS)
Yongacoglu, A.; Lyons, R. G.; Mazur, B. A.
1990-01-01
Code Division Multiple Access (CDMA) and Frequency Division Multiple Access (FDMA) (both analog and digital) systems capacities are compared on the basis of identical link availabilities and physical propagation models. Parameters are optimized for a bandwidth limited, multibeam environment. For CDMA, the benefits of voice activated carriers, antenna discrimination, polarization reuse, return link power control and multipath suppression are included in the analysis. For FDMA, the advantages of bandwidth efficient modulation/coding combinations, voice activated carriers, polarization reuse, beam placement, and frequency staggering were taken into account.
Color and texture associations in voice-induced synesthesia
Moos, Anja; Simmons, David; Simner, Julia; Smith, Rachel
2013-01-01
Voice-induced synesthesia, a form of synesthesia in which synesthetic perceptions are induced by the sounds of people's voices, appears to be relatively rare and has not been systematically studied. In this study we investigated the synesthetic color and visual texture perceptions experienced in response to different types of “voice quality” (e.g., nasal, whisper, falsetto). Experiences of three different groups—self-reported voice synesthetes, phoneticians, and controls—were compared using both qualitative and quantitative analysis in a study conducted online. Whilst, in the qualitative analysis, synesthetes used more color and texture terms to describe voices than either phoneticians or controls, only weak differences, and many similarities, between groups were found in the quantitative analysis. Notable consistent results between groups were the matching of higher speech fundamental frequencies with lighter and redder colors, the matching of “whispery” voices with smoke-like textures, and the matching of “harsh” and “creaky” voices with textures resembling dry cracked soil. These data are discussed in the light of current thinking about definitions and categorizations of synesthesia, especially in cases where individuals apparently have a range of different synesthetic inducers. PMID:24032023
Reproducibility of Automated Voice Range Profiles, a Systematic Literature Review.
Printz, Trine; Rosenberg, Tine; Godballe, Christian; Dyrvig, Anne-Kirstine; Grøntved, Ågot Møller
2018-05-01
Reliable voice range profiles are of great importance when measuring effects and side effects from surgery affecting voice capacity. Automated recording systems are increasingly used, but the reproducibility of results is uncertain. Our objective was to identify and review the existing literature on test-retest accuracy of the automated voice range profile assessment. Systematic review. PubMed, Scopus, Cochrane Library, ComDisDome, Embase, and CINAHL (EBSCO). We conducted a systematic literature search of six databases from 1983 to 2016. The following keywords were used: phonetogram, voice range profile, and acoustic voice analysis. Inclusion criteria were automated recording procedure, healthy voices, and no intervention between test and retest. Test-retest values concerning fundamental frequency and voice intensity were reviewed. Of 483 abstracts, 231 full-text articles were read, resulting in six articles included in the final results. The studies found high reliability, but data are few and heterogeneous. The reviewed articles generally reported high reliability of the voice range profile, and thus clinical usefulness, but uncertainty remains because of low sample sizes and different procedures for selecting, collecting, and analyzing data. More data are needed, and clinical conclusions must be drawn with caution. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Lindstrom, Fredric; Waye, Kerstin Persson; Södersten, Maria; McAllister, Anita; Ternström, Sten
2011-03-01
Although the relationship between noise exposure and vocal behavior (the Lombard effect) is well established, actual vocal behavior in the workplace is still relatively unexamined. The first purpose of this study was to investigate correlations between noise level and both voice level and voice average fundamental frequency (F₀) for a population of preschool teachers in their normal workplace. The second purpose was to study the vocal behavior of each teacher to investigate whether individual vocal behaviors or certain patterns could be identified. Voice and noise data were obtained for female preschool teachers (n=13) in their workplace, using wearable measurement equipment. Correlations between noise level and voice level, and between voice level and F₀, were calculated for each participant and ranged from 0.07 to 0.87 for voice level and from 0.11 to 0.78 for F₀. The large spread of the correlation coefficients indicates that the teachers react individually to the noise exposure. For example, some teachers increase their voice-to-noise level ratio when the noise is reduced, whereas others do not. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Shih, Ludy C; Piel, Jordan; Warren, Amanda; Kraics, Lauren; Silver, Althea; Vanderhorst, Veronique; Simon, David K; Tarsy, Daniel
2012-06-01
Parkinson's disease related speech and voice impairment have significant impact on quality of life measures. LSVT(®)LOUD voice and speech therapy (Lee Silverman Voice Therapy) has demonstrated scientific efficacy and clinical effectiveness, but musically based voice and speech therapy has been underexplored as a potentially useful method of rehabilitation. We undertook a pilot, open-label study of a group-based singing intervention, consisting of twelve 90-min weekly sessions led by a voice and speech therapist/singing instructor. The primary outcome measure of vocal loudness as measured by sound pressure level (SPL) at 50 cm during connected speech was not significantly different one week after the intervention or at 13 weeks after the intervention. A number of secondary measures reflecting pitch range, phonation time and maximum loudness also were unchanged. Voice related quality of life (VRQOL) and voice handicap index (VHI) also were unchanged. This study suggests that a group singing therapy intervention at this intensity and frequency does not result in significant improvement in objective and subject-rated measures of voice and speech impairment. Copyright © 2012 Elsevier Ltd. All rights reserved.
Uloza, Virgilijus; Padervinskis, Evaldas; Vegiene, Aurelija; Pribuisiene, Ruta; Saferis, Viktoras; Vaiciukynas, Evaldas; Gelzinis, Adas; Verikas, Antanas
2015-11-01
The objective of this study is to evaluate the reliability of acoustic voice parameters obtained using smart phone (SP) microphones and investigate the utility of use of SP voice recordings for voice screening. Voice samples of sustained vowel/a/obtained from 118 subjects (34 normal and 84 pathological voices) were recorded simultaneously through two microphones: oral AKG Perception 220 microphone and SP Samsung Galaxy Note3 microphone. Acoustic voice signal data were measured for fundamental frequency, jitter and shimmer, normalized noise energy (NNE), signal to noise ratio and harmonic to noise ratio using Dr. Speech software. Discriminant analysis-based Correct Classification Rate (CCR) and Random Forest Classifier (RFC) based Equal Error Rate (EER) were used to evaluate the feasibility of acoustic voice parameters classifying normal and pathological voice classes. Lithuanian version of Glottal Function Index (LT_GFI) questionnaire was utilized for self-assessment of the severity of voice disorder. The correlations of acoustic voice parameters obtained with two types of microphones were statistically significant and strong (r = 0.73-1.0) for the entire measurements. When classifying into normal/pathological voice classes, the Oral-NNE revealed the CCR of 73.7% and the pair of SP-NNE and SP-shimmer parameters revealed CCR of 79.5%. However, fusion of the results obtained from SP voice recordings and GFI data provided the CCR of 84.60% and RFC revealed the EER of 7.9%, respectively. In conclusion, measurements of acoustic voice parameters using SP microphone were shown to be reliable in clinical settings demonstrating high CCR and low EER when distinguishing normal and pathological voice classes, and validated the suitability of the SP microphone signal for the task of automatic voice analysis and screening.
Transgender Voice and Communication Treatment: A Retrospective Chart Review of 25 Cases
ERIC Educational Resources Information Center
Hancock, Adrienne B.; Garabedian, Laura M.
2013-01-01
Background: People transitioning from male to female (MTF) gender seek speech-language pathology services when they feel their voice is betraying their genuine self or perhaps is the last obstacle to representing their authentic gender. Speaking fundamental frequency (pitch) and resonance are most often targets in treatment because the combination…
WWV, WWVH HF VOICE (TIME TICK)
Tsunamis 406 EPIRB's National Weather Service Marine Forecasts WWV, WWVH HF VOICE (TIME TICK) Marine of Standards, broadcasts a time and frequency service from stations WWV in Fort Collins, CO and WWVH in Kauai, Hawaii., commonly known to mariners as the "Time Tick", used as an aid in
Kreiman, Jody; Shue, Yen-Liang; Chen, Gang; Iseli, Markus; Gerratt, Bruce R.; Neubauer, Juergen; Alwan, Abeer
2012-01-01
Increases in open quotient are widely assumed to cause changes in the amplitude of the first harmonic relative to the second (H1*–H2*), which in turn correspond to increases in perceived vocal breathiness. Empirical support for these assumptions is rather limited, and reported relationships among these three descriptive levels have been variable. This study examined the empirical relationship among H1*–H2*, the glottal open quotient (OQ), and glottal area waveform skewness, measured synchronously from audio recordings and high-speed video images of the larynges of six phonetically knowledgeable, vocally healthy speakers who varied fundamental frequency and voice qualities quasi-orthogonally. Across speakers and voice qualities, OQ, the asymmetry coefficient, and fundamental frequency accounted for an average of 74% of the variance in H1*–H2*. However, analyses of individual speakers showed large differences in the strategies used to produce the same intended voice qualities. Thus, H1*–H2* can be predicted with good overall accuracy, but its relationship to phonatory characteristics appears to be speaker dependent. PMID:23039455
Design and fabrication of a new electrolarynx and voice amplifier for laryngectomees.
Sundeep Krishna, M; Jayanthy, A K; Divakar, C; Mekhala, R
2005-01-01
A Laryngectomee is a person whose vocal cords i.e. voice box is surgically removed owing to cancer or due to automobile accidents, burns or trauma. The patient, therefore permanently loses the ability to speak normally. An Electrolarynx is an electronic speech aid that enables the Laryngectomee to communicate with other people as quickly as possible after the successful removal of the larynx. A neck type Electrolarynx has been designed. Earlier designs could not alter frequency and intensity simultaneously during conversation. The Electrolarynx developed can control both frequency and intensity simultaneously during conversation. The device has been tested on the patient and found to be very effective. A portable, pocket size, battery powered voice amplifier (PA system) has also been developed which uses an electret condenser microphone as the input. The voice amplifier developed is a two stage amplifier which uses a preamplifier stage and a power amplifier stage. The output of the power amplifier is connected to a speaker. The device is being used by the patient and found to be very useful.
Petrovic-Lazic, Mirjana; Jovanovic, Nadica; Kulic, Milan; Babac, Snezana; Jurisic, Vladimir
2015-03-01
The aim of the study was to assess the effect of endolaryngeal phonomicrosurgery (EPM) and voice therapy in patients with vocal fold polyps using perceptual and acoustic analysis before and after both therapies. The acoustic tests and perceptual evaluation of voice were carried out on 41 female patients with vocal fold polyp before and after EPM and voice therapy. Both therapy strategies were performed. Used acoustic parameters were Jitter percent (Jitt), pitch perturbation quotient (PPQ), shimmer percent (Shim), amplitude perturbation quotient (APQ), fundamental frequency variation (vF0), noise-to-harmonic ratio (NHR), Voice Turbulence Index (VTI). For perceptual evaluation, GRB scale was used. Results indicated higher values of investigated parameters in patients' group than in the control group (P < 0.01). Good correlation between the perceptual hoarseness factors of GRB scale and objective acoustic voice parameters were observed. All analyzed acoustic parameters improved after the phonomicrosurgery and voice therapy and tend to approach to values of the control group. For Jitt percent, Shim percent, vF0, VTI, and NHR, there were statistically significant differences. Perceptual voice evaluation revealed statistically significantly (P < 0.01) decreased rating of G (grade), R (rough) and B (breathy) after surgery and voice therapy. Our data indicated that both acoustic and perceptual characteristic of voice in patients with vocal polyps significantly improved after phonomicrosurgical and voice treatment. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Voice changes after thyroidectomy without recurrent laryngeal nerve injury.
Sinagra, Diego L; Montesinos, Manuel R; Tacchi, Verónica A; Moreno, Julio C; Falco, Jorge E; Mezzadri, Norberto A; Debonis, Daniel L; Curutchet, H Pablo
2004-10-01
Injury of the inferior laryngeal nerve is not the only cause of voice alteration after thyroidectomy; many patients notice minimal changes immediately after operation, without evidence of inferior laryngeal nerve damage. We hypothesized that there may be other causes for voice modification, such as injuries of the superior laryngeal nerve, prethyroid strap muscles, and cricothyroid muscles. We describe voice changes after total thyroidectomy, without inferior laryngeal nerve injury, using a computer program to objectively compare different patterns of voice. Forty-six consecutive patients who underwent total thyroidectomy were studied between March 1997 and December 1999. Acoustic voice analysis was performed preoperatively and at the second, fourth, and sixth postoperative months using a microphone adapted to a personal computer. Parameters measured were intensity of the voice (Shimmer) and fundamental frequency (Fo). No complications occurred during operation or in the postoperative period. Voice fatigue during phonation was the most common symptom after thyroidectomy. Forty patients (87%) stated that their voices had changed since the operation, and common complaints were voice alteration while speaking loudly, changes in voice pitch, and voice disorder while singing. Changes in the Fo and Shimmer values in smokers versus nonsmokers were similar (Fo overall, p = 0.56; Shimmer overall, p = 0.66), as were the same parameters in benign and malignant pathologies (Fo overall, p = 0.66; Shimmer overall, p = 0.67). Voice changes after uncomplicated thyroidectomy occur and can be objectively measured. This is important in the preoperative counseling of patients before thyroidectomy, for ethical and legal purposes.
Schloneger, Matthew; Hunter, Eric
2016-01-01
The multiple social and performance demands placed on college/university singers could put their still developing voices at risk. Previous ambulatory monitoring studies have analyzed the duration, intensity, and frequency (in Hz) of voice use among such students. Nevertheless, no studies to date have incorporated the simultaneous acoustic voice quality measures into the acquisition of these measures to allow for direct comparison during the same voicing period. Such data could provide greater insight into how young singers use their voices, as well as identify potential correlations between vocal dose and acoustic changes in voice quality. The purpose of this study was to assess the voice use and estimated voice quality of college/university singing students (18–24 y/o, N = 19). Ambulatory monitoring was conducted over three full, consecutive weekdays measuring voice from an unprocessed accelerometer signal measured at the neck. From this signal were analyzed traditional vocal dose metrics such as phonation percentage, dose time, cycle dose, and distance dose. Additional acoustic measures included perceived pitch, pitch strength, LTAS slope, alpha ratio, dB SPL 1–3 kHz, and harmonic-to-noise ratio. Major findings from more than 800 hours of recording indicated that among these students (a) higher vocal doses correlated significantly with greater voice intensity, more vocal clarity and less perturbation; and (b) there were significant differences in some acoustic voice quality metrics between non-singing, solo singing and choral singing. PMID:26897545
Glottoplasty for male-to-female transsexualism: voice results.
Remacle, Marc; Matar, Nayla; Morsomme, Dominique; Veduyckt, Ingrid; Lawson, Georges
2011-01-01
The aim of this study was to evaluate the objective voice results of Wendler's glottoplasty in male-to-female transsexuals. We retrospectively reviewed our patients treated with Wendler's technique with minor modifications. Glottoplasty consisted in CO(2)-laser epithelial ablation of the anterior commissure and the two vocal folds in anterior third, suturing of the two vocal folds with two stitches of 3.0 resorbable thread, and application of fibrin sealant to strengthen the suture. Voice assessment was based mainly on fundamental frequency (F(0)), frequency range, jitter, maximum phonation time, phonation quotient, estimated subglottic pressure (ESGP) grade of dysphonia (G), and voice handicap index (VHI). These measures were taken before surgery and on the last follow-up visit. Our series included 15 patients with a mean age of 36 years. The mean follow-up period was 7.2 months. We did not observe any early complications related to the technique. The comparison between the preoperative and the postoperative measurements, using Wilcoxon signed rank test, showed a significant improvement of median F(0) from 139 to 191 Hz (P=0.006) with an increase in the grade of dysphonia (G(pre)=0.2, G(post)=1, P=0.013) and ESGP (ESGP(pre)=8.1 ± 3.2, ESGP(post)=12.0 ± 3.8, P=0.002). Other measurements, including VHI, did not show any significant differences pre- and postoperatively. Wendler's glottoplasty can contribute to feminize the voice. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Bele, Irene Velsvik
2006-12-01
The current study concerns speaking voice quality in two groups of professional voice users, teachers (n = 35) and actors (n = 36), representing trained and untrained voices. The voice quality of text reading at two intensity levels was acoustically analyzed. The central concept was the speaker's formant (SPF), related to the perceptual characteristics "better normal voice quality" (BNQ) and "worse normal voice quality" (WNQ). The purpose of the current study was to get closer to the origin of the phenomenon of the SPF, and to discover the differences in spectral and formant characteristics between the two professional groups and the two voice quality groups. The acoustic analyses were long-term average spectrum (LTAS) and spectrographical measurements of formant frequencies. At very high intensities, the spectral slope was rather quandrangular without a clear SPF peak. The trained voices had a higher energy level in the SPF region compared with the untrained, significantly so in loud phonation. The SPF seemed to be related to both sufficiently strong overtones and a glottal setting, allowing for a lowering of F4 and a closeness of F3 and F4. However, the existence of SPF also in LTAS of the WNQ voices implies that more research is warranted concerning the formation of SPF, and concerning the acoustic correlates of the BNQ voices.
Signal analysis of the female singing voice: Features for perceptual singer identity
NASA Astrophysics Data System (ADS)
Mellody, Maureen
2001-07-01
Individual singing voices tend to be easy for a listener to identify, particularly when compared to the difficulty of identifying the performer of any other musical instrument. What cues does a listener use to identify a particular singing voice? This work seeks to identify a set of features with which one can synthesize notes with the vocal quality of a particular singer. Such analysis and synthesis influences computer music (in the creation of synthetic sounds with different timbre), vocal pedagogy (as a training tool to help singers understand properties of their own voice as well as different professional-quality voices), and vocal health (to identify improper behavior in vocal production). The problem of singer identification is approached in three phases: signal analysis, the development of low- order representations, and perceptual evaluation. To perform the signal analysis, a high-resolution time- frequency distribution is applied to vowel tokens from sopranos and mezzo-sopranos. From these results, low- order representations are created for each singer's notes, which are used to synthesize sounds with the timbral quality of that singer. Finally, these synthesized sounds, along with original recordings, are evaluated by trained listeners in a variety of perceptual experiments to determine the extent to which the vocal quality of the desired singer is captured. Results from the signal analysis show that amplitude and frequency estimates extracted from the time-frequency signal analysis can be used to re-create each signal with little degradation in quality and no loss of perceptual identity. Low-order representations derived from the signal analysis are used in clustering and classification, which successfully clusters signals with corresponding singer identity. Finally, perceptual results indicate that trained listeners are, surprisingly, only modestly successful at correctly identifying the singer of a recording, and find the task to be particularly difficult for certain voices and extremely easy for others. Listeners also indicate that the majority of sounds synthesized with the low-order representations sufficiently capture the desired vocal timbre. Again, the task is easy for certain voices and much more difficult when evaluating other singers, consistent with the results from the original recordings.
"Ring" in the solo child singing voice.
Howard, David M; Williams, Jenevora; Herbst, Christian T
2014-03-01
Listeners often describe the voices of solo child singers as being "pure" or "clear"; these terms would suggest that the voice is not only pleasant but also clearly audible. The audibility or clarity could be attributed to the presence of high-frequency partials in the sound: a "brightness" or "ring." This article aims to investigate spectrally the acoustic nature of this ring phenomenon in children's solo voices, and in particular, relating it to their "nonring" production. Additionally, this is set in the context of establishing to what extent, if any, the spectral characteristics of ring are shared with those of the singer's formant cluster associated with professional adult opera singers in the 2.5-3.5kHz region. A group of child solo singers, acknowledged as outstanding by a singing teacher who specializes in teaching professional child singers, were recorded in a major UK concert hall performing Come unto him, all ye that labour, from the aria He shall feed his flock from The Messiah by GF Handel. Their singing was accompanied by a recording of a piano played through in-ear headphones. Sound pressure recordings were made from well within the critical distance in the hall. The singers were observed to produce notes with and without ring, and these recordings were analyzed in the frequency domain to investigate their spectra. The results indicate that there is evidence to suggest that ring in child solo singers is carried in two areas of the output spectrum: first in the singer's formant cluster region, centered around 4kHz, which is more than 1000Hz higher than what is observed in adults; and second in the region around 7.5-11kHz where a significant strengthening of harmonic presence is observed. A perceptual test has been carried out demonstrating that 94% of 62 listeners label a synthesized version of the calculated overall average ring spectrum for all subjects as having ring when compared with a synthesized version of the calculated overall average nonring spectrum. The notion of ring in the child solo voice manifests itself not only with spectral features in common with the projection peak found in adult singers but also in a higher frequency region. It is suggested that the formant cluster at around 4kHz is the children's equivalent of the singers' formant cluster; the frequency is higher than in the adult, most likely due to the smaller dimensions of the epilaryngeal tube. The frequency cluster observed as a strong peak at about 7.5-11kHz, when added to the children's singers' formant cluster, may be the key to cueing the notion of ring in the child solo voice. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Montero Benavides, Ana; Blanco Murillo, José Luis; Fernández Pozo, Rubén; Espinoza Cuadros, Fernando; Torre Toledano, Doroteo; Alcázar-Ramírez, José D; Hernández Gómez, Luis A
2016-01-01
We investigated whether differences in formants and their bandwidths, previously reported comparing small sample population of healthy individuals and patients with obstructive sleep apnea (OSA), are detected on a larger population representative of a clinical practice scenario. We examine possible indirect or mediated effects of clinical variables, which may shed some light on the connection between speech and OSA. In a retrospective study, 241 male subjects suspected to suffer from OSA were examined. The apnea-hypopnea index (AHI) was obtained for every subject using overnight polysomnography. Furthermore, the clinical variables usually reported as predictors of OSA, body mass index (BMI), cervical perimeter, height, weight, and age, were collected. Voice samples of sustained phonations of the vowels /a/, /e/, /i/, /o/, and /u/ were recorded. Formant frequencies F1, F2, and F3 and bandwidths BW1, BW2, and BW3 of the sustained vowels were determined using spectrographic analysis. Correlations among AHI, clinical parameters, and formants and bandwidths were determined. Correlations between AHI and clinical variables were stronger than those between AHI and voice features. AHI only correlates poorly with BW2 of /a/ and BW3 of /e/. A number of further weak but significant correlations have been detected between voice and clinical variables. Most of them were for height and age, with two higher values for age and F2 of /o/ and F2 of /u/. Only few very weak correlations were detected between voice and BMI, weight and cervical perimeter, wich are the clinical variables more correlated with AHI. No significant correlations were detected between AHI and formant frequencies and bandwidths. Correlations between voice and other clinical factors characterizing OSA are weak but highlight the importance of considering indirect or mediated effects of such clinical variables in any research on speech and OSA. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Damping effects of magnetic fluids of various saturation magnetization (abstract)
NASA Astrophysics Data System (ADS)
Chagnon, Mark
1990-05-01
Magnetic fluids have been widely accepted for use in loudspeaker voice coil gaps as viscous dampers and liquid coolants. When applied properly to a voice coil in manufacturing of the loudspeaker, dramatic improvement in frequency response and power handling is observed. Over the past decade, a great deal of study has been given to the effects of damping as a function of fluid viscosity. It is known that the apparent viscosity of a magnetic fluid increases as a function of applied magnetic field, and that the viscosity versus field relationship approximate that of the magnetization versus applied field. At applied magnetic field strength sufficient to cause magnetic saturation of the fluid, no further increase in viscosity with increased magnetic field is observed. In order to provide a better understanding of the second order magnetoviscous damping effects in magnetic fluids used in voice coils and to provide a better loudspeaker design criterion using magnetic fluids, we have studied the effect on damping of several magnetic fluids of the same O field viscosity and of varying saturation magnetization. Magnetic fluids with saturation magnetization ranging from 50 to 450 G and 100 cps viscosity at O applied field were injected into the voice coil gap of a standard midrange loudspeaker. The frequency response over the entire dynamic range of the speaker was measured. The changes in frequency response versus fluid magnetization are reported.
Quantifying the impact of androgen therapy on the female larynx.
Damrose, Edward J
2009-02-01
To describe the timing of changes in fundamental frequency of the female voice following androgen therapy during female to male gender reassignment. A 33-year-old female semi-professional singer undergoing gender reassignment and intramuscular androgen injections was examined at monthly intervals to monitor the impact of therapy on the voice. Laryngostroboscopy and acoustic analysis were performed simultaneously to monitor for potential laryngeal pathology. Pretreatment mean fundamental frequency (MF(0)) was 228.45 Hz and ranged from 140.26 Hz to 430.64 Hz. Between month 3 and month 4 of treatment, MF(0) declined to 116.52 Hz and ranged from 90.75 Hz to 201.07 Hz. Shimmer increased from 3.4% to 7.8%. Noise to harmonics ratio (NHR) also increased from 0.12 to 0.17. The patient has continued to sing semi-professionally despite these changes in laryngeal function. Androgen therapy exerted a profound change on mean fundamental frequency between the third and fourth months of treatment. In addition, pitch range was reduced in a commensurate fashion. Patients undergoing androgen therapy may undergo a significant change in speaking voice between the third and fourth months of therapy. Moreover, though these changes may exert a profound impact on the singing voice, patients undergoing gender reassignment may still be able to achieve personal and professional success in their singing careers.
A new VOX technique for reducing noise in voice communication systems. [voice operated keying
NASA Technical Reports Server (NTRS)
Morris, C. F.; Morgan, W. C.; Shack, P. E.
1974-01-01
A VOX technique for reducing noise in voice communication systems is described which is based on the separation of voice signals into contiguous frequency-band components with the aid of an adaptive VOX in each band. It is shown that this processing scheme can effectively reduce both wideband and narrowband quasi-periodic noise since the threshold levels readjust themselves to suppress noise that exceeds speech components in each band. Results are reported for tests of the adaptive VOX, and it is noted that improvements can still be made in such areas as the elimination of noise pulses, phoneme reproduction at high-noise levels, and the elimination of distortion introduced by phase delay.
Schloneger, Matthew J; Hunter, Eric J
2017-01-01
The multiple social and performance demands placed on college/university singers could put their still-developing voices at risk. Previous ambulatory monitoring studies have analyzed the duration, intensity, and frequency (in Hertz) of voice use among such students. Nevertheless, no studies to date have incorporated the simultaneous acoustic voice quality measures into the acquisition of these measures to allow for direct comparison during the same voicing period. Such data could provide greater insight into how young singers use their voices, as well as identify potential correlations between vocal dose and acoustic changes in voice quality. The purpose of this study was to assess the voice use and the estimated voice quality of college/university singing students (18-24 years old, N = 19). Ambulatory monitoring was conducted over three full, consecutive weekdays measuring voice from an unprocessed accelerometer signal measured at the neck. From this signal, traditional vocal dose metrics such as phonation percentage, dose time, cycle dose, and distance dose were analyzed. Additional acoustic measures included perceived pitch, pitch strength, long-term average spectrum slope, alpha ratio, dB sound pressure level 1-3 kHz, and harmonic-to-noise ratio. Major findings from more than 800 hours of recording indicated that among these students (a) higher vocal doses correlated significantly with greater voice intensity, more vocal clarity and less perturbation; and (b) there were significant differences in some acoustic voice quality metrics between nonsinging, solo singing, and choral singing. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Most, Tova; Gaon-Sivan, Gal; Shpak, Talma; Luntz, Michal
2012-01-01
Binaural hearing in cochlear implant (CI) users can be achieved either by bilateral implantation or bimodally with a contralateral hearing aid (HA). Binaural-bimodal hearing has the advantage of complementing the high-frequency electric information from the CI by low-frequency acoustic information from the HA. We examined the contribution of a contralateral HA in 25 adult implantees to their perception of fundamental frequency-cued speech characteristics (initial consonant voicing, intonation, and emotions). Testing with CI alone, HA alone, and bimodal hearing showed that all three characteristics were best perceived under the bimodal condition. Significant differences were recorded between bimodal and HA conditions in the initial voicing test, between bimodal and CI conditions in the intonation test, and between both bimodal and CI conditions and between bimodal and HA conditions in the emotion-in-speech test. These findings confirmed that such binaural-bimodal hearing enhances perception of these speech characteristics and suggest that implantees with residual hearing in the contralateral ear may benefit from a HA in that ear.
NASA Astrophysics Data System (ADS)
Edwards, Sharry K.
2005-04-01
Over the past 20+ years the pioneering field of Human Bioacoustics, which includes voice spectral analysis, has begun to model the frequencies and architecture of human vocalizations to identify the innate mathematical templates found within the various system of the human body. Using the idea that the voice is a holographic representation of health and wellness, these non-invasive techniques are being advanced to the extent that a computerized Vocal Profile, using a system of Frequency Equivalents, can be used to accurately quantify, organize, interpret, define, and extrapolate biometric information from the human voice. This information, in turn, provides the opportunity to predict, direct, and maintain intrinsic form and function. This novel approach has provided an accumulation of significant data but until recently has been without an efficient biological framework of reference. The emerging Mathematical Model being assembled through Human Bioacoustic research likely has the potential to allow Vocal Profiling to be used to predict and monitor health issues from the very first cries of a newborn through the frequency foundations of disease and aging.
Effects of HearFones on speaking and singing voice quality.
Laukkanen, Anne-Maria; Mickelson, Nils Peter; Laitala, Marja; Syrjä, Tiina; Salo, Arla; Sihvo, Marketta
2004-12-01
HearFones (HF) have been designed to enhance auditory feedback during phonation. This study investigated the effects of HF (1) on sound perceivable by the subject, (2) on voice quality in reading and singing, and (3) on voice production in speech and singing at the same pitch and sound level. Test 1: Text reading was recorded with two identical microphones in the ears of a subject. One ear was covered with HF, and the other was free. Four subjects attended this test. Tests 2 and 3: A reading sample was recorded from 13 subjects and a song from 12 subjects without and with HF on. Test 4: Six females repeated [pa:p:a] in speaking and singing modes without and with HF on same pitch and sound level. Long-term average spectra were made (Tests 1-3), and formant frequencies, fundamental frequency, and sound level were measured (Tests 2 and 3). Subglottic pressure was estimated from oral pressure in [p], and simultaneously electroglottography (EGG) was registered during voicing on [a:] (Test 4). Voice quality in speech and singing was evaluated by three professional voice trainers (Tests 2-4). HF seemed to enhance sound perceivable at the whole range studied (0-8 kHz), with the greatest enhancement (up to ca 25 dB) being at 1-3 kHz and at 4-7 kHz. The subjects tended to decrease loudness with HF (when sound level was not being monitored). In more than half of the cases, voice quality was evaluated "less strained" and "better controlled" with HF. When pitch and loudness were constant, no clear differences were heard but closed quotient of the EGG signal was higher and the signal more skewed, suggesting a better glottal closure and/or diminished activity of the thyroarytenoid muscle.
Period for Normalization of Voice Acoustic Parameters in Indian Pediatric Cochlear Implantees.
Joy, Jeena V; Deshpande, Shweta; Vaid, Dr Neelam
2017-05-01
The purpose of this study was to investigate the duration required by children with cochlear implants to approximate the norms of voice acoustic parameters. The study design is retrospective. Thirty children with cochlear implants (chronological ages ranging between 4.1 and 6.7 years) were divided into three groups, based on the postimplantation duration. Ten normal-hearing children (chronological ages ranging between 4 and 7 years) were selected as the control group. All implanted children underwent an objective voice analysis using Dr. Speech software (Tiger DRS, Inc., Seattle, WA, USA) at 6 months and at 1 and 2 years of implant use. Voice analysis was done for the children in the control group and means were derived for all the parameters analyzed to obtain the normal values. Habitual fundamental frequency (HFF), jitter (frequency variation), and shimmer (amplitude variation) were the voice acoustic parameters analyzed for the vowels |a|, |i|, and |u|. The obtained values of these parameters were then compared with the norms. HFF for the children with implant use for 6 months and 1 year did significantly differ from the control group. However, there was no significant difference (P > 0.5) observed in the children with implant use for 2 years, thus matching the norms. Jitter and shimmer showed a significant difference (P < 0.5) even at 2 years of implant use when compared with the control group. The findings of the study divulge that children with cochlear implants approximate age-matched normal-hearing kids with respect to the voice acoustic parameter of HFF by 2 years of implant use. However, jitter and shimmer were not found to stabilize for the duration studied. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Voice Conversion Using Pitch Shifting Algorithm by Time Stretching with PSOLA and Re-Sampling
NASA Astrophysics Data System (ADS)
Mousa, Allam
2010-01-01
Voice changing has many applications in the industry and commercial filed. This paper emphasizes voice conversion using a pitch shifting method which depends on detecting the pitch of the signal (fundamental frequency) using Simplified Inverse Filter Tracking (SIFT) and changing it according to the target pitch period using time stretching with Pitch Synchronous Over Lap Add Algorithm (PSOLA), then resampling the signal in order to have the same play rate. The same study was performed to see the effect of voice conversion when some Arabic speech signal is considered. Treatment of certain Arabic voiced vowels and the conversion between male and female speech has shown some expansion or compression in the resulting speech. Comparison in terms of pitch shifting is presented here. Analysis was performed for a single frame and a full segmentation of speech.
Ultrasonic speech translator and communications system
Akerman, M.A.; Ayers, C.W.; Haynes, H.D.
1996-07-23
A wireless communication system undetectable by radio frequency methods for converting audio signals, including human voice, to electronic signals in the ultrasonic frequency range, transmitting the ultrasonic signal by way of acoustical pressure waves across a carrier medium, including gases, liquids, or solids, and reconverting the ultrasonic acoustical pressure waves back to the original audio signal. The ultrasonic speech translator and communication system includes an ultrasonic transmitting device and an ultrasonic receiving device. The ultrasonic transmitting device accepts as input an audio signal such as human voice input from a microphone or tape deck. The ultrasonic transmitting device frequency modulates an ultrasonic carrier signal with the audio signal producing a frequency modulated ultrasonic carrier signal, which is transmitted via acoustical pressure waves across a carrier medium such as gases, liquids or solids. The ultrasonic receiving device converts the frequency modulated ultrasonic acoustical pressure waves to a frequency modulated electronic signal, demodulates the audio signal from the ultrasonic carrier signal, and conditions the demodulated audio signal to reproduce the original audio signal at its output. 7 figs.
Uloza, Virgilijus; Padervinskis, Evaldas; Uloziene, Ingrida; Saferis, Viktoras; Verikas, Antanas
2015-09-01
The aim of the present study was to evaluate the reliability of the measurements of acoustic voice parameters obtained simultaneously using oral and contact (throat) microphones and to investigate utility of combined use of these microphones for voice categorization. Voice samples of sustained vowel /a/ obtained from 157 subjects (105 healthy and 52 pathological voices) were recorded in a soundproof booth simultaneously through two microphones: oral AKG Perception 220 microphone (AKG Acoustics, Vienna, Austria) and contact (throat) Triumph PC microphone (Clearer Communications, Inc, Burnaby, Canada) placed on the lamina of thyroid cartilage. Acoustic voice signal data were measured for fundamental frequency, percent of jitter and shimmer, normalized noise energy, signal-to-noise ratio, and harmonic-to-noise ratio using Dr. Speech software (Tiger Electronics, Seattle, WA). The correlations of acoustic voice parameters in vocal performance were statistically significant and strong (r = 0.71-1.0) for the entire functional measurements obtained for the two microphones. When classifying into healthy-pathological voice classes, the oral-shimmer revealed the correct classification rate (CCR) of 75.2% and the throat-jitter revealed CCR of 70.7%. However, combination of both throat and oral microphones allowed identifying a set of three voice parameters: throat-signal-to-noise ratio, oral-shimmer, and oral-normalized noise energy, which provided the CCR of 80.3%. The measurements of acoustic voice parameters using a combination of oral and throat microphones showed to be reliable in clinical settings and demonstrated high CCRs when distinguishing the healthy and pathological voice patient groups. Our study validates the suitability of the throat microphone signal for the task of automatic voice analysis for the purpose of voice screening. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Valadez, Victor; Ysunza, Antonio; Ocharan-Hernandez, Esther; Garrido-Bustamante, Norma; Sanchez-Valerio, Araceli; Pamplona, Ma C
2012-09-01
Vocal Nodules (VN) are a functional voice disorder associated with voice misuse and abuse in children. There are few reports addressing vocal parameters in children with VN, especially after a period of vocal rehabilitation. The purpose of this study is to describe measurements of vocal parameters including Fundamental Frequency (FF), Shimmer (S), and Jitter (J), videonasolaryngoscopy examination and clinical perceptual assessment, before and after voice therapy in children with VN. Voice therapy was provided using visual support through Speech-Viewer software. Twenty patients with VN were studied. An acoustical analysis of voice was performed and compared with data from subjects from a control group matched by age and gender. Also, clinical perceptual assessment of voice and videonasolaryngoscopy were performed to all patients with VN. After a period of voice therapy, provided with visual support using Speech Viewer-III (SV-III-IBM) software, new acoustical analyses, perceptual assessments and videonasolaryngoscopies were performed. Before the onset of voice therapy, there was a significant difference (p<0.05) in mean FF, S and J, between the patients with VN and subjects from the control group. After the voice therapy period, a significant improvement (p<0.05) was found in all acoustic voice parameters. Moreover, perceptual voice analysis demonstrated improvement in all cases. Finally, videonasolaryngoscopy demonstrated that vocal nodules were no longer discernible on the vocal folds in any of the cases. SV-III software seems to be a safe and reliable method for providing voice therapy in children with VN. Acoustic voice parameters, perceptual data and videonasolaryngoscopy were significantly improved after the speech therapy period was completed. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Trans Male Voice in the First Year of Testosterone Therapy: Make No Assumptions
ERIC Educational Resources Information Center
Hancock, Adrienne B.; Childs, Kayla D.; Irwig, Michael S.
2017-01-01
Purpose: The purpose of this study was to prospectively examine changes in gender-related voice domain of pitch measured by fundamental frequency, function-related domains of vocal quality, range, and habitual pitch level and the self-perceptions of transmasculine people during their first year of testosterone treatment. Method: Seven trans men…
Burns, P
1986-05-01
An acoustical analysis of the speaking and singing voices of two types of professional singers was conducted. The vowels /i/, /a/, and /o/ were spoken and sung ten times each by seven opera and seven country and western singers. Vowel spectra were derived by computer software techniques allowing quantitative assessment of formant structure (F1-F4), relative amplitude of resonance peaks (F1-F4), fundamental frequency, and harmonic high frequency energy. Formant analysis was the most effective parameter differentiating the two groups. Only opera singers lowered their fourth formant creating a wide-band resonance area (approximately 2,800 Hz) corresponding to the well-known "singing formant." Country and western singers revealed similar resonatory voice characteristics for both spoken and sung output. These results implicate faulty vocal technique in country and western singers as a contributory reason for vocal abuse/fatigue.
Voice measures of workload in the advanced flight deck: Additional studies
NASA Technical Reports Server (NTRS)
Schneider, Sid J.; Alpert, Murray
1989-01-01
These studies investigated acoustical analysis of the voice as a measure of workload in individual operators. In the first study, voice samples were recorded from a single operator during high, medium, and low workload conditions. Mean amplitude, frequency, syllable duration, and emphasis all tended to increase as workload increased. In the second study, NASA test pilots performed a laboratory task, and used a flight simulator under differing work conditions. For two of the pilots, high workload in the simulator brought about greater amplitude, peak duration, and stress. In both the laboratory and simulator tasks, high workload tended to be associated with more statistically significant drop-offs in the acoustical measures than were lower workload levels. There was a great deal of intra-subject variability in the acoustical measures. The results suggested that in individual operators, increased workload might be revealed by high initial amplitude and frequency, followed by rapid drop-offs over time.
Kim, Keun Ho; Ku, Boncho; Kang, Namsik; Kim, Young-Su; Jang, Jun-Su; Kim, Jong Yeol
2012-01-01
The voice has been used to classify the four constitution types, and to recognize a subject's health condition by extracting meaningful physical quantities, in traditional Korean medicine. In this paper, we propose a method of selecting the reliable variables from various voice features, such as frequency derivative features, frequency band ratios, and intensity, from vowels and a sentence. Further, we suggest a process to extract independent variables by eliminating explanatory variables and reducing their correlation and remove outlying data to enable reliable discriminant analysis. Moreover, the suitable division of data for analysis, according to the gender and age of subjects, is discussed. Finally, the vocal features are applied to a discriminant analysis to classify each constitution type. This method of voice classification can be widely used in the u-Healthcare system of personalized medicine and for improving diagnostic accuracy. PMID:22529874
Yilmaz, Atilla; Sarac, Elif Tuğba; Aydinli, Fatma Esen; Yildizgoren, Mustafa Turgut; Okuyucu, Emine Esra; Serarslan, Yurdal
2018-06-25
Parkinson's disease (PD) is the second most frequent progressive neuro-degenerative disorder. In addition to motor symptoms, nonmotor symptoms and voice and speech disorders can also develop in 90% of PD patients. The aim of our study was to investigate the effects of DBS and different DBS frequencies on speech acoustics of vowels in PD patients. The study included 16 patients who underwent STN-DBS surgery due to PD. The voice recordings for the vowels including [a], [e], [i], and [o] were performed at frequencies including 230, 130, 90, and 60 Hz and off-stimulation. The voice recordings were gathered and evaluated by the Praat software, and the effects on the first (F1), second (F2), and third formant (F3) frequencies were analyzed. A significant difference was found for the F1 value of the vowel [a] at 130 Hz compared to off-stimulation. However, no significant difference was found between the three formant frequencies with regard to the stimulation frequencies and off-stimulation. In addition, though not statistically significant, stimulation at 60 and 230 Hz led to several differences in the formant frequencies of other three vowels. Our results indicated that STN-DBS stimulation at 130 Hz had a significant positive effect on articulation of [a] compared to off-stimulation. Although there is not any statistical significant stimulation at 60 and 230 Hz may also have an effect on the articulation of [e], [i], and [o] but this effect needs to be investigated in future studies with higher numbers of participants.
Kim, Yongdae; Kim, Sangyoo; Park, Kyihwan
2009-04-01
A six-axis active vibration isolation system (AVIS) is developed using voice coil actuators. Point contact configuration is employed to have an easy assembly of eight voice coil actuators to an upper and a base plates. The velocity sensor, using an electromagnetic principle that is commonly used in the vibration control, is investigated since its phase lead characteristic causes an instability problem for a low frequency vibration. The performances of the AVIS are investigated in the frequency domain and finally validated by comparing with the passive isolation system using the atomic force microscope images.
Subjective and objective voice evaluation in Sjögren's syndrome.
Saltürk, Ziya; Özdemir, Erdi; Kumral, Tolgar Lütfi; Karabacakoğlu, Zeynep; Kumral, Esra; Yildiz, Hatice Elvin; Mersinlioğlu, Gökhan; Atar, Yavuz; Berkiten, Güler; Yildirim, Güven; Uyar, Yavuz
2017-04-01
Objective The aim of this study is to assess the subjective and objective aspects of voice in Sjögren's syndrome. Methods The study enrolled 10 women with Sjögren's syndrome and 12 healthy women. Maximum phonation time, fundamental frequency, jitter, shimmer, and noise-to-harmonics ratio were determined during acoustic voice analysis. The Stroboscopy Evaluation Rating Form was used for the laryngostroboscopic evaluation. A subjective evaluation was performed using the Turkish version of Voice Handicap Index-10. Results The mean age of the Sjögren's syndrome and control groups was 46 ± 13.89 and 41.27 ± 6.99 years, respectively, and did not differ (P = 0.131). In the laryngostroboscopic evaluation, the smoothness and straightness of vocal folds, regularity, and glottal closure differed significantly. In the acoustic and aerodynamic analyses, none of the parameters differed statistically, while the Sjögren's syndrome group had significantly higher Voice Handicap Index-10 scores than the controls. Conclusion Sjögren's syndrome affects the voice and voice quality.
Discriminating male and female voices: differentiating pitch and gender.
Latinus, Marianne; Taylor, Margot J
2012-04-01
Gender is salient, socially critical information obtained from faces and voices, yet the brain processes underlying gender discrimination have not been well studied. We investigated neural correlates of gender processing of voices in two ERP studies. In the first, ERP differences were seen between female and male voices starting at 87 ms, in both spatial-temporal and peak analyses, particularly the fronto-central N1 and P2. As pitch differences may drive gender differences, the second study used normal, high- and low-pitch voices. The results of these studies suggested that differences in pitch produced early effects (27-63 ms). Gender effects were seen on N1 (120 ms) with implicit pitch processing (study 1), but were not seen with manipulations of pitch (study 2), demonstrating that N1 was modulated by attention. P2 (between 170 and 230 ms) discriminated male from female voices, independent of pitch. Thus, these data show that there are two stages in voice gender processing; a very early pitch or frequency discrimination and a later more accurate determination of gender at the P2 latency.
Mobile voice health monitoring using a wearable accelerometer sensor and a smartphone platform.
Mehta, Daryush D; Zañartu, Matías; Feng, Shengran W; Cheyne, Harold A; Hillman, Robert E
2012-11-01
Many common voice disorders are chronic or recurring conditions that are likely to result from faulty and/or abusive patterns of vocal behavior, referred to generically as vocal hyperfunction. An ongoing goal in clinical voice assessment is the development and use of noninvasively derived measures to quantify and track the daily status of vocal hyperfunction so that the diagnosis and treatment of such behaviorally based voice disorders can be improved. This paper reports on the development of a new, versatile, and cost-effective clinical tool for mobile voice monitoring that acquires the high-bandwidth signal from an accelerometer sensor placed on the neck skin above the collarbone. Using a smartphone as the data acquisition platform, the prototype device provides a user-friendly interface for voice use monitoring, daily sensor calibration, and periodic alert capabilities. Pilot data are reported from three vocally normal speakers and three subjects with voice disorders to demonstrate the potential of the device to yield standard measures of fundamental frequency and sound pressure level and model-based glottal airflow properties. The smartphone-based platform enables future clinical studies for the identification of the best set of measures for differentiating between normal and hyperfunctional patterns of voice use.
Mobile voice health monitoring using a wearable accelerometer sensor and a smartphone platform
Mehta, Daryush D.; Zañartu, Matías; Feng, Shengran W.; Cheyne, Harold A.; Hillman, Robert E.
2012-01-01
Many common voice disorders are chronic or recurring conditions that are likely to result from faulty and/or abusive patterns of vocal behavior, referred to generically as vocal hyperfunction. An ongoing goal in clinical voice assessment is the development and use of noninvasively derived measures to quantify and track the daily status of vocal hyperfunction so that the diagnosis and treatment of such behaviorally based voice disorders can be improved. This paper reports on the development of a new, versatile, and cost-effective clinical tool for mobile voice monitoring that acquires the high-bandwidth signal from an accelerometer sensor placed on the neck skin above the collarbone. Using a smartphone as the data acquisition platform, the prototype device provides a user-friendly interface for voice use monitoring, daily sensor calibration, and periodic alert capabilities. Pilot data are reported from three vocally normal speakers and three subjects with voice disorders to demonstrate the potential of the device to yield standard measures of fundamental frequency and sound pressure level and model-based glottal airflow properties. The smartphone-based platform enables future clinical studies for the identification of the best set of measures for differentiating between normal and hyperfunctional patterns of voice use. PMID:22875236
Voice Use Among Music Theory Teachers: A Voice Dosimetry and Self-Assessment Study.
Schiller, Isabel S; Morsomme, Dominique; Remacle, Angélique
2017-07-25
This study aimed (1) to investigate music theory teachers' professional and extra-professional vocal loading and background noise exposure, (2) to determine the correlation between vocal loading and background noise, and (3) to determine the correlation between vocal loading and self-evaluation data. Using voice dosimetry, 13 music theory teachers were monitored for one workweek. The parameters analyzed were voice sound pressure level (SPL), fundamental frequency (F0), phonation time, vocal loading index (VLI), and noise SPL. Spearman correlation was used to correlate vocal loading parameters (voice SPL, F0, and phonation time) and noise SPL. Each day, the subjects self-assessed their voice using visual analog scales. VLI and self-evaluation data were correlated using Spearman correlation. Vocal loading parameters and noise SPL were significantly higher in the professional than in the extra-professional environment. Voice SPL, phonation time, and female subjects' F0 correlated positively with noise SPL. VLI correlated with self-assessed voice quality, vocal fatigue, and amount of singing and speaking voice produced. Teaching music theory is a profession with high vocal demands. More background noise is associated with increased vocal loading and may indirectly increase the risk for voice disorders. Correlations between VLI and self-assessments suggest that these teachers are well aware of their vocal demands and feel their effect on voice quality and vocal fatigue. Visual analog scales seem to represent a useful tool for subjective vocal loading assessment and associated symptoms in these professional voice users. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
van Leer, Eva; Connor, Nadine P.
2012-01-01
Summary Objectives/Hypotheses There are many documented barriers to successful adherence to voice therapy. However, methods for facilitating adherence are not well understood. The purpose of this study was to determine if patient adherence could be improved by providing patients with practice support between sessions using mobile treatment videos. Methods Thirteen voice therapy participants were provided with portable media players containing videos of voice exercises exemplified by their therapists and themselves. A randomized crossover design of two conditions was used: (1) standard of care voice therapy where participants were provided with written homework descriptions; and (2) video-enhanced voice therapy where participants received a portable digital media player with clinician and self-videos. The duration of each condition was 1 week. Results Practice of voice exercises was significantly greater in the video-enhanced voice therapy condition than in the standard of care “written” condition (P < 0.05). Three aspects of participant motivation for practice-overall commitment to practice, importance of practice, and confidence in the ability to practice were also significantly greater after video-enhanced condition than after standard of care condition. Conclusion These results support the use of video examples and portable digital media players in voice therapy for individuals who are comfortable using such technology. PMID:21840169
Perceptual, auditory and acoustic vocal analysis of speech and singing in choir conductors.
Rehder, Maria Inês Beltrati Cornacchioni; Behlau, Mara
2008-01-01
the voice of choir conductors. to evaluate the vocal quality of choir conductors based on the production of a sustained vowel during singing and when speaking in order to observe auditory and acoustic differences. participants of this study were 100 choir conductors, with an equal distribution between genders. Participants were asked to produce the sustained vowel "é" using a singing and speaking voice. Speech samples were analyzed based on auditory-perceptive and acoustic parameters. The auditory-perceptive analysis was carried out by two speech-language pathologist, specialists in this field of knowledge. The acoustic analysis was carried out with the support of the computer software Doctor Speech (Tiger Electronics, SRD, USA, version 4.0), using the Real Analysis module. the auditory-perceptive analysis of the vocal quality indicated that most conductors have adapted voices, presenting more alterations in their speaking voice. The acoustic analysis indicated different values between genders and between the different production modalities. The fundamental frequency was higher in the singing voice, as well as the values for the first formant; the second formant presented lower values in the singing voice, with statistically significant results only for women. the voice of choir conductors is adapted, presenting fewer deviations in the singing voice when compared to the speaking voice. Productions differ based the voice modality, singing or speaking.
NASA Astrophysics Data System (ADS)
Fouquet, Meddy; Pisanski, Katarzyna; Mathevon, Nicolas; Reby, David
2016-10-01
Voice pitch (the perceptual correlate of fundamental frequency, F0) varies considerably even among individuals of the same sex and age, communicating a host of socially and evolutionarily relevant information. However, due to the almost exclusive utilization of cross-sectional designs in previous studies, it remains unknown whether these individual differences in voice pitch emerge before, during or after sexual maturation, and whether voice pitch remains stable into adulthood. Here, we measured the F0 parameters of men who were recorded once every 7 years from age 7 to 56 as they participated in the British television documentary Up Series. Linear mixed models revealed significant effects of age on all F0 parameters, wherein F0 mean, minimum, maximum and the standard deviation of F0 showed sharp pubertal decreases between age 7 and 21, yet remained remarkably stable after age 28. Critically, men's pre-pubertal F0 at age 7 strongly predicted their F0 at every subsequent adult age, explaining up to 64% of the variance in post-pubertal F0. This finding suggests that between-individual differences in voice pitch that are known to play an important role in men's reproductive success are in fact largely determined by age 7, and may therefore be linked to prenatal and/or pre-pubertal androgen exposure.
Dallaston, Katherine; Rumbach, Anna F
2016-01-01
(1) To quantify acute changes in acoustic parameters of the voices of group fitness instructors (GFIs) before and after exercise instruction. (2) To determine whether these changes are discernible perceptually by the instructor. This is a pilot prospective cohort study. Participants were six female GFIs, based in Brisbane, Australia. Participants performed a series of vocal tasks before and after instruction of a 60-minute exercise class. Data were obtained pertaining to fundamental frequency (pitch), intensity (volume), jitter, shimmer, harmonic-to-noise ratio (HNR), maximum duration of sustained phonation (MDSP), and pitch range. Additionally, self-ratings of voice quality were obtained before and after instruction. Data were analyzed using the Wilcoxon signed rank test. Significant increases (P ≤ 0.05) were found in fundamental frequency and intensity after instruction. No significant changes in jitter, shimmer, HNR, or MDSP were found before and after instruction. For the group, no significant change in self-ratings of voice quality occurred before and after instruction. Statistically significant changes in pitch and volume were found on acoustic analysis. However, these subtle changes remained within the limits of what is considered normal and representative of the participant's age and gender. Further research into the effects of exercise instruction on the voice is needed. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Physiologic effects of voice stimuli in conscious and unconscious palliative patients-a pilot study.
Buchholz, Kerstin; Liebl, Patrick; Keinki, Christian; Herth, Natalie; Huebner, Jutta
2018-05-01
Sounds and acoustic stimuli can have an effect on human beings. In medical care, sounds are often used as parts of therapies, e. g., in different types of music therapies. Also, human speech greatly affects the mental status. Although calming sounds and music are widely established in the medical field, clear evidence for the effect of sounds in palliative care is scare, and data about effects of the human voice in general are still missing. Thus, the aim of this study was to evaluate the effects of different voice stimuli on palliative patients. Two different voice stimuli (one calm, the other turbulent) were presented in a randomized sequence, and physiological parameters (blood pressure, heart frequency, oxygen saturation, respiratory rate) were recorded. Twenty patients (14 conscious and 6 unconscious) participated in this study. There was a decrease of heart frequency as well as an increase of oxygen saturation in the group of conscious patients, whereas no significant change of blood pressure or respiratory rate were detected in either group, conscious and unconscious patients. Although our dataset is heterogeneous, it can be concluded that voice stimuli can influence conscious patients. However, in this setting, no effect on unconscious patients was demonstrated. More clinical research on this topic with larger groups and a broader spectrum of parameters is needed.
Perceptual and Acoustic Analyses of Good Voice Quality in Male Radio Performers.
Warhurst, Samantha; Madill, Catherine; McCabe, Patricia; Ternström, Sten; Yiu, Edwin; Heard, Robert
2017-03-01
Good voice quality is an asset to professional voice users, including radio performers. We examined whether (1) voices could be reliably categorized as good for the radio and (2) these categories could be predicted using acoustic measures. Male radio performers (n = 24) and age-matched male controls performed "The Rainbow Passage" as if presenting on the radio. Voice samples were rated using a three-stage paired-comparison paradigm by 51 naive listeners and perceptual categories were identified (Study 1), and then analyzed for fundamental frequency, long-term average spectrum, cepstral peak prominence, and pause or spoken-phrase duration (Study 2). Study 1: Good inter-judge reliability was found for perceptual judgments of the best 15 voices (good for radio category, 14/15 = radio performers), but agreement on the remaining 33 voices (unranked category) was poor. Study 2: Discriminant function analyses showed that the SD standard deviation of sounded portion duration, equivalent sound level, and smoothed cepstral peak prominence predicted membership of categories with moderate accuracy (R 2 = 0.328). Radio performers are heterogeneous for voice quality; good voice quality was judged reliably in only 14 out of 24 radio performers. Current acoustic analyses detected some of the relevant signal properties that were salient in these judgments. More refined perceptual analysis and the use of other perceptual methods might provide more information on the complex nature of judging good voices. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Understanding The Neural Mechanisms Involved In Sensory Control Of Voice Production
Parkinson, Amy L.; Flagmeier, Sabina G.; Manes, Jordan L.; Larson, Charles R.; Rogers, Bill; Robin, Donald A.
2012-01-01
Auditory feedback is important for the control of voice fundamental frequency (F0). In the present study we used neuroimaging to identify regions of the brain responsible for sensory control of the voice. We used a pitch-shift paradigm where subjects respond to an alteration, or shift, of voice pitch auditory feedback with a reflexive change in F0. To determine the neural substrates involved in these audio-vocal responses, subjects underwent fMRI scanning while vocalizing with or without pitch-shifted feedback. The comparison of shifted and unshifted vocalization revealed activation bilaterally in the superior temporal gyrus (STG) in response to the pitch shifted feedback. We hypothesize that the STG activity is related to error detection by auditory error cells located in the superior temporal cortex and efference copy mechanisms whereby this region is responsible for the coding of a mismatch between actual and predicted voice F0. PMID:22406500
ERIC Educational Resources Information Center
Tarone, Elaine; Dwyer, Sharon; Gillette, Susan; Icke, Vincent.
1998-01-01
A study examined frequency of active, passive verb forms in two astrophysics journal articles, finding "we" plus an active voice occurs at least as frequently as the passive. This pattern typifies a previously unidentified type of research article, the logical argument scientific paper, whose characteristics are detailed. Similar pattern…
ERIC Educational Resources Information Center
Guaitella, Isabelle; Santi, Serge; Lagrue, Benoit; Cave, Christian
2009-01-01
Following our work on the relationship between eyebrow movements and the fundamental frequency of the voice, this article presents the results of a study on this phenomenon, and also on the temporal location of rapid eyebrow movements with respect to speaking turns during dialogue. We used an automatic movement-acquisition system coupled with the…
Liu, Hanjun; Wang, Emily Q.; Chen, Zhaocong; Liu, Peng; Larson, Charles R.; Huang, Dongfeng
2010-01-01
The purpose of this cross-language study was to examine whether the online control of voice fundamental frequency (F0) during vowel phonation is influenced by language experience. Native speakers of Cantonese and Mandarin, both tonal languages spoken in China, participated in the experiments. Subjects were asked to vocalize a vowel sound ∕u∕ at their comfortable habitual F0, during which their voice pitch was unexpectedly shifted (±50, ±100, ±200, or ±500 cents, 200 ms duration) and fed back instantaneously to them over headphones. The results showed that Cantonese speakers produced significantly smaller responses than Mandarin speakers when the stimulus magnitude varied from 200 to 500 cents. Further, response magnitudes decreased along with the increase in stimulus magnitude in Cantonese speakers, which was not observed in Mandarin speakers. These findings suggest that online control of voice F0 during vocalization is sensitive to language experience. Further, systematic modulations of vocal responses across stimulus magnitude were observed in Cantonese speakers but not in Mandarin speakers, which indicates that this highly automatic feedback mechanism is sensitive to the specific tonal system of each language. PMID:21218905
Acoustic analysis of speech under stress.
Sondhi, Savita; Khan, Munna; Vijay, Ritu; Salhan, Ashok K; Chouhan, Satish
2015-01-01
When a person is emotionally charged, stress could be discerned in his voice. This paper presents a simplified and a non-invasive approach to detect psycho-physiological stress by monitoring the acoustic modifications during a stressful conversation. Voice database consists of audio clips from eight different popular FM broadcasts wherein the host of the show vexes the subjects who are otherwise unaware of the charade. The audio clips are obtained from real-life stressful conversations (no simulated emotions). Analysis is done using PRAAT software to evaluate mean fundamental frequency (F0) and formant frequencies (F1, F2, F3, F4) both in neutral and stressed state. Results suggest that F0 increases with stress; however, formant frequency decreases with stress. Comparison of Fourier and chirp spectra of short vowel segment shows that for relaxed speech, the two spectra are similar; however, for stressed speech, they differ in the high frequency range due to increased pitch modulation.
Vocal tract length and formant frequency dispersion correlate with body size in rhesus macaques.
Fitch, W T
1997-08-01
Body weight, length, and vocal tract length were measured for 23 rhesus macaques (Macaca mulatta) of various sizes using radiographs and computer graphic techniques. linear predictive coding analysis of tape-recorded threat vocalizations were used to determine vocal tract resonance frequencies ("formants") for the same animals. A new acoustic variable is proposed, "formant dispersion," which should theoretically depend upon vocal tract length. Formant dispersion is the averaged difference between successive formant frequencies, and was found to be closely tied to both vocal tract length and body size. Despite the common claim that voice fundamental frequency (F0) provides an acoustic indication of body size, repeated investigations have failed to support such a relationship in many vertebrate species including humans. Formant dispersion, unlike voice pitch, is proposed to be a reliable predictor of body size in macaques, and probably many other species.
Vocal warm-up increases phonation threshold pressure in soprano singers at high pitch.
Motel, Tamara; Fisher, Kimberly V; Leydon, Ciara
2003-06-01
Vocal warm-up is thought to optimize singing performance. We compared effects of short-term, submaximal, vocal warm-up exercise with those of vocal rest on the soprano voice (n = 10, ages 19-21 years). Dependent variables were the minimum subglottic air pressure required for vocal fold oscillation to occur (phonation threshold pressure, Pth), and the maximum and minimum phonation fundamental frequency. Warm-up increased Pth for high pitch phonation (p = 0.033), but not for comfortable (p = 0.297) or low (p = 0.087) pitch phonation. No significant difference in the maximum phonation frequency (p = 0.193) or minimum frequency (p = 0.222) was observed. An elevated Pth at controlled high pitch, but an unchanging maximum and minimum frequency production suggests that short-term vocal exercise may increase the viscosity of the vocal fold and thus serve to stabilize the high voice.
NASA Technical Reports Server (NTRS)
Brenner, Malcolm; Shipp, Thomas
1988-01-01
In a study of the validity of eight candidate voice measures (fundamental frequency, amplitude, speech rate, frequency jitter, amplitude shimmer, Psychological Stress Evaluator scores, energy distribution, and the derived measure of the above measures) for determining psychological stress, 17 males age 21 to 35 were subjected to a tracking task on a microcomputer CRT while parameters of vocal production as well as heart rate were measured. Findings confirm those of earlier studies that increases in fundamental frequency, amplitude, and speech rate are found in speakers involved in extreme levels of stress. In addition, it was found that the same changes appear to occur in a regular fashion within a more subtle level of stress that may be characteristic, for example, of routine flying situations. None of the individual speech measures performed as robustly as did heart rate.
Development of a mobile satellite communication unit
NASA Technical Reports Server (NTRS)
Suzuki, Ryutaro; Ikegami, Tetsushi; Hamamoto, Naokazu; Taguchi, Tetsu; Endo, Nobuhiro; Yamamoto, Osamu; Ichiyoshi, Osamu
1988-01-01
A compact 210(W) x 280(H) x 330(D) mm mobile terminal capable of transmitting voice and data through L-band mobile satellites is described. The Voice Codec can convert an analog voice to or from digital codes at rates of 9.6, 8 and 4.8 kb/s by an MPC algorithm. The terminal functions with a single 12 V power supplied vehicle battery. The equipment can operate at any L-band frequency allocated for mobile uses in a full duplex mode and will soon be put into a field test via Japans's ETS-V satellite.
Ueno, Sanae; Okumura, Eiichi; Remijn, Gerard B; Yoshimura, Yuko; Kikuchi, Mitsuru; Shitamichi, Kiyomi; Nagao, Kikuko; Mochiduki, Masayuki; Haruta, Yasuhiro; Hayashi, Norio; Munesue, Toshio; Tsubokawa, Tsunehisa; Oi, Manabu; Nakatani, Hideo; Higashida, Haruhiro; Minabe, Yoshio
2012-05-02
Accurate perception of fundamental frequency (F0) contour changes in the human voice is important for understanding a speaker's intonation, and consequently also his/her attitude. In this study, we investigated the neural processes involved in the perception of F0 contour changes in the Japanese one-syllable interjection "ne" in 21 native-Japanese listeners. A passive oddball paradigm was applied in which "ne" with a high falling F0 contour, used when urging a reaction from the listener, was randomly presented as a rare deviant among a frequent "ne" syllable with a flat F0 contour (i.e., meaningless intonation). We applied an adaptive spatial filtering method to the neuromagnetic time course recorded by whole-head magnetoencephalography (MEG) and estimated the spatiotemporal frequency dynamics of event-related cerebral oscillatory changes in the oddball paradigm. Our results demonstrated a significant elevation of beta band event-related desynchronization (ERD) in the right temporal and frontal areas, in time windows from 100 to 300 and from 300 to 500 ms after the onset of deviant stimuli (high falling F0 contour). This is the first study to reveal detailed spatiotemporal frequency characteristics of cerebral oscillations during the perception of intonational (not lexical) F0 contour changes in the human voice. The results further confirmed that the right hemisphere is associated with perception of intonational F0 contour information in the human voice, especially in early time windows. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Zhang, Zhaoyan
2016-01-01
The goal of this study is to better understand the cause-effect relation between vocal fold physiology and the resulting vibration pattern and voice acoustics. Using a three-dimensional continuum model of phonation, the effects of changes in vocal fold stiffness, medial surface thickness in the vertical direction, resting glottal opening, and subglottal pressure on vocal fold vibration and different acoustic measures are investigated. The results show that the medial surface thickness has dominant effects on the vertical phase difference between the upper and lower margins of the medial surface, closed quotient, H1-H2, and higher-order harmonics excitation. The main effects of vocal fold approximation or decreasing resting glottal opening are to lower the phonation threshold pressure, reduce noise production, and increase the fundamental frequency. Increasing subglottal pressure is primarily responsible for vocal intensity increase but also leads to significant increase in noise production and an increased fundamental frequency. Increasing AP stiffness significantly increases the fundamental frequency and slightly reduces noise production. The interaction among vocal fold thickness, stiffness, approximation, and subglottal pressure in the control of F0, vocal intensity, and voice quality is discussed. PMID:27106298
The "Overdrive" Mode in the "Complete Vocal Technique": A Preliminary Study.
Sundberg, Johan; Bitelli, Maddalena; Holmberg, Annika; Laaksonen, Ville
2017-09-01
"Complete Vocal Technique," or CVT, is an internationally widespread method for teaching voice. It classifies voicing into four types, referred to as "vocal modes," one of which is called "Overdrive." The physiological correlates of these types are unclear. This study presents an attempt to analyze its voice source and formant frequency characteristics. A male and a female expert of CVT sang a set of "Overdrive" and falsetto tones on the syllable /pᴂ/. The voice source could be analyzed by inverse filtering in the case of the male subject. Results showed that subglottal pressure, measured as the oral pressure during /p/ occlusion, was low in falsetto and high in "Overdrive", and it was strongly correlated with each of the voice source parameters. These correlations could be described in terms of equations. The deviations from these equations of the different voice source parameters for the various voice samples suggested that "Overdrive" phonation was produced with stronger vocal fold adduction than the falsetto tones. Further, the subject was also found to tune the first formant to the second partial in "Overdrive" tones. The results support the conclusion that the method used, to compensate for the influence of subglottal pressure on the voice source, seems promising to use for analyses of other CVT vocal modes and also for other types of phonation. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Effects of chemoradiotherapy on voice and swallowing
Lazarus, Cathy L.
2009-01-01
Purpose of review Chemotherapy has been found to result in comparable survival rates to surgery for head and neck cancer. However, toxicity can often be worse after chemoradiotherapy, with impairment in voice, swallowing, nutrition, and quality of life. Investigators are attempting to modify radiotherapy treatment regimens to spare organs that have an impact on swallowing. This review will highlight voice and swallowing impairment seen after chemoradiotherapy, as well as treatment for voice and swallowing disorders in this population. Results of newer radiotherapy regimens will also be highlighted. Recent findings Specific oropharyngeal swallowing motility disorders after chemoradiotherapy have been identified. Damage to specific structures has been correlated with specific pharyngeal phase swallow impairment. Swallowing function and quality of life have been examined over time, with improvement seen in both. Preventive/prophylactic swallow exercise programs have been encouraging. Chemoradiotherapy effects on voice have been identified in terms of acoustic, aerodynamic, and patient and clinician-rated perception of function. Improvement in voice has also been observed over time after chemoradiotherapy. Voice therapy has been found to have a positive impact on voice and perceptual measures in this population. Summary Current studies show some improvement in swallow function after swallow and voice therapy in patients treated with chemoradiotherapy. Further, there is a suggestion of improved swallow function with sparing of organs with specific radiotherapy protocols. Future research needs to focus on specific voice and swallow treatment regimens in the head and neck cancer patient treated with chemoradiotherapy, specifically, timing, frequency, duration, and specific treatment types. PMID:19337126
The Human Voice in Speech and Singing
NASA Astrophysics Data System (ADS)
Lindblom, Björn; Sundberg, Johan
This chapter
The Human Voice in Speech and Singing
NASA Astrophysics Data System (ADS)
Lindblom, Björn; Sundberg, Johan
This chapter describes various aspects of the human voice as a means of communication in speech and singing. From the point of view of function, vocal sounds can be regarded as the end result of a three stage process: (1) the compression of air in the respiratory system, which produces an exhalatory airstream, (2) the vibrating vocal folds' transformation of this air stream to an intermittent or pulsating air stream, which is a complex tone, referred to as the voice source, and (3) the filtering of this complex tone in the vocal tract resonator. The main function of the respiratory system is to generate an overpressure of air under the glottis, or a subglottal pressure. Section 16.1 describes different aspects of the respiratory system of significance to speech and singing, including lung volume ranges, subglottal pressures, and how this pressure is affected by the ever-varying recoil forces. The complex tone generated when the air stream from the lungs passes the vibrating vocal folds can be varied in at least three dimensions: fundamental frequency, amplitude and spectrum. Section 16.2 describes how these properties of the voice source are affected by the subglottal pressure, the length and stiffness of the vocal folds and how firmly the vocal folds are adducted. Section 16.3 gives an account of the vocal tract filter, how its form determines the frequencies of its resonances, and Sect. 16.4 gives an account for how these resonance frequencies or formants shape the vocal sounds by imposing spectrum peaks separated by spectrum valleys, and how the frequencies of these peaks determine vowel and voice qualities. The remaining sections of the chapter describe various aspects of the acoustic signals used for vocal communication in speech and singing. The syllable structure is discussed in Sect. 16.5, the closely related aspects of rhythmicity and timing in speech and singing is described in Sect. 16.6, and pitch and rhythm aspects in Sect. 16.7. The impressive control of all these acoustic characteristics of vocal signals is discussed in Sect. 16.8, while Sect. 16.9 considers expressive aspects of vocal communication.
Bolbol, Sarah A; Zalat, Marwa M; Hammam, Rehab A M; Elnakeb, Nasser L
2017-03-01
Even though many studies have explored the problem of voice disorders among teachers worldwide, this problem is still not adequately studied in Egypt. The following study was conducted to investigate the risk factors of voice disorders among an Egyptian sample of school teachers, to measure the effect of a vocal hygiene awareness program on them, and to investigate their vocal cord lesions. One hundred fifty-six teachers working in public schools and 180 administrative workers in the Faculty of Medicine in the same city participated in this study. They completed a self-administered questionnaire investigating voice disorders, and were subjected to a voice awareness program and a clinical examination. Voice-related symptoms and Voice Handicap Index were statistically significantly higher among teachers compared with the control subjects. Work duration and high frequency of classes per week of ≥15 were the most statistically significant indicators influencing a teacher's voice. Three months after application of vocal hygiene awareness program, the teachers who were studied showed a statistically significant increase in their awareness about vocal hygiene tips. Egyptian teachers working in public schools are dealing with classes that include a great number of students per class. They also have to deal with unprofessional facilities and limited assisting resources. Therefore, they are highly exposed to the risk of voice-related disorders. Increasing awareness about healthy behavior with the voice in their occupations will help in improving their quality of work and in minimizing any permanent impairments and/or disability. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
The Moderating Effect of Frequent Singing on Voice Aging.
Lortie, Catherine L; Rivard, Julie; Thibeault, Mélanie; Tremblay, Pascale
2017-01-01
The effects of aging on voice production are well documented, including changes in loudness, pitch, and voice quality. However, one important and clinically relevant question that remains concerns the possibility that the aging of voice can be prevented or at least delayed through noninvasive methods. Indeed, discovering natural means to preserve the integrity of the human voice throughout aging could have a major impact on the quality of life of elderly adults. The objective of this study was therefore to examine the potentially positive effect of singing on voice production. To this aim, a group of 72 healthy nonsmoking adults (20-93 years old) was recruited and separated into three groups based on their singing habits. Several voice parameters were assessed (fundamental frequency [f0] mean, f0 standard deviation [SD], f0 minimum and f0 maximum, mean amplitude and amplitude SD, jitter, shimmer, and harmonic-to-noise ratio) during the sustained production of vowel /a/. Other parameters were assessed during standardized reading passage (speaking f0, speaking f0 SD). As was expected, age effects were found on most acoustic parameters with significant sex differences. Importantly, moderation analyses revealed that frequent singing moderates the effect of aging on most acoustic parameters. Specifically, in frequent singers, there was no decrease in the stability of pitch and amplitude with age, suggesting that the voice of frequent singers remains more stable in aging than the voice of non-singers, and more generally, providing empirical evidence for a positive effect of singing on voice in aging. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Relationship Between Voice and Motor Disabilities of Parkinson's Disease.
Majdinasab, Fatemeh; Karkheiran, Siamak; Soltani, Majid; Moradi, Negin; Shahidi, Gholamali
2016-11-01
To evaluate voice of Iranian patients with Parkinson's disease (PD) and find any relationship between motor disabilities and acoustic voice parameters as speech motor components. We evaluated 27 Farsi-speaking PD patients and 21 age- and sex-matched healthy persons as control. Motor performance was assessed by the Unified Parkinson's Disease Rating Scale part III and Hoehn and Yahr rating scale in the "on" state. Acoustic voice evaluation, including fundamental frequency (f0), standard deviation of f0, minimum of f0, maximum of f0, shimmer, jitter, and harmonic to noise ratio, was done using the Praat software via /a/ prolongation. No difference was seen between the voice of the patients and the voice of the controls. f0 and its variation had a significant correlation with the duration of the disease, but did not have any relationships with the Unified Parkinson's Disease Rating Scale part III. Only limited relationship was observed between voice and motor disabilities. Tremor is an important main feature of PD that affects motor and phonation systems. Females had an older age at onset, more prolonged disease, and more severe motor disabilities (not statistically significant), but phonation disorders were more frequent in males and showed more relationship with severity of motor disabilities. Voice is affected by PD earlier than many other motor components and is more sensitive to disease progression. Tremor is the most effective part of PD that impacts voice. PD has more effect on voice of male versus female patients. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Signal Frequency Spectra with Audacity®
ERIC Educational Resources Information Center
Gailey, Alycia
2015-01-01
The primary objective of the activity presented here is to allow students to explore the frequency components of various simple signals, with the ultimate goal of teaching them how to remove unwanted noise from a voice signal. Analysis of the frequency components of a signal allows students to design filters that remove unwanted components of a…
Ultrasonic speech translator and communications system
DOE Office of Scientific and Technical Information (OSTI.GOV)
Akerman, M.A.; Ayers, C.W.; Haynes, H.D.
1996-07-23
A wireless communication system undetectable by radio frequency methods for converting audio signals, including human voice, to electronic signals in the ultrasonic frequency range, transmitting the ultrasonic signal by way of acoustical pressure waves across a carrier medium, including gases, liquids, or solids, and reconverting the ultrasonic acoustical pressure waves back to the original audio signal. The ultrasonic speech translator and communication system includes an ultrasonic transmitting device and an ultrasonic receiving device. The ultrasonic transmitting device accepts as input an audio signal such as human voice input from a microphone or tape deck. The ultrasonic transmitting device frequency modulatesmore » an ultrasonic carrier signal with the audio signal producing a frequency modulated ultrasonic carrier signal, which is transmitted via acoustical pressure waves across a carrier medium such as gases, liquids or solids. The ultrasonic receiving device converts the frequency modulated ultrasonic acoustical pressure waves to a frequency modulated electronic signal, demodulates the audio signal from the ultrasonic carrier signal, and conditions the demodulated audio signal to reproduce the original audio signal at its output. 7 figs.« less
Ultrasonic speech translator and communications system
Akerman, M. Alfred; Ayers, Curtis W.; Haynes, Howard D.
1996-01-01
A wireless communication system undetectable by radio frequency methods for converting audio signals, including human voice, to electronic signals in the ultrasonic frequency range, transmitting the ultrasonic signal by way of acoustical pressure waves across a carrier medium, including gases, liquids, or solids, and reconverting the ultrasonic acoustical pressure waves back to the original audio signal. The ultrasonic speech translator and communication system (20) includes an ultrasonic transmitting device (100) and an ultrasonic receiving device (200). The ultrasonic transmitting device (100) accepts as input (115) an audio signal such as human voice input from a microphone (114) or tape deck. The ultrasonic transmitting device (100) frequency modulates an ultrasonic carrier signal with the audio signal producing a frequency modulated ultrasonic carrier signal, which is transmitted via acoustical pressure waves across a carrier medium such as gases, liquids or solids. The ultrasonic receiving device (200) converts the frequency modulated ultrasonic acoustical pressure waves to a frequency modulated electronic signal, demodulates the audio signal from the ultrasonic carrier signal, and conditions the demodulated audio signal to reproduce the original audio signal at its output (250).
Full Duplex, Spread Spectrum Radio System
NASA Technical Reports Server (NTRS)
Harvey, Bruce A.
2000-01-01
The goal of this project was to support the development of a full duplex, spread spectrum voice communications system. The assembly and testing of a prototype system consisting of a Harris PRISM spread spectrum radio, a TMS320C54x signal processing development board and a Zilog Z80180 microprocessor was underway at the start of this project. The efforts under this project were the development of multiple access schemes, analysis of full duplex voice feedback delays, and the development and analysis of forward error correction (FEC) algorithms. The multiple access analysis involved the selection between code division multiple access (CDMA), frequency division multiple access (FDMA) and time division multiple access (TDMA). Full duplex voice feedback analysis involved the analysis of packet size and delays associated with full loop voice feedback for confirmation of radio system performance. FEC analysis included studies of the performance under the expected burst error scenario with the relatively short packet lengths, and analysis of implementation in the TMS320C54x digital signal processor. When the capabilities and the limitations of the components used were considered, the multiple access scheme chosen was a combination TDMA/FDMA scheme that will provide up to eight users on each of three separate frequencies. Packets to and from each user will consist of 16 samples at a rate of 8,000 samples per second for a total of 2 ms of voice information. The resulting voice feedback delay will therefore be 4 - 6 ms. The most practical FEC algorithm for implementation was a convolutional code with a Viterbi decoder. Interleaving of the bits of each packet will be required to offset the effects of burst errors.
Do Women's Voices Provide Cues of the Likelihood of Ovulation? The Importance of Sampling Regime
Fischer, Julia; Semple, Stuart; Fickenscher, Gisela; Jürgens, Rebecca; Kruse, Eberhard; Heistermann, Michael; Amir, Ofer
2011-01-01
The human voice provides a rich source of information about individual attributes such as body size, developmental stability and emotional state. Moreover, there is evidence that female voice characteristics change across the menstrual cycle. A previous study reported that women speak with higher fundamental frequency (F0) in the high-fertility compared to the low-fertility phase. To gain further insights into the mechanisms underlying this variation in perceived attractiveness and the relationship between vocal quality and the timing of ovulation, we combined hormone measurements and acoustic analyses, to characterize voice changes on a day-to-day basis throughout the menstrual cycle. Voice characteristics were measured from free speech as well as sustained vowels. In addition, we asked men to rate vocal attractiveness from selected samples. The free speech samples revealed marginally significant variation in F0 with an increase prior to and a distinct drop during ovulation. Overall variation throughout the cycle, however, precluded unequivocal identification of the period with the highest conception risk. The analysis of vowel samples revealed a significant increase in degree of unvoiceness and noise-to-harmonic ratio during menstruation, possibly related to an increase in tissue water content. Neither estrogen nor progestogen levels predicted the observed changes in acoustic characteristics. The perceptual experiments revealed a preference by males for voice samples recorded during the pre-ovulatory period compared to other periods in the cycle. While overall we confirm earlier findings in that women speak with a higher and more variable fundamental frequency just prior to ovulation, the present study highlights the importance of taking the full range of variation into account before drawing conclusions about the value of these cues for the detection of ovulation. PMID:21957453
Satellite switched FDMA advanced communication technology satellite program
NASA Technical Reports Server (NTRS)
Atwood, S.; Higton, G. H.; Wood, K.; Kline, A.; Furiga, A.; Rausch, M.; Jan, Y.
1982-01-01
The satellite switched frequency division multiple access system provided a detailed system architecture that supports a point to point communication system for long haul voice, video and data traffic between small Earth terminals at Ka band frequencies at 30/20 GHz. A detailed system design is presented for the space segment, small terminal/trunking segment at network control segment for domestic traffic model A or B, each totaling 3.8 Gb/s of small terminal traffic and 6.2 Gb/s trunk traffic. The small terminal traffic (3.8 Gb/s) is emphasized, for the satellite router portion of the system design, which is a composite of thousands of Earth stations with digital traffic ranging from a single 32 Kb/s CVSD voice channel to thousands of channels containing voice, video and data with a data rate as high as 33 Mb/s. The system design concept presented, effectively optimizes a unique frequency and channelization plan for both traffic models A and B with minimum reorganization of the satellite payload transponder subsystem hardware design. The unique zoning concept allows multiple beam antennas while maximizing multiple carrier frequency reuse. Detailed hardware design estimates for an FDMA router (part of the satellite transponder subsystem) indicate a weight and dc power budget of 353 lbs, 195 watts for traffic model A and 498 lbs, 244 watts for traffic model B.
Trends in musical theatre voice: an analysis of audition requirements for singers.
Green, Kathryn; Freeman, Warren; Edwards, Matthew; Meyer, David
2014-05-01
The American musical theatre industry is a multibillion dollar business in which the requirements for singers are varied and complex. This study identifies the musical genres and voice requirements that are currently most requested at professional auditions to help voice teachers, pedagogues, and physicians who work with musical theatre singers understand the demands of their clients' business. Frequency count. One thousand two thirty-eight professional musical theatre audition listings were gathered over a 6-month period, and information from each listing was categorized and entered into a spreadsheet for analysis. The results indicate that four main genres of music were requested over a wide variety of styles, with more than half of auditions requesting genre categories that may not be served by traditional or classical voice technique alone. To adequately prepare young musical theatre performers for the current job market and keep the performers healthily making the sounds required by the industry, new singing styles may need to be studied and integrated into voice training that only teaches classical styles. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Prevalence of Hearing Loss in Teachers of Singing and Voice Students.
Isaac, Mitchell J; McBroom, Deanna H; Nguyen, Shaun A; Halstead, Lucinda A
2017-05-01
Singers and voice teachers are exposed to a range of noise levels during a normal working day. This study aimed to assess the hearing thresholds in a large sample of generally healthy professional voice teachers and voice students to determine the prevalence of hearing loss in this population. A cross-sectional study was carried out. Voice teachers and vocal students had the option to volunteer for a hearing screening of six standard frequencies in a quiet room with the Shoebox audiometer (Clearwater Clinical Limited) and to fill out a brief survey. Data were analyzed for the prevalence and severity of hearing loss in teachers and students based on several parameters assessed in the surveys. All data were analyzed using Microsoft Excel (Microsoft Corp.) and SPSS Statistics Software (IBM Corp.). A total of 158 participants were included: 58 self-identified as voice teachers, 106 as voice students, and 6 as both. The 6 participants who identified as both, were included in both categories for statistical purposes. Of the 158 participants, 36 had some level of hearing loss: 51.7% of voice teachers had hearing loss, and 7.5% of voice students had hearing loss. Several parameters of noise exposure were found to positively correlate with hearing loss and tinnitus (P < 0.05). Years as a voice teacher and age were both predictors of hearing loss (P < 0.05). Hearing loss in a cohort of voice teachers appears to be more prevalent and severe than previously thought. There is a significant association between years teaching and hearing loss. Raising awareness in this population may prompt teachers and students to adopt strategies to protect their hearing. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Sielska-Badurek, Ewelina M; Sobol, Maria; Olszowska, Katarzyna; Niemczyk, Kazimierz
2017-10-03
The purpose of this study was to assess the voice quality and the vocal tract function in popular singing students at the beginning of their singing training at the High School of Music. This is a retrospective cross-sectional study. The study consisted of 45 popular singing students (35 females and 10 males, mean age: 19.9 ± 2.8 years). They were assessed in the first 2 months of their 4-year singing training at the High School of Music, between 2013 and 2016. Voice quality and vocal tract function were evaluated using videolaryngostroboscopy, palpation of the vocal tract structures, the perceptual speaking and singing voice assessment, acoustic analysis, maximal phonation time, the Voice Handicap Index, and the Singing Voice Handicap Index (SVHI). Twenty-two percent of Contemporary Commercial Music singing students began their education in the High School, with vocal nodules. Palpation of the vocal tract structure showed in 50% correct motions and tension in speaking and in 39.3% in singing. Perceptual voice assessment showed in 80% proper speaking voice quality and in 82.4% proper singing voice quality. The mean vocal fundamental frequency while speaking in females was 214 Hz and in males was 116 Hz. Dysphonia Severity Index was at the level of 2, and maximum phonation time was 17.7 seconds. The Voice Handicap Index and the SVHI remained within the normal range: 7.5 and 19, respectively. Perceptual singing voice assessment correlated with the SVHI (P = 0.006). Twenty-two percent of the Contemporary Commercial Music singing students began their education in the High School, with organic vocal fold lesions. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Validation of the Voice of America Coverage Analysis Program (VOACAP)
2013-02-01
related to receiver characteristics or man-made noise, for example. Parameters such as Transmitter Frequency, Receiver Latitude /Longitude and Sunspot... Transmitter > <Frequency Unit="MHz">10</Frequency> <Type>0</Type> < Latitude Unit="deg">32</ Latitude ... Transmitter > <Frequency Unit="MHz">10.00</Frequency> <Type>0</Type> < Latitude Unit="deg">32</ Latitude
An integrated voice and data multiple-access scheme for a land-mobile satellite system
NASA Technical Reports Server (NTRS)
Li, V. O. K.; Yan, T.-Y.
1984-01-01
An analytical study is performed of the satellite requirements for a land mobile satellite system (LMSS). The spacecraft (MSAT-X) would be in GEO and would be compatible with multiple access by mobile radios and antennas and fixed stations. The FCC has received a petition from NASA to reserve the 821-825 and 866-870 MHz frequencies for the LMSS, while communications with fixed earth stations would be in the Ku band. MSAT-X transponders would alter the frequencies of signal and do no processing in the original configuration considered. Channel use would be governed by an integrated demand-assigned, multiple access protocol, which would divide channels into reservation and information channels, governed by a network management center. Further analyses will cover tradeoffs between data and voice users, probability of blocking, and the performance impacts of on-board switching and variable bandwidth assignment. Initial calculations indicate that a large traffic volume can be handled with acceptable delays and voice blocking probabilities.
An integrated voice and data multiple-access scheme for a land-mobile satellite system
NASA Astrophysics Data System (ADS)
Li, V. O. K.; Yan, T.-Y.
1984-11-01
An analytical study is performed of the satellite requirements for a land mobile satellite system (LMSS). The spacecraft (MSAT-X) would be in GEO and would be compatible with multiple access by mobile radios and antennas and fixed stations. The FCC has received a petition from NASA to reserve the 821-825 and 866-870 MHz frequencies for the LMSS, while communications with fixed earth stations would be in the Ku band. MSAT-X transponders would alter the frequencies of signal and do no processing in the original configuration considered. Channel use would be governed by an integrated demand-assigned, multiple access protocol, which would divide channels into reservation and information channels, governed by a network management center. Further analyses will cover tradeoffs between data and voice users, probability of blocking, and the performance impacts of on-board switching and variable bandwidth assignment. Initial calculations indicate that a large traffic volume can be handled with acceptable delays and voice blocking probabilities.
NASA Astrophysics Data System (ADS)
Fredouille, Corinne; Pouchoulin, Gilles; Ghio, Alain; Revis, Joana; Bonastre, Jean-François; Giovanni, Antoine
2009-12-01
This paper addresses voice disorder assessment. It proposes an original back-and-forth methodology involving an automatic classification system as well as knowledge of the human experts (machine learning experts, phoneticians, and pathologists). The goal of this methodology is to bring a better understanding of acoustic phenomena related to dysphonia. The automatic system was validated on a dysphonic corpus (80 female voices), rated according to the GRBAS perceptual scale by an expert jury. Firstly, focused on the frequency domain, the classification system showed the interest of 0-3000 Hz frequency band for the classification task based on the GRBAS scale. Later, an automatic phonemic analysis underlined the significance of consonants and more surprisingly of unvoiced consonants for the same classification task. Submitted to the human experts, these observations led to a manual analysis of unvoiced plosives, which highlighted a lengthening of VOT according to the dysphonia severity validated by a preliminary statistical analysis.
Numerical analysis of effects of transglottal pressure change on fundamental frequency of phonation.
Deguchi, Shinji; Matsuzaki, Yuji; Ikeda, Tadashige
2007-02-01
In humans, a decrease in transglottal pressure (Pt) causes an increase in the fundamental frequency of phonation (F0) only at a specific voice pitch within the modal register, the mechanism of which remains unclear. In the present study, numerical analyses were performed to investigate the mechanism of the voice pitch-dependent positive change of F0 due to Pt decrease. The airflow and the airway, including the vocal folds, were modeled in terms of mechanics of fluid and structure. Simulations of phonation using the numerical model indicated that Pt affects both the average position and the average amplitude magnitude of vocal fold self-excited oscillation in a non-monotonous manner. This effect results in voice pitch-dependent responses of F0 to Pt decreases, including the positive response of F0 as actually observed in humans. The findings of the present study highlight the importance of considering self-excited oscillation of the vocal folds in elucidation of the phonation mechanism.
Thermal welding vs. cold knife tonsillectomy: a comparison of voice and speech.
Celebi, Saban; Yelken, Kursat; Celik, Oner; Taskin, Umit; Topak, Murat
2011-01-01
To compare acoustic, aerodynamic and perceptual voice and speech parameters in thermal welding system tonsillectomy and cold knife tonsillectomy patients in order to determine the impact of operation technique on voice and speech. Thirty tonsillectomy patients (22 children, 8 adults) participated in this study. The preferred technique was cold knife tonsillectomy in 15 patients and thermal welding system tonsillectomy in the remaining 15 patients. One week before and 1 month after surgery the following parameters were estimated: average of fundamental frequency, Jitter, Shimmer, harmonic to noise ratio, formant frequency analyses of sustained vowels. Perceptual speech analysis and aerodynamic measurements (maximum phonation time and s/z ratio) were also conducted. There was no significant difference in any of the parameters between cold knife tonsillectomy and thermal welding system tonsillectomy groups (p>0.05). When the groups were contrasted among themselves with regards to preoperative and postoperative rates, fundamental frequency was found to be significantly decreased after tonsillectomy in both of the groups (p<0.001). First formant for the vowel /a/ in the cold knife tonsillectomy group and for the vowel /i/ in the thermal welding system tonsillectomy group, second formant for the vowel /u/ in the thermal welding system tonsillectomy group and third formant for the vowel /u/ in the cold knife tonsillectomy group were found to be significantly decreased (p<0.05). The surgical technique, whether it is cold knife or thermal welding system, does not appear to affect voice and speech in tonsillectomy patients. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
The eye-voice span during reading aloud
Laubrock, Jochen; Kliegl, Reinhold
2015-01-01
Although eye movements during reading are modulated by cognitive processing demands, they also reflect visual sampling of the input, and possibly preparation of output for speech or the inner voice. By simultaneously recording eye movements and the voice during reading aloud, we obtained an output measure that constrains the length of time spent on cognitive processing. Here we investigate the dynamics of the eye-voice span (EVS), the distance between eye and voice. We show that the EVS is regulated immediately during fixation of a word by either increasing fixation duration or programming a regressive eye movement against the reading direction. EVS size at the beginning of a fixation was positively correlated with the likelihood of regressions and refixations. Regression probability was further increased if the EVS was still large at the end of a fixation: if adjustment of fixation duration did not sufficiently reduce the EVS during a fixation, then a regression rather than a refixation followed with high probability. We further show that the EVS can help understand cognitive influences on fixation duration during reading: in mixed model analyses, the EVS was a stronger predictor of fixation durations than either word frequency or word length. The EVS modulated the influence of several other predictors on single fixation durations (SFDs). For example, word-N frequency effects were larger with a large EVS, especially when word N-1 frequency was low. Finally, a comparison of SFDs during oral and silent reading showed that reading is governed by similar principles in both reading modes, although EVS maintenance and articulatory processing also cause some differences. In summary, the EVS is regulated by adjusting fixation duration and/or by programming a regressive eye movement when the EVS gets too large. Overall, the EVS appears to be directly related to updating of the working memory buffer during reading. PMID:26441800
Reliability of human-supervised formant-trajectory measurement for forensic voice comparison.
Zhang, Cuiling; Morrison, Geoffrey Stewart; Ochoa, Felipe; Enzinger, Ewald
2013-01-01
Acoustic-phonetic approaches to forensic voice comparison often include human-supervised measurement of vowel formants, but the reliability of such measurements is a matter of concern. This study assesses the within- and between-supervisor variability of three sets of formant-trajectory measurements made by each of four human supervisors. It also assesses the validity and reliability of forensic-voice-comparison systems based on these measurements. Each supervisor's formant-trajectory system was fused with a baseline mel-frequency cepstral-coefficient system, and performance was assessed relative to the baseline system. Substantial improvements in validity were found for all supervisors' systems, but some supervisors' systems were more reliable than others.
Gender and vocal production mode discrimination using the high frequencies for speech and singing
Monson, Brian B.; Lotto, Andrew J.; Story, Brad H.
2014-01-01
Humans routinely produce acoustical energy at frequencies above 6 kHz during vocalization, but this frequency range is often not represented in communication devices and speech perception research. Recent advancements toward high-definition (HD) voice and extended bandwidth hearing aids have increased the interest in the high frequencies. The potential perceptual information provided by high-frequency energy (HFE) is not well characterized. We found that humans can accomplish tasks of gender discrimination and vocal production mode discrimination (speech vs. singing) when presented with acoustic stimuli containing only HFE at both amplified and normal levels. Performance in these tasks was robust in the presence of low-frequency masking noise. No substantial learning effect was observed. Listeners also were able to identify the sung and spoken text (excerpts from “The Star-Spangled Banner”) with very few exposures. These results add to the increasing evidence that the high frequencies provide at least redundant information about the vocal signal, suggesting that its representation in communication devices (e.g., cell phones, hearing aids, and cochlear implants) and speech/voice synthesizers could improve these devices and benefit normal-hearing and hearing-impaired listeners. PMID:25400613
Broniatowski, Michael; Grundfest-Broniatowski, Sharon; Tucker, Harvey M; Tyler, Dustin J
2007-02-01
We hypothesized that voice may be artificially manipulated to ameliorate dystonias considered to be a failure in dynamic integration between competing neuromuscular systems. Orderly intrinsic laryngeal muscle recruitment by anodal block via the recurrent laryngeal and vagus nerves has allowed us to define specific values based on differential excitabilities, but has precluded voice fluency because of focused breaks during stimulation and the need to treat several neural conduits. Such problems may be obviated by a circuit capable of stimulating some axons while simultaneously blocking others in the recurrent laryngeal nerve, which carries innervation to all intrinsic laryngeal muscles, including the arguably intrinsic cricothyroideus. In 5 dogs, both recurrent laryngeal nerves received 40-Hz quasi-trapezoidal pulses (0 to 2000 microA, 0 to 2000 micros, 0 to 500 micros decay) via tripolar electrodes. Electromyograms were matched with audio intensities and fundamental frequencies recorded under a constant flow of humidified air. Data were digitized and evaluated for potential correlations. Orderly recruitment of the thyroarytenoideus, posterior cricoarytenoideus, and cricothyroideus was correlated with stimulating intensities (p < .001), and posterior cricoarytenoideus opposition to the thyroarytenoideus and cricothyroideus was instrumental in manipulating audio intensities and fundamental frequencies. Manipulation of canine voice parameters appears feasible via the sole recurrent laryngeal nerve within appropriate stimulation envelopes, and offers promise in human laryngeal dystonias.
Vocal fundamental and formant frequencies affect perceptions of speaker cooperativeness.
Knowles, Kristen K; Little, Anthony C
2016-01-01
In recent years, the perception of social traits in faces and voices has received much attention. Facial and vocal masculinity are linked to perceptions of trustworthiness; however, while feminine faces are generally considered to be trustworthy, vocal trustworthiness is associated with masculinized vocal features. Vocal traits such as pitch and formants have previously been associated with perceived social traits such as trustworthiness and dominance, but the link between these measurements and perceptions of cooperativeness have yet to be examined. In Experiment 1, cooperativeness ratings of male and female voices were examined against four vocal measurements: fundamental frequency (F0), pitch variation (F0-SD), formant dispersion (Df), and formant position (Pf). Feminine pitch traits (F0 and F0-SD) and masculine formant traits (Df and Pf) were associated with higher cooperativeness ratings. In Experiment 2, manipulated voices with feminized F0 were found to be more cooperative than voices with masculinized F0(,) among both male and female speakers, confirming our results from Experiment 1. Feminine pitch qualities may indicate an individual who is friendly and non-threatening, while masculine formant qualities may reflect an individual that is socially dominant or prestigious, and the perception of these associated traits may influence the perceived cooperativeness of the speakers.
Acoustic changes in student actors' voices after 12 months of training.
Walzak, Peta; McCabe, Patricia; Madill, Cate; Sheard, Christine
2008-05-01
This study was to evaluate acoustic changes in student actors' voices after 12 months of actor training. The design used was a longitudinal study. Eighteen students enrolled in an Australian tertiary 3-year acting program (nine male and nine female) were assessed at the beginning of their acting course and again 12 months later using a questionnaire, interview, maximum phonation time (MPT), reading, spontaneous speaking, sustained phonation tasks, and a pitch range task. Samples were analyzed for MPT, fundamental frequency across tasks, pitch range for speaking and reading, singing pitch range, noise-to-harmonic ratio, shimmer, and jitter. After training, measures of shimmer significantly increased for both male and female participants. Female participants' pitch range significantly increased after training, with a significantly lower mean frequency for their lowest pitch. The finding of limited or negative changes for some measures indicate that further investigation is required into the long-term effects of actor voice training and which parameters of voicing are most targeted and valued in training. Particular investigation into the relationship between training targets and outcomes could more reliably inform acting programs about changes in teaching methodologies. Further research into the relationship between specific training techniques, physiological changes, and vocal changes may also provide information on implementing more evidence-based training methods.
Voice Habits and Behaviors: Voice Care Among Flamenco Singers.
Garzón García, Marina; Muñoz López, Juana; Y Mendoza Lara, Elvira
2017-03-01
The purpose of this study is to analyze the vocal behavior of flamenco singers, as compared with classical music singers, to establish a differential vocal profile of voice habits and behaviors in flamenco music. Bibliographic review was conducted, and the Singer's Vocal Habits Questionnaire, an experimental tool designed by the authors to gather data regarding hygiene behavior, drinking and smoking habits, type of practice, voice care, and symptomatology perceived in both the singing and the speaking voice, was administered. We interviewed 94 singers, divided into two groups: the flamenco experimental group (FEG, n = 48) and the classical control group (CCG, n = 46). Frequency analysis, a Likert scale, and discriminant and exploratory factor analysis were used to obtain a differential profile for each group. The FEG scored higher than the CCG in speaking voice symptomatology. The FEG scored significantly higher than the CCG in use of "inadequate vocal technique" when singing. Regarding voice habits, the FEG scored higher in "lack of practice and warm-up" and "environmental habits." A total of 92.6% of the subjects classified themselves correctly in each group. The Singer's Vocal Habits Questionnaire has proven effective in differentiating flamenco and classical singers. Flamenco singers are exposed to numerous vocal risk factors that make them more prone to vocal fatigue, mucosa dehydration, phonotrauma, and muscle stiffness than classical singers. Further research is needed in voice training in flamenco music, as a means to strengthen the voice and enable it to meet the requirements of this musical genre. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Flow Glottogram Characteristics and Perceived Degree of Phonatory Pressedness.
Millgård, Moa; Fors, Tobias; Sundberg, Johan
2016-05-01
Phonatory pressedness is a clinically relevant aspect of voice, which generally is analyzed by auditory perception. The present investigation aimed at identifying voice source and formant characteristics related to experts' ratings of phonatory pressedness. Experimental study of the relations between visual analog scale ratings of phonatory pressedness and voice source parameters in healthy voices. Audio, electroglottogram, and subglottal pressure, estimated from oral pressure during /p/ occlusion, were recorded from five female and six male subjects, each of whom deliberately varied phonation type between neutral, flow, and pressed in the syllable /pae/, produced at three loudness levels and three pitches. Speech-language pathologists rated, along a visual analog scale, the degree of perceived phonatory pressedness in these samples. The samples were analyzed by means of inverse filtering with regard to closed quotient, dominance of the voice source fundamental, normalized amplitude quotient, peak-to-peak flow amplitude, as well as formant frequencies and the alpha ratio of spectrum energy above and below 1000 Hz. The results were compared with the rating data, which showed that the ratings were closely related to voice source parameters. Approximately, 70% of the variance of the ratings could be explained by the voice source parameters. A multiple linear regression analysis suggested that perceived phonatory pressedness is related most closely to subglottal pressure, closed quotient, and the two lowest formants. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Role of timbre and fundamental frequency in voice gender adaptation.
Skuk, Verena G; Dammann, Lea M; Schweinberger, Stefan R
2015-08-01
Prior adaptation to male (or female) voices causes androgynous voices to be perceived as more female (or male). Using a selective adaptation paradigm the authors investigate the relative impact of the vocal fold vibration rate (F0) and timbre (operationally in this paper as characteristics that differentiate two voices of the same F0 and loudness) on this basic voice gender aftereffect. TANDEM-STRAIGHT was used to morph between 10 pairs of male and female speakers uttering 2 different vowel-consonant-vowel sequences (20 continua). Adaptor stimuli had one parameter (either F0 or timbre) set at a clearly male or female level, while the other parameter was set at an androgynous level, as determined by an independent set of listeners. Compared to a control adaptation condition (in which both F0 and timbre were clearly male or female), aftereffects were clearly reduced in both F0 and timbre adaptation conditions. Critically, larger aftereffects were found after timbre adaptation (comprising androgynous F0) compared to F0 adaptation (comprising an androgynous timbre). Together these results suggest that timbre plays a larger role than F0 in voice gender adaptation. Finally, the authors found some evidence that individual differences among listeners reflect in part pre-experimental contact to male and female voices.
The interaction of tone with voicing and foot structure: evidence from Kera phonetics and phonology
NASA Astrophysics Data System (ADS)
Pearce, Mary Dorothy
This thesis uses acoustic measurements as a basis for the phonological analysis of the interaction of tone with voicing and foot structure in Kera (a Chadic language). In both tone spreading and vowel harmony, the iambic foot acts as a domain for spreading. Further evidence for the foot comes from measurements of duration, intensity and vowel quality. Kera is unusual in combining a tone system with a partially independent metrical system based on iambs. In words containing more than one foot, the foot is the tone bearing unit (TBU), but in shorter words, the TBU is the syllable. In perception and production experiments, results show that Kera speakers, unlike English and French, use the fundamental frequency as the principle cue to 'Voicing" contrast. Voice onset time (VOT) has only a minor role. Historically, tones probably developed from voicing through a process of tonogenesis, but synchronically, the feature voice is no longer contrastive and VOT is used in an enhancing role. Some linguists have claimed that Kera is a key example for their controversial theory of long-distance voicing spread. But as voice is not part of Kera phonology, this thesis gives counter-evidence to the voice spreading claim. An important finding from the experiments is that the phonological grammars are different between village women, men moving to town and town men. These differences are attributed to French contact. The interaction between Kera tone and voicing and contact with French have produced changes from a 2-way voicing contrast, through a 3-way tonal contrast, to a 2-way voicing contrast plus another contrast with short VOT. These diachronic and synchronic tone/voicing facts are analysed using laryngeal features and Optimality Theory. This thesis provides a body of new data, detailed acoustic measurements, and an analysis incorporating current theoretical issues in phonology, which make it of interest to Africanists and theoreticians alike.
Reetz, Stephanie; Bohlender, Joerg E; Brockmann-Bauser, Meike
2018-01-29
The validity and sensitivity to change of instrumental acoustic measurements in patients with functional dysphonia have been controversially discussed. This work examines combined voice therapy effects on standard acoustic measurements, and if these agree with perceptual and subjective voice outcomes. Retrospective study. Thirty-nine patients (26 women, 13 men) aged 20-70 years (mean: 46.3, standard deviation 12.8) with functional dysphonia were investigated before and after combined voice therapy. Instrumental parameters included mean and range of speaking fundamental frequency (f o ) and intensity (SPL (dBA)); maximum SPL and mean f o of calling voice; minimum, maximum, range of singing voice f o and SPL, jitter (%), and the Dysphonia Severity Index. Voice Handicap Index-9 international was used for subjective and Grading-Roughness-Breathiness-Asthenia-Strain scale for perceptual assessment. Differences were investigated by Wilcoxon signed ranks test and coherences by Spearman rank correlation coefficient. After treatment, the speaking voice f o range (7-8.13 semitones) and SPL range (12.9-14.85 dB(A)) were significantly larger (P < 0.05). Both parameters were highly correlated (P < 0.001). Subjective symptoms were significantly reduced from a mean Voice Handicap Index-9 international of 15.6-8.6, and all perceptual Grading-Roughness-Breathiness-Asthenia-Strain scale parameters were significantly improved (G: 1.05-0.51) after therapy (P < 0.05). These findings were not associated with any acoustic parameter (P > 0.05). Significantly improved subjective and perceptual findings verify positive combined voice therapy effects in patients with functional dysphonia. The larger f o and SPL speaking voice range after treatment indicate an altered voice technique. These instrumental measures may be clinical indicators of therapy success and transfer effects. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
The singing/acting mature adult--singing instruction perspective.
Westerman Gregg, J
1997-06-01
Complete knowledge of anatomy and physiology of the vocal mechanism and tract is essential for the voice teacher to be maximally effective. Possible contributing factors to vocal attrition in the mature singer/actor are outlined: poor posture, inadequate respiratory function, lack of adequate hydration, phonatory hyperfunction, habitual speaking pitch at too low a frequency, lack of resonance, tongue tension affecting phonation, resonation, and articulation. Techniques for rehabilitation of the damaged voice are recommended.
Knez Ambrožič, Mojca; Hočevar Boltežar, Irena; Ihan Hren, Nataša
2015-09-01
Skeletal anterior open bite (AOB) or apertognathism is characterized by the absence of contact of the anterior teeth and affects articulation parameters, chewing, biting and voice quality. The treatment of AOB consists of orthognatic surgical procedures. The aim of this study was to evaluate the effects of treatment on voice quality, articulation and nasality in speech with respect to skeletal changes. The study was prospective; 15 patients with AOB were evaluated before and after surgery. Lateral cephalometric x-ray parameters (facial angle, interincisal distance, Wits appraisal) were measured to determine skeletal changes. Before surgery, nine patients still had articulation disorders despite speech therapy during childhood. The voice quality parameters were determined by acoustic analysis of the vowel sound /a/ (fundamental frequency-F0, jitter, shimmer). Spectral analysis of vowels /a/, /e/, /i/, /o/, /u/ was carried out by determining the mean frequency of the first (F1) and second (F2) formants. Nasality in speech was expressed as the ratio between the nasal and the oral sound energies during speech samples. After surgery, normalizations of facial skeletal parameters were observed in all patients, but no statistically significant changes in articulation and voice quality parameters occurred despite subjective observations of easier articulation. Any deterioration in velopharyngeal insufficiency was absent in all of the patients. In conclusion, the surgical treatment of skeletal AOB does not lead to deterioration in voice, resonance and articulation qualities. Despite surgical correction of the unfavourable skeletal situation of the speech apparatus, the pre-existing articulation disorder cannot improve without professional intervention.
Effects of adventitious acute vocal trauma: Relative fundamental frequency and listener perception
Murray, Elizabeth Heller; Hands, Gabrielle L.; Calabrese, Carolyn R.; Stepp, Cara E.
2015-01-01
Objective High voice users (individuals who demonstrate excessive or loud vocal use) are at risk for developing voice disorders. The objective of this study was to examine, both acoustically and perceptually, vocal changes in healthy speakers following an acute period of high voice use. Methods Members of a university women’s volleyball team (N=12) were recorded a week prior (Pre) and week following (Post) the 10-week spring season; N=6 control speakers were recorded over the same time period for comparison. Speakers read four sentences, which were analyzed for relative fundamental frequency (RFF). Eight naïve listeners participated in an auditory-perceptual visual sort and rate (VSR) task, in which they rated each voice sample’s overall severity and strain. Results No significant differences were found as a function of time point in the VSR ratings for the volleyball group. Onset cycle 1 RFF values were significantly lower (p = 0.04) in the Post recordings of the volleyball participants compared to Pre recordings, but there was no significant difference (p = 0.20) in offset cycle 10 RFF values. Receiver operating characteristic analyses indicated moderate sensitivity and specificity of onset cycle 1 RFF for discrimination between the volleyball and control participants. Changes were not apparent in the control group as a function of time for either, onset cycle 1 RFF, offset cycle 10 FF, or either vocal attribute. Conclusion Onset cycle 1 RFF may be an effective marker for detecting vocal changes over an acute high voice use period of time before perceptual changes are noted. PMID:26028369
Speech adjustments for room acoustics and their effects on vocal effort
Bottalico, Pasquale
2016-01-01
Objectives The aims of the present study are: (1) to analyze the effects of the acoustical environment and the voice style on time dose (Dt_p,) and fundamental frequency (mean fo and standard deviation std_fo), while taking into account the effect of short term vocal fatigue; (2) to predict the self-reported vocal effort from the voice acoustical parameters. Methods Ten male and ten female subjects were recorded while reading a text in normal and loud styles, in three rooms - anechoic, semi-reverberant and reverberant –with and without acrylic glass panels 0.5 m from the mouth, which increased external auditory feedback. Subjects quantified how much effort was required to speak in each condition on a visual analogue scale after each task. Results (Aim1) In the loud style, Dt_p, fo and std_fo increased. The Dt_p was higher in the reverberant room compared to the other two rooms. Both genders tended to increase fo in less reverberant environments, while a more monotonous speech was produced in rooms with greater reverberation. All three voice parameters increased with short-term vocal fatigue. (Aim2) A model of the vocal effort to acoustic vocal parameters is proposed. The SPL (Sound Pressure Level) contributed to 66% of the variance explained by the model, followed by the fundamental frequency (30%) and the modulation in amplitude (4%). Conclusions The results provide insight into how voice acoustical parameters can predict vocal effort. In particular, it increased when SPL and fo increased and when the amplitude voice modulation (std_ΔSPL) decreased. PMID:28029555
Teachers' voice use in teaching environments: a field study using ambulatory phonation monitor.
Lyberg Åhlander, Viveka; Pelegrín García, David; Whitling, Susanna; Rydell, Roland; Löfqvist, Anders
2014-11-01
This case-control designed field study examines the vocal behavior in teachers with self-estimated voice problems (VP) and their age- and school-matched voice healthy (VH) colleagues. It was hypothesized that teachers with and teachers without VP use their voices differently regarding fundamental frequency, sound pressure level (SPL), and in relation to the background noise. Teachers with self-estimated VP (n = 14; two males and 12 females) were age and gender matched to VH school colleagues (n = 14; two males and 12 females). The subjects, recruited from an earlier study, had been examined in laryngeal, vocal, hearing, and psychosocial aspects. The fundamental frequency, SPL, and phonation time were recorded with an Ambulatory Phonation Monitor during one representative workday. The teachers reported their activities in a structured diary. The SPL (including teachers' and students' activity and ambient noise) was recorded with a sound level meter; the room temperature and air quality were measured simultaneously. The acoustic properties of the empty classrooms were measured. Teachers with VP behaved vocally different from their VH peers, in particular during teaching sessions. The phonation time was significantly higher in the group with VP, and the number of vibratory cycles differed between the female teachers. The F0 pattern, related to the vocal SPL and room acoustics, differed between the groups. The results suggest a different vocal behavior in subjects with subjective VP and a higher vocal load with fewer possibilities for vocal recovery. Copyright © 2014 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
When the Eyes No Longer Lead: Familiarity and Length Effects on Eye-Voice Span
Silva, Susana; Reis, Alexandra; Casaca, Luís; Petersson, Karl M.; Faísca, Luís
2016-01-01
During oral reading, the eyes tend to be ahead of the voice (eye-voice span, EVS). It has been hypothesized that the extent to which this happens depends on the automaticity of reading processes, namely on the speed of print-to-sound conversion. We tested whether EVS is affected by another automaticity component – immunity from interference. To that end, we manipulated word familiarity (high-frequency, low-frequency, and pseudowords, PW) and word length as proxies of immunity from interference, and we used linear mixed effects models to measure the effects of both variables on the time interval at which readers do parallel processing by gazing at word N + 1 while not having articulated word N yet (offset EVS). Parallel processing was enhanced by automaticity, as shown by familiarity × length interactions on offset EVS, and it was impeded by lack of automaticity, as shown by the transformation of offset EVS into voice-eye span (voice ahead of the offset of the eyes) in PWs. The relation between parallel processing and automaticity was strengthened by the fact that offset EVS predicted reading velocity. Our findings contribute to understand how the offset EVS, an index that is obtained in oral reading, may tap into different components of automaticity that underlie reading ability, oral or silent. In addition, we compared the duration of the offset EVS with the average reference duration of stages in word production, and we saw that the offset EVS may accommodate for more than the articulatory programming stage of word N. PMID:27853446
Great talent, excellent voices-no problem for pubertal girls?
Decoster, Wivine; Ghesquiere, Sofie; Van Steenberge, Sebastiaan
2008-01-01
This research on 17 girls (aged 9;9 y to 16;11 y) singing in an established choir was focused on two issues: 1) the variety in physical and vocal development using Gackle's model, and 2) the matching of vocal demands and abilities. Developmental and acoustical data on the speaking and singing voice revealed considerable variation between individual girl singers. The model was greatly applicable. However, all girls had a greater total singing range, mainly in favour of the lower tones, and 11 girls used a lower speaking fundamental frequency. A third of the girls met the vocal and developmental features of their stage at a younger age. Next the lower limit of the frequency range of all girls was several semitones below the lowest notes of the pieces being worked on at the time of the experiment. However the upper limit of the pieces coincided with or exceeded their upper frequency limit.
Li, Jianwen; Li, Yan; Zhang, Ming; Ma, Weifang; Ma, Xuezong
2014-01-01
The current use of hearing aids and artificial cochleas for deaf-mute individuals depends on their auditory nerve. Skin-hearing technology, a patented system developed by our group, uses a cutaneous sensory nerve to substitute for the auditory nerve to help deaf-mutes to hear sound. This paper introduces a new solution, multi-channel-array skin-hearing technology, to solve the problem of speech discrimination. Based on the filtering principle of hair cells, external voice signals at different frequencies are converted to current signals at corresponding frequencies using electronic multi-channel bandpass filtering technology. Different positions on the skin can be stimulated by the electrode array, allowing the perception and discrimination of external speech signals to be determined by the skin response to the current signals. Through voice frequency analysis, the frequency range of the band-pass filter can also be determined. These findings demonstrate that the sensory nerves in the skin can help to transfer the voice signal and to distinguish the speech signal, suggesting that the skin sensory nerves are good candidates for the replacement of the auditory nerve in addressing deaf-mutes’ hearing problems. Scientific hearing experiments can be more safely performed on the skin. Compared with the artificial cochlea, multi-channel-array skin-hearing aids have lower operation risk in use, are cheaper and are more easily popularized. PMID:25317171
Physical constraints of cultural evolution of dialects in killer whales.
Filatova, Olga A; Samarra, Filipa I P; Barrett-Lennard, Lance G; Miller, Patrick J O; Ford, John K B; Yurk, Harald; Matkin, Craig O; Hoyt, Erich
2016-11-01
Odontocete sounds are produced by two pairs of phonic lips situated in soft nares below the blowhole; the right pair is larger and is more likely to produce clicks, while the left pair is more likely to produce whistles. This has important implications for the cultural evolution of delphinid sounds: the greater the physical constraints, the greater the probability of random convergence. In this paper the authors examine the call structure of eight killer whale populations to identify structural constraints and to determine if they are consistent among all populations. Constraints were especially pronounced in two-voiced calls. In the calls of all eight populations, the lower component of two-voiced (biphonic) calls was typically centered below 4 kHz, while the upper component was typically above that value. The lower component of two-voiced calls had a narrower frequency range than single-voiced calls in all populations. This may be because some single-voiced calls are homologous to the lower component, while others are homologous to the higher component of two-voiced calls. Physical constraints on the call structure reduce the possible variation and increase the probability of random convergence, producing similar calls in different populations.
Motorcycle Start-stop System based on Intelligent Biometric Voice Recognition
NASA Astrophysics Data System (ADS)
Winda, A.; E Byan, W. R.; Sofyan; Armansyah; Zariantin, D. L.; Josep, B. G.
2017-03-01
Current mechanical key in the motorcycle is prone to bulgary, being stolen or misplaced. Intelligent biometric voice recognition as means to replace this mechanism is proposed as an alternative. The proposed system will decide whether the voice is belong to the user or not and the word utter by the user is ‘On’ or ‘Off’. The decision voice will be sent to Arduino in order to start or stop the engine. The recorded voice is processed in order to get some features which later be used as input to the proposed system. The Mel-Frequency Ceptral Coefficient (MFCC) is adopted as a feature extraction technique. The extracted feature is the used as input to the SVM-based identifier. Experimental results confirm the effectiveness of the proposed intelligent voice recognition and word recognition system. It show that the proposed method produces a good training and testing accuracy, 99.31% and 99.43%, respectively. Moreover, the proposed system shows the performance of false rejection rate (FRR) and false acceptance rate (FAR) accuracy of 0.18% and 17.58%, respectively. In the intelligent word recognition shows that the training and testing accuracy are 100% and 96.3%, respectively.
An evaluation of voice stress analysis techniques in a simulated AWACS environment
NASA Astrophysics Data System (ADS)
Jones, William A., Jr.
1990-08-01
The purpose was to determine if voice analysis algorithms are an effective measure of stress resulting from high workload. Fundamental frequency, frequency jitter, and amplitude shimmer algorithms were employed to measure the effects of stress in crewmember communications data in simulated AWACS mission scenarios. Two independent workload measures were used to identify levels of stress: a predictor model developed by the simulation author based upon scenario generated stimulus events; and the duration of communication for each weapons director, representative of the individual's response to the induced stress. Between eight and eleven speech samples were analyzed for each of the sixteen Air Force officers who participated in the study. Results identified fundamental frequency and frequency jitter as statistically significant vocal indicators of stress, while amplitude shimmer showed no signs of any significant relationship with workload or stress. Consistent with previous research, the frequency algorithm was identified as the most reliable measure. However, the results did not reveal a sensitive discrimination measure between levels of stress, but rather, did distinguish between the presence or absence of stress. The results illustrate a significant relationship between fundamental frequency and the effects of stress and also a significant inverse relationship with jitter, though less dramatic.
Pencina, Karol M.; Coady, Jeffry A.; Beleva, Yusnie M.; Bhasin, Shalender; Basaria, Shehzad
2015-01-01
Objective: To determine dose-dependent effects of T administration on voice changes in women with low T levels. Methods: Seventy-one women who have undergone a hysterectomy with or without oophorectomy with total T < 31 ng/dL and/or free T < 3.5 pg/mL received a standardized transdermal estradiol regimen during the 12-week run-in period and were then randomized to receive weekly im injections of placebo or 3, 6.25, 12.5, or 25 mg T enanthate for 24 weeks. Total and free T levels were measured by liquid chromatography-tandem mass spectrometry and equilibrium dialysis, respectively. Voice handicap was measured by self-report using a validated voice handicap index questionnaire at baseline and 24 weeks after intervention. Functional voice testing was performed using the Kay Elemetrics-Computer Speech Lab to determine voice frequency, volume, and harmonics. Results: Forty-six women with evaluable voice data at baseline and after intervention were included in the analysis. The five groups were similar at baseline. Mean on-treatment nadir total T concentrations were 13, 83, 106, 122, and 250 ng/dL in the placebo, 3-, 6.25-, 12.5-, and 25-mg groups, respectively. Analyses of acoustic voice parameters revealed significant lowering of average pitch in the 12.5- and 25-mg dose groups compared to placebo (P < .05); these changes in pitch were significantly related to increases in T concentrations. No significant dose- or concentration-dependent changes in self-reported voice handicap index scores were observed. Conclusion: Testosterone administration in women with low T levels over 24 weeks was associated with dose- and concentration-dependent decreases in average pitch in the higher dose groups. These changes were seen despite the lack of self-reported changes in voice. PMID:25875779
Vocal Fold Mucus Aggregation in Persons with Voice Disorders
Bonilha, Heather Shaw; White, Lisa; Kuckhahn, Kelsey; Gerlach, Terri Treman; Deliyski, Dimitar D.
2012-01-01
Mucus aggregation on the vocal folds is a common finding from laryngeal endoscopy. Patients with voice disorders report the presence of mucus aggregation. Patients also report that mucus aggregation causes them to clear their throat, a behavior believed to be harmful to vocal fold mucosa. Even though clinicians and patients report and discuss mucus aggregation, we have a limited understanding of mucus aggregation in persons with voice disorders. The primary goal of this study was to provide an initial assessment of the presence and features of mucus aggregation in persons with voice disorders. The secondary goal of this study was to determine if there are differences in mucus aggregation between persons with and without voice disorders. To address these goals, four features of mucus aggregation were judged from laryngeal endoscopy recordings from 54 speakers with voice disorders and compared to judgments of these same features in persons without voice disorders. The results from this study showed: (1) 100% of dysphonic speakers had visible mucus aggregation on their vocal folds. (2) Persons with hyperfunctional voice disorders had different mucus characteristics than persons with hypofunctional voice disorders (p=0.002). (3) Dysphonic speakers did not differ in frequency of mucus identified on the vocal folds than non-dysphonic speakers. However, the two groups had different mucus characteristics (p=0.001). Future studies are warranted to determine if these differences in mucus aggregation between persons with and without voice disorders relate to specific aspects of laryngeal pathology or patient characteristics, such as age and gender. Once we understand these relationships, we may be able to use this information to improve our diagnosis and treatment of patients with atypical laryngeal mucus aggregation. PMID:22510352
Huang, Grace; Pencina, Karol M; Coady, Jeffry A; Beleva, Yusnie M; Bhasin, Shalender; Basaria, Shehzad
2015-06-01
To determine dose-dependent effects of T administration on voice changes in women with low T levels. Seventy-one women who have undergone a hysterectomy with or without oophorectomy with total T < 31 ng/dL and/or free T < 3.5 pg/mL received a standardized transdermal estradiol regimen during the 12-week run-in period and were then randomized to receive weekly im injections of placebo or 3, 6.25, 12.5, or 25 mg T enanthate for 24 weeks. Total and free T levels were measured by liquid chromatography-tandem mass spectrometry and equilibrium dialysis, respectively. Voice handicap was measured by self-report using a validated voice handicap index questionnaire at baseline and 24 weeks after intervention. Functional voice testing was performed using the Kay Elemetrics-Computer Speech Lab to determine voice frequency, volume, and harmonics. Forty-six women with evaluable voice data at baseline and after intervention were included in the analysis. The five groups were similar at baseline. Mean on-treatment nadir total T concentrations were 13, 83, 106, 122, and 250 ng/dL in the placebo, 3-, 6.25-, 12.5-, and 25-mg groups, respectively. Analyses of acoustic voice parameters revealed significant lowering of average pitch in the 12.5- and 25-mg dose groups compared to placebo (P < .05); these changes in pitch were significantly related to increases in T concentrations. No significant dose- or concentration-dependent changes in self-reported voice handicap index scores were observed. Testosterone administration in women with low T levels over 24 weeks was associated with dose- and concentration-dependent decreases in average pitch in the higher dose groups. These changes were seen despite the lack of self-reported changes in voice.
Voice amplification as a means of reducing vocal load for elementary music teachers.
Morrow, Sharon L; Connor, Nadine P
2011-07-01
Music teachers are over four times more likely than classroom teachers to develop voice disorders and greater than eight times more likely to have voice-related problems than the general public. Research has shown that individual voice-use parameters of phonation time, fundamental frequency and vocal intensity, as well as vocal load as calculated by cycle dose and distance dose are significantly higher for music teachers than their classroom teacher counterparts. Finding effective and inexpensive prophylactic measures to decrease vocal load for music teachers is an important aspect for voice preservation for this group of professional voice users. The purpose of this study was to determine the effects of voice amplification on vocal intensity and vocal load in the workplace as measured using a KayPENTAX Ambulatory Phonation Monitor (APM) (KayPENTAX, Lincoln Park, NJ). Seven music teachers were monitored for 1 workweek using an APM to determine average vocal intensity (dB sound pressure level [SPL]) and vocal load as calculated by cycle dose and distance dose. Participants were monitored a second week while using a voice amplification unit (Asyst ChatterVox; Asyst Communications Company, Inc., Indian Creek, IL). Significant decreases in mean vocal intensity of 7.00-dB SPL (P<0.001) were found using amplification, along with significant decreases (P=0.001) in cycle dose and distance dose. In addition, mean phonation time was found to decrease using amplification (P=0.023). These data suggest that voice amplification may be an effective intervention to decrease the potentially damaging vocal loads experienced by elementary music teachers in the classroom. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
[Phonatory airflow in the supraglottal space].
Müsebeck, K; Rosenberg, H
1983-05-01
The phonatory airflow can be measured by means of a hot wire tube placed in the supraglottic space without tying down the tongue. The velocity of airflow above the glottis reaches values around c = 50 to 150 cm/s. The variations in airflow oscillations were recorded. The voice of the person under examination was picked up by a condenser microphone (Bruel & Kjaer No. 2112). According to D'Alembert's wave equation, the sound intensity is related to the velocity of the phonatory air stream. The validity of this statement has been confirmed by repeated testing. The fundamental frequency of voice and of the airflow were analysed synchronously by means of the Nicolet analyser. The air consumption is not utilized for sound production in phonation by breathing. A "hard" or "pressed" voice is associated with diminished or irregular air consumption. The method can be employed in assessing the conditions of phonetic airflow in normal and dysphonic voices.
Baker, J
1999-12-01
Four women aged between 27 and 58 years sought otolaryngological examination due to significant alterations to their voices, the primary concerns being hoarseness in vocal quality, lowering of habitual pitch, difficulty projecting their speaking voices, and loss of control over their singing voices. Otolaryngological examination with a mirror or flexible laryngoscope revealed no apparent abnormality of vocal fold structure or function, and the women were referred for speech pathology with diagnoses of functional dysphonia. Objective acoustic measures using the Kay Visipitch indicated significant lowering of the mean fundamental frequency for each woman, and perceptual analysis of the patients' voices during quiet speaking, projected voice use, and comprehensive singing activities revealed a constellation of features typically noted in the pubescent male. The original diagnoses of a functional dysphonia were queried, prompting further exploration of each woman's medical history, revealing in each case onset of vocal symptoms shortly after commencing treatment for conditions with medications containing virilizing agents (eg, Danocrine (danazol), Deca-Durabolin (nandrolene decanoate), and testosterone). Although some of the vocal symptoms decreased in severity with the influences from 6 months voice therapy and after withdrawal from the drugs, a number of symptoms remained permanent, suggesting each subject had suffered significant alterations in vocal physiology, including muscle tissue changes, muscle coordination dysfunction, and propioceptive dysfunction. This retrospective study is presented in order to illustrate that it was both the projected speaking voice and the singing voice that proved so highly sensitive to the virilization effects. The implications for future prospective research studies and responsible clinical practice are discussed.
Vocal parameters and voice-related quality of life in adult women with and without ovarian function.
Ferraz, Pablo Rodrigo Rocha; Bertoldo, Simão Veras; Costa, Luanne Gabrielle Morais; Serra, Emmeliny Cristini Nogueira; Silva, Eduardo Magalhães; Brito, Luciane Maria Oliveira; Chein, Maria Bethânia da Costa
2013-05-01
To identify the perceptual and acoustic parameters of voice in adult women with and without ovarian function and its impact on quality of life related to voice. Cross-sectional and analytical study with 106 women divided into, two groups: G1, with ovarian function (n=43) and G2, without physiological ovarian function (n=63). The women were instructed to sustain the vowel "a" and the sounds of /s/ and /z/ in habitual pitch and loudness. They were also asked to classify their voices and answer the voice-related quality of life (V-RQOL) questionnaire. The perceptual analysis of the vocal samples was performed by three speech-language pathologists using the GRBASI (G: grade; R: roughness; B: breathness; A: asthenia; S: strain; I: instability) scale. The acoustic analysis was carried out with the software VoxMetria 2.7h (CTS Informatica). The data were analyzed using descriptive statistics. In the perceptual analysis, both groups showed a mild deviation for the parameters roughness, strain, and instability, but only G2 showed a mild impact for the overall degree of dysphonia. The mean of fundamental frequency was significantly lower for the G2, with a difference of 17.41Hz between the two groups. There was no impact on V-RQOL in any of the V-RQOL domains for this group. With the menopause, there is a change in women's voices, impacting on some voice parameters. However, there is no direct impact on their quality of life related to voice. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Casado, Juan C; Rodríguez-Parra, María J; Adrián, José A
2017-04-01
The objective of this study was to evaluate the medium-term results of Wendler's glottoplasty surgery (WG) and the effects of post-operative voice therapy in a group of male-to-female transsexuals. This is a retrospective study of 18 transsexuals who voluntarily underwent WG between 2010 and 2014 at a single hospital. Ten of the subjects underwent an additional voice therapy training. The group was assessed pre- vs. post-treatments with a limited battery of measures consisting of fundamental frequency (Fo), maximum phonation time, the TSEQ transgender self-assessment questionnaire, and perceptual assessment of the voice (Visual Analog Scale and a simplified version of the classical Hirano-GRBAS scale) by inter-rater agreement. The surgical procedure consisted of a de-epithelialization of the anterior third of both vocal folds; this area was sutured, and the surface of both vocal folds was vaporized with a laser diode. The results showed a significant increase in vocal tone and feminization of voice in all participants, including a significant increase in Fo 12 months after treatment. Significant improvements were also shown in other evaluated measures, such as self-reported satisfaction and the degree of feminization of the voice. However, no improvements in maximum phonation time were observed. The use of voice therapy appears decisive for optimal improvement of this class of patients. WG applied appropriately by well-trained hands is thus a very effective and less traumatic procedure than other techniques that aim for an acceptable feminization of the voice in MtoF transgendered clients.
Effects of Phonetic Context on Relative Fundamental Frequency
ERIC Educational Resources Information Center
Lien, Yu-An S.; Gattuccio, Caitlin I.; Stepp, Cara E.
2014-01-01
Purpose: The effect of phonetic context on relative fundamental frequency (RFF) was examined, in order to develop stimuli sets with minimal within-speaker variability that can be implemented in future clinical protocols. Method: Sixteen speakers with healthy voices produced RFF stimuli. Uniform utterances consisted of 3 repetitions of the same…
Influence of complaints and singing style in singers voice handicap.
Moreti, Felipe; Ávila, Maria Emília Barros de; Rocha, Clara; Borrego, Maria Cristina de Menezes; Oliveira, Gisele; Behlau, Mara
2012-01-01
The aim of this research was to verify whether the difference of singing styles and the presence of vocal complaints influence the perception of voice handicap of singers. One hundred eighteen singing voice handicap self-assessment protocols were selected: 17 popular singers with vocal complaints, 42 popular singers without complaints, 17 classic singers with complaints, and 42 classic singers without complaints. The groups were similar regarding age, gender and voice types. Both protocols used--Modern Singing Handicap Index (MSHI) and Classical Singing Handicap Index (CSHI)--have specific questions to their respective singing styles, and consist of 30 items equally divided into three subscales: disability (functional domain), handicap (emotional domain) and impairment (organic domain), answered according to the frequency of occurrence. Each subscale has a maximum of 40 points, and the total score is 120 points. The higher the score, the higher the singing voice handicap perceived. For statistical analysis, we used the ANOVA test, with 5% of significance. Classical and popular singers referred higher impairment, followed by disability and handicap. However, the degree of this perception varied according to the singing style and the presence of vocal complaints. The classical singers with vocal complaints showed higher voice handicap than popular singers with vocal complaints, while the classic singers without complaints reported lower handicap than popular singers without complaints. This evidences that classical singers have higher perception of their own voice, and that vocal disturbances in this group may cause greater voice handicap when compared to popular singers.
Master, Suely; Guzman, Marco; Azócar, Maria Josefina; Muñoz, Daniel; Bortnem, Cori
2015-05-01
The present study aimed to compare actors/actresses's voices and vocally trained subjects through aerodynamic and electroglottographic (EGG) analyses. We hypothesized that glottal and breathing functions would reflect technical and physiological differences between vocally trained and untrained subjects. Forty participants with normal voices participated in this study (20 professional theater actors and 20 untrained participants). In each group, 10 male and 10 female subjects were assessed. All participants underwent aerodynamic and EGG assessment of voice. From the Phonatory Aerodynamic System, three protocols were used: comfortable sustained phonation with EGG, voice efficiency with EGG, and running speech. Contact quotient was calculated from EGG. All phonatory tasks were produced at three different loudness levels. Mean sound pressure level and fundamental frequency were also assessed. Univariate, multivariate, and correlation statistical analyses were performed. Main differences between vocally trained and untrained participants were found in the following variables: mean sound pressure level, phonatory airflow, subglottic pressure, inspiratory airflow duration, inspiratory airflow, and inspiratory volume. These variables were greater for trained participants. Mean pitch was found to be lower for trained voices. The glottal source seemed to have a weak contribution when differentiating the training status in speaking voice. More prominent changes between vocally trained and untrained participants are demonstrated in respiratory-related variables. These findings may be related to better management of breathing function (better breath support). Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Bell, Imogen H; Fielding-Smith, Sarah F; Hayward, Mark; Rossell, Susan L; Lim, Michelle H; Farhall, John; Thomas, Neil
2018-05-02
Smartphone-based ecological momentary assessment and intervention (EMA/I) show promise for enhancing psychological treatments for psychosis. EMA has the potential to improve assessment and formulation of experiences which fluctuate day-to-day, and EMI may be used to prompt use of therapeutic strategies in daily life. The current study is an examination of these capabilities in the context of a brief, coping-focused intervention for distressing voice hearing experiences. This is a rater-blinded, pilot randomised controlled trial comparing a four-session intervention in conjunction with use of smartphone EMA/I between sessions, versus treatment-as-usual. The recruitment target is 34 participants with persisting and distressing voice hearing experiences, recruited through a Voices Clinic based in Melbourne, Australia, and via wider advertising. Allocation will be made using minimisation procedure, balancing of the frequency of voices between groups. Assessments are completed at baseline and 8 weeks post-baseline. The primary outcomes of this trial will focus on feasibility and acceptability of the intervention and trial methodology, with secondary outcomes examining preliminary clinical effects related to overall voice severity, the emotional and functional impact of the voices, and emotional distress. This study offers a highly novel examination of specific smartphone capabilities and their integration with traditional psychological treatment for distressing voices. Such technology has potential to enhance psychological interventions and promote adaptation to distressing experiences. Australian New Zealand Clinical Trial Registry, ACTRN12617000348358 . Registered on 7 March 2017.
Effects of voice-sparing cricotracheal resection on phonation in women.
Tanner, Kristine; Dromey, Christopher; Berardi, Mark L; Mattei, Lisa M; Pierce, Jenny L; Wisco, Jonathan J; Hunter, Eric J; Smith, Marshall E
2017-09-01
Individuals with idiopathic subglottic stenosis (SGS) are at risk for voice disorders prior to and following surgical management. This study examined the nature and severity of voice disorders in patients with SGS before and after a revised cricotracheal resection (CTR) procedure designed to minimize adverse effects on voice function. Eleven women with idiopathic SGS provided presurgical and postsurgical audio recordings. Voice Handicap Index (VHI) scores were also collected. Cepstral, signal-to-noise, periodicity, and fundamental frequency (F 0 ) analyses were undertaken for connected speech and sustained vowel samples. Listeners made auditory-perceptual ratings of overall quality and monotonicity. Paired samples statistical analyses revealed that mean F 0 decreased from 215 Hz (standard deviation [SD] = 40 Hz) to 201 Hz (SD = 65 Hz) following surgery. In general, VHI scores decreased after surgery. Voice disorder severity based on the Cepstral Spectral Index of Dysphonia (KayPentax, Montvale, NJ) for sustained vowels decreased (improved) from 41 (SD = 41) to 25 (SD = 21) points; no change was observed for connected speech. Semitone SD (2.2 semitones) did not change from pre- to posttreatment. Auditory-perceptual ratings demonstrated similar results. These preliminary results indicate that this revised CTR procedure is promising in minimizing adverse voice effects while offering a longer-term surgical outcome for SGS. Further research is needed to determine causal factors for pretreatment voice disorders, as well as to optimize treatments in this population. 4. Laryngoscope, 127:2085-2092, 2017. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
Electroglottogram waveform types of untrained speakers.
Painter, C
1990-01-01
Electroglottography is a useful, non-invasive technique that can assist in the assessment of vocal fold dysfunction. However, if it is to become a useful clinical tool, there is a need for normative studies of the electroglottogram waveform types that characterize different groups of speakers. This report compares the electroglottogram waveform types characterizing one trained professional voice user phonating in 15 experimental sessions under various fundamental frequencies, intensities and voice qualities with those obtained from 52 untrained non-professional speakers.
Age-related changes in long-term average spectra of children's voices.
Sergeant, Desmond; Welch, Graham Frederick
2008-11-01
This paper forms part of a larger study into the nature of singing development in children. The focus here is on an investigation of age-related changes in long-term average spectra (LTAS). Three hundred and twenty children in age groups 4-11 years learned a song. Each child was then digitally recorded singing alone. LTAS curves were calculated from the recordings of each voice and perceived age was estimated by a panel of independent judges. Progressive statistically significant changes were observed in the LTAS as a function of increasing age of the children. These took the form of increases in spectral energy in all frequencies below 5.75 kHz, with concomitant reductions of energy in frequency regions above this point. Increases with age were also found in overall intensity levels of the vocal products. Four experienced listeners audited the voice samples and made estimates of the children's ages. The level of accuracy of age-estimates was remarkably high for children in the youngest age groups, but was reduced with voice samples from older children. Maturation and developing competence of the vocal system, both in growth of lung capacity and at a laryngeal level, are implicated in the generation of age-related spectral changes. Perceived child singer age appears to be less closely related to spectral characteristics (as defined within LTAS) with increasing age of children.
Satellite voice broadcast system study, volume 2
NASA Technical Reports Server (NTRS)
Horstein, M.
1985-01-01
This study investigates the feasibility of providing Voice of America (VOA) broadcasts by satellite relay, rather than via terrestrial relay stations. Satellite voice broadcast systems are described for three different frequency bands: HF (26 MHz), VHF (68 MHz), and L-band (1.5 GHz). The geographical areas of interest at HF and L-band include all major land masses worldwide with the exception of the U.S., Canada, and Australia. Geostationary satellite configurations are considered for both frequency bands. In addition, a system of subsynchronous, circular satellites with an orbit period of 8 hours is developed for the HF band. VHF broadcasts, which are confined to the Soviet Union, are provied by a system of Molniya satellites. Satellites intended for HF or VHF broadcastinbg are extremely large and heavy. Satellite designs presented here are limited in size and weight to the capability of the STS/Centaur launch vehicle combination. Even so, at HF it would take 47 geostationary satellites or 20 satellites in 8-hour orbits to fully satisfy the voice-channel requirements of the broadcast schedule provided by VOA. On the other hand, three Molniya satellites suffice for the geographically restricted schedule at VHF. At L-band, only four geostationary satellites are needed to meet the requirements of the complete broadcast schedule. Moreover, these satellites are comparable in size and weight to current satellites designed for direct broadcast of video program material.
Jayakumar, T; Savithri, S R
2012-01-01
Dysphonia Severity Index (DSI) is a widely used multiparametric approach to objectively quantify the voice quality. Few research groups have investigated the test-retest, interobserver variability, and influence of age and gender. They have also verified the application of DSI in various voice rehabilitation conditions. However, all these studies have been conducted on European population. There is a possibility of variation in the basic parameters of DSI across geographical and ethnic groups. Hence, the present study evaluated DSI in Indian population. One hundred twenty voluntary participants (60 males, 60 females) who had G(0) on the Grade, Roughness, Breathiness, Aesthenia, Strain (GRBAS) scale participated in the study (age range of 18-25 years, M=21.8, standard deviation=2.7). Maximum phonation time (MPT), frequency intensity, and jitter measurements were made using CSL 4500 (Kay Elemetrics, Pine Brook, NJ). Results showed noticeable difference between Indian and European population on MPT, Highest frequency (F(0)-High), and DSI values. Significant gender difference was also observed on MPT and F(0)-High. Test-retest reliability showed >95% for all the parameters. The MPT decrement lead to a reduction in the overall DSI value in both the genders. These results of the study caution voice professionals to reinvestigate and establish their own norms for their geographical and ethnic groups. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Vocal Age Disguise: The Role of Fundamental Frequency and Speech Rate and Its Perceived Effects.
Skoog Waller, Sara; Eriksson, Mårten
2016-01-01
The relationship between vocal characteristics and perceived age is of interest in various contexts, as is the possibility to affect age perception through vocal manipulation. A few examples of such situations are when age is staged by actors, when ear witnesses make age assessments based on vocal cues only or when offenders (e.g., online groomers) disguise their voice to appear younger or older. This paper investigates how speakers spontaneously manipulate two age related vocal characteristics ( f 0 and speech rate) in attempt to sound younger versus older than their true age, and if the manipulations correspond to actual age related changes in f 0 and speech rate (Study 1). Further aims of the paper is to determine how successful vocal age disguise is by asking listeners to estimate the age of generated speech samples (Study 2) and to examine whether or not listeners use f 0 and speech rate as cues to perceived age. In Study 1, participants from three age groups (20-25, 40-45, and 60-65 years) agreed to read a short text under three voice conditions. There were 12 speakers in each age group (six women and six men). They used their natural voice in one condition, attempted to sound 20 years younger in another and 20 years older in a third condition. In Study 2, 60 participants (listeners) listened to speech samples from the three voice conditions in Study 1 and estimated the speakers' age. Each listener was exposed to all three voice conditions. The results from Study 1 indicated that the speakers increased fundamental frequency ( f 0 ) and speech rate when attempting to sound younger and decreased f 0 and speech rate when attempting to sound older. Study 2 showed that the voice manipulations had an effect in the sought-after direction, although the achieved mean effect was only 3 years, which is far less than the intended effect of 20 years. Moreover, listeners used speech rate, but not f 0 , as a cue to speaker age. It was concluded that age disguise by voice can be achieved by naïve speakers even though the perceived effect was smaller than intended.
ERIC Educational Resources Information Center
Vatti, Marianna; Santurette, Sébastien; Pontoppidan, Niels Henrik; Dau, Torsten
2014-01-01
Purpose: Frequency fluctuations in human voices can usually be described as coherent frequency modulation (FM). As listeners with hearing impairment (HI listeners) are typically less sensitive to FM than listeners with normal hearing (NH listeners), this study investigated whether hearing loss affects the perception of a sung vowel based on FM…
Instantaneous and Frequency-Warped Signal Processing Techniques for Auditory Source Separation.
NASA Astrophysics Data System (ADS)
Wang, Avery Li-Chun
This thesis summarizes several contributions to the areas of signal processing and auditory source separation. The philosophy of Frequency-Warped Signal Processing is introduced as a means for separating the AM and FM contributions to the bandwidth of a complex-valued, frequency-varying sinusoid p (n), transforming it into a signal with slowly-varying parameters. This transformation facilitates the removal of p (n) from an additive mixture while minimizing the amount of damage done to other signal components. The average winding rate of a complex-valued phasor is explored as an estimate of the instantaneous frequency. Theorems are provided showing the robustness of this measure. To implement frequency tracking, a Frequency-Locked Loop algorithm is introduced which uses the complex winding error to update its frequency estimate. The input signal is dynamically demodulated and filtered to extract the envelope. This envelope may then be remodulated to reconstruct the target partial, which may be subtracted from the original signal mixture to yield a new, quickly-adapting form of notch filtering. Enhancements to the basic tracker are made which, under certain conditions, attain the Cramer -Rao bound for the instantaneous frequency estimate. To improve tracking, the novel idea of Harmonic -Locked Loop tracking, using N harmonically constrained trackers, is introduced for tracking signals, such as voices and certain musical instruments. The estimated fundamental frequency is computed from a maximum-likelihood weighting of the N tracking estimates, making it highly robust. The result is that harmonic signals, such as voices, can be isolated from complex mixtures in the presence of other spectrally overlapping signals. Additionally, since phase information is preserved, the resynthesized harmonic signals may be removed from the original mixtures with relatively little damage to the residual signal. Finally, a new methodology is given for designing linear-phase FIR filters which require a small fraction of the computational power of conventional FIR implementations. This design strategy is based on truncated and stabilized IIR filters. These signal-processing methods have been applied to the problem of auditory source separation, resulting in voice separation from complex music that is significantly better than previous results at far lower computational cost.
The vagal nerve stimulation outcome, and laryngeal effect: Otolaryngologists roles and perspective.
Al Omari, Ahmad I; Alzoubi, Firas Q; Alsalem, Mohammad M; Aburahma, Samah K; Mardini, Diala T; Castellanos, Paul F
Epilepsy is one of the most common neurologic disorders. Vagus nerve stimulation (VNS), first investigated in 1938 and subsequently studied as a potential therapy for epilepsy. The FDA approved the use of VNS in 1997 as an adjunctive non-pharmacologic symptomatic treatment option for refractory epilepsy for adults and adolescents over 12years. VNS can cause laryngeal and voice side effects that can be managed by otolaryngologists safely and effectively. This study is to review the outcomes of vagal nerve stimulator (VNS) implantation in terms of the surgical procedures, complications, seizure frequency, and the clinical effect on larynx and vocal folds motion. Series of thirty consecutive patients who had VNS implantation between 2007 and 2014 were recruited. Seizure-frequency outcome, surgical complications and device adverse effects of VNS were retrospectively reviewed. Additional evaluation included use of the Voice Handicap Index and Maximum Phonation Time (MPT) were conducted before and after the implantation. Videolaryngoscopy was used to evaluate the vocal fold mobility before and after the VNS implantation. Seizure frequency reduction over a minimum of 2years of follow up demonstrated: 100% in seizure frequency reduction in 1 patient, drastic reduction in seizure frequency (70-90%) in 9 patients, a good reduction in terms of seizure frequency (50%) in 8 patients, a 30% reduction in 5 patients, no response in 6 patients, and 1 patient had increased frequency. The most commonly reported adverse effects after VNS activation were coughing and voice changes with pitch breaks, as well as mild intermittent shortness of breath in 33% of patients. For those patients secondary supraglottic muscle tension and hyper function with reduced left vocal fold mobility were noticed on videolaryngoscopy, though none had aspiration problems. Surgical complications included a wound dehiscence in one patient (3%) which was surgically managed, minor intra-operative bleeding 3%; a superficial wound infection in one patient (3%) which was treated conservatively, none of the complications necessitated VNS removal. VNS appears to be an effective non-pharmacologic adjuvant therapy in patients with medically refractory seizures. With the favorable adverse-effect profile previously described, VNS is generally well tolerated and of a great benefit to such patients. Laryngeal side effects, of which hoarseness being of the greatest repetition, are the most common after the VNS implantation. VNS can affect the voice and reduced vocal cord motion on the implantation side with secondary supraglottic muscle tension. Otolaryngologists are not only capable of performing VNS implantation, but can also manage surgical complications, assess laryngeal side effects and treat them as needed. Copyright © 2017 Elsevier Inc. All rights reserved.
Nurse moral distress: A survey identifying predictors and potential interventions.
Rathert, Cheryl; May, Douglas R; Chung, Hye Sook
2016-01-01
Ethical dilemmas and conflicts are inherent in today's health care organizations and may lead to moral distress, which is often associated with physical and psychological symptoms. Although the existence of moral distress has been observed by scholars for decades, most of the research has been descriptive and has examined what types of health care conflicts lead to distress. This study tested a comprehensive model, underpinned by Social Cognitive Theory, that examined work environment and intrapersonal variables that may influence moral distress. We surveyed nursing staff employed in a U.S. acute care hospital (response rate=45%; n=290). More than half of the respondents reported they experience ethical dilemmas and conflicts from several times a month to daily, and nearly half reported they experience moral distress at least several times a month. Structural equation modeling analysis simultaneously examined the effects of five independent variables on moral distress and moral voice: (a) frequency of ethical dilemmas and conflicts; (b) moral efficacy; (c) ethics communication; (d) ethical environment; and (e) organizational ethics support. Results revealed significant independent effects of the frequency of ethics issues and organizational ethics support on moral distress. Bootstrapping analysis indicated that voice fully mediated the relationship between moral efficacy and moral distress, and partially mediated the relationship between organizational ethics support and distress. Supplemental analysis revealed that organizational ethics support moderated the moral efficacy-voice-moral distress relationship such that when organizational support was low, moral efficacy was negatively related to moral distress via voice. Although it may be impossible to eliminate all ethical dilemmas and conflicts, leaders and organizations may wish to help improve nurses' moral efficacy, which appears to give rise to voice, and reduced moral distress. Increasing organizational ethics support may be a key approach. Copyright © 2015 Elsevier Ltd. All rights reserved.
Training outcome in future professional voice users after 18 months of voice training.
Timmermans, Bernadette; De Bodt, Marc S; Wuyts, Floris L; Van de Heyning, Paul H
2004-01-01
The goal of this study is to define the long-term influence of vocal hygiene education and the effectiveness of voice training in 46 students. Half of the subjects, called the trained group (n = 23), received vocal hygiene education during 1 school year and voice training during 2 school years (18 months). The other half, also 23 subjects, received neither vocal hygiene education nor voice training as such (called the untrained group). The voice training is made up of technical workshops (30 h a year in groups of 5-8 subjects) and vocal coaching in the radio and drama projects (30 h whole class). In the lectures (30 h) a theoretical background on breathing, articulation, voicing and vocal hygiene was discussed. A multidimensional test battery containing the GRBAS scale, videolaryngostroboscopy, maximum phonation time, jitter, lowest intensity, highest frequency, Dysphonia Severity Index (DSI) and Voice Handicap Index (VHI) was applied before and after 18 months to evaluate the effect of voice training over time. A questionnaire on daily habits was presented before the lectures, and after 18 months to detect the long-term effect of the lectures. The objectively measured voice quality (DSI) of the trained group improved significantly over time (p < 0.001) due to training (p = 0.008), which was not the case in the untrained group. The self-assessed VHI, on the other hand, changed over time (p < 0.001) in both groups. For the trained group the VHI changed from 18.4 to 14.4 and in the untrained group from 20.1 to 15.3. It is important to note that the VHI scores of both groups remained high. The interpretation of the results of the daily habit questionnaire is disturbing: the initial high degree of smoking, vocal abuse, stress and late meals was not influenced by the lectures or training and remained high. This study proves the positive outcome and emphasizes the need for a well-organized voice training program in future professional voice users. However, the lectures and training on vocal hygiene failed to improve voice-conserving habits. Copyright 2004 S. Karger AG, Basel
Musician's and Physicist's View on Tuning Keyboard Instruments
ERIC Educational Resources Information Center
Lubenow, Martin; Meyn, Jan-Peter
2007-01-01
The simultaneous sound of several voices or instruments requires proper tuning to achieve consonance for certain intervals and chords. Most instruments allow enough frequency variation to enable pure tuning while being played. Keyboard instruments such as organ and piano have given frequencies for individual notes and the tuning must be based on a…
47 CFR 90.355 - LMS operations below 512 MHz.
Code of Federal Regulations, 2010 CFR
2010-10-01
... established, provided that: (a) For transmission between vehicles and base stations, each frequency in a... LMS station and the nearest co-channel base station of another licensee operating a voice system is 75... MHz, 150-170 MHz, and 450-512 MHz bands may use either base-mobile frequencies currently assigned the...
An Integrated Architecture to Support Hastily Formed Network (HFN)
2007-12-01
17 1. Creating Awareness of the Situation (intra-organization).............17 2. Sharing Awareness Among Organizations (inter...Convergence - Sharing a Common Goal to Achieve a Common Outcome...................................................................39 b. Interdependency and...Weaknesses, Opportunities and Threat UC Unclassified UCC Unified Command Center UHF Ultra High Frequency VHF Very High Frequency VoIP Voice over
ERIC Educational Resources Information Center
Murray, Elizabeth S. Heller; Lien, Yu-An S.; Van Stan, Jarrad H.; Mehta, Daryush D.; Hillman, Robert E.; Noordzij, J. Pieter; Stepp, Cara E.
2017-01-01
Purpose: The purpose of this article is to examine the ability of an acoustic measure, relative fundamental frequency (RFF), to distinguish between two subtypes of vocal hyperfunction (VH): phonotraumatic (PVH) and non-phonotraumatic (NPVH). Method: RFF values were compared among control individuals with typical voices (N = 49), individuals with…
77 FR 5406 - Amateur Radio Use of the Allocation at 5 MHz
Federal Register 2010, 2011, 2012, 2013, 2014
2012-02-03
... transmit emission types in addition to those proposed in the NPRM? (3) Would a Voice-Operated Transmit (VOX... carrier frequency is set to the center frequency. 22. VOX Requirement. The Commission requested comment on whether amateur operators should be required to use VOX in the phone emission mode, which ARRL stated...
47 CFR 15.121 - Scanning receivers and frequency converters used with scanning receivers.
Code of Federal Regulations, 2014 CFR
2014-10-01
... transmissions to analog voice audio. (2) Be designed so that the tuning, control and filtering circuitry is inaccessible. The design must be such that any attempts to modify the equipment to receive transmissions from... Radiotelephone Service transmissions. (e) Scanning receivers and frequency converters designed for use with...
47 CFR 15.121 - Scanning receivers and frequency converters used with scanning receivers.
Code of Federal Regulations, 2012 CFR
2012-10-01
... transmissions to analog voice audio. (2) Be designed so that the tuning, control and filtering circuitry is inaccessible. The design must be such that any attempts to modify the equipment to receive transmissions from... Radiotelephone Service transmissions. (e) Scanning receivers and frequency converters designed for use with...
47 CFR 15.121 - Scanning receivers and frequency converters used with scanning receivers.
Code of Federal Regulations, 2011 CFR
2011-10-01
... transmissions to analog voice audio. (2) Be designed so that the tuning, control and filtering circuitry is inaccessible. The design must be such that any attempts to modify the equipment to receive transmissions from... Radiotelephone Service transmissions. (e) Scanning receivers and frequency converters designed for use with...
47 CFR 15.121 - Scanning receivers and frequency converters used with scanning receivers.
Code of Federal Regulations, 2013 CFR
2013-10-01
... transmissions to analog voice audio. (2) Be designed so that the tuning, control and filtering circuitry is inaccessible. The design must be such that any attempts to modify the equipment to receive transmissions from... Radiotelephone Service transmissions. (e) Scanning receivers and frequency converters designed for use with...
Two-dimensional model of vocal fold vibration for sound synthesis of voice and soprano singing
NASA Astrophysics Data System (ADS)
Adachi, Seiji; Yu, Jason
2005-05-01
Voiced sounds were simulated with a computer model of the vocal fold composed of a single mass vibrating both parallel and perpendicular to the airflow. Similarities with the two-mass model are found in the amplitudes of the glottal area and the glottal volume flow velocity, the variation in the volume flow waveform with the vocal tract shape, and the dependence of the oscillation amplitude upon the average opening area of the glottis, among other similar features. A few dissimilarities are also found in the more symmetric glottal and volume flow waveforms in the rising and falling phases. The major improvement of the present model over the two-mass model is that it yields a smooth transition between oscillations with an inductive load and a capacitive load of the vocal tract with no sudden jumps in the vibration frequency. Self-excitation is possible both below and above the first formant frequency of the vocal tract. By taking advantage of the wider continuous frequency range, the two-dimensional model can successfully be applied to the sound synthesis of a high-pitched soprano singing, where the fundamental frequency sometimes exceeds the first formant frequency. .
OVERLAP OF HEARING AND VOICING RANGES IN SINGING
Hunter, Eric J.; Titze, Ingo R.
2008-01-01
Frequency and intensity ranges in voice production by trained and untrained singers were superimposed onto the average normal human hearing range. The vocal output for all subjects was shown both in Voice Range Profiles and Spectral Level Profiles. Trained singers took greater advantage of the dynamic range of the auditory system with harmonic energy (45% of the hearing range compared to 38% for untrained vocalists). This difference seemed to come from the trained singers ablily to exploit the most sensitive part of the hearing range (around 3 to 4 kHz) through the use of the singer’s formant. The trained vocalists’ average maximum third-octave spectral band level was 95 dB SPL, compared to 80 dB SPL for untrained. PMID:19844607
OVERLAP OF HEARING AND VOICING RANGES IN SINGING.
Hunter, Eric J; Titze, Ingo R
2005-04-01
Frequency and intensity ranges in voice production by trained and untrained singers were superimposed onto the average normal human hearing range. The vocal output for all subjects was shown both in Voice Range Profiles and Spectral Level Profiles. Trained singers took greater advantage of the dynamic range of the auditory system with harmonic energy (45% of the hearing range compared to 38% for untrained vocalists). This difference seemed to come from the trained singers ablily to exploit the most sensitive part of the hearing range (around 3 to 4 kHz) through the use of the singer's formant. The trained vocalists' average maximum third-octave spectral band level was 95 dB SPL, compared to 80 dB SPL for untrained.
The perceptual features of vocal fatigue as self-reported by a group of actors and singers.
Kitch, J A; Oates, J
1994-09-01
Performers (10 actors/10 singers) rated via a self-report questionnaire the severity of their voice-related changes when vocally fatigued. Similar frequency patterns and perceptual features of vocal fatigue were found across subjects. Actors rated "power" aspects (e.g., voice projection) and singers rated vocal dynamic aspects (e.g., pitch range) of their voices as most affected when vocally fatigued. Vocal fatigue was evidenced by changes in kinesthetic/proprioceptive sensations and vocal dynamics. The causes and context of vocal fatigue were vocal misuse, being "run down," high performance demands, and using high pitch/volume levels. Further research is needed to delineate the perceptual features of "normal" levels of vocal fatigue and its possible causes.
A laryngographic and laryngoscopic study of Northern Vietnamese tones.
Brunelle, Marc; Nguyên, Duy Duong; Nguyên, Khac Hùng
2010-01-01
A laryngographic and laryngoscopic study of tone production in Northern Vietnamese, a language whose tones combine both fundamental frequency (f0) modulations and voice qualities (phonation types), was conducted with 5 male and 5 female speakers. Results show that the f0 contours of Northern Vietnamese tones are not only attributable to changes in vocal fold length and tension (partly through changes in larynx height), but that f0 drops are also largely caused by the glottal configurations responsible for the contrastive voice qualities associated with some of the tones. We also find that voice quality contrasts are mostly due to glottal constriction: they occasionally involve additional ventricular fold incursion and epiglottal constriction, but these articulations are usually absent. Copyright © 2010 S. Karger AG, Basel.
Phadke, Ketaki Vasant; Abo-Hasseba, Ahmed; Švec, Jan G; Geneid, Ahmed
2018-05-03
Teachers are professional voice users, always at high risk of developing voice disorders due to high vocal demand and unfavorable environmental conditions. This study aimed at identifying possible correlations between teachers' voice symptoms and their perception of noise, the location of schools, as well as the location and conditions of their classrooms. One hundred forty teachers (ages 21-56) from schools in Upper Egypt participated in this study. They filled out a questionnaire including questions about the severity and frequency of their voice symptoms, noise perception, and the location and conditions of their schools and classrooms. Questionnaire responses were statistically analyzed to identify possible correlations. There were significant correlations (P < 0.05) between voice symptoms, teachers' noise perception, and noise resulting from the location and conditions of schools and classrooms. Teachers experienced severe dysphonia, neck pain, and increased vocal effort with weekly or daily recurrence. Among the teachers who participated in the study, 24.2% felt they were always in a noisy environment, with 51.4% of the total participants reporting having to raise their voices. The most common sources of noise were from student activities and talking in the teachers' own classrooms (61.4%), noise from adjacent classrooms (52.9%), and road traffic (40.7%). Adverse effect on teachers' voices due to noise from poor school and classroom conditions necessitates solutions for the future improvement of conditions in Egyptian schools. This study may help future studies that focus on developing guidelines for the better planning of Egyptian schools in terms of improved infrastructure and architecture, thus considering the general and vocal health of teachers. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Fuentes-López, Eduardo; Fuente, Adrian; Contreras, Karem V
2017-12-18
The aim of this study is to determine possible associations between vocal hygiene habits and self-reported vocal symptoms in telemarketers. A cross-sectional study that included 79 operators from call centres in Chile was carried out. Their vocal hygiene habits and self-reported symptoms were investigated using a validated and reliable questionnaire created for the purposes of this study. Forty-five percent of telemarketers reported having one or more vocal symptoms. Among them, 16.46% reported that their voices tense up when talking and 10.13% needed to clear their throat to make their voices clearer. Five percent mentioned that they always talk without taking a break and 40.51% reported using their voices in noisy environments. The number of working hours per day and inadequate vocal hygiene habits were associated with the presence of self-reported symptoms. Additionally, an interaction between the use of the voice in noisy environments and not taking breaks during the day was observed. Finally, the frequency of inadequate vocal hygiene habits was associated with the number of symptoms reported. Using the voice in noisy environments and talking without taking breaks were both associated with the presence of specific vocal symptoms. This study provides some evidence about the interaction between these two inadequate vocal hygiene habits that potentiates vocal symptoms.
Wong, Raymond
2013-01-01
Voice biometrics is one kind of physiological characteristics whose voice is different for each individual person. Due to this uniqueness, voice classification has found useful applications in classifying speakers' gender, mother tongue or ethnicity (accent), emotion states, identity verification, verbal command control, and so forth. In this paper, we adopt a new preprocessing method named Statistical Feature Extraction (SFX) for extracting important features in training a classification model, based on piecewise transformation treating an audio waveform as a time-series. Using SFX we can faithfully remodel statistical characteristics of the time-series; together with spectral analysis, a substantial amount of features are extracted in combination. An ensemble is utilized in selecting only the influential features to be used in classification model induction. We focus on the comparison of effects of various popular data mining algorithms on multiple datasets. Our experiment consists of classification tests over four typical categories of human voice data, namely, Female and Male, Emotional Speech, Speaker Identification, and Language Recognition. The experiments yield encouraging results supporting the fact that heuristically choosing significant features from both time and frequency domains indeed produces better performance in voice classification than traditional signal processing techniques alone, like wavelets and LPC-to-CC. PMID:24288684
Francis, Alexander L.; Kaganovich, Natalya; Driscoll-Huber, Courtney
2008-01-01
In English, voiced and voiceless syllable-initial stop consonants differ in both fundamental frequency at the onset of voicing (onset F0) and voice onset time (VOT). Although both correlates, alone, can cue the voicing contrast, listeners weight VOT more heavily when both are available. Such differential weighting may arise from differences in the perceptual distance between voicing categories along the VOT versus onset F0 dimensions, or it may arise from a bias to pay more attention to VOT than to onset F0. The present experiment examines listeners’ use of these two cues when classifying stimuli in which perceptual distance was artificially equated along the two dimensions. Listeners were also trained to categorize stimuli based on one cue at the expense of another. Equating perceptual distance eliminated the expected bias toward VOT before training, but successfully learning to base decisions more on VOT and less on onset F0 was easier than vice versa. Perceptual distance along both dimensions increased for both groups after training, but only VOT-trained listeners showed a decrease in Garner interference. Results lend qualified support to an attentional model of phonetic learning in which learning involves strategic redeployment of selective attention across integral acoustic cues. PMID:18681610
Vocal indices of stress: a review.
Giddens, Cheryl L; Barron, Kirk W; Byrd-Craven, Jennifer; Clark, Keith F; Winter, A Scott
2013-05-01
Identification of stress patterns in the voice has multiple potential applications. The objective was to review literature pertaining to the effects of various forms of stress upon the healthy voice. Literature review, discussion of results, and direction for further study. This review article offers a model of stress and a review of the historical and recent research into the effects of stress on the voice. Electronic databases were searched using the key words. No studies were excluded on the basis of design; however, an attempt was made to include in the discussion studies which primarily address physiological and acoustic vocal parameters. The results of greater than 50 studies examining the effect of stressors ranging from lie and guilt to high altitude and space flight upon the voice were included in the review. Increase in fundamental frequency is the most commonly reported effect of stress in well-controlled trials. The trend, however, is not universal. A reduction in noise as reflected by the diminished vocal jitter is reported, but less frequently. Stress types, gender, and individual differences in baseline autonomic tone may explain the primarily equivocal findings of effects of stressor exposure or perceived stress on voice; and as such, the article concludes with a discussion of directions for future study. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Naval Research Laboratory 1983 Review.
1983-01-01
Kennedy A realistic earth model improves radio wave propagation theory 50 Design of High - Frequency Networks for Tactical Communication Dennis J. Baker...This signal is ing approximately a 3 kHz bandwidth) such as a pulse train if the speech is voiced (vowel high frequency (HF) channels and public tele...developed a design concept for a high quickly and reliably broadcast messages to all frequency (HF) intratask force (ITF) communi- cluster members
Sapienza, C M; Crandell, C C; Curtis, B
1999-09-01
Voice problems are a frequent difficulty that teachers experience. Common complaints by teachers include vocal fatigue and hoarseness. One possible explanation for these symptoms is prolonged elevations in vocal loudness within the classroom. This investigation examined the effectiveness of sound-field frequency modulation (FM) amplification on reducing the sound pressure level (SPL) of the teacher's voice during classroom instruction. Specifically, SPL was examined during speech produced in a classroom lecture by 10 teachers with and without the use of sound-field amplification. Results indicated a significant 2.42-dB decrease in SPL with the use of sound-field FM amplification. These data support the use of sound-field amplification in the vocal hygiene regimen recommended to teachers by speech-language pathologists.
Vowel selection and its effects on perturbation and nonlinear dynamic measures.
Maccallum, Julia K; Zhang, Yu; Jiang, Jack J
2011-01-01
Acoustic analysis of voice is typically conducted on recordings of sustained vowel phonation. This study applied perturbation and nonlinear dynamic analyses to the vowels /a/, /i/, and /u/ in order to determine vowel selection effects on analysis. Forty subjects (20 males and 20 females) with normal voices participated in recording. Traditional parameters of fundamental frequency, signal-to-noise ratio, percent jitter, and percent shimmer were calculated for the signals using CSpeech. Nonlinear dynamic parameters of correlation dimension and second-order entropy were also calculated. Perturbation analysis results were largely incongruous in this study and in previous research. Fundamental frequency results corroborated previous work, indicating higher fundamental frequency for /i/ and /u/ and lower fundamental frequency for /a/. Signal-to-noise ratio results showed that /i/ and /u/ have greater harmonic levels than /a/. Results of nonlinear dynamic analysis suggested that more complex activity may be evident in /a/ than in /i/ or /u/. Percent jitter and percent shimmer may not be useful for description of acoustic differences between vowels. Fundamental frequency, signal-to-noise ratio, and nonlinear dynamic parameters may be applied to characterize /a/ as having lower frequency, higher noise, and greater nonlinear components than /i/ and /u/. Copyright © 2010 S. Karger AG, Basel.
The perceptual significance of high-frequency energy in the human voice.
Monson, Brian B; Hunter, Eric J; Lotto, Andrew J; Story, Brad H
2014-01-01
While human vocalizations generate acoustical energy at frequencies up to (and beyond) 20 kHz, the energy at frequencies above about 5 kHz has traditionally been neglected in speech perception research. The intent of this paper is to review (1) the historical reasons for this research trend and (2) the work that continues to elucidate the perceptual significance of high-frequency energy (HFE) in speech and singing. The historical and physical factors reveal that, while HFE was believed to be unnecessary and/or impractical for applications of interest, it was never shown to be perceptually insignificant. Rather, the main causes for focus on low-frequency energy appear to be because the low-frequency portion of the speech spectrum was seen to be sufficient (from a perceptual standpoint), or the difficulty of HFE research was too great to be justifiable (from a technological standpoint). The advancement of technology continues to overcome concerns stemming from the latter reason. Likewise, advances in our understanding of the perceptual effects of HFE now cast doubt on the first cause. Emerging evidence indicates that HFE plays a more significant role than previously believed, and should thus be considered in speech and voice perception research, especially in research involving children and the hearing impaired.
The perceptual significance of high-frequency energy in the human voice
Monson, Brian B.; Hunter, Eric J.; Lotto, Andrew J.; Story, Brad H.
2014-01-01
While human vocalizations generate acoustical energy at frequencies up to (and beyond) 20 kHz, the energy at frequencies above about 5 kHz has traditionally been neglected in speech perception research. The intent of this paper is to review (1) the historical reasons for this research trend and (2) the work that continues to elucidate the perceptual significance of high-frequency energy (HFE) in speech and singing. The historical and physical factors reveal that, while HFE was believed to be unnecessary and/or impractical for applications of interest, it was never shown to be perceptually insignificant. Rather, the main causes for focus on low-frequency energy appear to be because the low-frequency portion of the speech spectrum was seen to be sufficient (from a perceptual standpoint), or the difficulty of HFE research was too great to be justifiable (from a technological standpoint). The advancement of technology continues to overcome concerns stemming from the latter reason. Likewise, advances in our understanding of the perceptual effects of HFE now cast doubt on the first cause. Emerging evidence indicates that HFE plays a more significant role than previously believed, and should thus be considered in speech and voice perception research, especially in research involving children and the hearing impaired. PMID:24982643
A population-based study on the association between rheumatoid arthritis and voice problems.
Hah, J Hun; An, Soo-Youn; Sim, Songyong; Kim, So Young; Oh, Dong Jun; Park, Bumjung; Kim, Sung-Gyun; Choi, Hyo Geun
2016-07-01
The objective of this study was to investigate whether rheumatoid arthritis increases the frequency of organic laryngeal lesions and the subjective voice complaint rate in those with no organic laryngeal lesion. We performed a cross-sectional study using the data from 19,368 participants (418 rheumatoid arthritis patients and 18,950 controls) of the 2008-2011 Korea National Health and Nutrition Examination Survey. The associations between rheumatoid arthritis and organic laryngeal lesions/subjective voice complaints were analyzed using simple/multiple logistic regression analysis with complex sample adjusting for confounding factors, including age, sex, smoking status, stress level, and body mass index, which could provoke voice problems. Vocal nodules, vocal polyp, and vocal palsy were not associated with rheumatoid arthritis in a multiple regression analysis, and only laryngitis showed a positive association (adjusted odds ratio, 1.59; 95 % confidence interval, 1.01-2.52; P = 0.047). Rheumatoid arthritis was associated with subjective voice discomfort in a simple regression analysis, but not in a multiple regression analysis. Participants with rheumatoid arthritis were older, more often female, and had higher stress levels than those without rheumatoid arthritis. These factors were associated with subjective voice complaints in both simple and multiple regression analyses. Rheumatoid arthritis was not associated with organic laryngeal diseases except laryngitis. Rheumatoid arthritis did not increase the odds ratio for subjective voice complaints. Voice problems in participants with rheumatoid arthritis originated from the characteristics of the rheumatoid arthritis group (higher mean age, female sex, and stress level) rather than rheumatoid arthritis itself.
Ben-David, Boaz M; Icht, Michal
2016-03-01
Occupational-related vocal load is an increasing global problem with adverse personal and economic implications. We examined voice changes in real speaking situations during a single day, with and without vocal loading, aiming to identify an objective acoustic index for vocal load over a day. Call center operators (CCOs, n = 27) and age- and gender-matched students (n = 25) were recorded at the beginning and at the end of a day, with (CCOs) and without (students) vocal load. Speaking and reading voice samples were analyzed for fundamental frequency (F0), sound pressure level (SPL), and their variance (F0 coefficient of variation [F0 CV], SPL CV). The impact of lifestyle habits on voice changes was also estimated. The main findings revealed an interaction, with F0 rise at the end of the day for the students but not for the CCOs. We suggest that F0 rise is a typical phenomenon of a day of normal vocal use, whereas vocal loading interferes with this mechanism. In addition, different lifestyle profiles of CCOs and controls were observed, as the CCOs reported higher incidence of dehydrating behaviors (eg, smoking, caffeine). Yet, this profile was not linked with voice changes. In sum, we suggest that F0 rise over a day can potentially serve as an index for typical voice use. Its lack thereof can hint on consequent voice symptoms and complaints. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Basic Technical Data on Transmission Systems and Equipment Using Communications Lines. Part 1.
1978-08-01
without noticeable degradation of the speech quality. - 219 - The maximum number of repeater sections: For the KNK-6s For the KNK-6t for multiquad...power circuit 1]; 15. Low frequency amplifier for direction B - Aj 16. Low frequency amplifier; 17. KNN [initial slope network]; 18. LVN-2...frequency Voice frequency ringing at 3,800 Hz with a level 0.4 - 0.8 Np lower than the speech channel level. The system for service
Chiropractic Care for a Patient with Spasmodic Dysphonia Associated with Cervical Spine Trauma
Waddell, Roger K.
2005-01-01
Abstract Objective To discuss the diagnosis and response to treatment of spasmodic dysphonia in a 25-year-old female vocalist following an auto accident. Clinical Features The voice disorder and neck pain appeared after the traumatic incident. Examination of the cervical spine revealed moderate pain, muscle spasm and restricted joint motion at C-1 and C-5 on the left side. Cervical range of motion was reduced on left rotation. Bilateral manual muscle testing of the trapezius and sternocleidomastoid muscles, which share innervation with the laryngeal muscles by way of the spinal accessory nerve, were weak on the left side. Pre and post accident voice range profiles (phonetograms) that measure singing voice quality were examined. The pre- and post-accident phonetograms revealed significant reduction in voice intensity and fundamental frequency as measured in decibels and hertz. Intervention and Outcome Low-force chiropractic spinal manipulative therapy to C-1 and C-5 was employed. Following a course of care, the patient's singing voice returned to normal, as well as a resolution of her musculo- skeletal complaints. Conclusion It appears that in certain cases, the singing voice can be adversely affected if neck or head trauma is severe enough. This case proposes that trauma with irritation to the cervical spine nerve roots as they communicate with the spinal accessory, and in turn the laryngeal nerves, may be contributory in some functional voice disorders or muscle tension dysphonia. PMID:19674642
Lower Vocal Tract Morphologic Adjustments Are Relevant for Voice Timbre in Singing.
Mainka, Alexander; Poznyakovskiy, Anton; Platzek, Ivan; Fleischer, Mario; Sundberg, Johan; Mürbe, Dirk
2015-01-01
The vocal tract shape is crucial to voice production. Its lower part seems particularly relevant for voice timbre. This study analyzes the detailed morphology of parts of the epilaryngeal tube and the hypopharynx for the sustained German vowels /a/, /e/, /i/, /o/, and /u/ by thirteen male singer subjects who were at the beginning of their academic singing studies. Analysis was based on two different phonatory conditions: a natural, speech-like phonation and a singing phonation, like in classical singing. 3D models of the vocal tract were derived from magnetic resonance imaging and compared with long-term average spectrum analysis of audio recordings from the same subjects. Comparison of singing to the speech-like phonation, which served as reference, showed significant adjustments of the lower vocal tract: an average lowering of the larynx by 8 mm and an increase of the hypopharyngeal cross-sectional area (+ 21:9%) and volume (+ 16:8%). Changes in the analyzed epilaryngeal portion of the vocal tract were not significant. Consequently, lower larynx-to-hypopharynx area and volume ratios were found in singing compared to the speech-like phonation. All evaluated measures of the lower vocal tract varied significantly with vowel quality. Acoustically, an increase of high frequency energy in singing correlated with a wider hypopharyngeal area. The findings offer an explanation how classical male singers might succeed in producing a voice timbre with increased high frequency energy, creating a singer`s formant cluster.
Lower Vocal Tract Morphologic Adjustments Are Relevant for Voice Timbre in Singing
Mainka, Alexander; Poznyakovskiy, Anton; Platzek, Ivan; Fleischer, Mario; Sundberg, Johan; Mürbe, Dirk
2015-01-01
The vocal tract shape is crucial to voice production. Its lower part seems particularly relevant for voice timbre. This study analyzes the detailed morphology of parts of the epilaryngeal tube and the hypopharynx for the sustained German vowels /a/, /e/, /i/, /o/, and /u/ by thirteen male singer subjects who were at the beginning of their academic singing studies. Analysis was based on two different phonatory conditions: a natural, speech-like phonation and a singing phonation, like in classical singing. 3D models of the vocal tract were derived from magnetic resonance imaging and compared with long-term average spectrum analysis of audio recordings from the same subjects. Comparison of singing to the speech-like phonation, which served as reference, showed significant adjustments of the lower vocal tract: an average lowering of the larynx by 8 mm and an increase of the hypopharyngeal cross-sectional area (+ 21.9%) and volume (+ 16.8%). Changes in the analyzed epilaryngeal portion of the vocal tract were not significant. Consequently, lower larynx-to-hypopharynx area and volume ratios were found in singing compared to the speech-like phonation. All evaluated measures of the lower vocal tract varied significantly with vowel quality. Acoustically, an increase of high frequency energy in singing correlated with a wider hypopharyngeal area. The findings offer an explanation how classical male singers might succeed in producing a voice timbre with increased high frequency energy, creating a singer‘s formant cluster. PMID:26186691
Major depressive disorder discrimination using vocal acoustic features.
Taguchi, Takaya; Tachikawa, Hirokazu; Nemoto, Kiyotaka; Suzuki, Masayuki; Nagano, Toru; Tachibana, Ryuki; Nishimura, Masafumi; Arai, Tetsuaki
2018-01-01
The voice carries various information produced by vibrations of the vocal cords and the vocal tract. Though many studies have reported a relationship between vocal acoustic features and depression, including mel-frequency cepstrum coefficients (MFCCs) which applied to speech recognition, there have been few studies in which acoustic features allowed discrimination of patients with depressive disorder. Vocal acoustic features as biomarker of depression could make differential diagnosis of patients with depressive state. In order to achieve differential diagnosis of depression, in this preliminary study, we examined whether vocal acoustic features could allow discrimination between depressive patients and healthy controls. Subjects were 36 patients who met the criteria for major depressive disorder and 36 healthy controls with no current or past psychiatric disorders. Voices of reading out digits before and after verbal fluency task were recorded. Voices were analyzed using OpenSMILE. The extracted acoustic features, including MFCCs, were used for group comparison and discriminant analysis between patients and controls. The second dimension of MFCC (MFCC 2) was significantly different between groups and allowed the discrimination between patients and controls with a sensitivity of 77.8% and a specificity of 86.1%. The difference in MFCC 2 between the two groups reflected an energy difference of frequency around 2000-3000Hz. The MFCC 2 was significantly different between depressive patients and controls. This feature could be a useful biomarker to detect major depressive disorder. Sample size was relatively small. Psychotropics could have a confounding effect on voice. Copyright © 2017 Elsevier B.V. All rights reserved.
Dietrich, Maria; Verdolini Abbott, Katherine; Gartner-Schmidt, Jackie; Rosen, Clark A
2008-07-01
The study's objectives were to investigate (1) the frequency of perceived stress, anxiety, and depression for patients with common voice disorders, (2) the distribution of these variables by diagnosis, and (3) the distribution of the variables by gender. Retrospective data were derived from self-report questionnaires assessing recent stress (Perceived Stress Scale-10), anxiety, and depression (Hospital Anxiety and Depression Scale) in a cohort of new patients presenting to a voice clinic. Data are presented on 160 patients with muscle tension dysphonia (MTD), benign vocal fold lesions, paradoxical vocal fold movement disorder (PVFMD), or glottal insufficiency. Pooled data indicated that average stress, anxiety, and depression scores were similar to those found for the healthy population. However, 25.0%, 36.9%, and 31.2% of patients showed elevated stress, anxiety, and depression scores, respectively, compared to norms. Patients with PVFMD had the most frequent occurrence-and patients with glottal insufficiency had the least frequent occurrence of elevated stress, anxiety, and depression. Stress and depression were more common with MTD than with lesions, whereas reverse results were obtained for anxiety. More females than males had elevated stress, anxiety, and depression scores. The data are consistent with suggestions that stress, anxiety, and depression may be common among some patients with PVFMD, MTD, and vocal fold lesions and more common for women than men. However, individual variability in the data set was large. Further studies should evaluate the specific role of these conditions for selected categories of voice disorders in susceptible individuals.
Bioengineered vocal fold mucosa for voice restoration*
Ling, Changying; Li, Qiyao; Brown, Matthew E.; Kishimoto, Yo; Toya, Yutaka; Devine, Erin E.; Choi, Kyeong-Ok; Nishimoto, Kohei; Norman, Ian G.; Tsegyal, Tenzin; Jiang, Jack J.; Burlingham, William J.; Gunasekaran, Sundaram; Smith, Lloyd M.; Frey, Brian L.; Welham, Nathan V.
2015-01-01
Patients with voice impairment caused by advanced vocal fold (VF) fibrosis or tissue loss have few treatment options. A transplantable, bioengineered VF mucosa would address the individual and societal costs of voice-related communication loss. Such a tissue must be biomechanically capable of aerodynamic-to-acoustic energy transfer and high-frequency vibration, and physiologically capable of maintaining a barrier against the airway lumen. Here, we isolated primary human VF fibroblasts and epithelial cells and cocultured them under organotypic conditions. The resulting engineered mucosae showed morphologic features of native tissue, proteome-level evidence of mucosal morphogenesis and emerging extracellular matrix complexity, and rudimentary barrier function in vitro. When grafted into canine larynges ex vivo, the mucosae generated vibratory behavior and acoustic output that were indistinguishable from those of native VF tissue. When grafted into humanized mice in vivo, the mucosae survived and were well tolerated by the human adaptive immune system. This tissue engineering approach has the potential to restore voice function in patients with otherwise untreatable VF mucosal disease. PMID:26582902
Hearing history influences voice gender perceptual performance in cochlear implant users.
Kovačić, Damir; Balaban, Evan
2010-12-01
The study was carried out to assess the role that five hearing history variables (chronological age, age at onset of deafness, age of first cochlear implant [CI] activation, duration of CI use, and duration of known deafness) play in the ability of CI users to identify speaker gender. Forty-one juvenile CI users participated in two voice gender identification tasks. In a fixed, single-interval task, subjects listened to a single speech item from one of 20 adult male or 20 adult female speakers and had to identify speaker gender. In an adaptive speech-based voice gender discrimination task with the fundamental frequency difference between the voices as the adaptive parameter, subjects listened to a pair of speech items presented in sequential order, one of which was always spoken by an adult female and the other by an adult male. Subjects had to identify the speech item spoken by the female voice. Correlation and regression analyses between perceptual scores in the two tasks and the hearing history variables were performed. Subjects fell into three performance groups: (1) those who could distinguish voice gender in both tasks, (2) those who could distinguish voice gender in the adaptive but not the fixed task, and (3) those who could not distinguish voice gender in either task. Gender identification performance for single voices in the fixed task was significantly and negatively related to the duration of deafness before cochlear implantation (shorter deafness yielded better performance), whereas performance in the adaptive task was weakly but significantly related to age at first activation of the CI device, with earlier activations yielding better scores. The existence of a group of subjects able to perform adaptive discrimination but unable to identify the gender of singly presented voices demonstrates the potential dissociability of the skills required for these two tasks, suggesting that duration of deafness and age of cochlear implantation could have dissociable effects on the development of different skills required by CI users to identify speaker gender.
ERIC Educational Resources Information Center
Lien, Yu-An S.; Michener, Carolyn M.; Eadie, Tanya L.; Stepp, Cara E.
2015-01-01
Purpose: The acoustic measure relative fundamental frequency (RFF) was investigated as a potential objective measure to track variations in vocal effort within and across individuals. Method: Twelve speakers with healthy voices created purposeful modulations in their vocal effort during speech tasks. RFF and an aerodynamic measure of vocal effort,…
Voice gender identification by cochlear implant users: The role of spectral and temporal resolution
NASA Astrophysics Data System (ADS)
Fu, Qian-Jie; Chinchilla, Sherol; Nogaki, Geraldine; Galvin, John J.
2005-09-01
The present study explored the relative contributions of spectral and temporal information to voice gender identification by cochlear implant users and normal-hearing subjects. Cochlear implant listeners were tested using their everyday speech processors, while normal-hearing subjects were tested under speech processing conditions that simulated various degrees of spectral resolution, temporal resolution, and spectral mismatch. Voice gender identification was tested for two talker sets. In Talker Set 1, the mean fundamental frequency values of the male and female talkers differed by 100 Hz while in Talker Set 2, the mean values differed by 10 Hz. Cochlear implant listeners achieved higher levels of performance with Talker Set 1, while performance was significantly reduced for Talker Set 2. For normal-hearing listeners, performance was significantly affected by the spectral resolution, for both Talker Sets. With matched speech, temporal cues contributed to voice gender identification only for Talker Set 1 while spectral mismatch significantly reduced performance for both Talker Sets. The performance of cochlear implant listeners was similar to that of normal-hearing subjects listening to 4-8 spectral channels. The results suggest that, because of the reduced spectral resolution, cochlear implant patients may attend strongly to periodicity cues to distinguish voice gender.
The management of vocal fold nodules in children: a national survey of speech-language pathologists.
Signorelli, Monique E; Madill, Catherine J; McCabe, Patricia
2011-06-01
The purpose of this study was to determine the management options and voice therapy techniques currently being used by practicing speech-language pathologists (SLPs) to treat vocal fold nodules (VFNs) in children. The sources used by SLPs to inform and guide their clinical decisions when managing VFNs in children were also explored. Sixty-two SLPs completed a 23-item web-based survey. Data was analysed using frequency counts, content analyses, and supplementary analyses. SLPs reported using a range of management options and voice therapy techniques to treat VFNs in children. Voice therapy was reportedly the most frequently used management option across all respondents, with the majority of SLPs using a combination of indirect and direct voice therapy techniques. When selecting voice therapy techniques, the majority of SLPs reported that they did not use the limited external evidence available to guide their clinical decisions. Additionally, the majority of SLPs reported that they frequently relied on lower levels of evidence or non-evidence-based sources to guide clinical practice both in the presence and absence of higher quality evidence. Further research needs to investigate strategies to remove the barriers that impede SLPs use of external evidence when managing VFNs in children.
Evaluation of speaker de-identification based on voice gender and age conversion
NASA Astrophysics Data System (ADS)
Přibil, Jiří; Přibilová, Anna; Matoušek, Jindřich
2018-03-01
Two basic tasks are covered in this paper. The first one consists in the design and practical testing of a new method for voice de-identification that changes the apparent age and/or gender of a speaker by multi-segmental frequency scale transformation combined with prosody modification. The second task is aimed at verification of applicability of a classifier based on Gaussian mixture models (GMM) to detect the original Czech and Slovak speakers after applied voice deidentification. The performed experiments confirm functionality of the developed gender and age conversion for all selected types of de-identification which can be objectively evaluated by the GMM-based open-set classifier. The original speaker detection accuracy was compared also for sentences uttered by German and English speakers showing language independence of the proposed method.
Low-Bit Rate Speech Encoders Based on Line-Spectrum Frequencies (LSFs)
1985-01-24
Bitsa 31 Bitsb 33 Bitsc , Voicing 90.6 91.7 91.7 90.6 "" Nasality 95.3 93.7 90.4 93.2 % Sustention 81.0 79.7 79.2 83.9 Sibilation 90.1 89.6 91.9 91.4...r: Voicing 90.6 89.6 90.1 Nasality 95.3 94.8 93.2 Sustention 81.0 74.7 77.6 Sibilation 90.1 90.1 83.3 Graveness 87.0 75.0 74.2 SCompactness 94.8 94.5...0 1 2 3 Voicing 90.6 95.3 91.4 93.0 Nasality 97.7 96.9 95.3 93.0 Sustention 76.3 79.7 78.7 77.1 Sibilation 89.8 89.1 91.4 93.0 Graveness 83.3 76.1
Comparison of hearing and voicing ranges in singing
NASA Astrophysics Data System (ADS)
Hunter, Eric J.; Titze, Ingo R.
2003-04-01
The spectral and dynamic ranges of the human voice of professional and nonprofessional vocalists were compared to the auditory hearing and feeling thresholds at a distance of one meter. In order to compare these, an analysis was done in true dB SPL, not just relative dB as is usually done in speech analysis. The methodology of converting the recorded acoustic signal to absolute pressure units was described. The human voice range of a professional vocalist appeared to match the dynamic range of the auditory system at some frequencies. In particular, it was demonstrated that professional vocalists were able to make use of the most sensitive part of the hearing thresholds (around 4 kHz) through the use of a learned vocal ring or singer's formant. [Work sponsored by NIDCD.
Nass, C; Lee, K M
2001-09-01
Would people exhibit similarity-attraction and consistency-attraction toward unambiguously computer-generated speech even when personality is clearly not relevant? In Experiment 1, participants (extrovert or introvert) heard a synthesized voice (extrovert or introvert) on a book-buying Web site. Participants accurately recognized personality cues in text to speech and showed similarity-attraction in their evaluation of the computer voice, the book reviews, and the reviewer. Experiment 2, in a Web auction context, added personality of the text to the previous design. The results replicated Experiment 1 and demonstrated consistency (voice and text personality)-attraction. To maximize liking and trust, designers should set parameters, for example, words per minute or frequency range, that create a personality that is consistent with the user and the content being presented.
Auditory word recognition: extrinsic and intrinsic effects of word frequency.
Connine, C M; Titone, D; Wang, J
1993-01-01
Two experiments investigated the influence of word frequency in a phoneme identification task. Speech voicing continua were constructed so that one endpoint was a high-frequency word and the other endpoint was a low-frequency word (e.g., best-pest). Experiment 1 demonstrated that ambiguous tokens were labeled such that a high-frequency word was formed (intrinsic frequency effect). Experiment 2 manipulated the frequency composition of the list (extrinsic frequency effect). A high-frequency list bias produced an exaggerated influence of frequency; a low-frequency list bias showed a reverse frequency effect. Reaction time effects were discussed in terms of activation and postaccess decision models of frequency coding. The results support a late use of frequency in auditory word recognition.
Vocal Age Disguise: The Role of Fundamental Frequency and Speech Rate and Its Perceived Effects
Skoog Waller, Sara; Eriksson, Mårten
2016-01-01
The relationship between vocal characteristics and perceived age is of interest in various contexts, as is the possibility to affect age perception through vocal manipulation. A few examples of such situations are when age is staged by actors, when ear witnesses make age assessments based on vocal cues only or when offenders (e.g., online groomers) disguise their voice to appear younger or older. This paper investigates how speakers spontaneously manipulate two age related vocal characteristics (f0 and speech rate) in attempt to sound younger versus older than their true age, and if the manipulations correspond to actual age related changes in f0 and speech rate (Study 1). Further aims of the paper is to determine how successful vocal age disguise is by asking listeners to estimate the age of generated speech samples (Study 2) and to examine whether or not listeners use f0 and speech rate as cues to perceived age. In Study 1, participants from three age groups (20–25, 40–45, and 60–65 years) agreed to read a short text under three voice conditions. There were 12 speakers in each age group (six women and six men). They used their natural voice in one condition, attempted to sound 20 years younger in another and 20 years older in a third condition. In Study 2, 60 participants (listeners) listened to speech samples from the three voice conditions in Study 1 and estimated the speakers’ age. Each listener was exposed to all three voice conditions. The results from Study 1 indicated that the speakers increased fundamental frequency (f0) and speech rate when attempting to sound younger and decreased f0 and speech rate when attempting to sound older. Study 2 showed that the voice manipulations had an effect in the sought-after direction, although the achieved mean effect was only 3 years, which is far less than the intended effect of 20 years. Moreover, listeners used speech rate, but not f0, as a cue to speaker age. It was concluded that age disguise by voice can be achieved by naïve speakers even though the perceived effect was smaller than intended. PMID:27917144
Lechner, William J; MacGlashan, James; Wray, Tyler B; Littman, Michael L
2017-01-01
Background Computer-delivered interventions have been shown to be effective in reducing alcohol consumption in heavy drinking college students. However, these computer-delivered interventions rely on mouse, keyboard, or touchscreen responses for interactions between the users and the computer-delivered intervention. The principles of motivational interviewing suggest that in-person interventions may be effective, in part, because they encourage individuals to think through and speak aloud their motivations for changing a health behavior, which current computer-delivered interventions do not allow. Objective The objective of this study was to take the initial steps toward development of a voice-based computer-delivered intervention that can ask open-ended questions and respond appropriately to users’ verbal responses, more closely mirroring a human-delivered motivational intervention. Methods We developed (1) a voice-based computer-delivered intervention that was run by a human controller and that allowed participants to speak their responses to scripted prompts delivered by speech generation software and (2) a text-based computer-delivered intervention that relied on the mouse, keyboard, and computer screen for all interactions. We randomized 60 heavy drinking college students to interact with the voice-based computer-delivered intervention and 30 to interact with the text-based computer-delivered intervention and compared their ratings of the systems as well as their motivation to change drinking and their drinking behavior at 1-month follow-up. Results Participants reported that the voice-based computer-delivered intervention engaged positively with them in the session and delivered content in a manner consistent with motivational interviewing principles. At 1-month follow-up, participants in the voice-based computer-delivered intervention condition reported significant decreases in quantity, frequency, and problems associated with drinking, and increased perceived importance of changing drinking behaviors. In comparison to the text-based computer-delivered intervention condition, those assigned to voice-based computer-delivered intervention reported significantly fewer alcohol-related problems at the 1-month follow-up (incident rate ratio 0.60, 95% CI 0.44-0.83, P=.002). The conditions did not differ significantly on perceived importance of changing drinking or on measures of drinking quantity and frequency of heavy drinking. Conclusions Results indicate that it is feasible to construct a series of open-ended questions and a bank of responses and follow-up prompts that can be used in a future fully automated voice-based computer-delivered intervention that may mirror more closely human-delivered motivational interventions to reduce drinking. Such efforts will require using advanced speech recognition capabilities and machine-learning approaches to train a program to mirror the decisions made by human controllers in the voice-based computer-delivered intervention used in this study. In addition, future studies should examine enhancements that can increase the perceived warmth and empathy of voice-based computer-delivered intervention, possibly through greater personalization, improvements in the speech generation software, and embodying the computer-delivered intervention in a physical form. PMID:28659259
Ali, Zulfiqar; Alsulaiman, Mansour; Muhammad, Ghulam; Elamvazuthi, Irraivan; Al-Nasheri, Ahmed; Mesallam, Tamer A; Farahat, Mohamed; Malki, Khalid H
2017-05-01
A large population around the world has voice complications. Various approaches for subjective and objective evaluations have been suggested in the literature. The subjective approach strongly depends on the experience and area of expertise of a clinician, and human error cannot be neglected. On the other hand, the objective or automatic approach is noninvasive. Automatic developed systems can provide complementary information that may be helpful for a clinician in the early screening of a voice disorder. At the same time, automatic systems can be deployed in remote areas where a general practitioner can use them and may refer the patient to a specialist to avoid complications that may be life threatening. Many automatic systems for disorder detection have been developed by applying different types of conventional speech features such as the linear prediction coefficients, linear prediction cepstral coefficients, and Mel-frequency cepstral coefficients (MFCCs). This study aims to ascertain whether conventional speech features detect voice pathology reliably, and whether they can be correlated with voice quality. To investigate this, an automatic detection system based on MFCC was developed, and three different voice disorder databases were used in this study. The experimental results suggest that the accuracy of the MFCC-based system varies from database to database. The detection rate for the intra-database ranges from 72% to 95%, and that for the inter-database is from 47% to 82%. The results conclude that conventional speech features are not correlated with voice, and hence are not reliable in pathology detection. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Barsics, Catherine; Brédart, Serge
2010-11-01
Autonoetic consciousness is a fundamental property of human memory, enabling us to experience mental time travel, to recollect past events with a feeling of self-involvement, and to project ourselves in the future. Autonoetic consciousness is a characteristic of episodic memory. By contrast, awareness of the past associated with a mere feeling of familiarity or knowing relies on noetic consciousness, depending on semantic memory integrity. Present research was aimed at evaluating whether conscious recollection of episodic memories is more likely to occur following the recognition of a familiar face than following the recognition of a familiar voice. Recall of semantic information (biographical information) was also assessed. Previous studies that investigated the recall of biographical information following person recognition used faces and voices of famous people as stimuli. In this study, the participants were presented with personally familiar people's voices and faces, thus avoiding the presence of identity cues in the spoken extracts and allowing a stricter control of frequency exposure with both types of stimuli (voices and faces). In the present study, the rate of retrieved episodic memories, associated with autonoetic awareness, was significantly higher from familiar faces than familiar voices even though the level of overall recognition was similar for both these stimuli domains. The same pattern was observed regarding semantic information retrieval. These results and their implications for current Interactive Activation and Competition person recognition models are discussed.
Perrodin, Catherine; Kayser, Christoph; Logothetis, Nikos K; Petkov, Christopher I
2015-01-06
When social animals communicate, the onset of informative content in one modality varies considerably relative to the other, such as when visual orofacial movements precede a vocalization. These naturally occurring asynchronies do not disrupt intelligibility or perceptual coherence. However, they occur on time scales where they likely affect integrative neuronal activity in ways that have remained unclear, especially for hierarchically downstream regions in which neurons exhibit temporally imprecise but highly selective responses to communication signals. To address this, we exploited naturally occurring face- and voice-onset asynchronies in primate vocalizations. Using these as stimuli we recorded cortical oscillations and neuronal spiking responses from functional MRI (fMRI)-localized voice-sensitive cortex in the anterior temporal lobe of macaques. We show that the onset of the visual face stimulus resets the phase of low-frequency oscillations, and that the face-voice asynchrony affects the prominence of two key types of neuronal multisensory responses: enhancement or suppression. Our findings show a three-way association between temporal delays in audiovisual communication signals, phase-resetting of ongoing oscillations, and the sign of multisensory responses. The results reveal how natural onset asynchronies in cross-sensory inputs regulate network oscillations and neuronal excitability in the voice-sensitive cortex of macaques, a suggested animal model for human voice areas. These findings also advance predictions on the impact of multisensory input on neuronal processes in face areas and other brain regions.
Doppler compensation by shifting transmitted object frequency within limits
NASA Technical Reports Server (NTRS)
Laughlin, C. R., Jr.; Hollenbaugh, R. C.; Allen, W. K. (Inventor)
1973-01-01
A system and method are disclosed for position locating, deriving centralized air traffic control data, and communicating via voice and digital signals between a multiplicity of remote aircraft, including supersonic transports, and a central station. Such communication takes place through a synchronous satellite relay station. Side tone ranging patterns, as well as the digital and voice signals, are modulated on a carrier transmitted from the central station and received on all of the supersonic transports. Each aircraft communicates with the ground stations via a different frequency multiplexed spectrum. Supersonic transport position is derived from a computer at the central station and supplied to a local air traffic controller. Position is determined in response to variable phase information imposed on the side tones at the aircrafts. Common to all of the side tone techniques is Doppler compensation for the supersonic transport velocity.
Body mass index and acoustic voice parameters: is there a relationship.
Souza, Lourdes Bernadete Rocha de; Santos, Marquiony Marques Dos
2017-05-06
Specific elements such as weight and body volume can interfere in voice production and consequently in its acoustic parameters, which is why it is important for the clinician to be aware of these relationships. To investigate the relationship between body mass index and the average acoustic voice parameters. Observational, cross-sectional descriptive study. The sample consisted of 84 women, aged between 18 and 40years, an average of 26.83 (±6.88). The subjects were grouped according to body mass index: 19 underweight; 23 normal ranges, 20 overweight and 22 obese and evaluated the fundamental frequency of the sustained vowel [a] and the maximum phonation time of the vowels [a], [i], [u], using PRAAT software. The data were submitted to the Kruskal-Wallis test to verify if there were differences between the groups regarding the study variables. All variables showed statistically significant results and were subjected to non-parametric test Mann-Whitney. Regarding to the average of the fundamental frequency, there was statistically significant difference between groups with underweight and overweight and obese; normal range and overweight and obese. The average maximum phonation time revealed statistically significant difference between underweight and obese individuals; normal range and obese; overweight and obese. Body mass index influenced the average fundamental frequency of overweight and obese individuals evaluated in this study. Obesity influenced in reducing maximum phonation time average. Copyright © 2017 Associação Brasileira de Otorrinolaringologia e Cirurgia Cérvico-Facial. Published by Elsevier Editora Ltda. All rights reserved.
NASA Technical Reports Server (NTRS)
Schilling, D. L.
1971-01-01
The conclusions of the design research of the song adaptive delta modulator are presented for source encoding voice signals. The variation of output SNR vs input signal power/when 8, 9, and 10 bit internal arithmetic is employed. Voice intelligibility tapes to test the 10-bit system are used. An analysis of a delta modulator is also presented designed to minimize the in-band rms error. This is accomplished by frequency shaping the error signal in the modulator prior to hard limiting. The result is a significant increase in the output SNR measured after low pass filtering.
Acoustic, respiratory kinematic and electromyographic effects of vocal training
NASA Astrophysics Data System (ADS)
Mendes, Ana Paula De Brito Garcia
The longitudinal effects of vocal training on the respiratory, phonatory and articulatory systems were investigated in this study. During four semesters, fourteen voice major students were recorded while speaking and singing. Acoustic, temporal, respiratory kinematic and electromyographic parameters were measured to determine changes in the three systems as a function of vocal training. Acoustic measures of the speaking voice included fundamental frequency, sound pressure level (SPL), percent jitter and shimmer, and harmonic-to-noise ratio. Temporal measures included duration of sentences, diphthongs and the closure durations of stop consonants. Acoustic measures of the singing voice included fundamental frequency and sound pressure level of the phonational range, vibrato pulses per second, vibrato amplitude variation and the presence of the singer's formant. Analysis of the data revealed that vocal training had a significant effect on the singing voice. Fundamental frequency and SPL of the 90% level and 90--10% of the phonational range increased significantly during four semesters of vocal training. Physiological data was collected from four subjects during three semesters of vocal training. Respiratory kinematic measures included lung volume, rib cage and abdominal excursions extracted from spoken sung samples. Descriptive statistics revealed that rib cage and abdominal excursions increased from the 1st to the 2nd semester and decrease from the 2nd to the 3rd semester of vocal training. Electromyographic measures of the pectoralis major, rectus abdominis and external obliques muscles revealed that burst duration means decreased from the 1st to the 2nd semester and increased from the 2nd to the 3rd semester. Peak amplitude means increased from the 1st to the 2nd and decreased from the 2nd to the 3rd semester of vocal training. Chest wall excursions and muscle force generation of the three muscles increased as the demanding level and the length of the phonatory tasks increased.
Code of Federal Regulations, 2011 CFR
2011-01-01
... (Hz) High Group Frequencies (Hz) 1209 1336 1477 1633 697 1 2 3 Spare 770 4 5 6 Spare 852 7 8 9 Spare..._regulations/ibr_locations.html. (5) Bell Communications Research (Bellcore) document SR-TSV-002275, BOC Notes... Extender Voice Frequency Repeater Combinations, December 1973, is incorporated by reference by RUS. This...
Code of Federal Regulations, 2014 CFR
2014-01-01
... (Hz) High Group Frequencies (Hz) 1209 1336 1477 1633 697 1 2 3 Spare 770 4 5 6 Spare 852 7 8 9 Spare..._regulations/ibr_locations.html. (5) Bell Communications Research (Bellcore) document SR-TSV-002275, BOC Notes... Extender Voice Frequency Repeater Combinations, December 1973, is incorporated by reference by RUS. This...
Code of Federal Regulations, 2013 CFR
2013-01-01
... (Hz) High Group Frequencies (Hz) 1209 1336 1477 1633 697 1 2 3 Spare 770 4 5 6 Spare 852 7 8 9 Spare..._regulations/ibr_locations.html. (5) Bell Communications Research (Bellcore) document SR-TSV-002275, BOC Notes... Extender Voice Frequency Repeater Combinations, December 1973, is incorporated by reference by RUS. This...
Code of Federal Regulations, 2012 CFR
2012-01-01
... (Hz) High Group Frequencies (Hz) 1209 1336 1477 1633 697 1 2 3 Spare 770 4 5 6 Spare 852 7 8 9 Spare..._regulations/ibr_locations.html. (5) Bell Communications Research (Bellcore) document SR-TSV-002275, BOC Notes... Extender Voice Frequency Repeater Combinations, December 1973, is incorporated by reference by RUS. This...
ERIC Educational Resources Information Center
Most, Tova; Gaon-Sivan, Gal; Shpak, Talma; Luntz, Michal
2012-01-01
Binaural hearing in cochlear implant (CI) users can be achieved either by bilateral implantation or bimodally with a contralateral hearing aid (HA). Binaural-bimodal hearing has the advantage of complementing the high-frequency electric information from the CI by low-frequency acoustic information from the HA. We examined the contribution of a…
Perceptual aspects of singing.
Sundberg, J
1994-06-01
The relations between acoustic and perceived characteristics of vowel sounds are demonstrated with respect to timbre, loudness, pitch, and expressive time patterns. The conditions for perceiving an ensemble of sine tones as one tone or several tones are reviewed. There are two aspects of timbre of voice sounds: vowel quality and voice quality. Although vowel quality depends mainly on the frequencies of the lowest two formants. In particular, the center frequency of the so-called singer's formant seems perceptually relevant. Vocal loudness, generally assumed to correspond closely to the sound pressure level, depends rather on the amplitude balance between the lower and the higher spectrum partials. The perceived pitch corresponds to the fundamental frequency, or for vibrato tones, the mean of this frequency. In rapid passages, such as coloratura singing, special patterns are used. Pitch and duration differences are categorically perceived in music. This means that small variations in tuning or duration do not affect the musical interval and the note value perceived. Categorical perception is used extensively in music performance for the purpose of musical expression because without violating the score, the singer may sharpen or flatten and lengthen or shorten the tones, thereby creating musical expression.
Fundamental frequency estimation of singing voice
NASA Astrophysics Data System (ADS)
de Cheveigné, Alain; Henrich, Nathalie
2002-05-01
A method of fundamental frequency (F0) estimation recently developped for speech [de Cheveigné and Kawahara, J. Acoust. Soc. Am. (to be published)] was applied to singing voice. An electroglottograph signal recorded together with the microphone provided a reference by which estimates could be validated. Using standard parameter settings as for speech, error rates were low despite the wide range of F0s (about 100 to 1600 Hz). Most ``errors'' were due to irregular vibration of the vocal folds, a sharp formant resonance that reduced the waveform to a single harmonic, or fast F0 changes such as in high-amplitude vibrato. Our database (18 singers from baritone to soprano) included examples of diphonic singing for which melody is carried by variations of the frequency of a narrow formant rather than F0. Varying a parameter (ratio of inharmonic to total power) the algorithm could be tuned to follow either frequency. Although the method has not been formally tested on a wide range of instruments, it seems appropriate for musical applications because it is accurate, accepts a wide range of F0s, and can be implemented with low latency for interactive applications. [Work supported by the Cognitique programme of the French Ministry of Research and Technology.
[Objective study of the voice quality following partial laryngectomy].
Remacle, M; Millet, B
1991-01-01
The high resolution frequency analyzer is used for the study of the vocal quality after partial laryngectomy. The post-operative plot after speech therapy is of good quality when respecting one vocal fold. On the contrary, the heard vocal sound does not correspond to the harmonics of the fundamental frequency but to intense noise from irregular vibrations of the residual laryngeal mucosa (ventricular folds, arytenoids). High resolution frequency analysis contributes to the follow-up of the partial laryngectomy.
The effect of change in spectral slope and formant frequencies on the perception of loudness.
Duvvuru, Sirisha; Erickson, Molly
2013-11-01
This study attempts to understand how changes in spectral slope and formant frequency influence changes in perceived loudness. It was hypothesized that voices synthesized with steeper spectral slopes will be perceived as less loud than voices synthesized with less steep spectral slopes, in spite of the fact that they are of equal root mean square (RMS) amplitude. It was also hypothesized that stimuli with higher formant patterns will be perceived as louder than those with lower formant patterns, in spite of the fact that they are of equal RMS amplitude. Repeated measures factorial design. For the pitches A3, C4, B4, and F5, three different source signals were synthesized with varying slopes of -9, -12, and -15 dB/octave using a frequency vibrato rate of 5.6 Hz and a frequency vibrato extent of 50 cents. Each of the three source signals were filtered using two formant patterns, a lower formant pattern typical of a mezzo-soprano (pattern A) and a higher formant pattern typical of a soprano (pattern B) for the vowel /a/. For each pitch, the six stimuli were combined into all possible pairs and normalized to equal RMS amplitude. Listeners were presented with 120 paired stimuli (60 pairs repeated twice). The listener's task was to indicate whether the first or second stimulus in the pair was louder. Generally, as the spectral slope decreased, perceived loudness increased, with the magnitude of the perceived difference in loudness being related to the degree of difference in spectral slope. Likewise, at all pitches except A3, perceived loudness increased as formant frequency increased. RMS amplitude is an important predictor of loudness perception, but many other factors also affect the perception of this important vocal parameter. Spectral composition is one such factor and must be considered when using loudness perception in the process of clinical diagnostics. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Arruda, Polyanna; Diniz da Rosa, Marine Raquel; Almeida, Larissa Nadjara Alves; de Araujo Pernambuco, Leandro; Almeida, Anna Alice
2018-03-07
Estradiol production varies cyclically, changes in levels are hypothesized to affect the voice. The main objective of this study was to investigate vocal acoustic and auditory-perceptual characteristics during fluctuations in the levels of the hormone estradiol during the menstrual cycle. A total of 44 volunteers aged between 18 and 45 were selected. Of these, 27 women with regular menstrual cycles comprised the test group (TG) and 17 combined oral contraceptive users comprised the control group (CG). The study was performed in two phases. In phase 1, anamnesis was performed. Subsequently, the TG underwent blood sample collection for measurement of estradiol levels and voice recording for later acoustic and auditory-perceptual analysis. The CG underwent only voice recording. Phase 2 involved the same measurements as phase 1 for each group. Variables were evaluated using descriptive and inferential analysis to compare groups and phases and to determine relationships between variables. Voice changes were found during the menstrual cycle, and such changes were determined to be related to variations in estradiol levels. Impaired voice quality was observed to be associated with decreased levels of estradiol. The CG did not demonstrate significant vocal changes during phases 1 and 2. The TG showed significant increases in vocal parameters of roughness, tension, and instability during phase 2 (the period of low estradiol levels) when compared with the CG. Low estradiol levels were also found to be negatively correlated with the parameters of tension, instability, and jitter and positively correlated with fundamental voice frequency. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Voice Signals Produced With Jitter Through a Stochastic One-mass Mechanical Model.
Cataldo, Edson; Soize, Christian
2017-01-01
The quasiperiodic oscillation of the vocal folds causes perturbations in the length of the glottal cycles, which are known as jitter. The observation of the glottal cycles variations suggests that jitter is a random phenomenon described by random deviations of the glottal cycle lengths in relation to a corresponding mean value and, in general, its values are expressed as a percentage of the duration of the glottal pulse. The objective of this paper is the construction of a stochastic model for jitter using a one-mass mechanical model of the vocal folds, which assumes complete right-left symmetry of the vocal folds, and which considers motions of the vocal folds only in the horizontal direction. The jitter has been the subject for researchers due to its important applications such as the identification of pathological voices (nodules in the vocal folds, paralysis of the vocal folds, or even, the vocal aging, among others). Large values for jitter variations can indicate a pathological characteristic of the voice. The corresponding stiffness of each vocal fold is considered as a stochastic process, and its modeling is proposed. The probability density function of the fundamental frequency related to the voice signals produced are constructed and compared for different levels of jitter. Some samples of synthesized voices in these cases are obtained. It is showed that jitter could be obtained using the model proposed. The Praat software was also used to verify the measures of jitter in the synthesized voice signals. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Hunter, Eric J.
2009-01-01
Objectives Building on the concept that task type may influence fundamental frequency (F0) values, the purpose of this case study was to investigate the difference in a child’s F0 during structured, elicited tasks and long-term, unstructured activities. It also explores the possibility that the distribution in children’s F0 may make the standard statistical measures of mean and standard deviation less than ideal metrics. Methods A healthy male child (5 years, 7 months) was evaluated. The child completed four voice tasks used in a previous study of the influence of task type on F0 values: (1) sustaining the vowel /a/; (2) sustaining the vowel, /a/, embedded in a word at the end of a phrase; (3) repeating a sentence; and (4) counting from 1 to 10. The child also wore a National Center for Voice and Speech voice dosimeter, a device that collects voice data over the course of an entire day, during all activities for 34 hours over 4 days. Results Throughout the structured vocal tasks within the clinical environment, the child’s F0, as measured by both the dosimeter and acoustic analysis of microphone data, was similar for all four tasks, with the counting task the most dissimilar. The mean F0 (~257 Hz) matched very closely to the average task results in the literature given for the child’s age group. However, the child’s mean fundamental frequency during the unstructured activities was significantly higher (~376 Hz). Finally, the mode and median of the structured vocal tasks were respectively 260 Hz and 259 Hz (both near the mean), while the unstructured mode and median were respectively 290 Hz and 355 Hz. Conclusions The results of this study suggest that children may produce a notably different voice pattern during clinical observations compared to routine daily activities. In addition, the child’s long-term F0 distribution is not normal. If this distribution is consistent in long-term, unstructured natural vocalization patterns of children, statistical mean would not be a valid measure. Mode and median are suggested as two parameters which convey more accurate information about typical F0 usage. Finally, future research avenues, including further exploration of how children may adapt their F0 to various environments, conversation partners, and activity, are suggested. PMID:19185926
Multidimensional vocal assessment after laser treatment for recurrent respiratory papillomatosis.
Kono, Takeyuki; Yabe, Haruna; Uno, Kosuke; Saito, Koichiro; Ogawa, Kaoru
2017-03-01
Recurrent respiratory papillomatosis (RRP) is a benign epithelial tumor that exhibits a high frequency of recurrence. This study assesses the vocal function after laser treatment for RRP, particularly in relation to the frequency of surgery. Retrospective study. Thirty RRP patients who underwent laser surgery that controlled the tumor were included. Preoperative and postoperative Grade, Roughness, Breathiness, Asthenia, and Strain Scale, videostroboscopic findings, aerodynamic and acoustic parameters, and self-assessment questionnaires were measured and compared with an age- and sex-matched control group. Subsequently, to evaluate the association between postoperative voice quality and the number of surgeries, the patients were divided into three groups (group 1: single surgery, group 2: 2-5 surgeries, group3: >6 surgeries), and comparative multidimensional vocal assessments were performed. The mean number of surgeries was 3.4 (range, 1-8). Although all patients exhibited poorer vocal function than the control group preoperatively, they showed improvement in postoperative subjective and objective parameters. However, four patients who underwent one surgery with relatively aggressive ablation exhibited vocal cord scarring and deteriorated objective parameters. All remaining patients showed voice quality that was on par with the control group. Subgroup analysis proved no association between post-therapeutic voice quality and the patient characteristics, including preoperative staging and the number of surgical treatments performed. RRP patients can achieve a close to normal voice with high satisfaction even after recurrent surgical treatment when ablation of a subepithelial lesion using sufficient laser energy is adequate. 3b Laryngoscope, 127:679-684, 2017. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
Vogel, Adam P; Fletcher, Janet; Snyder, Peter J; Fredrickson, Amy; Maruff, Paul
2011-03-01
Assessment of the voice for supporting classifications of central nervous system (CNS) impairment requires a different practical, methodological, and statistical framework compared with assessment of the voice to guide decisions about change in the CNS. In experimental terms, an understanding of the stability and sensitivity to change of an assessment protocol is required to guide decisions about CNS change. Five experiments (N = 70) were conducted using a set of commonly used stimuli (eg, sustained vowel, reading, extemporaneous speech) and easily acquired measures (eg, f₀-f₄, percent pause). Stability of these measures was examined through their repeated application in healthy adults over brief and intermediate retest intervals (ie, 30 seconds, 2 hours, and 1 week). Those measures found to be stable were then challenged using an experimental model that reliably changes voice acoustic properties (ie, the Lombard effect). Finally, adults with an established CNS-related motor speech disorder (dysarthria) were compared with healthy controls. Of the 61 acoustic variables studied, 36 showed good stability over all three stability experiments (eg, number of pauses, total speech time, speech rate, f₀-f₄. Of the measures with good stability, a number of frequency measures showed a change in response to increased vocal effort resulting from the Lombard effect challenge. Furthermore, several timing measures significantly separated the control and motor speech impairment groups. Measures with high levels of stability within healthy adults, and those that show sensitivity to change and impairment may prove effective for monitoring changes in CNS functioning. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
[Acoustic and aerodynamic characteristics of the oesophageal voice].
Vázquez de la Iglesia, F; Fernández González, S
2005-12-01
The aim of the study is to determine the physiology and pathophisiology of esophageal voice according to objective aerodynamic and acoustic parameters (quantitative and qualitative parameters). Our subjects were comprised of 33 laryngectomized patients (all male) that underwent aerodynamic, acoustic and perceptual protocol. There is a statistical association between acoustic and aerodynamic qualitative parameters (phonation flow chart type, sound spectrum, perceptual analysis) among quantitative parameters (neoglotic pressure, phonation flow, phonation time, fundamental frequency, maximum intensity sound level, speech rate). Nevertheles, not always such observations bring practical resources to clinical practice. We consider that the facts studied may enable us to add, pragmatically, new resources to the more effective vocal rehabilitation to these patients. The physiology of esophageal voice is well understood by the method we have applied, also seeking for rehabilitation, improving oral communication skills in the laryngectomee population.
Bele, Irene; Laukkanen, Anne-Maria; Sipilä, Laura
2010-12-01
Nine broadcast journalism students attended 10 hours in Kuukka vocal exercises, which aims at producing a ringing vocal quality. Nine control subjects received no training. A text was read at habitual loudness before and after the course. Five speech specialists evaluated the text samples for perceptual voice quality and analyzed mean fundamental frequency (F0), equivalent sound level (Leq), and long-term average spectrum (LTAS). For the Training Group, voice quality improved and correlated negatively with firmness and timbre (less firm and darker qualities being considered more desirable), and F0 increased slightly. Leq increased significantly in both groups. The results show positive and perceivable differences after the course. However, the aimed ring was not reached, may be due to too short time.
Garcia Perez, Alejandro; Hernández López, Xochiquetzal; Valadez Jiménez, Víctor Manuel; Minor Martínez, Arturo; Ysunza, Pablo Antonio
2014-07-01
Although electrical stimulation of the larynx has been widely studied for treating voice disorders, its effectiveness has not been assessed under safety and comfortable conditions. This article describes design, theoretical issues, and preliminary evaluation of an innovative system for transdermal electrical stimulation of the larynx. The proposed design includes synchronization of electrical stimuli with laryngeal neuromuscular activity. To study whether synchronous electrical stimulation of the larynx could be helpful for improving voice quality in patients with dysphonia due to unilateral recurrent laryngeal nerve paralysis (URLNP). A 3-year prospective study was carried out at the Instituto Nacional de Rehabilitacion in the Mexico City. Ten patients were subjected to transdermal current electrical stimulation synchronized with the fundamental frequency of the vibration of the vocal folds during phonation. The stimulation was triggered during the phase of maximum glottal occlusion. A complete acoustic voice analysis was performed before and after the period of electrical stimulation. Acoustic analysis revealed significant improvements in all parameters after the stimulation period. Transdermal synchronous electrical stimulation of vocal folds seems to be a safe and reliable procedure for enhancing voice quality in patients with (URLNP). Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Acoustic correlate of vocal effort in spasmodic dysphonia.
Eadie, Tanya L; Stepp, Cara E
2013-03-01
This study characterized the relationship between relative fundamental frequency (RFF) and listeners' perceptions of vocal effort and overall spasmodic dysphonia severity in the voices of 19 individuals with adductor spasmodic dysphonia. Twenty inexperienced listeners evaluated the vocal effort and overall severity of voices using visual analog scales. The squared correlation coefficients (R2) between average vocal effort and overall severity and RFF measures were calculated as a function of the number of acoustic instances used for the RFF estimate (from 1 to 9, of a total of 9 voiced-voiceless-voiced instances). Increases in the number of acoustic instances used for the RFF average led to increases in the variance predicted by the RFF at the first cycle of voicing onset (onset RFF) in the perceptual measures; the use of 6 or more instances resulted in a stable estimate. The variance predicted by the onset RFF for vocal effort (R2 range, 0.06 to 0.43) was higher than that for overall severity (R2 range, 0.06 to 0.35). The offset RFF was not related to the perceptual measures, irrespective of the sample size. This study indicates that onset RFF measures are related to perceived vocal effort in patients with adductor spasmodic dysphonia. These results have implications for measuring outcomes in this population.
Soltis, Joseph; Blowers, Tracy E; Savage, Anne
2011-02-01
As in other mammals, there is evidence that the African elephant voice reflects affect intensity, but it is less clear if positive and negative affective states are differentially reflected in the voice. An acoustic comparison was made between African elephant "rumble" vocalizations produced in negative social contexts (dominance interactions), neutral social contexts (minimal social activity), and positive social contexts (affiliative interactions) by four adult females housed at Disney's Animal Kingdom®. Rumbles produced in the negative social context exhibited higher and more variable fundamental frequencies (F(0)) and amplitudes, longer durations, increased voice roughness, and higher first formant locations (F1), compared to the neutral social context. Rumbles produced in the positive social context exhibited similar shifts in most variables (F(0 )variation, amplitude, amplitude variation, duration, and F1), but the magnitude of response was generally less than that observed in the negative context. Voice roughness and F(0) observed in the positive social context remained similar to that observed in the neutral context. These results are most consistent with the vocal expression of affect intensity, in which the negative social context elicited higher intensity levels than the positive context, but differential vocal expression of positive and negative affect cannot be ruled out.
Pützer, Manfred; Wokurek, Wolfgang; Moringlane, Jean Richard
2017-07-01
The effect of deep brain stimulation (DBS) on phonatory behavior and voice quality in eight patients with multiple sclerosis (MS) was examined instrumentally and perceptually. The acoustic signals of vowel productions obtained from patients (produced with and without stimulation) and from a group of 16 healthy control speakers were analyzed to prove statistically the changes of phonatory behavior and voice quality. This is a randomized study. Firstly, a new parametrization was used to determine phonatory behavior. Secondly, a perceptual evaluation of voice quality of the same speech material was performed. With stimulation, phonation has a greater tendency to be strained. The results of perceptual evaluation support this strained phonation behavior under stimulation, resulting in a smaller degree of breathiness ratings of all raters. Without stimulation, an impaired and partly disturbed adduction of the vocal folds can be shown. These findings are also supported in the perceptual experiment providing a higher degree of hoarseness ratings of all raters for these signals. High-frequency electrical impulses to the thalamus in patients with MS influence the phonatory behavior of their vocal folds. The results suggest the need for long-term monitoring of phonatory behavior during DBS to initiate adequate treatments without delay. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Compensation for pitch-shifted auditory feedback during the production of Mandarin tone sequences
NASA Astrophysics Data System (ADS)
Xu, Yi; Larson, Charles R.; Bauer, Jay J.; Hain, Timothy C.
2004-08-01
Recent research has found that while speaking, subjects react to perturbations in pitch of voice auditory feedback by changing their voice fundamental frequency (F0) to compensate for the perceived pitch-shift. The long response latencies (150-200 ms) suggest they may be too slow to assist in on-line control of the local pitch contour patterns associated with lexical tones on a syllable-to-syllable basis. In the present study, we introduced pitch-shifted auditory feedback to native speakers of Mandarin Chinese while they produced disyllabic sequences /ma ma/ with different tonal combinations at a natural speaking rate. Voice F0 response latencies (100-150 ms) to the pitch perturbations were shorter than syllable durations reported elsewhere. Response magnitudes increased from 50 cents during static tone to 85 cents during dynamic tone productions. Response latencies and peak times decreased in phrases involving a dynamic change in F0. The larger response magnitudes and shorter latency and peak times in tasks requiring accurate, dynamic control of F0, indicate this automatic system for regulation of voice F0 may be task-dependent. These findings suggest that auditory feedback may be used to help regulate voice F0 during production of bi-tonal Mandarin phrases.
47 CFR 2.1047 - Measurements required: Modulation characteristics.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 47 Telecommunication 1 2010-10-01 2010-10-01 false Measurements required: Modulation characteristics. 2.1047 Section 2.1047 Telecommunication FEDERAL COMMUNICATIONS COMMISSION GENERAL FREQUENCY... Certification § 2.1047 Measurements required: Modulation characteristics. (a) Voice modulated communication...
47 CFR 2.1047 - Measurements required: Modulation characteristics.
Code of Federal Regulations, 2011 CFR
2011-10-01
... 47 Telecommunication 1 2011-10-01 2011-10-01 false Measurements required: Modulation characteristics. 2.1047 Section 2.1047 Telecommunication FEDERAL COMMUNICATIONS COMMISSION GENERAL FREQUENCY... Certification § 2.1047 Measurements required: Modulation characteristics. (a) Voice modulated communication...
Non-invasive distress evaluation in preterm newborn infants.
Manfredi, C; Bocchi, L; Orlandi, S; Calisti, M; Spaccaterra, L; Donzelli, G P
2008-01-01
With the increased survival of very preterm infants, there is a growing concern for their developmental outcomes. Infant cry characteristics reflect the development and possibly the integrity of the central nervous system. In this paper, relationships between fundamental frequency (F(0)) and vocal tract resonance frequencies (F(1)-F(3)) are investigated for a set of preterm newborns, by means of a multi-purpose voice analysis tool (BioVoice), characterised by high-resolution and tracking capabilities. Also, first results about possible distress occurring during cry in preterm newborn infants, as related to the decrease of central blood oxygenation, are presented. To this aim, a recording system (Newborn Recorder) has been developed, that allows synchronised, non-invasive monitoring of blood oxygenation and audio recordings of newborn infant's cry. The method has been applied to preterm newborns at the Intensive Care Unit, A.Meyer Children Hospital, Firenze, Italy.
Doppler-multipath tolerant voice communication
NASA Astrophysics Data System (ADS)
Harris, R. M.
Line of sight communication between high performance aircraft has been found to be subject to a peculiar form of multipath radio wave propagation - Doppler multipath. It degrades analogue voice reception on the standard fit ultrahigh frequency radio, producing low frequency random noise and warbling. Various modifications were carried out on the aircraft's communications system, but the problem remained. All the evidence points to a natural phenomenon. The reported observations are corroborated by theoretical studies and laboratory simulations of multipath radio wave propagation between two points moving relative to a diffusely scattering reflector. Theoretical predictions of Rician fading have explained the disruption of speech transmitted using conventional dsb(am) modulation. This also indicated suppressing the carrier as a radical cure. Double sideband suppressed carrier radios have been developed for airborne evaluation in comparison with standard dsb(am). The air to air flying trials proved the superior performance of the suppressed carrier system under conditions of Doppler multipath.
Correlation between vocal tract symptoms and modern singing handicap index in church gospel singers.
Pinheiro, Joel; Silverio, Kelly Cristina Alves; Siqueira, Larissa Thaís Donalonso; Ramos, Janine Santos; Brasolotto, Alcione Ghedini; Zambon, Fabiana; Behlau, Mara
2017-08-24
To verify the correlation between vocal tract discomfort symptoms and perceived voice handicaps in gospel singers, analyzing possible differences according to gender. 100 gospel singers volunteered, 50 male and 50 female. All participants answered two questionnaires: Vocal Tract Discomfort (VTD) scale and the Modern Singing Handicap Index (MSHI) that investigates the vocal handicap perceived by singers, linking the results of both instruments (p<0.05). Women presented more perceived handicaps and also more frequent and higher intensity vocal tract discomfort. Furthermore, the more frequent and intense the vocal tract symptoms, the higher the vocal handicap for singing. Female gospel singers present higher frequency and intensity of vocal tract discomfort symptoms, as well as higher voice handicap for singing than male gospel singers. The higher the frequency and intensity of the laryngeal symptoms, the higher the vocal handicap will be.
On noninvasive assessment of acoustic fields acting on the fetus
NASA Astrophysics Data System (ADS)
Antonets, V. A.; Kazakov, V. V.
2014-05-01
The aim of this study is to verify a noninvasive technique for assessing the characteristics of acoustic fields in the audible range arising in the uterus under the action of maternal voice, external sounds, and vibrations. This problem is very important in view of actively developed methods for delivery of external sounds to the uterus: music, maternal voice recordings, sounds from outside the mother's body, etc., that supposedly support development of the fetus at the prenatal stage psychologically and cognitively. However, the parameters of acoustic signals have been neither measured nor normalized, which may be dangerous for the fetus and hinder actual assessment of their impact on fetal development. The authors show that at frequencies below 1 kHz, acoustic pressure in the uterus may be measured noninvasively using a hydrophone placed in a soft capsule filled with liquid. It was found that the acoustic field at frequencies up to 1 kHz arising in the uterus under the action of an external sound field has amplitude-frequency parameters close to those of the external field; i.e., the external field penetrates the uterus with hardly any difficulty.
Space-to-Space Communications System
NASA Technical Reports Server (NTRS)
Tu, Kwei; Gaylor, Kent; Vitalpur, Sharada; Sham, Cathy
1999-01-01
The Space-to-Space Communications System (SSCS) is an Ultra High Frequency (UHF) Time-Division-Multiple Access (TDMA) system that is designed, developed, and deployed by the NASA Johnson Space Center (JSC) to provide voice, commands, telemetry and data services in close proximity among three space elements: International Space Station (ISS), Space Shuttle Orbiter, and Extravehicular Mobility Units (EMU). The SSCS consists of a family of three radios which are, Space-to-Space Station Radio (SSSR), Space-to-Space Orbiter Radio (SSOR), and Space-to-Space Extravehicular Mobility Radio (SSER). The SSCS can support up to five such radios at a time. Each user has its own time slot within which to transmit voice and data. Continuous Phase Frequency Shift Keying (CPFSK) carrier modulation with a burst data rate of 695 kbps and a frequency deviation of 486.5 kHz is employed by the system. Reed-Solomon (R-S) coding is also adopted to ensure data quality. In this paper, the SSCS system requirements, operational scenario, detailed system architecture and parameters, link acquisition strategy, and link performance analysis will be presented and discussed
Construction site Voice Operated Information System (VOIS) test
NASA Astrophysics Data System (ADS)
Lawrence, Debbie J.; Hettchen, William
1991-01-01
The Voice Activated Information System (VAIS), developed by USACERL, allows inspectors to verbally log on-site inspection reports on a hand held tape recorder. The tape is later processed by the VAIS, which enters the information into the system's database and produces a written report. The Voice Operated Information System (VOIS), developed by USACERL and Automated Sciences Group, through a ESACERL cooperative research and development agreement (CRDA), is an improved voice recognition system based on the concepts and function of the VAIS. To determine the applicability of the VOIS to Corps of Engineers construction projects, Technology Transfer Test Bad (T3B) funds were provided to the Corps of Engineers National Security Agency (NSA) Area Office (Fort Meade) to procure and implement the VOIS, and to train personnel in its use. This report summarizes the NSA application of the VOIS to quality assurance inspection of radio frequency shielding and to progress payment logs, and concludes that the VOIS is an easily implemented system that can offer improvements when applied to repetitive inspection procedures. Use of VOIS can save time during inspection, improve documentation storage, and provide flexible retrieval of stored information.
Davidow, Jason H; Bothe, Anne K; Richardson, Jessica D; Andreatta, Richard D
2010-12-01
This study introduces a series of systematic investigations intended to clarify the parameters of the fluency-inducing conditions (FICs) in stuttering. Participants included 11 adults, aged 20-63 years, with typical speech-production skills. A repeated measures design was used to examine the relationships between several speech production variables (vowel duration, voice onset time, fundamental frequency, intraoral pressure, pressure rise time, transglottal airflow, and phonated intervals) and speech rate and instatement style during metronome-entrained rhythmic speech. Measures of duration (vowel duration, voice onset time, and pressure rise time) differed across different metronome conditions. When speech rates were matched between the control condition and metronome condition, voice onset time was the only variable that changed. Results confirm that speech rate and instatement style can influence speech production variables during the production of fluency-inducing conditions. Future studies of normally fluent speech and of stuttered speech must control both features and should further explore the importance of voice onset time, which may be influenced by rate during metronome stimulation in a way that the other variables are not.
Computational Modeling of Fluid–Structure–Acoustics Interaction during Voice Production
Jiang, Weili; Zheng, Xudong; Xue, Qian
2017-01-01
The paper presented a three-dimensional, first-principle based fluid–structure–acoustics interaction computer model of voice production, which employed a more realistic human laryngeal and vocal tract geometries. Self-sustained vibrations, important convergent–divergent vibration pattern of the vocal folds, and entrainment of the two dominant vibratory modes were captured. Voice quality-associated parameters including the frequency, open quotient, skewness quotient, and flow rate of the glottal flow waveform were found to be well within the normal physiological ranges. The analogy between the vocal tract and a quarter-wave resonator was demonstrated. The acoustic perturbed flux and pressure inside the glottis were found to be at the same order with their incompressible counterparts, suggesting strong source–filter interactions during voice production. Such high fidelity computational model will be useful for investigating a variety of pathological conditions that involve complex vibrations, such as vocal fold paralysis, vocal nodules, and vocal polyps. The model is also an important step toward a patient-specific surgical planning tool that can serve as a no-risk trial and error platform for different procedures, such as injection of biomaterials and thyroplastic medialization. PMID:28243588
What can vortices tell us about vocal fold vibration and voice production.
Khosla, Sid; Murugappan, Shanmugam; Gutmark, Ephraim
2008-06-01
Much clinical research on laryngeal airflow has assumed that airflow is unidirectional. This review will summarize what additional knowledge can be obtained about vocal fold vibration and voice production by studying rotational motion, or vortices, in laryngeal airflow. Recent work suggests two types of vortices that may strongly contribute to voice quality. The first kind forms just above the vocal folds during glottal closing, and is formed by flow separation in the glottis; these flow separation vortices significantly contribute to rapid closing of the glottis, and hence, to producing loudness and high frequency harmonics in the acoustic spectrum. The second is a group of highly three-dimensional and coherent supraglottal vortices, which can produce sound by interaction with structures in the vocal tract. Present work is also described that suggests that certain laryngeal pathologies, such as asymmetric vocal fold tension, will significantly modify both types of vortices, with adverse impact on sound production: decreased rate of glottal closure, increased broadband noise, and a decreased signal to noise ratio. Recent research supports the hypothesis that glottal airflow contains certain vortical structures that significantly contribute to voice quality.
Rank-frequency distributions of Romanian words
NASA Astrophysics Data System (ADS)
Cocioceanu, Adrian; Raportaru, Carina Mihaela; Nicolin, Alexandru I.; Jakimovski, Dragan
2017-12-01
The calibration of voice biometrics solutions requires detailed analyses of spoken texts and in this context we investigate by computational means the rank-frequency distributions of Romanian words and word series to determine the most common words and word series of the language. To this end, we have constructed a corpus of approximately 2.5 million words and then determined that the rank-frequency distributions of the Romanian words, as well as series of two, and three subsequent words, obey the celebrated Zipf law.
Neural Correlates of Vocal Production and Motor Control in Human Heschl's Gyrus
Oya, Hiroyuki; Nourski, Kirill V.; Kawasaki, Hiroto; Larson, Charles R.; Brugge, John F.; Howard, Matthew A.; Greenlee, Jeremy D.W.
2016-01-01
The present study investigated how pitch frequency, a perceptually relevant aspect of periodicity in natural human vocalizations, is encoded in Heschl's gyrus (HG), and how this information may be used to influence vocal pitch motor control. We recorded local field potentials from multicontact depth electrodes implanted in HG of 14 neurosurgical epilepsy patients as they vocalized vowel sounds and received brief (200 ms) pitch perturbations at 100 Cents in their auditory feedback. Event-related band power responses to vocalizations showed sustained frequency following responses that tracked voice fundamental frequency (F0) and were significantly enhanced in posteromedial HG during speaking compared with when subjects listened to the playback of their own voice. In addition to frequency following responses, a transient response component within the high gamma frequency band (75–150 Hz) was identified. When this response followed the onset of vocalization, the magnitude of the response was the same for the speaking and playback conditions. In contrast, when this response followed a pitch shift, its magnitude was significantly enhanced during speaking compared with playback. We also observed that, in anterolateral HG, the power of high gamma responses to pitch shifts correlated with the magnitude of compensatory vocal responses. These findings demonstrate a functional parcellation of HG with neural activity that encodes pitch in natural human voice, distinguishes between self-generated and passively heard vocalizations, detects discrepancies between the intended and heard vocalization, and contains information about the resulting behavioral vocal compensations in response to auditory feedback pitch perturbations. SIGNIFICANCE STATEMENT The present study is a significant contribution to our understanding of sensor-motor mechanisms of vocal production and motor control. The findings demonstrate distinct functional parcellation of core and noncore areas within human auditory cortex on Heschl's gyrus that process natural human vocalizations and pitch perturbations in the auditory feedback. In addition, our data provide evidence for distinct roles of high gamma neural oscillations and frequency following responses for processing periodicity in human vocalizations during vocal production and motor control. PMID:26888939
Modulation of voice related to tremor and vibrato
NASA Astrophysics Data System (ADS)
Lester, Rosemary Anne
Modulation of voice is a result of physiologic oscillation within one or more components of the vocal system including the breathing apparatus (i.e., pressure supply), the larynx (i.e. sound source), and the vocal tract (i.e., sound filter). These oscillations may be caused by pathological tremor associated with neurological disorders like essential tremor or by volitional production of vibrato in singers. Because the acoustical characteristics of voice modulation specific to each component of the vocal system and the effect of these characteristics on perception are not well-understood, it is difficult to assess individuals with vocal tremor and to determine the most effective interventions for reducing the perceptual severity of the disorder. The purpose of the present studies was to determine how the acoustical characteristics associated with laryngeal-based vocal tremor affect the perception of the magnitude of voice modulation, and to determine if adjustments could be made to the voice source and vocal tract filter to alter the acoustic output and reduce the perception of modulation. This research was carried out using both a computational model of speech production and trained singers producing vibrato to simulate laryngeal-based vocal tremor with different voice source characteristics (i.e., vocal fold length and degree of vocal fold adduction) and different vocal tract filter characteristics (i.e., vowel shapes). It was expected that, by making adjustments to the voice source and vocal tract filter that reduce the amplitude of the higher harmonics, the perception of magnitude of voice modulation would be reduced. The results of this study revealed that listeners' perception of the magnitude of modulation of voice was affected by the degree of vocal fold adduction and the vocal tract shape with the computational model, but only by the vocal quality (corresponding to the degree of vocal fold adduction) with the female singer. Based on regression analyses, listeners' judgments were predicted by modulation information in both low and high frequency bands. The findings from these studies indicate that production of a breathy vocal quality might be a useful compensatory strategy for reducing the perceptual severity of modulation of voice for individuals with tremor affecting the larynx.
[Role of psychological factors in pathogenesis of disturbances of voice caused with vocal nodules].
Ratajczak, Jan; Grzywacz, Krzysztof; Wojdas, Andrzej; Rapiejko, Piotr; Jurkiewicz, Dariusz
2008-01-01
Hoarseness is most frequent complaint notified by ill in phoniatric outpatient clinics. Looking of causes notified of disturbances of voice often we ascertain in larynx existence of vocal nodules. Changes these come into being in consequence of excessive or irregular phonations. Single incident of disturbances of voice caused with oedema changes nascent of in consequence of inappropriate work with voice does not wake of our trouble, instead returns this of type of complaint provoke to other researches coexisting of etiological factors this diseases. Estimation of influence of individual personality trait of ill on formation of vocal nodules. One examined 20 patients with vocal nodules classified to treatments operating and 20 without disturbances of voice. All patients were subjected to otolaryngological and stroboscopic examinations. Character created of voice one examined at help of scale GRBAS, instead influence of disturbances of voice on quality of life ill at help of test VHI. Psychological examinations one executed using questionnaire State-Trait-Anxiety-Inventory (STAI), questionnaire NEO-FFI and questionnaire of aggression Buss-Perry. Obtained results showed, that persons with returning vocal nodules, both during of research as in different situations everyday lives characterizes with higher level of fear and have greater inclination to worry oneself. Ill from groups examined in greater degree are extroverts, show greater activity and more are contagious socially in comparison to persons of comparative group. Attitude this in situations extorting rivalry will be able to be ruthless, are well organized guided, scrupulous and consistently endeavour to aim. Wanting efficiently to treat persons with returning vocal nodules we should subject to ill psychological examination and in once of ascertainment of irregularity to correct it, what at simultaneous correct treatment of organic changes should diminish frequency or to eliminate returns of disease. Skill psychological looks on patient by therapists treating disturbances of voice and speeches in case not large emotional instabilities probably would be able to improve results of treatment ill not only with functional disturbances of voice but also with disturbances of voice caused with organic changes in larynx.
Maciejewska, Barbara; Rajewska-Rager, Aleksandra; Maciejewska-Szaniec, Zofia; Michalak, Michał; Rajewski, Andrzej; Wiskirska-Woźnica, Bożena
2016-06-01
Chronic undernourishment in the course of anorexia nervosa leads to various metabolic and hormonal changes, which translates to the impaired functioning of the majority of systems and internal organs. The impact of eating disorders on the condition of the vocal apparatus has been described in the literature; nevertheless, it concerns mainly bulimia nervosa. assessment of the vocal apparatus in adolescent girls diagnosed with anorexia nervosa from the point of view of possible influence on the function and structure of the larynx, low body mass accompanying anorexia, as well as energy deficiency, hormonal and emotional disturbances. The research included 41 girls aged 12-19 years, diagnosed with anorexia, who were assessed for the condition of the vocal apparatus, using the perceptual assessment of voice according to GRBAS scale, videolarynostroboscopy, acoustic assessment, and voice self-assessment in Jacobson's VHI scale (voice handicap index). The perceptual assessment of voice using the GRBAS scale revealed that changes in voice were mainly weak, asthenic in nature (70.73%) and there was also the feature of puffing perceived in voice (41.46%). In voice self-assessment with the use of VHI, most subjects seemed to point to changes of voice self-perception in emotional subscale (68%). Videolaryngostroboscopy revealed some features of functional disturbances of voice in more than half of subjects, mainly in the form of hyperfunctional dysphonia (31.78%). The maximal phonation time was significantly shorter, in proportion to duration of the primary disease. In the acoustic analysis, the decrease in the basic frequency F0 and narrowing of the voice scale were observed. 55% of older, post-adolescent patients presented with the structure of the larynx that was inappropriate for their age. These results might indicate that anorexia nervosa could have led to the structural and functional changes in the vocal apparatus. Such disturbances may be explained by the hormonal dysfunctions as well as starvation. Hormonal substitution at the appropriate time might be beneficial for the structure and phonation function of the larynx in girls with AN. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Larrouy-Maestri, Pauline; Magis, David; Morsomme, Dominique
2014-05-01
The operatic singing technique is frequently used in classical music. Several acoustical parameters of this specific technique have been studied but how these parameters combine remains unclear. This study aims to further characterize the Western operatic singing technique by observing the effects of melody and technique on acoustical and musical parameters of the singing voice. Fifty professional singers performed two contrasting melodies (popular song and romantic melody) with two vocal techniques (with and without operatic singing technique). The common quality parameters (energy distribution, vibrato rate, and extent), perturbation parameters (standard deviation of the fundamental frequency, signal-to-noise ratio, jitter, and shimmer), and musical features (fundamental frequency of the starting note, average tempo, and sound pressure level) of the 200 sung performances were analyzed. The results regarding the effect of melody and technique on the acoustical and musical parameters show that the choice of melody had a limited impact on the parameters observed, whereas a particular vocal profile appeared depending on the vocal technique used. This study confirms that vocal technique affects most of the parameters examined. In addition, the observation of quality, perturbation, and musical parameters contributes to a better understanding of the Western operatic singing technique. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Fundamental frequency perturbation indicates perceived health and age in male and female speakers
NASA Astrophysics Data System (ADS)
Feinberg, David R.
2004-05-01
There is strong support for the idea that healthy vocal chords are able to produce fundamental frequencies (F0) with minimal perturbation. Measures of F0 perturbation have been shown to discriminate pathological versus healthy populations. In addition to measuring vocal chord health, F0 perturbation is a correlate of real and perceived age. Here, the role of jitter (periodic variation in F0) and shimmer (periodic variation in amplitude of F0) in perceived health and age in a young adult (males aged 18-33, females aged 18-26), nondysphonic population was investigated. Voices were assessed for health and age by peer aged, opposite-sex raters. Jitter and shimmer were measured with Praat software (www.praat.org) using various algorithms (jitter: DDP, local, local absolute, PPQ5, and RAP; shimmer: DDA, local, local absolute, APQ3, APQ5, APQ11) to reduce measurement error, and to ascertain the robustness of the findings. Male and female voices were analyzed separately. In both sexes, ratings of health and age were significantly correlated. Measures of jitter and shimmer correlated negatively with perceived health, and positively with perceived age. Further analysis revealed that these effects were independent in male voices. Implications of this finding are that attributions of vocal health and age may reflect actual underlying condition.
Acoustic correlates of Japanese expressions associated with voice quality of male adults
NASA Astrophysics Data System (ADS)
Kido, Hiroshi; Kasuya, Hideki
2004-05-01
Japanese expressions associated with the voice quality of male adults were extracted by a series of questionnaire surveys and statistical multivariate analysis. One hundred and thirty-seven Japanese expressions were collected through the first questionnaire and careful investigations of well-established Japanese dictionaries and articles. From the second questionnaire about familiarity with each of the expressions and synonymity that were addressed to 249 subjects, 25 expressions were extracted. The third questionnaire was about an evaluation of their own voice quality. By applying a statistical clustering method and a correlation analysis to the results of the questionnaires, eight bipolar expressions and one unipolar expression were obtained. They constituted high-pitched/low-pitched, masculine/feminine, hoarse/clear, calm/excited, powerful/weak, youthful/elderly, thick/thin, tense/lax, and nasal, respectively. Acoustic correlates of each of the eight bipolar expressions were extracted by means of perceptual evaluation experiments that were made with sentence utterances of 36 males and by a statistical decision tree method. They included an average of the fundamental frequency (F0) of the utterance, speaking rate, spectral tilt, formant frequency parameter, standard deviation of F0 values, and glottal noise, when SPL of each of the stimuli was maintained identical in the perceptual experiments.
Evaluation of Different Speech and Touch Interfaces to In-Vehicle Music Retrieval Systems
Garay-Vega, L.; Pradhan, A. K.; Weinberg, G.; Schmidt-Nielsen, B.; Harsham, B.; Shen, Y.; Divekar, G.; Romoser, M.; Knodler, M.; Fisher, D. L.
2010-01-01
In-vehicle music retrieval systems are becoming more and more popular. Previous studies have shown that they pose a real hazard to drivers when the interface is a tactile one which requires multiple entries and a combination of manual control and visual feedback. Voice interfaces exist as an alternative. Such interfaces can require either multiple or single conversational turns. In this study, each of 17 participants between the ages of 18 and 30 years old was asked to use three different music-retrieval systems (one with a multiple entry touch interface, the iPod™, one with a multiple turn voice interface, interface B, and one with a single turn voice interface, interface C) while driving through a virtual world. Measures of secondary task performance, eye behavior, vehicle control, and workload were recorded. When compared with the touch interface, the voice interfaces reduced the total time drivers spent with their eyes off the forward roadway, especially in prolonged glances, as well as both the total number of glances away from the forward roadway and the perceived workload. Furthermore, when compared with driving without a secondary task, both voice interfaces did not significantly impact hazard anticipation, the frequency of long glances away from the forward roadway, or vehicle control. The multiple turn voice interface (B) significantly increased both the time it took drivers to complete the task and the workload. The implications for interface design and safety are discussed. PMID:20380920
McAllister, Anita; Brandt, Signe Kofoed
2012-09-01
A well-controlled recording in a studio is fundamental in most voice rehabilitation. However, this laboratory like recording method has been questioned because voice use in a natural environment may be quite different. In children's natural environment, high background noise levels are common and are an important factor contributing to voice problems. The primary noise source in day-care centers is the children themselves. The aim of the present study was to compare perceptual evaluations of voice quality and acoustic measures from a controlled recording with recordings of spontaneous speech in children's natural environment in a day-care setting. Eleven 5-year-old children were recorded three times during a day at the day care. The controlled speech material consisted of repeated sentences. Matching sentences were selected from the spontaneous speech. All sentences were repeated three times. Recordings were randomized and analyzed acoustically and perceptually. Statistic analyses showed that fundamental frequency was significantly higher in spontaneous speech (P<0.01) as was hyperfunction (P<0.001). The only characteristic the controlled sentences shared with spontaneous speech was degree of hoarseness (Spearman's rho=0.564). When data for boys and girls were analyzed separately, a correlation was found for the parameter breathiness (rho=0.551) for boys, and for girls the correlation for hoarseness remained (rho=0.752). Regarding acoustic data, none of the measures correlated across recording conditions for the whole group. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
[The lombard reflex as a test of vocal function (author's transl)].
Schultz-Coulon, H J; Fues, C P
1976-06-01
Any impairment of audio-phonatory control by background noise is followed by an increase in both the intensity and pitch of the speaking voice (Lombard reflex, 1911), thus increasing vocal strain. As a consequence, it might be anticipated that persons reacting to noise with marked changes in voice might be more liable to develop dysphonia. 22 singers, 34 normal controls, and 22 patients with hyperfunctional dysphonia where studied. In all patients, both ears were gradually masked with white noise. The change of the mean intensity level and of the mean pitch level of the speaking voice were then measured objectively with a special fundamental frequency analyzer (Fedders and Schultz-Coulon, 1975). Results show that the increase of intensity is comparable in all subjects, whereas the elevation of the mean pitch level differs significantly: trained voices (singers) react with the least pitch increment whereas dysphonic patients react with the most. The following conclusions were made from the present investigation: 1. Extreme increments in pitch level can be considered to be a more significant etiological factor of dysphonia than intensity increments; 2. Vocal therapy and voice training may have a favorable effect on the Lombard reflex (probably by improvement of the kinesthetic control mechanism) so that the speaking voice in a noisy environment is raised less with less vocal strain. The study also indicates that measurement of pitch changes during binaural masking can provide important information for the diagnosis, therapy and prophylaxis of dysphonia.
Cartei, Valentina; Bond, Rod; Reby, David
2014-09-01
Men's voices contain acoustic cues to body size and hormonal status, which have been found to affect women's ratings of speaker size, masculinity and attractiveness. However, the extent to which these voice parameters mediate the relationship between speakers' fitness-related features and listener's judgments of their masculinity has not yet been investigated. We audio-recorded 37 adult heterosexual males performing a range of speech tasks and asked 20 adult heterosexual female listeners to rate speakers' masculinity on the basis of their voices only. We then used a two-level (speaker within listener) path analysis to examine the relationships between the physiological (testosterone, height), acoustic (fundamental frequency or F0, and resonances or ΔF) and perceptual dimensions (listeners' ratings) of speakers' masculinity. Overall, results revealed that male speakers who were taller and had higher salivary testosterone levels also had lower F0 and ΔF, and were in turn rated as more masculine. The relationship between testosterone and perceived masculinity was essentially mediated by F0, while that of height and perceived masculinity was partially mediated by both F0 and ΔF. These observations confirm that women listeners attend to sexually dimorphic voice cues to assess the masculinity of unseen male speakers. In turn, variation in these voice features correlate with speakers' variation in stature and hormonal status, highlighting the interdependence of these physiological, acoustic and perceptual dimensions. Copyright © 2014. Published by Elsevier Inc.
[Acoustic characteristics of adductor spasmodic dysphonia].
Yang, Yang; Wang, Li-Ping
2008-06-01
To explore the acoustic characteristics of adductor spasmodic dysphonia. The acoustic characteristics, including acoustic signal of recorded voice, three-dimensional sonogram patterns and subjective assessment of voice, between 10 patients (7 women, 3 men) with adductor spasmodic dysphonia and 10 healthy volunteers (5 women, 5 men), were compared. The main clinical manifestation of adductor spasmodic dysphonia included the disorders of sound quality, rhyme and fluency. It demonstrated the tension dysphonia when reading, acoustic jitter, momentary fluctuation of frequency and volume, voice squeezing, interruption, voice prolongation, and losing normal chime. Among 10 patients, there were 1 mild dysphonia (abnormal syllable number < 25%), 6 moderate dysphonia (abnormal syllable number 25%-49%), 1 severe dysphonia (abnormal syllable number 50%-74%) and 2 extremely severe dysphonia (abnormal syllable number > or = 75%). The average reading time in 10 patients was 49 s, with reading time extension and aphasia area interruption in acoustic signals, whereas the average reading time in health control group was 30 s, without voice interruption. The aphasia ratio averaged 42%. The respective symptom syllable in different patients demonstrated in the three-dimensional sonogram. There were voice onset time prolongation, irregular, interrupted and even absent vowel formants. The consonant of symptom syllables displayed absence or prolongation of friction murmur in the block-friction murmur occasionally. The acoustic characteristics of adductor spasmodic dysphonia is the disorders of sound quality, rhyme and fluency. The three-dimensional sonogram of the symptom syllables show distinctive changes of proportional vowels or consonant phonemes.
Perrodin, Catherine; Kayser, Christoph; Logothetis, Nikos K.; Petkov, Christopher I.
2015-01-01
When social animals communicate, the onset of informative content in one modality varies considerably relative to the other, such as when visual orofacial movements precede a vocalization. These naturally occurring asynchronies do not disrupt intelligibility or perceptual coherence. However, they occur on time scales where they likely affect integrative neuronal activity in ways that have remained unclear, especially for hierarchically downstream regions in which neurons exhibit temporally imprecise but highly selective responses to communication signals. To address this, we exploited naturally occurring face- and voice-onset asynchronies in primate vocalizations. Using these as stimuli we recorded cortical oscillations and neuronal spiking responses from functional MRI (fMRI)-localized voice-sensitive cortex in the anterior temporal lobe of macaques. We show that the onset of the visual face stimulus resets the phase of low-frequency oscillations, and that the face–voice asynchrony affects the prominence of two key types of neuronal multisensory responses: enhancement or suppression. Our findings show a three-way association between temporal delays in audiovisual communication signals, phase-resetting of ongoing oscillations, and the sign of multisensory responses. The results reveal how natural onset asynchronies in cross-sensory inputs regulate network oscillations and neuronal excitability in the voice-sensitive cortex of macaques, a suggested animal model for human voice areas. These findings also advance predictions on the impact of multisensory input on neuronal processes in face areas and other brain regions. PMID:25535356
Comparison of voice-use profiles between elementary classroom and music teachers.
Morrow, Sharon L; Connor, Nadine P
2011-05-01
Among teachers, music teachers are roughly four times more likely than classroom teachers to develop voice-related problems. Although it has been established that music teachers use their voices at high intensities and durations in the course of their workday, voice-use profiles concerning the amount and intensity of vocal use and vocal load have neither been quantified nor has vocal load for music teachers been compared with classroom teachers using these same voice-use parameters. In this study, total phonation time, fundamental frequency (F₀), and vocal intensity (dB SPL [sound pressure level]) were measured or estimated directly using a KayPENTAX Ambulatory Phonation Monitor (KayPENTAX, Lincoln Park, NJ). Vocal load was calculated as cycle and distance dose, as defined by Švec et al (2003), which integrates total phonation time, F₀, and vocal intensity. Twelve participants (n = 7 elementary music teachers and n = 5 elementary classroom teachers) were monitored during five full teaching days of one workweek to determine average vocal load for these two groups of teachers. Statistically significant differences in all measures were found between the two groups (P < 0.05) with large effect sizes for all parameters. These results suggest that typical vocal loads for music teachers are substantially higher than those experienced by classroom teachers (P < 0.01). This study suggests that reducing vocal load may have immediate clinical and educational benefits in vocal health in music teachers. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Audio-vocal system regulation in children with autism spectrum disorders.
Russo, Nicole; Larson, Charles; Kraus, Nina
2008-06-01
Do children with autism spectrum disorders (ASD) respond similarly to perturbations in auditory feedback as typically developing (TD) children? Presentation of pitch-shifted voice auditory feedback to vocalizing participants reveals a close coupling between the processing of auditory feedback and vocal motor control. This paradigm was used to test the hypothesis that abnormalities in the audio-vocal system would negatively impact ASD compensatory responses to perturbed auditory feedback. Voice fundamental frequency (F(0)) was measured while children produced an /a/ sound into a microphone. The voice signal was fed back to the subjects in real time through headphones. During production, the feedback was pitch shifted (-100 cents, 200 ms) at random intervals for 80 trials. Averaged voice F(0) responses to pitch-shifted stimuli were calculated and correlated with both mental and language abilities as tested via standardized tests. A subset of children with ASD produced larger responses to perturbed auditory feedback than TD children, while the other children with ASD produced significantly lower response magnitudes. Furthermore, robust relationships between language ability, response magnitude and time of peak magnitude were identified. Because auditory feedback helps to stabilize voice F(0) (a major acoustic cue of prosody) and individuals with ASD have problems with prosody, this study identified potential mechanisms of dysfunction in the audio-vocal system for voice pitch regulation in some children with ASD. Objectively quantifying this deficit may inform both the assessment of a subgroup of ASD children with prosody deficits, as well as remediation strategies that incorporate pitch training.
Experimental analysis of the characteristics of artificial vocal folds.
Misun, Vojtech; Svancara, Pavel; Vasek, Martin
2011-05-01
Specialized literature presents a number of models describing the function of the vocal folds. In most of those models, an emphasis is placed on the air flowing through the glottis and, further, on the effect of the parameters of the air alone (its mass, speed, and so forth). The article focuses on the constructional definition of artificial vocal folds and their experimental analysis. The analysis is conducted for voiced source voice phonation and for the changing mean value of the subglottal pressure. The article further deals with the analysis of the pressure of the airflow through the vocal folds, which is cut (separated) into individual pulses by the vibrating vocal folds. The analysis results show that air pulse characteristics are relevant to voice generation, as they are produced by the flowing air and vibrating vocal folds. A number of artificial vocal folds have been constructed to date, and the aforementioned view of their phonation is confirmed by their analysis. The experiments have confirmed that man is able to consciously affect only two parameters of the source voice, that is, its fundamental frequency and voice intensity. The main forces acting on the vocal folds during phonation are as follows: subglottal air pressure and elastic and inertia forces of the vocal folds' structure. The correctness of the function of the artificial vocal folds is documented by the experimental verification of the spectra of several types of artificial vocal folds. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Kahler, Christopher W; Lechner, William J; MacGlashan, James; Wray, Tyler B; Littman, Michael L
2017-06-28
Computer-delivered interventions have been shown to be effective in reducing alcohol consumption in heavy drinking college students. However, these computer-delivered interventions rely on mouse, keyboard, or touchscreen responses for interactions between the users and the computer-delivered intervention. The principles of motivational interviewing suggest that in-person interventions may be effective, in part, because they encourage individuals to think through and speak aloud their motivations for changing a health behavior, which current computer-delivered interventions do not allow. The objective of this study was to take the initial steps toward development of a voice-based computer-delivered intervention that can ask open-ended questions and respond appropriately to users' verbal responses, more closely mirroring a human-delivered motivational intervention. We developed (1) a voice-based computer-delivered intervention that was run by a human controller and that allowed participants to speak their responses to scripted prompts delivered by speech generation software and (2) a text-based computer-delivered intervention that relied on the mouse, keyboard, and computer screen for all interactions. We randomized 60 heavy drinking college students to interact with the voice-based computer-delivered intervention and 30 to interact with the text-based computer-delivered intervention and compared their ratings of the systems as well as their motivation to change drinking and their drinking behavior at 1-month follow-up. Participants reported that the voice-based computer-delivered intervention engaged positively with them in the session and delivered content in a manner consistent with motivational interviewing principles. At 1-month follow-up, participants in the voice-based computer-delivered intervention condition reported significant decreases in quantity, frequency, and problems associated with drinking, and increased perceived importance of changing drinking behaviors. In comparison to the text-based computer-delivered intervention condition, those assigned to voice-based computer-delivered intervention reported significantly fewer alcohol-related problems at the 1-month follow-up (incident rate ratio 0.60, 95% CI 0.44-0.83, P=.002). The conditions did not differ significantly on perceived importance of changing drinking or on measures of drinking quantity and frequency of heavy drinking. Results indicate that it is feasible to construct a series of open-ended questions and a bank of responses and follow-up prompts that can be used in a future fully automated voice-based computer-delivered intervention that may mirror more closely human-delivered motivational interventions to reduce drinking. Such efforts will require using advanced speech recognition capabilities and machine-learning approaches to train a program to mirror the decisions made by human controllers in the voice-based computer-delivered intervention used in this study. In addition, future studies should examine enhancements that can increase the perceived warmth and empathy of voice-based computer-delivered intervention, possibly through greater personalization, improvements in the speech generation software, and embodying the computer-delivered intervention in a physical form. ©Christopher W Kahler, William J Lechner, James MacGlashan, Tyler B Wray, Michael L Littman. Originally published in JMIR Mental Health (http://mental.jmir.org), 28.06.2017.
3D simulation of an audible ultrasonic electrolarynx using difference waves.
Mills, Patrick; Zara, Jason
2014-01-01
A total laryngectomy removes the vocal folds which are fundamental in forming voiced sounds that make speech possible. Although implanted prosthetics are commonly used in developed countries, simple handheld vibrating electrolarynxes are still common worldwide. These devices are easy to use but suffer from many drawbacks including dedication of a hand, mechanical sounding voice, and sound leakage. To address some of these drawbacks, we introduce a novel electrolarynx that uses vibro-acoustic interference of dual ultrasonic waves to generate an audible fundamental frequency. A 3D simulation of the principles of the device is presented in this paper.
Wistbacka, Greta; Andrade, Pedro Amarante; Simberg, Susanna; Hammarberg, Britta; Södersten, Maria; Švec, Jan G; Granqvist, Svante
2018-01-01
Resonance tube phonation with tube end in water is a voice therapy method in which the patient phonates through a glass tube, keeping the free end of the tube submerged in water, creating bubbles. The purpose of this experimental study was to determine flow-pressure relationship, flow thresholds between bubble types, and bubble frequency as a function of flow and back volume. A flow-driven vocal tract simulator was used for recording the back pressure produced by resonance tubes with inner diameters of 8 and 9 mm submerged at water depths of 0-7 cm. Visual inspection of bubble types through video recording was also performed. The static back pressure was largely determined by the water depth. The narrower tube provided a slightly higher back pressure for a given flow and depth. The amplitude of the pressure oscillations increased with flow and depth. Depending on flow, the bubbles were emitted from the tube in three distinct types with increasing flow: one by one, pairwise, and in a chaotic manner. The bubble frequency was slightly higher for the narrower tube. An increase in back volume led to a decrease in bubble frequency. This study provides data on the physical properties of resonance tube phonation with the tube end in water. This information will be useful in future research when looking into the possible effects of this type of voice training. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Distinct Acoustic Features and Glottal Changes Define Two Modes of Singing in Peking Opera.
Li, Gelin; Li, Haiqing; Hou, Qian; Jiang, Zhen
2018-04-06
We aimed to delineate the acoustic characteristics of the Laodan and Qingyi role in Peking Opera and define glottis closure states and mucosal wave changes during singing in the two roles. The range of singing in A4 (440 Hz) pitch in seven female Peking Opera singers was determined using two classic pieces of Peking Opera. Glottal changes during singing were examined by stroboscopic laryngoscope. The fundamental frequency of /i/ in the first 15 seconds of the two pieces and the /i/ pitch range were determined. The relative length of the glottis fissure and the relative maximum mucosal amplitude were calculated. Qingyi had significantly higher mean fundamental frequency than Laodan. The long-term average spectrum showed an obvious formant cluster near 3000 Hz in Laodan versus Qingyi. No formant cluster was observed in singing in the regular mode. Strobe laryngoscopy showed complete glottal closure in Laodan and incomplete glottal closure in Qingyi in the maximal glottis closure phase. The relative length of the glottis fissure of Laodan was significantly lower than that of Qingyi in the singing mode. The relative maximum mucosal amplitude of Qingyi was significantly lower than that of Laodan. The Laodan role and the Qingyi role in Peking Opera sing in a fundamental frequency range compatible with the respective use of da sang (big voice) and xiao sang (small voice). The morphological patterns of glottal changes also indicate that the Laodan role and the Qingyi role sing with da sang and xiao sang, respectively. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Very-High-Frequency Aerosat Airborne Terminal
DOT National Transportation Integrated Search
1977-12-01
This report summarizes the result of a study aimed at defining the airborne VHF terminal for the experimental AEROSAT program. The system consists of a 22-channel VHF transceiver for full-duplex operation. Provisions are made for voice, data, and sur...
An Analysis of Tower (Ground) Controller - Pilot Voice Communications
DOT National Transportation Integrated Search
1995-11-01
This report is based on an analysis of over 48 hours of pilot-controller communications recorded from the ground-control : frequency at twelve air traffic control towers. The analysis examined the complexity of controller instructions, that : is, how...
Karasu, Mehmet Fatih; Gundogdu, Ramazan; Cagli, Sedat; Aydin, Mesut; Arli, Turan; Aydemir, Samet; Yuce, Imdat
2014-05-01
To compare the effects on voice of endolaryngeal microsurgery (EMS) with cold instruments and a new method, "diode laser," for vocal fold polyps. Fifty-one patients with vocal fold polyps suffering from dysphonia who were treated in the Erciyes University Department of Otolaryngology were included in the study. Voice analysis was performed in a soundproof room, holding the microphone 15 cm away from the patients' mouth and by recording a sustained [a] vowel for at least 10 seconds. Fundamental frequency (F0), Jitter, Shimmer, and noise-to-harmonic ratio (NHR) parameters were evaluated in terms of vocal analysis. All patients were asked for to fill in a questionnaire, after being informed about the voice handicap index (VHI). EMS was performed with a diode laser and cold knife on 26 and 25 patients, respectively. Patient follow-up was performed 8 weeks after surgery. Changes in F0, Jitter, Shimmer, and NHR values were measured and recorded. VHI was also completed and reassessed. There was a significant difference in each technique's VHI score between the preoperative and postoperative questionnaire (P < 0.001). Postoperatively, there was no significant difference in VHI scores between two groups (P > 0.05). There was a significant difference in voice analysis values measured preoperatively and at the postoperative controls for both groups (P < 0.05). Postoperatively, there was no significant difference in voice analysis values between two groups (P > 0.05). In the treatment of vocal polyps, EMS with both diode laser and traditional cold knife is effective. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Evaluating voice characteristics of first-year acting students in Israel: factor analysis.
Amir, Ofer; Primov-Fever, Adi; Kushnir, Tami; Kandelshine-Waldman, Osnat; Wolf, Michael
2013-01-01
Acting students require diverse, high-quality, and high-intensity vocal performance from early stages of their training. Demanding vocal activities, before developing the appropriate vocal skills, put them in high risk for developing vocal problems. A retrospective analysis of voice characteristics of first-year acting students using several voice evaluation tools. A total of 79 first-year acting students (55 women and 24 men) were assigned into two study groups: laryngeal findings (LFs) and no laryngeal findings, based on stroboscopic findings. Their voice characteristics were evaluated using acoustic analysis, aerodynamic examination, perceptual scales, and self-report questionnaires. Results obtained from each set of measures were examined using a factor analysis approach. Significant differences between the two groups were found for a single fundamental frequency (F(0))-Regularity factor; a single Grade, Roughness, Breathiness, Asthenia, Strain perceptual factor; and the three self-evaluation factors. Gender differences were found for two acoustic analysis factors, which were based on F(0) and its derivatives, namely an aerodynamic factor that represents expiratory volume measurements and a single self-evaluation factor that represents the tendency to seek therapy. Approximately 50% of the first-year acting students had LFs. These students differed from their peers in the control group in a single acoustic analysis factor, as well as perceptual and self-report factors. No group differences, however, were found for the aerodynamic factors. Early laryngeal examination and voice evaluation of future professional voice users could provide a valuable individual baseline, to which later examinations could be compared, and assist in providing personally tailored treatment. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Kluender, K R; Lotto, A J
1994-02-01
When F1-onset frequency is lower, longer F1 cut-back (VOT) is required for human listeners to perceive synthesized stop consonants as voiceless. K. R. Kluender [J. Acoust. Soc. Am. 90, 83-96 (1991)] found comparable effects of F1-onset frequency on the "labeling" of stop consonants by Japanese quail (coturnix coturnix japonica) trained to distinguish stop consonants varying in F1 cut-back. In that study, CVs were synthesized with natural-like rising F1 transitions, and endpoint training stimuli differed in the onset frequency of F1 because a longer cut-back resulted in a higher F1 onset. In order to assess whether earlier results were due to auditory predispositions or due to animals having learned the natural covariance between F1 cut-back and F1-onset frequency, the present experiment was conducted with synthetic continua having either a relatively low (375 Hz) or high (750 Hz) constant-frequency F1. Six birds were trained to respond differentially to endpoint stimuli from three series of synthesized /CV/s varying in duration of F1 cut-back. Second and third formant transitions were appropriate for labial, alveolar, or velar stops. Despite the fact that there was no opportunity for animal subjects to use experienced covariation of F1-onset frequency and F1 cut-back, quail typically exhibited shorter labeling boundaries (more voiceless stops) for intermediate stimuli of the continua when F1 frequency was higher. Responses by human subjects listening to the same stimuli were also collected. Results lend support to the earlier conclusion that part or all of the effect of F1 onset frequency on perception of voicing may be adequately explained by general auditory processes.(ABSTRACT TRUNCATED AT 250 WORDS)
Cross-Linguistic Differences in Bilinguals' Fundamental Frequency Ranges.
Ordin, Mikhail; Mennen, Ineke
2017-06-10
We investigated cross-linguistic differences in fundamental frequency range (FFR) in Welsh-English bilingual speech. This is the first study that reports gender-specific behavior in switching FFRs across languages in bilingual speech. FFR was conceptualized as a behavioral pattern using measures of span (range of fundamental frequency-in semitones-covered by the speaker's voice) and level (overall height of fundamental frequency maxima, minima, and means of speaker's voice) in each language. FFR measures were taken from recordings of 30 Welsh-English bilinguals (14 women and 16 men), who read 70 semantically matched sentences, 35 in each language. Comparisons were made within speakers across languages, separately in male and female speech. Language background and language use information was elicited for qualitative analysis of extralinguistic factors that might affect the FFR. Cross-linguistic differences in FFR were found to be consistent across female bilinguals but random across male bilinguals. Most female bilinguals showed distinct FFRs for each language. Most male bilinguals, however, were found not to change their FFR when switching languages. Those who did change used different strategies than women when differentiating FFRs between languages. Detected cross-linguistic differences in FFR can be explained by sociocultural factors. Therefore, sociolinguistic factors are to be taken into account in any further study of language-specific pitch setting and cross-linguistic differences in FFR.
Event identification by acoustic signature recognition
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dress, W.B.; Kercel, S.W.
1995-07-01
Many events of interest to the security commnnity produce acoustic emissions that are, in principle, identifiable as to cause. Some obvious examples are gunshots, breaking glass, takeoffs and landings of small aircraft, vehicular engine noises, footsteps (high frequencies when on gravel, very low frequencies. when on soil), and voices (whispers to shouts). We are investigating wavelet-based methods to extract unique features of such events for classification and identification. We also discuss methods of classification and pattern recognition specifically tailored for acoustic signatures obtained by wavelet analysis. The paper is divided into three parts: completed work, work in progress, and futuremore » applications. The completed phase has led to the successful recognition of aircraft types on landing and takeoff. Both small aircraft (twin-engine turboprop) and large (commercial airliners) were included in the study. The project considered the design of a small, field-deployable, inexpensive device. The techniques developed during the aircraft identification phase were then adapted to a multispectral electromagnetic interference monitoring device now deployed in a nuclear power plant. This is a general-purpose wavelet analysis engine, spanning 14 octaves, and can be adapted for other specific tasks. Work in progress is focused on applying the methods previously developed to speaker identification. Some of the problems to be overcome include recognition of sounds as voice patterns and as distinct from possible background noises (e.g., music), as well as identification of the speaker from a short-duration voice sample. A generalization of the completed work and the work in progress is a device capable of classifying any number of acoustic events-particularly quasi-stationary events such as engine noises and voices and singular events such as gunshots and breaking glass. We will show examples of both kinds of events and discuss their recognition likelihood.« less
Kendall, Katherine A; Leonard, Rebecca J
2011-01-01
Up to one-third of patients presenting with adductor spasmodic dysphonia will have an associated vocal tremor. These patients may not respond fully to treatment using thyroarytenoid (TA) muscle botulinum toxin (Botox) injection. Treatment failures are attributed to the involvement of multiple muscle groups in the tremor. This study evaluates the results of combined interarytenoid (IA) and TA muscle Botox injection in a group of 27 patients with adductor spasmodic dysphonia and vocal tremor and in four patients with severe vocal tremor alone. Patient-satisfaction data were reviewed retrospectively. Pre- and postinjection acoustic data were collected prospectively. Acoustic measures of fundamental frequency and cycle-by-cycle variability in frequency (jitter) and intensity (shimmer) were obtained from 15 patients' sustained vowel productions. Measures were collected after TA muscle injection, alone, and after combined TA and IA (TA+IA) muscle injections. In addition, two experienced voice clinicians blindly assessed tremor severity from recordings made for each patient in the two conditions. Patients were also queried regarding their satisfaction with the results of the injections and whether they desired to continue receiving TA+IA treatment. Significant improvement in all acoustic measures except for % jitter was observed after the TA+IA muscle injections. Listeners identified voice samples after TA+IA muscle injections as demonstrating less tremor in 73% of the paired comparisons. Sixty-seven percent of the patients with spasmodic dysphonia and vocal tremor wished to continue to receive IA muscle injections. Only one patient with severe vocal tremor wished to continue with injections. The addition of an IA muscle Botox injection to the treatment of patients with a combination adductor spasmodic dysphonia and vocal tremor may improve voice outcomes. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Indoor Air Problems and Hoarseness in Children.
Kallvik, Emma; Putus, Tuula; Simberg, Susanna
2016-01-01
A well-functioning voice is becoming increasingly important because voice-demanding professions are increasing. The largest proportion of voice disorders is caused by factors in the environment. Moisture damage is common and can initiate microbial growth and/or diffusion of chemicals from building materials. Indoor air problems due to moisture damage are associated with a number of health symptoms, for example, rhinitis, cough, and asthma symptoms. The purpose of this study was to investigate if children attending a day care center, preschool, or school with indoor air problems due to moisture damage were hoarse more often than the children in a control group. Information was collected through electronic and paper questionnaires from the parents of 6- to 9-year-old children (n = 1857) attending 57 different day care centers, preschools, or schools with or without indoor air problems due to moisture damage. The results showed a significant correlation between the degree of indoor air problem due to moisture damage and the frequency of hoarseness. Significant predictors for the child being hoarse every week or more often were dry cough, phlegm cough, and nasal congestion. The results indicate that these symptoms and exposure to indoor air problems due to moisture damage should be included in voice anamnesis. Furthermore, efforts should be made to remediate indoor air problems due to moisture damage and to treat health symptoms. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Analysis of Measured and Simulated Supraglottal Acoustic Waves.
Fraile, Rubén; Evdokimova, Vera V; Evgrafova, Karina V; Godino-Llorente, Juan I; Skrelin, Pavel A
2016-09-01
To date, although much attention has been paid to the estimation and modeling of the voice source (ie, the glottal airflow volume velocity), the measurement and characterization of the supraglottal pressure wave have been much less studied. Some previous results have unveiled that the supraglottal pressure wave has some spectral resonances similar to those of the voice pressure wave. This makes the supraglottal wave partially intelligible. Although the explanation for such effect seems to be clearly related to the reflected pressure wave traveling upstream along the vocal tract, the influence that nonlinear source-filter interaction has on it is not as clear. This article provides an insight into this issue by comparing the acoustic analyses of measured and simulated supraglottal and voice waves. Simulations have been performed using a high-dimensional discrete vocal fold model. Results of such comparative analysis indicate that spectral resonances in the supraglottal wave are mainly caused by the regressive pressure wave that travels upstream along the vocal tract and not by source-tract interaction. On the contrary and according to simulation results, source-tract interaction has a role in the loss of intelligibility that happens in the supraglottal wave with respect to the voice wave. This loss of intelligibility mainly corresponds to spectral differences for frequencies above 1500 Hz. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Fowler, Linda P; Gorham-Rowan, Mary; Hapner, Edie R
2011-01-01
The purpose of this study was to determine if measurable changes in fundamental frequency (F(0)) and relative sound level (RSL) occurred in healthy speakers after transcutaneous electrical stimulation (TES) as applied via VitalStim (Chattanooga Group, Chattanooga, TN). A prospective, repeated-measures design. Ten healthy female and 10 healthy male speakers, 20-53 years of age, participated in the study. All participants were nonsmokers and reported negative history for voice disorders. Participants received 1 hour of TES while engaged in eating, drinking, and conversation to simulate a typical dysphagia therapy protocol. Voice recordings were obtained before and immediately after TES. The voice samples consisted of a sustained vowel task and reading of the Rainbow Passage. Measurements of F(0) and RSL were obtained using TF32 (Milenkovic, 2005, University of Wisconsin). The participants also reported any sensations 5 minutes and 24 hours after TES. Measurable changes in F(0) and RSL were found for both tasks but were variable in direction and magnitude. These changes were not statistically significant. Subjective comments ranged from reports of a vocal warm-up feeling to delayed onset muscle soreness. These findings demonstrate that application of TES produces measurable changes in F(0) and RSL. However, the direction and magnitude of these changes are highly variable. Further research is needed to determine factors that may affect the extent to which TES contributes to significant changes in voice. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Noise Source Visualization Using a Digital Voice Recorder and Low-Cost Sensors
Cho, Yong Thung
2018-01-01
Accurate sound visualization of noise sources is required for optimal noise control. Typically, noise measurement systems require microphones, an analog-digital converter, cables, a data acquisition system, etc., which may not be affordable for potential users. Also, many such systems are not highly portable and may not be convenient for travel. Handheld personal electronic devices such as smartphones and digital voice recorders with relatively lower costs and higher performance have become widely available recently. Even though such devices are highly portable, directly implementing them for noise measurement may lead to erroneous results since such equipment was originally designed for voice recording. In this study, external microphones were connected to a digital voice recorder to conduct measurements and the input received was processed for noise visualization. In this way, a low cost, compact sound visualization system was designed and introduced to visualize two actual noise sources for verification with different characteristics: an enclosed loud speaker and a small air compressor. Reasonable accuracy of noise visualization for these two sources was shown over a relatively wide frequency range. This very affordable and compact sound visualization system can be used for many actual noise visualization applications in addition to educational purposes. PMID:29614038
Objective and Subjective Aspects of Voice in Pregnancy.
Saltürk, Ziya; Kumral, Tolgar Lütfi; Bekiten, Güler; Atar, Yavuz; Ataç, Enes; Aydoğdu, İmran; Yıldırım, Güven; Kılıç, Aydın; Uyar, Yavuz
2016-01-01
This study aimed to evaluate vocal changes in pregnancy according to trimesters both objectively and subjectively. Fifty pregnant women and 15 nonpregnant women were included in the study. Eighteen of the 50 pregnant women were in the first trimester, 17 in the second trimester, and 15 in the third trimester of their pregnancies. The fundamental frequency (F0), jitter, shimmer, noise-to-harmonics ratio (NHR), and minimum and maximum pitch were determined during acoustic voice analysis. Laryngologic examination was evaluated via reflux finding score (RFS). Voice Handicap Index 10 (VHI-10) was used for subjective analysis. Maximum phonation time (MPT), VHI-10, and RFS were the parameters that differed significantly. MPT was significantly shorter in the third trimester. Acoustic analysis revealed that F0, jitter, shimmer, NHR, and minimum and maximum pitch values were not significantly different in any groups. RFS was higher in the first and third trimesters than the second trimester and control groups. VHI-10 scores were significantly higher in the third trimester. Our results showed that MPT is decreased during the third trimester, although acoustic parameters did not differ. VHI-10 results deteriorated in the third trimester significantly. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
The effect of voice amplification on occupational vocal dose in elementary school teachers.
Gaskill, Christopher S; O'Brien, Shenendoah G; Tinter, Sara R
2012-09-01
Two elementary school teachers, one with and one without a history of vocal complaints, wore a vocal dosimeter all day at school for a 3-week period. In the second week, each teacher wore a portable voice amplifier. Each teacher showed a reduction in vocal intensity during the week of amplification, with a larger effect for the teacher with vocal difficulties. This teacher also showed a decrease in hourly vocal fold distance dose as measured by the dosimeter despite incurring longer phonation times. Fundamental frequency and vocal fold cycle dose did not appear to be affected by the use of amplification during the teaching day. Both teachers showed evidence of a possible moderate effect of adjusting vocal intensity in the week after amplification, possibly as a means to recalibrate their perceived vocal loudness. This study demonstrates the usefulness of both vocal dosimetry and amplification in monitoring and modifying vocal dose in an occupational setting and reinforces previous data suggesting the effectiveness of amplification in reducing the vocal load in schoolteachers. Implications of the data for future research regarding prevention and treatment of occupational voice disorders are discussed. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Relation between voice disorders and work in a group of Community Health Workers.
Cipriano, Fabiana Gonçalves; Ferreira, Léslie Piccolotto; Servilha, Emilse Aparecida Merlin; Marsiglia, Regina Maria Giffoni
2013-01-01
To analyze the relationship between voice disorders and work in a group of Community Health Agents (CHA). The subjects of this study were 65 CHA working in the city of São Paulo. Thefiinstrument used for data collection was an adaptation of the questionnaire named Conditions of Vocal Production - Teachers (CPV-P). The results were keyed in twice and submitted to statistical analysis, in order to verify: the self-reported frequency of voice disorder frequency of present vocal symptoms, the association among the three most frequently reported present symptoms, and environmental and organizational aspects of work. Of the 65 (100%) CHA in the study, 37 (56.9%) self-reported having present or past vocal disorders. The most frequently reported present symptoms were: dry throat, tiredness when speaking, and burning sensation in the throat. There was significant association between: taking work to home, having personal items stolen, police intervention, violence against employees and vocal symptom dry throat, not having enough time to complete all tasks, difficulty in leaving work, inadequate furniture, intense physical strain, objects stolen from the health unit, racism and vocal symptom tiredness when speaking, dust, job dissatisfaction, work stress, building destruction, drug issues, and vocal symptom burning in throat. Based on the obtained results, the initial hypothesis of association between the development of vocal disorders among the subjects and the adversities present in their work environment and organization was confirmed.
Relationship of the Cricothyroid Space with Vocal Range in Female Singers.
Pullon, Beverley
2017-01-01
This study aims to investigate the relationship between the anterior cricothyroid (CT) space at rest with vocal range in female singers. Potential associations with and between voice categories, age, ethnicity, anthropometric indices, neck dimensions, laryngeal dimensions, vocal data along with habitual speaking fundamental frequency were also explored. This is a cohort study. Laryngeal dimensions anterior CT space and heights of the thyroid and cricoid cartilages were measured using ultrasound in 43 healthy, classically trained, female singers during quiet respiration. Voice categories (soprano and mezzo-soprano), age, ethnicity, weight, height, body mass index, neck circumference and length, anterior thyroid and cricoid cartilage heights, practice and performance vocal range, lowest and highest practice and performance notes along with habitual speaking fundamental frequency were collected. The main finding was that mezzo-sopranos have a significantly wider resting CT space than sopranos (11.6 mm versus 10.4 mm; P = 0.007). Mezzo-sopranos also had significantly lower "lowest and highest" performance notes than sopranos. There was no significant correlation between the magnitudes of the anterior CT space with vocal range. The participants with the narrowest and widest anterior CT space had similar vocal ranges. These results suggest that the CT space is not the major determinant of performance vocal range. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Mishima, Katsuaki; Moritani, Norifumi; Nakano, Hiroyuki; Matsushita, Asuka; Iida, Seiji; Ueyama, Yoshiya
2013-12-01
The purpose of this study was to explore the voice characteristics of patients with mandibular prognathism, and to investigate the effects of mandibular setback surgery on these characteristics using nonlinear dynamics and conventional acoustic analyses. Sixteen patients (8 males and 8 females) who had skeletal 3, class III malocclusion without cleft palate, and who underwent a bilateral sagittal split ramus osteotomy (BSSRO), were enrolled. As controls, 50 healthy adults (25 males and 25 females) were enrolled. The mean first LEs (mLE1) computed for each one-second interval, and the fundamental frequency (F0) and frequencies of the first and second formant (F1, F2) were calculated for each Japanese vowel. The mLE1s for /u/ in males, and /o/ in females and the F2s for /i/ and /u/ in males, changed significantly after BSSRO. Class III voice characteristics were observed in the mLE1s for /i/ in both males and females, in the F0 for /a/, /i/, /u/ and /o/ in females, and in the F1 and F2 for /a/ in males, and the F1 for /u/ and the F2 for /i/ in females. Most of these characteristics were preserved after BSSRO. Copyright © 2013 European Association for Cranio-Maxillo-Facial Surgery. Published by Elsevier Ltd. All rights reserved.
Kokinous, Jenny; Tavano, Alessandro; Kotz, Sonja A; Schröger, Erich
2017-02-01
The role of spatial frequencies (SF) is highly debated in emotion perception, but previous work suggests the importance of low SFs for detecting emotion in faces. Furthermore, emotion perception essentially relies on the rapid integration of multimodal information from faces and voices. We used EEG to test the functional relevance of SFs in the integration of emotional and non-emotional audiovisual stimuli. While viewing dynamic face-voice pairs, participants were asked to identify auditory interjections, and the electroencephalogram (EEG) was recorded. Audiovisual integration was measured as auditory facilitation, indexed by the extent of the auditory N1 amplitude suppression in audiovisual compared to an auditory only condition. We found an interaction of SF filtering and emotion in the auditory response suppression. For neutral faces, larger N1 suppression ensued in the unfiltered and high SF conditions as compared to the low SF condition. Angry face perception led to a larger N1 suppression in the low SF condition. While the results for the neural faces indicate that perceptual quality in terms of SF content plays a major role in audiovisual integration, the results for angry faces suggest that early multisensory integration of emotional information favors low SF neural processing pathways, overruling the predictive value of the visual signal per se. Copyright © 2016 Elsevier B.V. All rights reserved.
Can Inexperienced Listeners Hear Who Is Flat? The Role of Timbre and Vibrato.
Erickson, Molly L
2016-09-01
Research has shown that the distribution of spectral energy and the presence of vibrato in a complex tone can affect pitch perception. This study sought to answer the questions: "Does timbre affect the perception of difference in pitch in complex synthetic stimuli modeled after singing voices?" "Does vibrato affect the perception of difference in pitch in complex synthetic stimuli modeled after singing voices?" and "Does the direction of timbre difference affect the perception of pitch difference?" This is a repeated-measures factorial design. The experiment consisted of three experimental blocks at the pitches A3, G4, and F5, each with a vibrato and no-vibrato subblock. For each block, two reference stimuli (mezzo-soprano and soprano) and six test stimuli (mezzo-soprano at frequencies of -1%, -2%, and -3%, soprano at frequencies of -1%, -2%, and -3%) were synthesized on the vowel /ɑ/. Each reference stimulus was paired with itself, with the other reference stimulus, and with all the test stimuli. Vibrato stimuli had a rate of 5.6 Hz and a frequency vibrato extent of ±50 cents. Listeners indicated the degree to which stimuli differed in pitch. Differences in timbre and vibrato were significant main effects on the perception of pitch difference. The direction of timbre difference was a consistent significant effect on the perception of pitch difference for the pitch G4; however, this was not a consistent effect at the pitches A3 and F5. Numerous factors can affect the perception of pitch including timbre and presence of vibrato. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Effects of Weight Loss on Acoustic Parameters After Bariatric Surgery.
de Souza, Lourdes Bernadete Rocha; Dos Santos, Marquiony Marques; Pernambuco, Leandro Araújo; de Almeida Godoy, Cynthia Meira; da Silva Lima, Deysianne Meire
2018-05-01
Patients with morbid obesity may present vocal alterations, since large accumulation of fat in the vocal tract may interfere with voice production of these individuals. Verify the neck circumference and the acoustic parameters of voice in obese women, before and after the bariatric surgery, and compare the results with a control group, with normal weight. Observational, longitudinal, descriptive study with patients referred to the SCODE (Obesity Surgery and Related Disorders Center) in a university hospital. The sample consisted of 25 morbidly obese women, age range 28-43 years and 23 non-obese women, aged 21-41 years control group. To measure the neck circumference, a tape measure was used and all participants were seated upright with the head positioned in the Frankfort horizontal plane. The fundamental frequency was calculated through the sustained emission of vowel [a] at usual intensity and pitch, to measure the fundamental frequency of the voice, that is, how much the vocal fold vibrates per second. After the recording, participants were prompted to produce vowels [a], [i], and [u] sustained at usual intensity and pitch, and a stopwatch was used to measure the maximum phonation time, to verify the balance between myoelastic and dynamic forces of the larynx. After 8 months post-surgery, the patients were recruited to be re-evaluated using the same pre-surgical data collection procedures. There was an increase in the mean value of f0. The maximum phonation time of all vowels increased after surgery. Obese individuals with post-surgery weight loss may present neck circumference, fundamental frequency, and maximum phonation time values closer to the mean values of normal weight individuals. In this study, weight loss was sufficient to adjust the acoustic parameter measurements.
Fundamental frequency characteristics of Jordanian Arabic speakers.
Natour, Yaser S; Wingate, Judith M
2009-09-01
This study is the first in a series of investigations designed to test the acoustic characteristics of the normal Arabic voice. The subjects were three hundred normal Jordanian Arabic speakers (100 adult males, 100 adult females, and 100 children). The subjects produced a sustained phonation of the vowel /a:/ and stated their complete names (i.e. first, second, third and surname) using a carrier phrase. The samples were analyzed using the Multi Dimensional Voice Program (MDVP). Fundamental frequency (F0) from the /a:/ and speaking fundamental frequency (SF0) from the sentence were analyzed. Results revealed a significant difference of both F0 and SF0 values among adult Jordanian Arabic-speaking males (F0=131.34Hz +/- 18.65, SF0=137.45 +/- 18.93), females (F0=231.13Hz +/- 20.86, SF0=230.84 +/- 16.50) and children (F0=270.93Hz +/- 20.01, SF0=278.04 +/- 32.07). Comparison with other ethnicities indicated that F0 values of adult Jordanian Arabic-speaking males and females are generally consistent with adult Caucasian and African-American values. However, for Jordanian Arabic-speaking children, a higher trend in F0 values was present than their Western counterparts. SF0 values for adult Jordanian Arabic-speaking males are generally consistent with the adult Caucasian male SF0 values. However, SF0 values of adult Jordanian-speaking females and children were relatively higher than the reported Western values. It is recommended that speech-language pathologists in Arabic-speaking countries, Jordan in specific, utilize the new data provided (F0 and SF0) when evaluating and/or treating Arabic-speaking patients. Due to its cross-linguistic variability, SF0 emerged as a preferred measurement when conducting cross-cultural comparisons of voice features.
Voice recognition through phonetic features with Punjabi utterances
NASA Astrophysics Data System (ADS)
Kaur, Jasdeep; Juglan, K. C.; Sharma, Vishal; Upadhyay, R. K.
2017-07-01
This paper deals with perception and disorders of speech in view of Punjabi language. Visualizing the importance of voice identification, various parameters of speaker identification has been studied. The speech material was recorded with a tape recorder in their normal and disguised mode of utterances. Out of the recorded speech materials, the utterances free from noise, etc were selected for their auditory and acoustic spectrographic analysis. The comparison of normal and disguised speech of seven subjects is reported. The fundamental frequency (F0) at similar places, Plosive duration at certain phoneme, Amplitude ratio (A1:A2) etc. were compared in normal and disguised speech. It was found that the formant frequency of normal and disguised speech remains almost similar only if it is compared at the position of same vowel quality and quantity. If the vowel is more closed or more open in the disguised utterance the formant frequency will be changed in comparison to normal utterance. The ratio of the amplitude (A1: A2) is found to be speaker dependent. It remains unchanged in the disguised utterance. However, this value may shift in disguised utterance if cross sectioning is not done at the same location.
Computer and Voice Network Management Through Low Earth Orbiting Satellites
2006-03-01
Correction Chart” [web page] (29 July 2005 [cited 01 DEC 05]); available from World Wide Web @ http://www.amsat.orgamsat/ ariss /news...Available from World Wide Web @ http://www.amsat.orgamsat/ ariss /news/ISS_frequencies_and_Doppler_correction. rtf “Technical Specifications” [web
47 CFR 90.249 - Control stations.
Code of Federal Regulations, 2011 CFR
2011-10-01
... 47 Telecommunication 5 2011-10-01 2011-10-01 false Control stations. 90.249 Section 90.249... MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.249 Control stations. Control... following: (a) Frequencies for control stations. (1) Control stations may be authorized to operate on...
47 CFR 90.249 - Control stations.
Code of Federal Regulations, 2013 CFR
2013-10-01
... 47 Telecommunication 5 2013-10-01 2013-10-01 false Control stations. 90.249 Section 90.249... MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.249 Control stations. Control... following: (a) Frequencies for control stations. (1) Control stations may be authorized to operate on...
47 CFR 90.249 - Control stations.
Code of Federal Regulations, 2014 CFR
2014-10-01
... 47 Telecommunication 5 2014-10-01 2014-10-01 false Control stations. 90.249 Section 90.249... MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.249 Control stations. Control... following: (a) Frequencies for control stations. (1) Control stations may be authorized to operate on...
47 CFR 90.249 - Control stations.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 47 Telecommunication 5 2010-10-01 2010-10-01 false Control stations. 90.249 Section 90.249... MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.249 Control stations. Control... following: (a) Frequencies for control stations. (1) Control stations may be authorized to operate on...
Analysis of Over-the-Horizon Tactical Communications in an Immature Theater
2014-06-13
frequency bands, capacity, costs, and mobility, the research examines both alternate portions of the electromagnetic spectrum and rising technologies...IMMATURE THEATER, by Major Samuel Eugene Sinclair, 75 pages. This qualitative research in the field of over-the-horizon (OTH) voice communications
Lien, Yu-An S; Michener, Carolyn M; Eadie, Tanya L; Stepp, Cara E
2015-06-01
The acoustic measure relative fundamental frequency (RFF) was investigated as a potential objective measure to track variations in vocal effort within and across individuals. Twelve speakers with healthy voices created purposeful modulations in their vocal effort during speech tasks. RFF and an aerodynamic measure of vocal effort, the ratio of sound pressure level to subglottal pressure level, were estimated from the aerodynamic and acoustic signals. Twelve listeners also judged the speech samples for vocal effort using the visual sort and rate method. Relationships between RFF and both the aerodynamic and perceptual measures of vocal effort were weak across speakers (R2 = .06-.26). Within speakers, relationships were variable but much stronger on average (R2 = .45-.56). RFF showed stronger relationships between both the aerodynamic and perceptual measures of vocal effort when examined within individuals versus across individuals. Future work is necessary to establish these relationships in individuals with voice disorders across the therapeutic process.
Michener, Carolyn M.; Eadie, Tanya L.; Stepp, Cara E.
2015-01-01
Purpose The acoustic measure relative fundamental frequency (RFF) was investigated as a potential objective measure to track variations in vocal effort within and across individuals. Method Twelve speakers with healthy voices created purposeful modulations in their vocal effort during speech tasks. RFF and an aerodynamic measure of vocal effort, the ratio of sound pressure level to subglottal pressure level, were estimated from the aerodynamic and acoustic signals. Twelve listeners also judged the speech samples for vocal effort using the visual sort and rate method. Results Relationships between RFF and both the aerodynamic and perceptual measures of vocal effort were weak across speakers (R2 = .06–.26). Within speakers, relationships were variable but much stronger on average (R2 = .45–.56). Conclusions RFF showed stronger relationships between both the aerodynamic and perceptual measures of vocal effort when examined within individuals versus across individuals. Future work is necessary to establish these relationships in individuals with voice disorders across the therapeutic process. PMID:25675090
MacPherson, Megan K; Abur, Defne; Stepp, Cara E
2017-07-01
This study aimed to determine the relationship among cognitive load condition and measures of autonomic arousal and voice production in healthy adults. A prospective study design was conducted. Sixteen healthy young adults (eight men, eight women) produced a sentence containing an embedded Stroop task in each of two cognitive load conditions: congruent and incongruent. In both conditions, participants said the font color of the color words instead of the word text. In the incongruent condition, font color differed from the word text, creating an increase in cognitive load relative to the congruent condition in which font color and word text matched. Three physiologic measures of autonomic arousal (pulse volume amplitude, pulse period, and skin conductance response amplitude) and four acoustic measures of voice (sound pressure level, fundamental frequency, cepstral peak prominence, and low-to-high spectral energy ratio) were analyzed for eight sentence productions in each cognitive load condition per participant. A logistic regression model was constructed to predict the cognitive load condition (congruent or incongruent) using subject as a categorical predictor and the three autonomic measures and four acoustic measures as continuous predictors. It revealed that skin conductance response amplitude, cepstral peak prominence, and low-to-high spectral energy ratio were significantly associated with cognitive load condition. During speech produced under increased cognitive load, healthy young adults show changes in physiologic markers of heightened autonomic arousal and acoustic measures of voice quality. Future work is necessary to examine these measures in older adults and individuals with voice disorders. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Kraaijenga, Sophie A C; van der Molen, Lisette; Jacobi, Irene; Hamming-Vrieze, Olga; Hilgers, Frans J M; van den Brekel, Michiel W M
2015-11-01
Concurrent chemoradiotherapy (CCRT) for advanced head and neck cancer (HNC) is associated with substantial early and late side effects, most notably regarding swallowing function, but also regarding voice quality and quality of life (QoL). Despite increased awareness/knowledge on acute dysphagia in HNC survivors, long-term (i.e., beyond 5 years) prospectively collected data on objective and subjective treatment-induced functional outcomes (and their impact on QoL) still are scarce. The objective of this study was the assessment of long-term CCRT-induced results on swallowing function and voice quality in advanced HNC patients. The study was conducted as a randomized controlled trial on preventive swallowing rehabilitation (2006-2008) in a tertiary comprehensive HNC center with twenty-two disease-free and evaluable HNC patients as participants. Multidimensional assessment of functional sequels was performed with videofluoroscopy, mouth opening measurements, Functional Oral Intake Scale, acoustic voice parameters, and (study specific, SWAL-QoL, and VHI) questionnaires. Outcome measures at 6 years post-treatment were compared with results at baseline and at 2 years post-treatment. At a mean follow-up of 6.1 years most initial tumor-, and treatment-related problems remained similarly low to those observed after 2 years follow-up, except increased xerostomia (68%) and increased (mild) pain (32%). Acoustic voice analysis showed less voicedness, increased fundamental frequency, and more vocal effort for the tumors located below the hyoid bone (n = 12), without recovery to baseline values. Patients' subjective vocal function (VHI score) was good. Functional swallowing and voice problems at 6 years post-treatment are minimal in this patient cohort, originating from preventive and continued post-treatment rehabilitation programs.
Cataldo, E; Soize, C
2018-06-06
Jitter, in voice production applications, is a random phenomenon characterized by the deviation of the glottal cycle length with respect to a mean value. Its study can help in identifying pathologies related to the vocal folds according to the values obtained through the different ways to measure it. This paper aims to propose a stochastic model, considering three control parameters, to generate jitter based on a deterministic one-mass model for the dynamics of the vocal folds and to identify parameters from the stochastic model taking into account real voice signals experimentally obtained. To solve the corresponding stochastic inverse problem, the cost function used is based on the distance between probability density functions of the random variables associated with the fundamental frequencies obtained by the experimental voices and the simulated ones, and also on the distance between features extracted from the voice signals, simulated and experimental, to calculate jitter. The results obtained show that the model proposed is valid and some samples of voices are synthesized considering the identified parameters for normal and pathological cases. The strategy adopted is also a novelty and mainly because a solution was obtained. In addition to the use of three parameters to construct the model of jitter, it is the discussion of a parameter related to the bandwidth of the power spectral density function of the stochastic process to measure the quality of the signal generated. A study about the influence of all the main parameters is also performed. The identification of the parameters of the model considering pathological cases is maybe of all novelties introduced by the paper the most interesting. Copyright © 2018 Elsevier Ltd. All rights reserved.
Speech Adjustments for Room Acoustics and Their Effects on Vocal Effort.
Bottalico, Pasquale
2017-05-01
The aims of the present study are (1) to analyze the effects of the acoustical environment and the voice style on time dose (D t_p ) and fundamental frequency (mean f 0 and standard deviation std_f 0 ) while taking into account the effect of short-term vocal fatigue and (2) to predict the self-reported vocal effort from the voice acoustical parameters. Ten male and ten female subjects were recorded while reading a text in normal and loud styles, in three rooms-anechoic, semi-reverberant, and reverberant-with and without acrylic glass panels 0.5 m from the mouth, which increased external auditory feedback. Subjects quantified how much effort was required to speak in each condition on a visual analogue scale after each task. (Aim1) In the loud style, D t_p , f 0 , and std_f 0 increased. The D t_p was higher in the reverberant room compared to the other two rooms. Both genders tended to increase f 0 in less reverberant environments, whereas a more monotonous speech was produced in rooms with greater reverberation. All three voice parameters increased with short-term vocal fatigue. (Aim2) A model of the vocal effort to acoustic vocal parameters is proposed. The sound pressure level contributed to 66% of the variance explained by the model, followed by the f 0 (30%) and the modulation in amplitude (4%). The results provide insight into how voice acoustical parameters can predict vocal effort. In particular, it increased when SPL and f 0 increased and when the amplitude voice modulation decreased. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Nixon, C W; Morris, L J; McCavitt, A R; McKinley, R L; Anderson, T R; McDaniel, M P; Yeager, D G
1998-07-01
Female produced speech, although more intelligible than male speech in some noise spectra, may be more vulnerable to degradation by high levels of some military aircraft cockpit noises. The acoustic features of female speech are higher in frequency, lower in power, and appear more susceptible than male speech to masking by some of these military noises. Current military aircraft voice communication systems were optimized for the male voice and may not adequately accommodate the female voice in these high level noises. This applied study investigated the intelligibility of female and male speech produced in the noise spectra of four military aircraft cockpits at levels ranging from 95 dB to 115 dB. The experimental subjects used standard flight helmets and headsets, noise-canceling microphones, and military aircraft voice communications systems during the measurements. The intelligibility of female speech was lower than that of male speech for all experimental conditions; however, differences were small and insignificant except at the highest levels of the cockpit noises. Intelligibility for both genders varied with aircraft noise spectrum and level. Speech intelligibility of both genders was acceptable during normal cruise noises of all four aircraft, but improvements are required in the higher levels of noise created during aircraft maximum operating conditions. The intelligibility of female speech was unacceptable at the highest measured noise level of 115 dB and may constitute a problem for other military aviators. The intelligibility degradation due to the noise can be neutralized by use of an available, improved noise-canceling microphone, by the application of current active noise reduction technology to the personal communication equipment, and by the development of a voice communications system to accommodate the speech produced by both female and male aviators.
! Boating Safety Beach Hazards Rip Currents Hypothermia Hurricanes Thunderstorms Lightning Coastal Flooding frequency) The U.S. Coast Guard broadcasts coastal forecasts and storm Warnings of interest to the mariner coverage of coastal U.S., Great Lakes, Hawaii, and populated Alaska coastline. Typical coverage is 20
ERIC Educational Resources Information Center
Larson, Kirstin
2001-01-01
This document gives voice to concerns raised by critics and supporters of commercialism in schools and provides brief descriptions of several important resources on this topic. "Commercial Activities in School" (U.S. General Accounting Office) reports on the nature and frequency of commercial activities in public schools, as well as the…
47 CFR 25.284 - Emergency Call Center Service.
Code of Federal Regulations, 2012 CFR
2012-10-01
... service to the extent that they offer real-time, two way switched voice service that is interconnected... provider to reuse frequencies and/or accomplish seamless hand-offs of subscriber calls. Emergency Call Center personnel must determine the emergency caller's phone number and location and then transfer or...
47 CFR 25.284 - Emergency Call Center Service.
Code of Federal Regulations, 2011 CFR
2011-10-01
... service to the extent that they offer real-time, two way switched voice service that is interconnected... provider to reuse frequencies and/or accomplish seamless hand-offs of subscriber calls. Emergency Call Center personnel must determine the emergency caller's phone number and location and then transfer or...
47 CFR 25.284 - Emergency Call Center Service.
Code of Federal Regulations, 2014 CFR
2014-10-01
... service to the extent that they offer real-time, two way switched voice service that is interconnected... provider to reuse frequencies and/or accomplish seamless hand-offs of subscriber calls. Emergency Call Center personnel must determine the emergency caller's phone number and location and then transfer or...
47 CFR 25.284 - Emergency Call Center Service.
Code of Federal Regulations, 2013 CFR
2013-10-01
... service to the extent that they offer real-time, two way switched voice service that is interconnected... provider to reuse frequencies and/or accomplish seamless hand-offs of subscriber calls. Emergency Call Center personnel must determine the emergency caller's phone number and location and then transfer or...
de Leede-Smith, Saskia; Barkus, Emma
2013-01-01
Over the years, the prevalence of auditory verbal hallucinations (AVHs) have been documented across the lifespan in varied contexts, and with a range of potential long-term outcomes. Initially the emphasis focused on whether AVHs conferred risk for psychosis. However, recent research has identified significant differences in the presentation and outcomes of AVH in patients compared to those in non-clinical populations. For this reason, it has been suggested that auditory hallucinations are an entity by themselves and not necessarily indicative of transition along the psychosis continuum. This review will examine the presentation of auditory hallucinations across the life span, as well as in various clinical groups. The stages described include childhood, adolescence, adult non-clinical populations, hypnagogic/hypnopompic experiences, high schizotypal traits, schizophrenia, substance induced AVH, AVH in epilepsy, and AVH in the elderly. In children, need for care depends upon whether the child associates the voice with negative beliefs, appraisals and other symptoms of psychosis. This theme appears to carry right through to healthy voice hearers in adulthood, in which a negative impact of the voice usually only exists if the individual has negative experiences as a result of their voice(s). This includes features of the voices such as the negative content, frequency, and emotional valence as well as anxiety and depression, independently or caused by voices presence. It seems possible that the mechanisms which maintain AVH in non-clinical populations are different from those which are behind AVH presentations in psychotic illness. For example, the existence of maladaptive coping strategies in patient populations is one significant difference between clinical and non-clinical groups which is associated with a need for care. Whether or not these mechanisms start out the same and have differential trajectories is not yet evidenced. Future research needs to focus on the comparison of underlying factors and mechanisms that lead to the onset of AVH in both patient and non-clinical populations. PMID:23882203
Jung, Soo Yeon; Ryu, Jung-Hwa; Park, Hae Sang; Chung, Sung Min; Ryu, Dong-Ryeol; Kim, Han Su
2014-03-01
Patients with end-stage renal disease (ESRD) who are treated with hemodialysis (HD) frequently complain about hoarseness after completion of each HD session. The HD treatment affects laryngeal volume and muscle function. This study attempted to evaluate the vocal effect of HD by acoustic and aerodynamic analysis and to determine the difference between voice change group (VCG) and nonvoice change group (NVCG). A total of 55 patients (34 females and 21 males) diagnosed with ESRD and undergoing outpatient HD were enrolled. The subjects were divided into the VCG (n=13) and NVCG (n=42) by the change of the Korean Voice Handicap Index score. Patients underwent weighing and acoustic, aerodynamic analysis before and after the HD. Fundamental frequency (F0), jitter, shimmer, noise-to-harmonics ratio (NHR), pitch range, habitual pitch, voice energy, and maximal phonation time (MPT) were obtained. The pre- and post-HD data were compared using paired t test. The results were compared after dividing the total group into the VCG and NVCG categories. Correlation between the change of the weight and change of the voice analysis result was certified by Pearson correlation coefficient. The F0 and habitual pitch increased in all subjects. The NHR and MPT parameters significantly decreased (P<0.05). In the NVCG group, all the results were same as the total group. In the VCG group, the NHR result differed from the total group. All acoustic parameters showed no statistically significant differences between the two groups. There was no correlation between the weight change (%) and the change of acoustic parameter results. The NVCG group of patient displayed improvement in NHR, whereas the VCG group showed no change. Weight change did not significantly correlate with the voice analysis results. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Master, Suely; Guzman, Marco; Carlos de Miranda, Helder; Lloyd, Adam
2013-03-01
Previous studies with long-term average spectrum (LTAS) showed the importance of the glottal source for understanding the projected voices of actresses. In this study, electroglottographic (EGG) analysis was used to investigate the contribution of the glottal source to the projected voice, comparing actresses and nonactresses' voices, in different levels of intensity. Thirty actresses and 30 nonactresses sustained vowels in habitual, moderate, and loud intensity levels. The EGG variables were contact quotient (CQ), closing quotient (QCQ), and opening quotient (QOQ). Other variables were sound pressure level (SPL) and fundamental frequency (F0). A KayPENTAX EGG was used. Variables were inputted in a general linear model. Actresses showed significantly higher values for SPL, in all levels, and both groups increased SPL significantly while changing from habitual to moderate and further to loud. There were no significant differences between groups for EGG quotients. There were significant differences between the levels only for F0 and CQ for both groups. SPL was significantly higher among actresses in all intensity levels, but in the EGG analysis, no differences were found. This apparently weak contribution of the glottal source in the supposedly projected voices of actresses, contrary to previous LTAS studies, might be because of a higher subglottal pressure or perhaps greater vocal tract contribution in SPL. Results from the present study suggest that trained subjects did not produce a significant higher SPL than untrained individuals by increasing the cost in terms of higher vocal fold collision and hence more impact stress. Future researches should explore the difference between trained and nontrained voices by aerodynamic measurements to evaluate the relationship between physiologic findings and the acoustic and EGG data. Moreover, further studies should consider both types of vocal tasks, sustained vowel and running speech, for both EGG and LTAS analysis. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
de Leede-Smith, Saskia; Barkus, Emma
2013-01-01
Over the years, the prevalence of auditory verbal hallucinations (AVHs) have been documented across the lifespan in varied contexts, and with a range of potential long-term outcomes. Initially the emphasis focused on whether AVHs conferred risk for psychosis. However, recent research has identified significant differences in the presentation and outcomes of AVH in patients compared to those in non-clinical populations. For this reason, it has been suggested that auditory hallucinations are an entity by themselves and not necessarily indicative of transition along the psychosis continuum. This review will examine the presentation of auditory hallucinations across the life span, as well as in various clinical groups. The stages described include childhood, adolescence, adult non-clinical populations, hypnagogic/hypnopompic experiences, high schizotypal traits, schizophrenia, substance induced AVH, AVH in epilepsy, and AVH in the elderly. In children, need for care depends upon whether the child associates the voice with negative beliefs, appraisals and other symptoms of psychosis. This theme appears to carry right through to healthy voice hearers in adulthood, in which a negative impact of the voice usually only exists if the individual has negative experiences as a result of their voice(s). This includes features of the voices such as the negative content, frequency, and emotional valence as well as anxiety and depression, independently or caused by voices presence. It seems possible that the mechanisms which maintain AVH in non-clinical populations are different from those which are behind AVH presentations in psychotic illness. For example, the existence of maladaptive coping strategies in patient populations is one significant difference between clinical and non-clinical groups which is associated with a need for care. Whether or not these mechanisms start out the same and have differential trajectories is not yet evidenced. Future research needs to focus on the comparison of underlying factors and mechanisms that lead to the onset of AVH in both patient and non-clinical populations.
Type 3 Thyroplasty for a Patient with Female-to-Male Gender Identity Disorder.
Saito, Yu; Nakamura, Kazuhiro; Itani, Shigeto; Tsukahara, Kiyoaki
2018-01-01
In most cases, about the voice of the patient with female-to-male/gender identity disorder (FTM/GID), hormone therapy makes the voice low-pitched. In success cases, there is no need for phonosurgery. However, hormone therapy is not effective in some cases. We perform type 3 thyroplasty in these cases. Hormone therapy was started in 2008 but did not lower the speaking fundamental frequencies (SFFs). We therefore performed TP3 under local anesthesia. In our case, the SFF at the first visit was 146 Hz. The postoperative SFF was 110 Hz. TP3 was performed under local anesthesia in a patient with FTM/GID in whom hormone therapy proved ineffective. With successful conversion to a lower-pitched voice, the patient could begin to live daily life as a male. QOL improved significantly with TP3. If hormone therapy proves ineffective, TP3 may be selected as an optional treatment and appears to show few surgical complications and was, in this case, a very effective treatment.
Short term effect of hubble-bubble smoking on voice.
Hamdan, A-L; Sibai, A; Mahfoud, L; Oubari, D; Ashkar, J; Fuleihan, N
2011-05-01
To investigate the short term effect of hubble-bubble smoking on voice. Prospective study. Eighteen non-dysphonic subjects (seven men and 11 women) with a history of hubble-bubble smoking and no history of cigarette smoking underwent acoustic analysis and laryngeal video-stroboscopic examination before and 30 minutes after hubble-bubble smoking. On laryngeal video-stroboscopy, none of the subjects had vocal fold erythema either before or after smoking. Five patients had mild vocal fold oedema both before and after smoking. After smoking, there was a slight increase in the number of subjects with thick mucus between the vocal folds (six, vs four before smoking) and with vocal fold vessel dilation (two, vs one before smoking). Acoustic analysis indicated a drop in habitual pitch, fundamental frequency and voice turbulence index after smoking, and an increase in noise-to-harmonics ratio. Even 30 minutes of hubble-bubble smoking can cause a drop in vocal pitch and an increase in laryngeal secretions and vocal fold vasodilation.
Vocal warm-up practices and perceptions in vocalists: a pilot survey.
Gish, Allison; Kunduk, Melda; Sims, Loraine; McWhorter, Andrew J
2012-01-01
Investigated in a pilot study the type, duration, and frequency of vocal warm-up regimens in the singing community using a survey. One hundred seventeen participants completed an online survey. Participants included voice students from undergraduate, masters, and doctoral music programs and professional singers. Fifty-four percent of participants reported always using vocal warm-up before singing. Twenty-two percent of the participants used vocal cool down. The most preferred warm-up duration was of 5-10 minutes in duration. Despite using vocal warm-up, 26% of the participants reported experiencing voice problems. Females tended to use vocal warm-up more frequently than males. Females also tended to use longer warm-up sessions than males. Education of the participants did not appear to have any noticeable effect on the vocal warm-up practices. The most commonly used singing warm-up exercises were ascending/descending five-note scales, ascending/descending octave scales, legato arpeggios, and glissandi. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
The Effectiveness of Pitch-raising Surgery in Male-to-Female Transsexuals: A Systematic Review.
Van Damme, Silke; Cosyns, Marjan; Deman, Sofie; Van den Eede, Zoë; Van Borsel, John
2017-03-01
This study aimed to review the evidence of the effectiveness of pitch-raising surgery performed in male-to-female transsexuals. A search for studies was performed in PubMed, Web of Science, Science Direct, EBSCOhost, Google Scholar, and the references in retrieved manuscripts, using as keywords "transsexual" or "transgender" combined with terms related to voice surgery. We included eight studies using cricothyroid approximation, six studies using anterior glottal web formation, and six studies using other surgery types or a combination of surgical techniques, leading to 20 studies in total. Objectively, a substantial rise in postoperative fundamental frequency was identified. Perceptually, mainly laryngeal web formation seems risky for decreasing voice quality. The majority of patients seemed satisfied with the outcome. However, none of the studies used a control group and randomization process. Further investigation regarding long-term results is necessary. Future research needs to investigate long-term effects of pitch-raising surgery using a stronger study design. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Dissecting choral speech: properties of the accompanist critical to stuttering reduction.
Kiefte, Michael; Armson, Joy
2008-01-01
The effects of choral speech and altered auditory feedback (AAF) on stuttering frequency were compared to identify those properties of choral speech that make it a more effective condition for stuttering reduction. Seventeen adults who stutter (AWS) participated in an experiment consisting of special choral speech conditions that were manipulated to selectively eliminate specific differences between choral speech and AAF. Consistent with previous findings, results showed that both choral speech and AAF reduced stuttering compared to solo reading. Although reductions under AAF were substantial, they were less dramatic than those for choral speech. Stuttering reduction for choral speech was highly robust even when the accompanist's voice temporally lagged that of the AWS, when there was no opportunity for dynamic interplay between the AWS and accompanist, and when the accompanist was replaced by the AWS's own voice, all of which approximate specific features of AAF. Choral speech was also highly effective in reducing stuttering across changes in speech rate and for both familiar and unfamiliar passages. We concluded that differences in properties between choral speech and AAF other than those that were manipulated in this experiment must account for differences in stuttering reduction. The reader will be able to (1) describe differences in stuttering reduction associated with altered auditory feedback compared to choral speech conditions and (2) describe differences between delivery of a second voice signal as an altered rendition of the speakers own voice (altered auditory feedback) and alterations in the voice of an accompanist (choral speech).
D'ALATRI, L.
2014-01-01
SUMMARY This study was carried out to compare the vocal limits obtained by speech range profile (SRP) with those of voice range profile (VRP) in untrained healthy and dysphonic females. Forty-six healthy voice volunteers (control group) and 148 dysphonic patients (dysphonic group) were evaluated using videolaryngostroboscopic assessment and phonetography for voice measurements. For VRP, subjects were asked to sustain the vowel /a/ as soft and as loud possible from the lowest to the highest frequencies using an automated procedure. The SRP was obtained by recording the speaking voice (SV) and the shouting voice (ShV) asking subjects to read a list of sentences aloud and to shout / ehi/ as loud as they could, respectively. All subjects in the control and dysphonic groups were able to perform SRP. fourty of 46 (85%) and 102 of 148 (68.91%) cases, respectively in control and dysphonic groups, were able to perform VRP. Most frequently, the VRP was not recorded because of the inability to perform or, especially in the dysphonic group, for inadequacy of the vocal signal. In the control group, there were no significant differences between the mean values of Fmin, Fmax, Imin and number of semitones (st) of the VRP and those of the SRP (p > 0.05). In the dysphonic group, the mean values of Fmin, Fmax and st SV+ShV for SRP were significantly higher than those of VRP. Our preliminary results suggest that the SRP may be a useful, alternative tool to assess vocal limits in both euphonic and dysphonic females. PMID:25210219
Fear of Public Speaking: Perception of College Students and Correlates.
Ferreira Marinho, Anna Carolina; Mesquita de Medeiros, Adriane; Côrtes Gama, Ana Cristina; Caldas Teixeira, Letícia
2017-01-01
The aims of the study were to determine the prevalence of fear of public speaking among college students and to assess its association with sociodemographic variables and those related to the voice and oral communication. A cross-sectional descriptive and analytic study was conducted with 1135 undergraduates aged 17-58 years. The assessment instruments were (1) a questionnaire addressing the variables sex, age, field of undergraduate study, voice, and frequency of exposure to public speaking, and (2) the Self-statements During Public Speaking Scale (SSPS), which includes variables implicated in specific domains of public speaking. A descriptive analysis was performed of the variables as well as uni- and multivariate logistic regressions to examine their association with fear of public speaking. The level of significance was set at 5%. In all, 63.9% of the college students reported fear of public speaking. As many as 89.3% of the students would like their undergraduate program to include classes to improve public speaking. Being female, having infrequent participation as speakers in groups, and perceiving their voice as high-pitched or too soft increase the odds of exhibiting fear of public speaking compared with students without those features. A great number of undergraduates report fear of public speaking. This fear is more prevalent among women, students who participate in few activities involving speaking to groups of people, and those who have a self-perception of their voice as high-pitched or too soft. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
A study of VHI scores and acoustic features in street vendors as occupational voice users.
Natour, Yaser S; Darawsheh, Wesam B; Bashiti, Sara; Wari, Majd; Taha, Juhayna; Odeh, Thair
to investigate acoustic features of phonation and perception of voice handicap in street vendors. Eighty-eight participants (44 street vendors, 44 controls) were recruited. The mean age of the group was 38.9±16.0 years (range: 20-78 years). Scores of the Arabic version of the Voice Handicap Index (VHI-Arab) were used for analysis. Acoustic measures of fundamental frequency (F 0 ), jitter, shimmer, and signal-to-noise ratio (SNR) were also analyzed. Analysis showed a significant difference between street vendors and controls in the total score of the VHI-Arab (p<0.001) as well as scores of all three VHI-Arab subsections: functional (p<0.001), physical (p<0.001), and emotional (p=0.025). Weak correlations were found among all of the VHI scores and acoustic measures (-0.219≤ r≤0.355), except for SNR where a moderate negative correlations were found (r=-0.555; -0.4) between the VHI (physical and total) scores and SNR values. Significant differences also were found in F 0 , jitter, and SNR among specific subgroups of street vendors when stratified by weekly hours worked (p<0.05), and in jitter (p=0.39) when stratified by educational level. Perception of voice handicap and a possible effect on vocal quality in street vendors were noted. The effect of factors, namely work hours and educational level, on voice quality should be further studied. Copyright © 2017. Published by Elsevier Inc.
The effects of stress on singing voice accuracy.
Larrouy-Maestri, Pauline; Morsomme, Dominique
2014-01-01
The quality of a music performance can be lessened or enhanced if the performer experiences stressful conditions. In addition, the quality of a sung performance requires control of the fundamental frequency of the voice, which is particularly sensitive to stress. The present study aimed to clarify the effects of stress on singing voice accuracy. Thirty-one music students were recorded in a stressful condition (ie, a music examination) and a nonstressful condition. Two groups were defined according to the challenge level of the music examination (first and second music levels). Measurements were made by self-reported state anxiety (CSAI-2R questionnaire) and by observing heart rate activity (electrocardiogram) during each performance. In addition, the vocal accuracy of the sung performances was objectively analyzed. As expected, state anxiety and heart rate were significantly higher on the day of the music examination than in the nonstressful condition for all the music students. However, the effect of stress was positive for the first-year students but negative for the second-year students, for whom the music examination was particularly challenging. In addition, highly significant correlations were found between the intensity of cognitive symptoms and the vocal accuracy criteria. This study highlights the contrasting effects of stress on singing voice accuracy but also the need to consider the challenge level and perception of the symptoms in experimental and pedagogical settings. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Using Voice Coils to Actuate Modular Soft Robots: Wormbot, an Example.
Nemitz, Markus P; Mihaylov, Pavel; Barraclough, Thomas W; Ross, Dylan; Stokes, Adam A
2016-12-01
In this study, we present a modular worm-like robot, which utilizes voice coils as a new paradigm in soft robot actuation. Drive electronics are incorporated into the actuators, providing a significant improvement in self-sufficiency when compared with existing soft robot actuation modes such as pneumatics or hydraulics. The body plan of this robot is inspired by the phylum Annelida and consists of three-dimensional printed voice coil actuators, which are connected by flexible silicone membranes. Each electromagnetic actuator engages with its neighbor to compress or extend the membrane of each segment, and the sequence in which they are actuated results in an earthworm-inspired peristaltic motion. We find that a minimum of three segments is required for locomotion, but due to our modular design, robots of any length can be quickly and easily assembled. In addition to actuation, voice coils provide audio input and output capabilities. We demonstrate transmission of data between segments by high-frequency carrier waves and, using a similar mechanism, we note that the passing of power between coupled coils in neighboring modules-or from an external power source-is also possible. Voice coils are a convenient multifunctional alternative to existing soft robot actuators. Their self-contained nature and ability to communicate with each other are ideal for modular robotics, and the additional functionality of sound input/output and power transfer will become increasingly useful as soft robots begin the transition from early proof-of-concept systems toward fully functional and highly integrated robotic systems.
Vocal perfection in yodelling--pitch stabilities and transition times.
Echternach, Matthias; Richter, Bernhard
2010-04-01
Yodelling is a special kind of vocal performance in traditional music which consists of rapid and repeated changes in pitch. It is assumed that these pitch changes are accompanied by register changes. We analysed, using the laryngograph, yodelling on different vowels by four professional yodelling teachers (two male, two female), four professional classically trained singers, and four untrained voices. Results reveal that pitch changes in yodelling are associated with decrease of electroglottograpgic (EGG) contact quotient for the upper pitch, indicating a register shift. Furthermore, in contrast to untrained voices, for the yodellers lower and upper pitches were more stable with respect to fundamental frequency and perturbation values, and the pitch transitions were faster.
A national voice network with satellite and small transceivers
NASA Technical Reports Server (NTRS)
Reilly, N. B.; Smith, J. G.
1978-01-01
A geostationary satellite utilizing a large multiple-beam UHF antenna is shown to be potentially capable of providing tens of thousands of voice channels for hundreds of thousands of mobile ground terminals using hand-held or vehicular-mounted transceivers with whip antennas. Inclusion of on-board network switching facilities permits full interconnection between any terminal pair within the continental United States (CONUS). Configuration tradeoff studies at selected frequencies from 150 to 1500 MHz, with antenna diameters ranging from 20 to 200 m, and CONUS-coverage multiple beams down to 0.3 deg beamwidth, establish that monthly system user costs in the range of $90 to $150, including leased and maintained ground equipment, are feasible.
77 FR 67171 - Comprehensive Review of Licensing and Operating Rules for Satellite Services
Federal Register 2010, 2011, 2012, 2013, 2014
2012-11-08
... operators of space stations that carry common-carrier voice or paging communications to report outages of 30... addressing radio frequency interference characteristics and orbital parameters of space stations and revise... Vol. 77 Thursday, No. 217 November 8, 2012 Part III Federal Communications Commission 47 CFR Part...
47 CFR 90.243 - Mobile relay stations.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 47 Telecommunication 5 2010-10-01 2010-10-01 false Mobile relay stations. 90.243 Section 90.243... MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.243 Mobile relay stations. (a) Mobile relay operations will be authorized on frequencies below 512 MHz, except in the Radiolocation...
Processing of No-Release Variants in Connected Speech
ERIC Educational Resources Information Center
LoCasto, Paul C.; Connine, Cynthia M.
2011-01-01
The cross modal repetition priming paradigm was used to investigate how potential lexically ambiguous no-release variants are processed. In particular we focus on segmental regularities that affect the variant's frequency of occurrence (voicing of the critical segment) and phonological context in which the variant occurs (status of the following…
Concept of Tone in Mandarin Revisited: A Perceptual Study on Tonal Coarticulation.
ERIC Educational Resources Information Center
Shen, Xiaonan Susan; Lin, Maocan
1991-01-01
Examination of the perceptibility of carryover coarticulatory perturbations occurring at syllabic vowels in Mandarin Chinese suggests that, in connected speech, a portion of fundamental frequency at intertonemic onset is perturbed, including initial voiced consonants and vowels, and that the perturbations result from preservative as well as…
Intensity Accents in French 2 Year Olds' Speech.
ERIC Educational Resources Information Center
Allen, George D.
The acoustic features and functions of accentuation in French are discussed, and features of accentuation in the speech of French 2-year-olds are explored. The four major acoustic features used to signal accentual distinctions are fundamental frequency of voicing, duration of segments and syllables, intensity of segments and syllables, and…
47 CFR 90.355 - LMS operations below 512 MHz.
Code of Federal Regulations, 2011 CFR
2011-10-01
... PRIVATE LAND MOBILE RADIO SERVICES Intelligent Transportation Systems Radio Service § 90.355 LMS... LMS station and the nearest co-channel base station of another licensee operating a voice system is 75... MHz, 150-170 MHz, and 450-512 MHz bands may use either base-mobile frequencies currently assigned the...
Immediate Effect of Alcohol on Voice Tremor Parameters and Speech Motor Control
ERIC Educational Resources Information Center
Krishnan, Gayathri; Ghosh, Vipin
2017-01-01
The complex neuro-muscular interplay of speech subsystems is susceptible to alcohol intoxication. Published reports have studied language formulation and fundamental frequency measures pre- and post-intoxication. This study aimed at tapping the speech motor control measure using rate, consistency, and accuracy measures of diadochokinesis and…
75 FR 5994 - Submission for OMB Review; Comment Request
Federal Register 2010, 2011, 2012, 2013, 2014
2010-02-05
... and OMB Number: Voice of Industry Survey; OMB Control Number 0704-TBD. Type of Request: New. Number of... Security Program (NISP)'' Section 202(a) stipulates that the Secretary of Defense shall serve as the... Security Agency (CSA), designated by the NISPOM, is responsible for determining the frequency of Security...
Guidi, Andrea; Salvi, Sergio; Ottaviano, Manuel; Gentili, Claudio; Bertschy, Gilles; de Rossi, Danilo; Scilingo, Enzo Pasquale; Vanello, Nicola
2015-11-06
Bipolar disorder is one of the most common mood disorders characterized by large and invalidating mood swings. Several projects focus on the development of decision support systems that monitor and advise patients, as well as clinicians. Voice monitoring and speech signal analysis can be exploited to reach this goal. In this study, an Android application was designed for analyzing running speech using a smartphone device. The application can record audio samples and estimate speech fundamental frequency, F0, and its changes. F0-related features are estimated locally on the smartphone, with some advantages with respect to remote processing approaches in terms of privacy protection and reduced upload costs. The raw features can be sent to a central server and further processed. The quality of the audio recordings, algorithm reliability and performance of the overall system were evaluated in terms of voiced segment detection and features estimation. The results demonstrate that mean F0 from each voiced segment can be reliably estimated, thus describing prosodic features across the speech sample. Instead, features related to F0 variability within each voiced segment performed poorly. A case study performed on a bipolar patient is presented.
Can blind persons accurately assess body size from the voice?
Pisanski, Katarzyna; Oleszkiewicz, Anna; Sorokowska, Agnieszka
2016-04-01
Vocal tract resonances provide reliable information about a speaker's body size that human listeners use for biosocial judgements as well as speech recognition. Although humans can accurately assess men's relative body size from the voice alone, how this ability is acquired remains unknown. In this study, we test the prediction that accurate voice-based size estimation is possible without prior audiovisual experience linking low frequencies to large bodies. Ninety-one healthy congenitally or early blind, late blind and sighted adults (aged 20-65) participated in the study. On the basis of vowel sounds alone, participants assessed the relative body sizes of male pairs of varying heights. Accuracy of voice-based body size assessments significantly exceeded chance and did not differ among participants who were sighted, or congenitally blind or who had lost their sight later in life. Accuracy increased significantly with relative differences in physical height between men, suggesting that both blind and sighted participants used reliable vocal cues to size (i.e. vocal tract resonances). Our findings demonstrate that prior visual experience is not necessary for accurate body size estimation. This capacity, integral to both nonverbal communication and speech perception, may be present at birth or may generalize from broader cross-modal correspondences. © 2016 The Author(s).
Guidi, Andrea; Salvi, Sergio; Ottaviano, Manuel; Gentili, Claudio; Bertschy, Gilles; de Rossi, Danilo; Scilingo, Enzo Pasquale; Vanello, Nicola
2015-01-01
Bipolar disorder is one of the most common mood disorders characterized by large and invalidating mood swings. Several projects focus on the development of decision support systems that monitor and advise patients, as well as clinicians. Voice monitoring and speech signal analysis can be exploited to reach this goal. In this study, an Android application was designed for analyzing running speech using a smartphone device. The application can record audio samples and estimate speech fundamental frequency, F0, and its changes. F0-related features are estimated locally on the smartphone, with some advantages with respect to remote processing approaches in terms of privacy protection and reduced upload costs. The raw features can be sent to a central server and further processed. The quality of the audio recordings, algorithm reliability and performance of the overall system were evaluated in terms of voiced segment detection and features estimation. The results demonstrate that mean F0 from each voiced segment can be reliably estimated, thus describing prosodic features across the speech sample. Instead, features related to F0 variability within each voiced segment performed poorly. A case study performed on a bipolar patient is presented. PMID:26561811
An integrated tool for the diagnosis of voice disorders.
Godino-Llorente, Juan I; Sáenz-Lechón, Nicolás; Osma-Ruiz, Víctor; Aguilera-Navarro, Santiago; Gómez-Vilda, Pedro
2006-04-01
A PC-based integrated aid tool has been developed for the analysis and screening of pathological voices. With it the user can simultaneously record speech, electroglottographic (EGG), and videoendoscopic signals, and synchronously edit them to select the most significant segments. These multimedia data are stored on a relational database, together with a patient's personal information, anamnesis, diagnosis, visits, explorations and any other comment the specialist may wish to include. The speech and EGG waveforms are analysed by means of temporal representations and the quantitative measurements of parameters such as spectrograms, frequency and amplitude perturbation measurements, harmonic energy, noise, etc. are calculated using digital signal processing techniques, giving an idea of the degree of hoarseness and quality of the voice register. Within this framework, the system uses a standard protocol to evaluate and build complete databases of voice disorders. The target users of this system are speech and language therapists and ear nose and throat (ENT) clinicians. The application can be easily configured to cover the needs of both groups of professionals. The software has a user-friendly Windows style interface. The PC should be equipped with standard sound and video capture cards. Signals are captured using common transducers: a microphone, an electroglottograph and a fiberscope or telelaryngoscope. The clinical usefulness of the system is addressed in a comprehensive evaluation section.
Dilley, Laura C; Wieland, Elizabeth A; Gamache, Jessica L; McAuley, J Devin; Redford, Melissa A
2013-02-01
As children mature, changes in voice spectral characteristics co-vary with changes in speech, language, and behavior. In this study, spectral characteristics were manipulated to alter the perceived ages of talkers' voices while leaving critical acoustic-prosodic correlates intact, to determine whether perceived age differences were associated with differences in judgments of prosodic, segmental, and talker attributes. Speech was modified by lowering formants and fundamental frequency, for 5-year-old children's utterances, or raising them, for adult caregivers' utterances. Next, participants differing in awareness of the manipulation (Experiment 1A) or amount of speech-language training (Experiment 1B) made judgments of prosodic, segmental, and talker attributes. Experiment 2 investigated the effects of spectral modification on intelligibility. Finally, in Experiment 3, trained analysts used formal prosody coding to assess prosodic characteristics of spectrally modified and unmodified speech. Differences in perceived age were associated with differences in ratings of speech rate, fluency, intelligibility, likeability, anxiety, cognitive impairment, and speech-language disorder/delay; effects of training and awareness of the manipulation on ratings were limited. There were no significant effects of the manipulation on intelligibility or formally coded prosody judgments. Age-related voice characteristics can greatly affect judgments of speech and talker characteristics, raising cautionary notes for developmental research and clinical work.
Can blind persons accurately assess body size from the voice?
Oleszkiewicz, Anna; Sorokowska, Agnieszka
2016-01-01
Vocal tract resonances provide reliable information about a speaker's body size that human listeners use for biosocial judgements as well as speech recognition. Although humans can accurately assess men's relative body size from the voice alone, how this ability is acquired remains unknown. In this study, we test the prediction that accurate voice-based size estimation is possible without prior audiovisual experience linking low frequencies to large bodies. Ninety-one healthy congenitally or early blind, late blind and sighted adults (aged 20–65) participated in the study. On the basis of vowel sounds alone, participants assessed the relative body sizes of male pairs of varying heights. Accuracy of voice-based body size assessments significantly exceeded chance and did not differ among participants who were sighted, or congenitally blind or who had lost their sight later in life. Accuracy increased significantly with relative differences in physical height between men, suggesting that both blind and sighted participants used reliable vocal cues to size (i.e. vocal tract resonances). Our findings demonstrate that prior visual experience is not necessary for accurate body size estimation. This capacity, integral to both nonverbal communication and speech perception, may be present at birth or may generalize from broader cross-modal correspondences. PMID:27095264
Voice Formants in Individuals With Congenital, Isolated, Lifetime Growth Hormone Deficiency.
Valença, Eugenia H O; Salvatori, Roberto; Souza, Anita H O; Oliveira-Neto, Luiz A; Oliveira, Alaíde H A; Gonçalves, Maria I R; Oliveira, Carla R P; D'Ávila, Jeferson S; Melo, Valdinaldo A; de Carvalho, Susana; de Andrade, Bruna M R; Nascimento, Larisse S; Rocha, Savinny B de V; Ribeiro, Thais R; Prado-Barreto, Valeria M; Melo, Enaldo V; Aguiar-Oliveira, Manuel H
2016-05-01
To analyze the voice formants (F1, F2, F3, and F4 in Hz) of seven oral vowels, in Brazilian Portuguese, [a, ε, e, i, ɔ, o, and u] in adult individuals with congenital lifetime untreated isolated growth hormone deficiency (IGHD). This is a cross-sectional study. Acoustic analysis of isolated vowels was performed in 33 individuals with IGHD, age 44.5 (17.6) years (16 women), and 29 controls, age 51.1 (17.6) years (15 women). Compared with controls, IGHD men showed higher values of F3 [i, e, and ε], P = 0.006, P = 0.022, and P = 0.006, respectively and F4 [i], P = 0.001 and lower values of F2 [u], P = 0.034; IGHD women presented higher values of F1 [i and e] P = 0.029 and P = 0.036; F2 [ɔ] P = 0.006; F4 [ɔ] P = 0.031 and lower values of F2 [i] P = 0.004. IGHD abolished most of the gender differences in formant frequencies present in controls. Congenital, severe IGHD results in higher values of most formant frequencies, suggesting smaller oral and pharyngeal cavities. In addition, it causes a reduction in the effect of gender on the structure of the formants, maintaining a prepubertal acoustic prediction. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
The effect of age of cochlear implantation on vocal characteristics in children.
Knight, Kerry; Ducasse, Simone; Coetzee, Ashley; van der Linde, Jeannie; Louw, Anel
2016-06-27
Early cochlear implantation aids auditory feedback and supports better communication and self-monitoring of the voice. The objective of this study was to determine whether the age of cochlear implantation has an impact on vocal development in children implanted before age 4. The study consisted of 19 participants in total. All implant recipients (experimental group) were 3-5 years post-implantation, including four prelingual (0-2 years) and five perilingual (2-4 years) implant recipients. The control group consisted of 10 children whose hearing was within normal limits between the ages 3-6 years and 10 months, which was compared to the experimental group. Established paediatric norms were used for additional comparison. A questionnaire was used to gather information from each of the participant's caregivers to determine whether other personal and contextual factors had an impact on voice production. An acoustic analysis was conducted for each participant using the Multi-Dimensional Voice Program of the Computerized Speech Lab. When the experimental group and the control group were compared, similar results were yielded for fundamental frequency and short-term perturbation (jitter and shimmer). More variability was noted in long-term frequency and amplitude measures, with significantly higher differences, and therefore further outside the norms, in the prelingual group when compared to the perilingual and control groups. In this study, age of implantation did not impact vocal characteristics. Further research should include larger sample sizes, with participants that are age and gender matched.
1990-06-01
reader is cautioned that computer programs developed in this research may not have been exercised for all cases of interest. While every effort has been...Source of Funding Numbers _. Program Element No Project No I Task No I Work Unit Accession No 11 Title (Include security classflcation) APPLICATION OF...formats. Previous applications of these encoding formats were on industry standard computers (PC) over a 16-20 klIz channel. This report discusses the
A guide to onboard checkout. Volume 7: RF communications
NASA Technical Reports Server (NTRS)
1971-01-01
The radio frequency communications subsystem for a space station is considered, with respect to onboard checkout requirements. The subsystem comprises all equipment necessary for transmitting and receiving, tracking and ranging, command, multiple voice and television information, and broadband experiment data. The communications subsystem provides a radio frequency interface between the space station and ground stations, either directly or indirectly, through a data relay satellite system, independent free-flying experiment modules, and logistics vehicles. Reliability, maintenance, and failure analyses are discussed, and computer programming techniques are presented.
Borch, D Zangger; Sundberg, Johan
2011-09-01
This investigation aims at describing voice function of four nonclassical styles of singing, Rock, Pop, Soul, and Swedish Dance Band. A male singer, professionally experienced in performing in these genres, sang representative tunes, both with their original lyrics and on the syllable /pae/. In addition, he sang tones in a triad pattern ranging from the pitch Bb2 to the pitch C4 on the syllable /pae/ in pressed and neutral phonation. An expert panel was successful in classifying the samples, thus suggesting that the samples were representative of the various styles. Subglottal pressure was estimated from oral pressure during the occlusion for the consonant [p]. Flow glottograms were obtained from inverse filtering. The four lowest formant frequencies differed between the styles. The mean of the subglottal pressure and the mean of the normalized amplitude quotient (NAQ), that is, the ratio between the flow pulse amplitude and the product of period and maximum flow declination rate, were plotted against the mean of fundamental frequency. In these graphs, Rock and Swedish Dance Band assumed opposite extreme positions with respect to subglottal pressure and mean phonation frequency, whereas the mean NAQ values differed less between the styles. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Concept and implementation of the Globalstar mobile satellite system
NASA Technical Reports Server (NTRS)
Schindall, Joel
1995-01-01
Globalstar is a satellite-based mobile communications system which provides quality wireless communications (voice and/or data) anywhere in the world except the polar regions. The Globalstar system concept is based upon technological advancements in Low Earth Orbit (LEO) satellite technology and in cellular telephone technology, including the commercial application of Code Division Multiple Access (CDMA) technologies. The Globalstar system uses elements of CDMA and Frequency Division Multiple Access (FDMA), combined with satellite Multiple Beam Antenna (MBA) technology and advanced variable-rate vocoder technology to arrive at one of the most efficient modulation and multiple access systems ever proposed for a satellite communications system. The technology used in Globalstar includes the following techniques in obtaining high spectral efficiency and affordable cost per channel: (1) CDMA modulation with efficient power control; (2) high efficiency vocoder with voice activity factor; (3) spot beam antenna for increased gain and frequency reuse; (4) weighted satellite antenna gain for broad geographic coverage; (5) multisatellite user links (diversity) to enhance communications reliability; and (6) soft hand-off between beams and satellites. Initial launch is scheduled in 1997 and the system is scheduled to be operational in 1998. The Globalstar system utilizes frequencies in L-, S- and C-bands which have the potential to offer worldwide availability with authorization by the appropriate regulatory agencies.
Keough, Dwayne; Jones, Jeffery A.
2009-01-01
Singing requires accurate control of the fundamental frequency (F0) of the voice. This study examined trained singers’ and untrained singers’ (nonsingers’) sensitivity to subtle manipulations in auditory feedback and the subsequent effect on the mapping between F0 feedback and vocal control. Participants produced the consonant-vowel ∕ta∕ while receiving auditory feedback that was shifted up and down in frequency. Results showed that singers and nonsingers compensated to a similar degree when presented with frequency-altered feedback (FAF); however, singers’ F0 values were consistently closer to the intended pitch target. Moreover, singers initiated their compensatory responses when auditory feedback was shifted up or down 6 cents or more, compared to nonsingers who began compensating when feedback was shifted up 26 cents and down 22 cents. Additionally, examination of the first 50 ms of vocalization indicated that participants commenced subsequent vocal utterances, during FAF, near the F0 value on previous shift trials. Interestingly, nonsingers commenced F0 productions below the pitch target and increased their F0 until they matched the note. Thus, singers and nonsingers rely on an internal model to regulate voice F0, but singers’ models appear to be more sensitive in response to subtle discrepancies in auditory feedback. PMID:19640048
Keough, Dwayne; Jones, Jeffery A
2009-08-01
Singing requires accurate control of the fundamental frequency (F0) of the voice. This study examined trained singers' and untrained singers' (nonsingers') sensitivity to subtle manipulations in auditory feedback and the subsequent effect on the mapping between F0 feedback and vocal control. Participants produced the consonant-vowel /ta/ while receiving auditory feedback that was shifted up and down in frequency. Results showed that singers and nonsingers compensated to a similar degree when presented with frequency-altered feedback (FAF); however, singers' F0 values were consistently closer to the intended pitch target. Moreover, singers initiated their compensatory responses when auditory feedback was shifted up or down 6 cents or more, compared to nonsingers who began compensating when feedback was shifted up 26 cents and down 22 cents. Additionally, examination of the first 50 ms of vocalization indicated that participants commenced subsequent vocal utterances, during FAF, near the F0 value on previous shift trials. Interestingly, nonsingers commenced F0 productions below the pitch target and increased their F0 until they matched the note. Thus, singers and nonsingers rely on an internal model to regulate voice F0, but singers' models appear to be more sensitive in response to subtle discrepancies in auditory feedback.
Vocal tract resonances in singing: The soprano voice
NASA Astrophysics Data System (ADS)
Joliveau, Elodie; Smith, John; Wolfe, Joe
2004-10-01
The vocal tract resonances of trained soprano singers were measured while they sang a range of vowels softly at different pitches. The measurements were made by broad band acoustic excitation at the mouth, which allowed the resonances of the tract to be measured simultaneously with and independently from the harmonics of the voice. At low pitch, when the lowest resonance frequency R1 exceeded f0, the values of the first two resonances R1 and R2 varied little with frequency and had values consistent with normal speech. At higher pitches, however, when f0 exceeded the value of R1 observed at low pitch, R1 increased with f0 so that R1 was approximately equal to f0. R2 also increased over this high pitch range, probably as an incidental consequence of the tuning of R1. R3 increased slightly but systematically, across the whole pitch range measured. There was no evidence that any resonances are tuned close to harmonics of the pitch frequency except for R1 at high pitch. The variations in R1 and R2 at high pitch mean that vowels move, converge, and overlap their positions on the vocal plane (R2,R1) to an extent that implies loss of intelligibility. .
A controlled study of Tourette syndrome. IV. Obsessions, compulsions, and schizoid behaviors.
Comings, D E; Comings, B G
1987-01-01
To determine the frequency of obsessive, compulsive, and schizoid behaviors in Tourette syndrome (TS), we prospectively questioned 246 patients with TS, 17 with attention-deficit disorder (ADD), 15 with ADD due to a TS gene, and 47 random controls. The comparative frequency of obsessive, compulsive, and repetitive behaviors--such as obsessive unpleasant thoughts, obsessive silly thoughts, echolalia, palilalia, touching things excessively, touching things a specific number of times, touching others excessively, sexual touching, biting or hurting oneself, head banging, rocking, mimicking others, counting things, and occasional or frequent public exhibitionism--were significantly more common in TS patients than in controls. The frequency of each of these was much higher for grade 3 (severe) TS. Most of these behaviors also occurred significantly more often in individuals with ADD or in individuals with ADD secondary to TS (ADD 2(0) TS). When these features were combined into an obsessive-compulsive score, 45.4% of TS patients had a score of 4-15, whereas 8.5% of controls had a score of 4 or 5. These results indicate that obsessive-compulsive behaviors are an integral part of the expression of the TS gene and can be inherited as an autosomal dominant trait. Schizoid symptoms, such as thinking that people were watching them or plotting against them, were significantly more common in TS patients than in controls. Auditory hallucinations of hearing voices were present in 14.6% of TS patients, compared with 2.1% of controls (P = .02). These symptoms were absent in ADD patients but present in ADD 2(0) TS patients. These voices were often blamed for telling them to do bad things and were frequently identified with the devil. None of the controls had a total schizoid behavior score greater than 3, whereas 10.9% of the TS patients had scores of 4-10 (P = .02). This frequency increased to 20.6% in the grade 3 TS patients. These quantitative results confirm our clinical impression that some TS patients have paranoid ideations, often feel that people are out to get them, and hear voices. PMID:3479015
Vocal fold vibrations: high-speed imaging, kymography, and acoustic analysis: a preliminary report.
Larsson, H; Hertegård, S; Lindestad, P A; Hammarberg, B
2000-12-01
To evaluate a new analysis system, High-Speed Tool Box (H. Larsson, custom-made program for image analysis, version 1.1, Department of Logopedics and Phoniatrics, Huddinge University Hospital, Huddinge, Sweden, 1998) for studying vocal fold vibrations using a high-speed camera and to relate findings from these analyses to sound characteristics. A Weinberger Speedcam + 500 system (Weinberger AG, Dietikon, Switzerland) was used with a frame rate of 1,904 frames per second. Images were stored and analyzed digitally. Analysis included automatic glottal edge detection and calculation of glottal area variations, as well as kymography. These signals were compared with acoustic waveforms using the Soundswell program (Hitech Development AB, Stockholm, Sweden). The High-Speed Tool Box was applied on two types of high-speed recordings: a diplophonic phonation and a tremor voice. Relations between glottal vibratory patterns and the sound waveform were analyzed. In the diplophonic phonation, the glottal area waveform, as well as the kymogram, showed a specific pattern of repetitive glottal closures, which was also seen in the acoustic waveform. In the tremor voice, fundamental frequency (F0) fluctuations in the acoustic waveform were reflected in slow variations in amplitude in the glottal area waveform. For studying details of mucosal movements during these kinds of abnormal vibrations, the glottal area waveform was particularly useful. Our results suggest that this combined high-speed acoustic-kymographic analysis package is a promising aid for separating and specifying different voice qualities such as diplophonia and voice tremor. Apart from clinical use, this finding should be of help for specification of the terminology of different voice qualities.
The speech range profile (SRP): an easy and useful tool to assess vocal limits.
D'Alatri, L; Marchese, M R
2014-08-01
This study was carried out to compare the vocal limits obtained by speech range profile (SRP) with those of voice range profile (VRP) in untrained healthy and dysphonic females. Forty-six healthy voice volunteers (control group) and 148 dysphonic patients (dysphonic group) were evaluated using videolaryngostroboscopic assessment and phonetography for voice measurements. For VRP, subjects were asked to sustain the vowel /a/ as soft and as loud possible from the lowest to the highest frequencies using an automated procedure. The SRP was obtained by recording the speaking voice (SV) and the shouting voice (ShV) asking subjects to read a list of sentences aloud and to shout / ehi/ as loud as they could, respectively. All subjects in the control and dysphonic groups were able to perform SRP. fourty of 46 (85%) and 102 of 148 (68.91%) cases, respectively in control and dysphonic groups, were able to perform VRP. Most frequently, the VRP was not recorded because of the inability to perform or, especially in the dysphonic group, for inadequacy of the vocal signal. In the control group, there were no significant differences between the mean values of Fmin, Fmax, Imin and number of semitones (st) of the VRP and those of the SRP (p > 0.05). In the dysphonic group, the mean values of Fmin, Fmax and st SV+ShV for SRP were significantly higher than those of VRP. Our preliminary results suggest that the SRP may be a useful, alternative tool to assess vocal limits in both euphonic and dysphonic females.
Using Voice Coils to Actuate Modular Soft Robots: Wormbot, an Example
Nemitz, Markus P.; Mihaylov, Pavel; Barraclough, Thomas W.; Ross, Dylan
2016-01-01
Abstract In this study, we present a modular worm-like robot, which utilizes voice coils as a new paradigm in soft robot actuation. Drive electronics are incorporated into the actuators, providing a significant improvement in self-sufficiency when compared with existing soft robot actuation modes such as pneumatics or hydraulics. The body plan of this robot is inspired by the phylum Annelida and consists of three-dimensional printed voice coil actuators, which are connected by flexible silicone membranes. Each electromagnetic actuator engages with its neighbor to compress or extend the membrane of each segment, and the sequence in which they are actuated results in an earthworm-inspired peristaltic motion. We find that a minimum of three segments is required for locomotion, but due to our modular design, robots of any length can be quickly and easily assembled. In addition to actuation, voice coils provide audio input and output capabilities. We demonstrate transmission of data between segments by high-frequency carrier waves and, using a similar mechanism, we note that the passing of power between coupled coils in neighboring modules—or from an external power source—is also possible. Voice coils are a convenient multifunctional alternative to existing soft robot actuators. Their self-contained nature and ability to communicate with each other are ideal for modular robotics, and the additional functionality of sound input/output and power transfer will become increasingly useful as soft robots begin the transition from early proof-of-concept systems toward fully functional and highly integrated robotic systems. PMID:28078195
Unilateral Vocal Fold Paralysis: A Systematic Review of Speech-Language Pathology Management.
Walton, Chloe; Conway, Erin; Blackshaw, Helen; Carding, Paul
2017-07-01
Dysphonia due to unilateral vocal fold paralysis (UVFP) can be characterized by hoarseness and weakness, resulting in a significant impact on patients' activity and participation. Voice therapy provided by a speech-language pathologist is designed to maximize vocal function and improve quality of life. The purpose of this paper is to systematically review literature surrounding the effectiveness of speech-language pathology intervention for the management of UVFP in adults. This is a systematic review. Electronic databases were searched using a range of key terms including dysphonia, vocal fold paralysis, and speech-language pathology. Eligible articles were extracted and reviewed by the authors for risk of bias, methodology, treatment efficacy, and clinical outcomes. Of the 3311 articles identified, 12 met the inclusion criteria: seven case series and five comparative studies. All 12 studies subjectively reported positive effects following the implementation of voice therapy for UVFP; however, the heterogeneity of participant characteristics, voice therapy, and voice outcome resulted in a low level of evidence. There is presently a lack of methodological rigor and clinical efficacy in the speech-language pathology management of dysphonia arising from UVFP in adults. Reasons for this reduced efficacy can be attributed to the following: (1) no standardized speech-language pathology intervention; (2) no consistency of assessment battery; (3) the variable etiology and clinical presentation of UVFP; and (4) inconsistent timing, frequency, and intensity of treatment. Further research is required to develop the evidence for the management of UVFP incorporating controlled treatment protocols and more rigorous clinical methodology. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Pannese, Alessia; Grandjean, Didier; Frühholz, Sascha
2016-12-01
Discriminating between auditory signals of different affective value is critical to successful social interaction. It is commonly held that acoustic decoding of such signals occurs in the auditory system, whereas affective decoding occurs in the amygdala. However, given that the amygdala receives direct subcortical projections that bypass the auditory cortex, it is possible that some acoustic decoding occurs in the amygdala as well, when the acoustic features are relevant for affective discrimination. We tested this hypothesis by combining functional neuroimaging with the neurophysiological phenomena of repetition suppression (RS) and repetition enhancement (RE) in human listeners. Our results show that both amygdala and auditory cortex responded differentially to physical voice features, suggesting that the amygdala and auditory cortex decode the affective quality of the voice not only by processing the emotional content from previously processed acoustic features, but also by processing the acoustic features themselves, when these are relevant to the identification of the voice's affective value. Specifically, we found that the auditory cortex is sensitive to spectral high-frequency voice cues when discriminating vocal anger from vocal fear and joy, whereas the amygdala is sensitive to vocal pitch when discriminating between negative vocal emotions (i.e., anger and fear). Vocal pitch is an instantaneously recognized voice feature, which is potentially transferred to the amygdala by direct subcortical projections. These results together provide evidence that, besides the auditory cortex, the amygdala too processes acoustic information, when this is relevant to the discrimination of auditory emotions. Copyright © 2016 Elsevier Ltd. All rights reserved.
Blend in Singing Ensemble Performance: Vibrato Production in a Vocal Quartet.
Daffern, Helena
2017-05-01
"Blend" is a defining characteristic of good vocal ensemble performance. To achieve this, directors often consider vibrato as a feature to be controlled and consequently restrict its use. Analysis of individual voices in ensemble situations presents several challenges, including the isolation of voices for analysis from recordings. This study considers vibrato production as a feature that contributes to blend through an ecological study of a vocal quartet. A vocal ensemble was recorded using head-worn microphones and electrolaryngograph electrodes to enable fundamental frequency analysis of the individual voices. The same four-part material was recorded over several weeks of rehearsal to allow analysis of conscious and subconscious changes to vibrato production over time. Alongside the recording of their rehearsal discussions, singers were also asked for opinions on vibrato production in connection with blend. The results indicate that vibrato is adjusted to some extent by individual singers to improve blend, with some instances of synchrony between voice parts. Some conscious alterations to vibrato were made to improve blend; however, these are not always evident in the data, suggesting that singers' own perceptions of their performance may be influenced by other factors. These findings indicate a need for further studies of vibrato as a feature of blend, particularly in terms of the synergies between expectation and actual production, and potential synchronicity between singers; increased understanding of vibrato in an ensemble setting will lead to more efficient rehearsal techniques and vocal training, and could prevent vocal misuse leading to pathology in the future. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Effects of type II thyroplasty on adductor spasmodic dysphonia.
Sanuki, Tetsuji; Yumoto, Eiji; Minoda, Ryosei; Kodama, Narihiro
2010-04-01
Type II thyroplasty, or laryngeal framework surgery, is based on the hypothesis that the effect of adductor spasmodic dysphonia (AdSD) on the voice is due to excessively tight closure of the glottis, hampering phonation. Most of the previous, partially effective treatments have aimed to relieve this tight closure, including recurrent laryngeal nerve section or avulsion, extirpation of the adductor muscle, and botulinum toxin injection, which is currently the most popular. The aim of this study was to assess the effects of type II thyroplasty on aerodynamic and acoustic findings in patients with AdSD. Case series. University hospital. Ten patients with AdSD underwent type II thyroplasty between August 2006 and December 2008. Aerodynamic and acoustic analyses were performed prior to and six months after surgery. Mean flow rates (MFRs) and voice efficiency were evaluated with a phonation analyzer. Jitter, shimmer, the harmonics-to-noise ratio (HNR), standard deviation of the fundamental frequency (SDF0), and degree of voice breaks (DVB) were measured from each subject's longest sustained phonation sample of the vowel /a/. Voice efficiency improved significantly after surgery. No significant difference was found in the MFRs between before and after surgery. Jitter, shimmer, HNR, SDF0, and DVB improved significantly after surgery. Treatment of AdSD with type II thyroplasty significantly improved aerodynamic and acoustic findings. The results of this study suggest that type II thyroplasty provides relief from voice strangulation in patients with AdSD. Copyright 2010 American Academy of Otolaryngology-Head and Neck Surgery Foundation. Published by Mosby, Inc. All rights reserved.
Acoustic voice analysis of prelingually deaf adults before and after cochlear implantation.
Evans, Maegan K; Deliyski, Dimitar D
2007-11-01
It is widely accepted that many severe to profoundly deaf adults have benefited from cochlear implants (CIs). However, limited research has been conducted to investigate changes in voice and speech of prelingually deaf adults who receive CIs, a population well known for presenting with a variety of voice and speech abnormalities. The purpose of this study was to use acoustic analysis to explore changes in voice and speech for three prelingually deaf males pre- and postimplantation over 6 months. The following measurements, some measured in varying contexts, were obtained: fundamental frequency (F0), jitter, shimmer, noise-to-harmonic ratio, voice turbulence index, soft phonation index, amplitude- and F0-variation, F0-range, speech rate, nasalance, and vowel production. Characteristics of vowel production were measured by determining the first formant (F1) and second formant (F2) of vowels in various contexts, magnitude of F2-variation, and rate of F2-variation. Perceptual measurements of pitch, pitch variability, loudness variability, speech rate, and intonation were obtained for comparison. Results are reported using descriptive statistics. The results showed patterns of change for some of the parameters while there was considerable variation across the subjects. All participants demonstrated a decrease in F0 in at least one context and demonstrated a change in nasalance toward the norm as compared to their normal hearing control. The two participants who were oral-language communicators were judged to produce vowels with an average of 97.2% accuracy and the sign-language user demonstrated low percent accuracy for vowel production.
Comparison of CDMA and FDMA for the MobileStar(sm) system
NASA Technical Reports Server (NTRS)
Jacobs, I. M.; Gilhousen, K. S.; Weaver, L. A.; Renshaw, K.; Murphy, T.
1988-01-01
Spread-spectrum code division multiple access (CDMA) and single channel per carrier frequency division multiple access (FDMA) systems are compared for spectrum efficiency. CDMA is shown to have greater maximum throughput than FDMA for the MobileStar(sm) system which uses digital voice activated carriers and directive circularly polarized satellite antennas.
Helium Speech: An Application of Standing Waves
ERIC Educational Resources Information Center
Wentworth, Christopher D.
2011-01-01
Taking a breath of helium gas and then speaking or singing to the class is a favorite demonstration for an introductory physics course, as it usually elicits appreciative laughter, which serves to energize the class session. Students will usually report that the helium speech "raises the frequency" of the voice. A more accurate description of the…
Threats and Strategies to Counter Threats: Voices of Elementary School Foreign Language Learniers
ERIC Educational Resources Information Center
Rosenbusch, Marcia Harmon; Sorensen, Laurie
2004-01-01
The experience described by Kay Hoag, Advocacy Chair of the National Network for Early Language Learning (NNELL), exemplifies the threat of program elimination and/or cutbacks that elementary school foreign language programs across the nation experienced with increased frequency during the 2002-2003 academic year. Reports of these threats…
Code of Federal Regulations, 2012 CFR
2012-10-01
... System: Alerting: 406.0-406.1 EPIRBs 406.0-406.1 MHz (Earth-to-space).1544-1545 MHz (space-to-Earth). INMARSAT Ship Earth Stations capable of voice and/or direct printing 1626.5-1645.5 MHz (Earth-to-space... safety communications and calling: Satellite 1530-1544 MHz (space-to-Earth) and 1626.5-1645.5 MHz (Earth...
Code of Federal Regulations, 2013 CFR
2013-10-01
... System: Alerting: 406.0-406.1 EPIRBs 406.0-406.1 MHz (Earth-to-space).1544-1545 MHz (space-to-Earth). INMARSAT Ship Earth Stations capable of voice and/or direct printing 1626.5-1645.5 MHz (Earth-to-space... safety communications and calling: Satellite 1530-1544 MHz (space-to-Earth) and 1626.5-1645.5 MHz (Earth...
Code of Federal Regulations, 2014 CFR
2014-10-01
... System: Alerting: 406.0-406.1 EPIRBs 406.0-406.1 MHz (Earth-to-space).1544-1545 MHz (space-to-Earth). INMARSAT Ship Earth Stations capable of voice and/or direct printing 1626.5-1645.5 MHz (Earth-to-space... safety communications and calling: Satellite 1530-1544 MHz (space-to-Earth) and 1626.5-1645.5 MHz (Earth...
47 CFR 90.247 - Mobile repeater stations.
Code of Federal Regulations, 2014 CFR
2014-10-01
... 47 Telecommunication 5 2014-10-01 2014-10-01 false Mobile repeater stations. 90.247 Section 90.247... MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.247 Mobile repeater stations. A mobile station authorized to operate on a mobile service frequency above 25 MHz may be used as a mobile...
47 CFR 90.247 - Mobile repeater stations.
Code of Federal Regulations, 2013 CFR
2013-10-01
... 47 Telecommunication 5 2013-10-01 2013-10-01 false Mobile repeater stations. 90.247 Section 90.247... MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.247 Mobile repeater stations. A mobile station authorized to operate on a mobile service frequency above 25 MHz may be used as a mobile...
47 CFR 90.247 - Mobile repeater stations.
Code of Federal Regulations, 2011 CFR
2011-10-01
... 47 Telecommunication 5 2011-10-01 2011-10-01 false Mobile repeater stations. 90.247 Section 90.247... MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.247 Mobile repeater stations. A mobile station authorized to operate on a mobile service frequency above 25 MHz may be used as a mobile...
47 CFR 90.247 - Mobile repeater stations.
Code of Federal Regulations, 2012 CFR
2012-10-01
... 47 Telecommunication 5 2012-10-01 2012-10-01 false Mobile repeater stations. 90.247 Section 90.247... MOBILE RADIO SERVICES Non-Voice and Other Specialized Operations § 90.247 Mobile repeater stations. A mobile station authorized to operate on a mobile service frequency above 25 MHz may be used as a mobile...
47 CFR 90.494 - Paging operations on shared channels in the 929-930 MHz band.
Code of Federal Regulations, 2010 CFR
2010-10-01
... under subpart B or C of this part, representatives of Federal Government agencies, individuals, and foreign governments and their representatives. The provisions of § 90.173(b) apply to all frequencies... desired (tone only, tone-voice, digital, tactile, optical readout, etc.). (e) There shall be no minimum or...
The 18/30 GHz fixed communications system service demand assessment. Volume 1: Executive summary
NASA Technical Reports Server (NTRS)
Gabriszeski, T.; Reiner, P.; Rogers, J.; Terbo, W.
1979-01-01
The total demand for voice, video, and data communications services, and satellite transmission services at the 4/6 GHz, 12/14 GHz, and 18/30 GHz frequencies is discussed. Major study objectives, overall methodology, results, and general observations about a satellite systems market characteristics and trends are summarized.
Civic Journalism and Nonelite Sourcing: Making Routine Newswork of Community Connectedness.
ERIC Educational Resources Information Center
Massey, Brian L.
1998-01-01
Compares the number of "average" citizens brought into the news in three newspapers. Finds nonelite information sources in numerical parity with elite sources in a civic-journalism newspaper, but finds the frequency and directness of their news voices largely unchanged. Finds that routine civic journalism did more to tone down elites'…
Sensory Aids Research Project - Clarke School for the Deaf.
ERIC Educational Resources Information Center
Boothroyd, Arthur
Described is a program of research into sensory aids for the deaf, emphasizing research on factors involved in the effective use of sensory aids rather than evaluation of particular devices. Aspects of the program are the development of a programed testing and training unit, the control of fundamental voice frequency using visual feedback, and…
Code of Federal Regulations, 2011 CFR
2011-10-01
... System: Alerting: 406.0-406.1 EPIRBs 406.0-406.1 MHz (Earth-to-space).1544-1545 MHz (space-to-Earth). INMARSAT-E EPIRBs 12 1626.5-1645.5 MHz (Earth-to-space). INMARSAT Ship Earth Stations capable of voice and... MARITIME SERVICES Global Maritime Distress and Safety System (GMDSS) General Provisions § 80.1077...
Speech Spectrum's Correlation with Speakers' Eysenck Personality Traits
Hu, Chao; Wang, Qiandong; Short, Lindsey A.; Fu, Genyue
2012-01-01
The current study explored the correlation between speakers' Eysenck personality traits and speech spectrum parameters. Forty-six subjects completed the Eysenck Personality Questionnaire. They were instructed to verbally answer the questions shown on a computer screen and their responses were recorded by the computer. Spectrum parameters of /sh/ and /i/ were analyzed by Praat voice software. Formant frequencies of the consonant /sh/ in lying responses were significantly lower than that in truthful responses, whereas no difference existed on the vowel /i/ speech spectrum. The second formant bandwidth of the consonant /sh/ speech spectrum was significantly correlated with the personality traits of Psychoticism, Extraversion, and Neuroticism, and the correlation differed between truthful and lying responses, whereas the first formant frequency of the vowel /i/ speech spectrum was negatively correlated with Neuroticism in both response types. The results suggest that personality characteristics may be conveyed through the human voice, although the extent to which these effects are due to physiological differences in the organs associated with speech or to a general Pygmalion effect is yet unknown. PMID:22439014
NASA Technical Reports Server (NTRS)
1971-01-01
The testing program with the ATS-1 and ATS-3 spacecraft showed that geostationary satellites can provide superior communications and position surveillance for mobile craft. Inexpensive modifications to conventional mobile communications equipment aboard the craft can provide reliable, high quality voice and digital communications with distant ground stations and other vehicles, and automatic surveillance of the positions of all the craft by a ground facility. The tests also demonstrated the location and automatic readout of remote data collection platforms. Frequency modulation signals with the narrow audio and radio frequency bandwidths of terrestrial mobile radio communications were relayed through the VHF transponders of the geostationary satellites. The voice and digital communications were far superior in reliability and quality to long-distance mobile communications by other means. It was shown that one satellite can provide nearly uniform high quality performance over approximately one-third of the earth's surface. Position fixes by range measurement from the two satellites were accurate to approximately one nautical mile, except near the equator and the poles.
Vibration isolation and dual-stage actuation pointing system for space precision payloads
NASA Astrophysics Data System (ADS)
Kong, Yongfang; Huang, Hai
2018-02-01
Pointing and stability requirements for future space missions are becoming more and more stringent. This work follows the pointing control method which consists of a traditional spacecraft attitude control system and a payload active pointing loop, further proposing a vibration isolation and dual-stage actuation pointing system for space precision payloads based on a soft Stewart platform. Central to the concept is using the dual-stage actuator instead of the traditional voice coil motor single-stage actuator to improve the payload active pointing capability. Based on a specified payload, the corresponding platform was designed to be installed between the spacecraft bus and the payload. The performance of the proposed system is demonstrated by preliminary closed-loop control investigations in simulations. With the ordinary spacecraft bus, the line-of-sight pointing accuracy can be controlled to below a few milliarcseconds in tip and tilt. Meanwhile, utilizing the voice coil motor with the softening spring in parallel, which is a portion of the dual-stage actuator, the system effectively achieves low-frequency motion transmission and high-frequency vibration isolation along the other four degree-of-freedom directions.
Li, Tianhao; Fu, Qian-Jie
2011-08-01
(1) To investigate whether voice gender discrimination (VGD) could be a useful indicator of the spectral and temporal processing abilities of individual cochlear implant (CI) users; (2) To examine the relationship between VGD and speech recognition with CI when comparable acoustic cues are used for both perception processes. VGD was measured using two talker sets with different inter-gender fundamental frequencies (F(0)), as well as different acoustic CI simulations. Vowel and consonant recognition in quiet and noise were also measured and compared with VGD performance. Eleven postlingually deaf CI users. The results showed that (1) mean VGD performance differed for different stimulus sets, (2) VGD and speech recognition performance varied among individual CI users, and (3) individual VGD performance was significantly correlated with speech recognition performance under certain conditions. VGD measured with selected stimulus sets might be useful for assessing not only pitch-related perception, but also spectral and temporal processing by individual CI users. In addition to improvements in spectral resolution and modulation detection, the improvement in higher modulation frequency discrimination might be particularly important for CI users in noisy environments.
The siren song of vocal fundamental frequency for romantic relationships.
Weusthoff, Sarah; Baucom, Brian R; Hahlweg, Kurt
2013-01-01
A multitude of factors contribute to why and how romantic relationships are formed as well as whether they ultimately succeed or fail. Drawing on evolutionary models of attraction and speech production as well as integrative models of relationship functioning, this review argues that paralinguistic cues (more specifically the fundamental frequency of the voice) that are initially a strong source of attraction also increase couples' risk for relationship failure. Conceptual similarities and differences between the multiple operationalizations and interpretations of vocal fundamental frequency are discussed and guidelines are presented for understanding both convergent and non-convergent findings. Implications for clinical practice and future research are discussed.
A study and experiment plan for digital mobile communication via satellite
NASA Technical Reports Server (NTRS)
Jones, J. J.; Craighill, E. J.; Evans, R. G.; Vincze, A. D.; Tom, N. N.
1978-01-01
The viability of mobile communications is examined within the context of a frequency division multiple access, single channel per carrier satellite system emphasizing digital techniques to serve a large population of users. The intent is to provide the mobile users with a grade of service consistant with the requirements for remote, rural (perhaps emergency) voice communications, but which approaches toll quality speech. A traffic model is derived on which to base the determination of the required maximum number of satellite channels to provide the anticipated level of service. Various voice digitalization and digital modulation schemes are reviewed along with a general link analysis of the mobile system. Demand assignment multiple access considerations and analysis tradeoffs are presented. Finally, a completed configuration is described.
Voice Register in Mon: Acoustics and Electroglottography
Abramson, Arthur S.; Tiede, Mark K.; Luangthongkum, Theraphan
2016-01-01
Mon is spoken in villages in Thailand and Myanmar. The dialect of Ban Nakhonchum, Thailand has two voice registers, modal and breathy; these phonation types, along with other phonetic properties, distinguish minimal pairs. Four native speakers of this dialect recorded repetitions of 14 randomized words (seven minimal pairs) for acoustic analysis. We used a subset of these pairs in a listening test to verify the perceptual robustness of the register distinction. Acoustic analysis found significant differences in noise component, spectral slope, and fundamental frequency. In a subsequent session four speakers were also recorded using electroglottography (EGG), which showed systematic differences in the contact quotient (CQ). The salience of these properties in maintaining the register distinction is discussed in the context of possible tonogenesis for this language. PMID:26636544
Cavalot, A L; Palonta, F; Preti, G; Nazionale, G; Ricci, E; Vione, N; Albera, R; Cortesina, G
2001-12-01
The insertion of a prosthesis and restoration with pectoralis major myocutaneous flaps for patients subjected to total pharyngolaryngectomy is a technique now universally accepted; however the literature on the subject is lacking. Our study considers 10 patients subjected to total pharyngolaryngectomy and restoration with pectoralis major myocutaneous flaps who were fitted with vocal function prostheses and a control group of 50 subjects treated with a total laryngectomy without pectoralis major myocutaneous flaps and who were fitted with vocal function prostheses. Specific qualitative and quantitative parameters were compared. The quantitative measurement of the levels of voice intensity and the evaluation of the harmonics-to-noise ratio were not statistically significant (p > 0.05) between the two study groups at either high- or low-volume speech. On the contrary, statistically significant differences were found (p < 0.05) for the basic frequency of both the low and the high volume voice. For the qualitative analysis seven parameters were established for evaluation by trained and untrained listeners: on the basis of these parameters the control group had statistically better voices.
Display-based communications for advanced transport aircraft
NASA Technical Reports Server (NTRS)
Lee, Alfred T.
1989-01-01
The next generation of civil transport aircraft will depend increasingly upon ground-air-ground and satellite data link for information critical to safe and efficient air transportation. Previous studies which examined the concept of display-based communications in addition to, or in lieu of, conventional voice transmissions are reviewed. A full-mission flight simulation comparing voice and display-based communication modes in an advanced transport aircraft is also described. The results indicate that a display-based mode of information transfer does not result in significantly increased aircrew workload, but does result in substantially increased message acknowledgment times when compared to conventional voice transmissions. User acceptance of the display-based communication system was generally high, replicating the findings of previous studies. However, most pilots tested expressed concern over the potential loss of information available from frequency monitoring which might result from the introduction of discrete address communications. Concern was expressed by some pilots for the reduced time available to search for conflicting traffic when using the communications display system. The implications of the findings for the design of display-based communications are discussed.
Women use voice parameters to assess men's characteristics
Bruckert, Laetitia; Liénard, Jean-Sylvain; Lacroix, André; Kreutzer, Michel; Leboucher, Gérard
2005-01-01
The purpose of this study was: (i) to provide additional evidence regarding the existence of human voice parameters, which could be reliable indicators of a speaker's physical characteristics and (ii) to examine the ability of listeners to judge voice pleasantness and a speaker's characteristics from speech samples. We recorded 26 men enunciating five vowels. Voices were played to 102 female judges who were asked to assess vocal attractiveness and speakers' age, height and weight. Statistical analyses were used to determine: (i) which physical component predicted which vocal component and (ii) which vocal component predicted which judgment. We found that men with low-frequency formants and small formant dispersion tended to be older, taller and tended to have a high level of testosterone. Female listeners were consistent in their pleasantness judgment and in their height, weight and age estimates. Pleasantness judgments were based mainly on intonation. Female listeners were able to correctly estimate age by using formant components. They were able to estimate weight but we could not explain which acoustic parameters they used. However, female listeners were not able to estimate height, possibly because they used intonation incorrectly. Our study confirms that in all mammal species examined thus far, including humans, formant components can provide a relatively accurate indication of a vocalizing individual's characteristics. Human listeners have the necessary information at their disposal; however, they do not necessarily use it. PMID:16519239
Dilley, Laura C.; Wieland, Elizabeth A.; Gamache, Jessica L.; McAuley, J. Devin; Redford, Melissa A.
2013-01-01
Purpose As children mature, changes in voice spectral characteristics covary with changes in speech, language, and behavior. Spectral characteristics were manipulated to alter the perceived ages of talkers’ voices while leaving critical acoustic-prosodic correlates intact, to determine whether perceived age differences were associated with differences in judgments of prosodic, segmental, and talker attributes. Method Speech was modified by lowering formants and fundamental frequency, for 5-year-old children’s utterances, or raising them, for adult caregivers’ utterances. Next, participants differing in awareness of the manipulation (Exp. 1a) or amount of speech-language training (Exp. 1b) made judgments of prosodic, segmental, and talker attributes. Exp. 2 investigated the effects of spectral modification on intelligibility. Finally, in Exp. 3 trained analysts used formal prosody coding to assess prosodic characteristics of spectrally-modified and unmodified speech. Results Differences in perceived age were associated with differences in ratings of speech rate, fluency, intelligibility, likeability, anxiety, cognitive impairment, and speech-language disorder/delay; effects of training and awareness of the manipulation on ratings were limited. There were no significant effects of the manipulation on intelligibility or formally coded prosody judgments. Conclusions Age-related voice characteristics can greatly affect judgments of speech and talker characteristics, raising cautionary notes for developmental research and clinical work. PMID:23275414
Behroozmand, Roozbeh; Korzyukov, Oleg; Larson, Charles R.
2012-01-01
Previous studies have shown that the pitch of a sound is perceived in the absence of its fundamental frequency (F0), suggesting that a distinct mechanism may resolve pitch based on a pattern that exists between harmonic frequencies. The present study investigated whether such a mechanism is active during voice pitch control. ERPs were recorded in response to +200 cents pitch shifts in the auditory feedback of self-vocalizations and complex tones with and without the F0. The absence of the fundamental induced no difference in ERP latencies. However, a right-hemisphere difference was found in the N1 amplitudes with larger responses to complex tones that included the fundamental compared to when it was missing. The P1 and N1 latencies were shorter in the left hemisphere, and the N1 and P2 amplitudes were larger bilaterally for pitch shifts in voice and complex tones compared with pure tones. These findings suggest hemispheric differences in neural encoding of pitch in sounds with missing fundamental. Data from the present study suggest that the right cortical auditory areas, thought to be specialized for spectral processing, may utilize different mechanisms to resolve pitch in sounds with missing fundamental. The left hemisphere seems to perform faster processing to resolve pitch based on the rate of temporal variations in complex sounds compared with pure tones. These effects indicate that the differential neural processing of pitch in the left and right hemispheres may enable the audio-vocal system to detect temporal and spectral variations in the auditory feedback for vocal pitch control. PMID:22386045
The effect of age of cochlear implantation on vocal characteristics in children
Knight, Kerry; Ducasse, Simone; Coetzee, Ashley; Louw, Anel
2016-01-01
Background Early cochlear implantation aids auditory feedback and supports better communication and self-monitoring of the voice. The objective of this study was to determine whether the age of cochlear implantation has an impact on vocal development in children implanted before age 4. Method and procedures The study consisted of 19 participants in total. All implant recipients (experimental group) were 3–5 years post-implantation, including four prelingual (0–2 years) and five perilingual (2–4 years) implant recipients. The control group consisted of 10 children whose hearing was within normal limits between the ages 3–6 years and 10 months, which was compared to the experimental group. Established paediatric norms were used for additional comparison. A questionnaire was used to gather information from each of the participant’s caregivers to determine whether other personal and contextual factors had an impact on voice production. An acoustic analysis was conducted for each participant using the Multi-Dimensional Voice Program of the Computerized Speech Lab. Results When the experimental group and the control group were compared, similar results were yielded for fundamental frequency and short-term perturbation (jitter and shimmer). More variability was noted in long-term frequency and amplitude measures, with significantly higher differences, and therefore further outside the norms, in the prelingual group when compared to the perilingual and control groups. Conclusion In this study, age of implantation did not impact vocal characteristics. Further research should include larger sample sizes, with participants that are age and gender matched. PMID:27380914