Federal Register 2010, 2011, 2012, 2013, 2014
2010-06-02
... request vouchers for distribution of grant funds using the automated Voice Response System (VRS). An... Payment and Line of Credit Control System (LOCCS) Voice Response System Access Authorization AGENCY... subject proposal. Payment request vouchers for distribution of grant funds using the automated Voice...
2009-06-01
Blackberry handheld) device. After each voice command activation, the medic provided voice comments to be recorded in Observer Notepad over Voice...vial (up-right corner of picture) upon voice activation from the medic’s Blackberry handheld. The NPS UAS which was controlled by voice commands...Voice Portal using a standard Blackberry handheld with a head set. The results demonstrated sufficient accuracy for controlling the tactical sensor
Central Nervous System Control of Voice and Swallowing
Ludlow, Christy L.
2015-01-01
This review of the central nervous control systems for voice and swallowing has suggested that the traditional concepts of a separation between cortical and limbic and brain stem control should be refined and more integrative. For voice production, a separation of the non-human vocalization system from the human learned voice production system has been posited based primarily on studies of non-human primates. However, recent humans studies of emotionally based vocalizations and human volitional voice production has shown more integration between these two systems than previously proposed. Recent human studies have shown that reflexive vocalization as well as learned voice production not involving speech, involve a common integrative system. On the other hand, recent studies of non-human primates have provided evidence of some cortical activity during vocalization and cortical changes with training during vocal behavior. For swallowing, evidence from the macaque and functional brain imaging in humans indicates that the control for the pharyngeal phase of swallowing is not primarily under brain stem mechanisms as previously proposed. Studies suggest that the initiation and patterning of swallowing for the pharyngeal phase is also under active cortical control for both spontaneous as well as volitional swallowing in awake humans and non-human primates. PMID:26241238
A simulation study of the effects of communication delay on air traffic control
DOT National Transportation Integrated Search
1990-09-01
This study was conducted to examine the impacts of voice communications delays : characteristic of Voice Switching and Control System (VSCS) and satellite : communications systems on air traffic system performance, controller stress : and workload, a...
Voice loops as coordination aids in space shuttle mission control.
Patterson, E S; Watts-Perotti, J; Woods, D D
1999-01-01
Voice loops, an auditory groupware technology, are essential coordination support tools for experienced practitioners in domains such as air traffic management, aircraft carrier operations and space shuttle mission control. They support synchronous communication on multiple channels among groups of people who are spatially distributed. In this paper, we suggest reasons for why the voice loop system is a successful medium for supporting coordination in space shuttle mission control based on over 130 hours of direct observation. Voice loops allow practitioners to listen in on relevant communications without disrupting their own activities or the activities of others. In addition, the voice loop system is structured around the mission control organization, and therefore directly supports the demands of the domain. By understanding how voice loops meet the particular demands of the mission control environment, insight can be gained for the design of groupware tools to support cooperative activity in other event-driven domains.
Voice loops as coordination aids in space shuttle mission control
NASA Technical Reports Server (NTRS)
Patterson, E. S.; Watts-Perotti, J.; Woods, D. D.
1999-01-01
Voice loops, an auditory groupware technology, are essential coordination support tools for experienced practitioners in domains such as air traffic management, aircraft carrier operations and space shuttle mission control. They support synchronous communication on multiple channels among groups of people who are spatially distributed. In this paper, we suggest reasons for why the voice loop system is a successful medium for supporting coordination in space shuttle mission control based on over 130 hours of direct observation. Voice loops allow practitioners to listen in on relevant communications without disrupting their own activities or the activities of others. In addition, the voice loop system is structured around the mission control organization, and therefore directly supports the demands of the domain. By understanding how voice loops meet the particular demands of the mission control environment, insight can be gained for the design of groupware tools to support cooperative activity in other event-driven domains.
[Research on Control System of an Exoskeleton Upper-limb Rehabilitation Robot].
Wang, Lulu; Hu, Xin; Hu, Jie; Fang, Youfang; He, Rongrong; Yu, Hongliu
2016-12-01
In order to help the patients with upper-limb disfunction go on rehabilitation training,this paper proposed an upper-limb exoskeleton rehabilitation robot with four degrees of freedom(DOF),and realized two control schemes,i.e.,voice control and electromyography control.The hardware and software design of the voice control system was completed based on RSC-4128 chips,which realized the speech recognition technology of a specific person.Besides,this study adapted self-made surface eletromyogram(sEMG)signal extraction electrodes to collect sEMG signals and realized pattern recognition by conducting sEMG signals processing,extracting time domain features and fixed threshold algorithm.In addition,the pulse-width modulation(PWM)algorithm was used to realize the speed adjustment of the system.Voice control and electromyography control experiments were then carried out,and the results showed that the mean recognition rate of the voice control and electromyography control reached 93.1%and 90.9%,respectively.The results proved the feasibility of the control system.This study is expected to lay a theoretical foundation for the further improvement of the control system of the upper-limb rehabilitation robot.
Brain systems mediating voice identity processing in blind humans.
Hölig, Cordula; Föcker, Julia; Best, Anna; Röder, Brigitte; Büchel, Christian
2014-09-01
Blind people rely more on vocal cues when they recognize a person's identity than sighted people. Indeed, a number of studies have reported better voice recognition skills in blind than in sighted adults. The present functional magnetic resonance imaging study investigated changes in the functional organization of neural systems involved in voice identity processing following congenital blindness. A group of congenitally blind individuals and matched sighted control participants were tested in a priming paradigm, in which two voice stimuli (S1, S2) were subsequently presented. The prime (S1) and the target (S2) were either from the same speaker (person-congruent voices) or from two different speakers (person-incongruent voices). Participants had to classify the S2 as either a old or a young person. Person-incongruent voices (S2) compared with person-congruent voices elicited an increased activation in the right anterior fusiform gyrus in congenitally blind individuals but not in matched sighted control participants. In contrast, only matched sighted controls showed a higher activation in response to person-incongruent compared with person-congruent voices (S2) in the right posterior superior temporal sulcus. These results provide evidence for crossmodal plastic changes of the person identification system in the brain after visual deprivation. Copyright © 2014 Wiley Periodicals, Inc.
DTO-675: Voice Control of the Closed Circuit Television System
NASA Technical Reports Server (NTRS)
Salazar, George; Gaston, Darilyn M.; Haynes, Dena S.
1996-01-01
This report presents the results of the Detail Test Object (DTO)-675 "Voice Control of the Closed Circuit Television (CCTV)" system. The DTO is a follow-on flight of the Voice Command System (VCS) that flew as a secondary payload on STS-41. Several design changes were made to the VCS for the STS-78 mission. This report discusses those design changes, the data collected during the mission, recognition problems encountered, and findings.
Central nervous system control of the laryngeal muscles in humans
Ludlow, Christy L.
2005-01-01
Laryngeal muscle control may vary for different functions such as: voice for speech communication, emotional expression during laughter and cry, breathing, swallowing, and cough. This review discusses the control of the human laryngeal muscles for some of these different functions. Sensori-motor aspects of laryngeal control have been studied by eliciting various laryngeal reflexes. The role of audition in learning and monitoring ongoing voice production for speech is well known; while the role of somatosensory feedback is less well understood. Reflexive control systems involving central pattern generators may contribute to swallowing, breathing and cough with greater cortical control during volitional tasks such as voice production for speech. Volitional control is much less well understood for each of these functions and likely involves the integration of cortical and subcortical circuits. The new frontier is the study of the central control of the laryngeal musculature for voice, swallowing and breathing and how volitional and reflexive control systems may interact in humans. PMID:15927543
Speech versus manual control of camera functions during a telerobotic task
NASA Technical Reports Server (NTRS)
Bierschwale, John M.; Sampaio, Carlos E.; Stuart, Mark A.; Smith, Randy L.
1989-01-01
Voice input for control of camera functions was investigated in this study. Objective were to (1) assess the feasibility of a voice-commanded camera control system, and (2) identify factors that differ between voice and manual control of camera functions. Subjects participated in a remote manipulation task that required extensive camera-aided viewing. Each subject was exposed to two conditions, voice and manual input, with a counterbalanced administration order. Voice input was found to be significantly slower than manual input for this task. However, in terms of remote manipulator performance errors and subject preference, there was no difference between modalities. Voice control of continuous camera functions is not recommended. It is believed that the use of voice input for discrete functions, such as multiplexing or camera switching, could aid performance. Hybrid mixes of voice and manual input may provide the best use of both modalities. This report contributes to a better understanding of the issues that affect the design of an efficient human/telerobot interface.
Design of digital voice storage and playback system
NASA Astrophysics Data System (ADS)
Tang, Chao
2018-03-01
Based on STC89C52 chip, this paper presents a single chip microcomputer minimum system, which is used to realize the logic control of digital speech storage and playback system. Compared with the traditional tape voice recording system, the system has advantages of small size, low power consumption, The effective solution of traditional voice recording system is limited in the use of electronic and information processing.
STS-41 Voice Command System Flight Experiment Report
NASA Technical Reports Server (NTRS)
Salazar, George A.
1981-01-01
This report presents the results of the Voice Command System (VCS) flight experiment on the five-day STS-41 mission. Two mission specialists,Bill Shepherd and Bruce Melnick, used the speaker-dependent system to evaluate the operational effectiveness of using voice to control a spacecraft system. In addition, data was gathered to analyze the effects of microgravity on speech recognition performance.
Voice control of the space shuttle video system
NASA Technical Reports Server (NTRS)
Bejczy, A. K.; Dotson, R. S.; Brown, J. W.; Lewis, J. L.
1981-01-01
A pilot voice control system developed at the Jet Propulsion Laboratory (JPL) to test and evaluate the feasibility of controlling the shuttle TV cameras and monitors by voice commands utilizes a commercially available discrete word speech recognizer which can be trained to the individual utterances of each operator. Successful ground tests were conducted using a simulated full-scale space shuttle manipulator. The test configuration involved the berthing, maneuvering and deploying a simulated science payload in the shuttle bay. The handling task typically required 15 to 20 minutes and 60 to 80 commands to 4 TV cameras and 2 TV monitors. The best test runs show 96 to 100 percent voice recognition accuracy.
Controller/Computer Interface with an Air-Ground Data Link
DOT National Transportation Integrated Search
1976-06-01
This report describes the results of an experiment for evaluating the controller/computer interface in an ARTS III/M&S system modified for use with a simulated digital data link and a voice link utilizing a computer-generated voice system. A modified...
Internet-Based System for Voice Communication With the ISS
NASA Technical Reports Server (NTRS)
Chamberlain, James; Myers, Gerry; Clem, David; Speir, Terri
2005-01-01
The Internet Voice Distribution System (IVoDS) is a voice-communication system that comprises mainly computer hardware and software. The IVoDS was developed to supplement and eventually replace the Enhanced Voice Distribution System (EVoDS), which, heretofore, has constituted the terrestrial subsystem of a system for voice communications among crewmembers of the International Space Station (ISS), workers at the Payloads Operations Center at Marshall Space Flight Center, principal investigators at diverse locations who are responsible for specific payloads, and others. The IVoDS utilizes a communication infrastructure of NASA and NASArelated intranets in addition to, as its name suggests, the Internet. Whereas the EVoDS utilizes traditional circuitswitched telephony, the IVoDS is a packet-data system that utilizes a voice over Internet protocol (VOIP). Relative to the EVoDS, the IVoDS offers advantages of greater flexibility and lower cost for expansion and reconfiguration. The IVoDS is an extended version of a commercial Internet-based voice conferencing system that enables each user to participate in only one conference at a time. In the IVoDS, a user can receive audio from as many as eight conferences simultaneously while sending audio to one of them. The IVoDS also incorporates administrative controls, beyond those of the commercial system, that provide greater security and control of the capabilities and authorizations for talking and listening afforded to each user.
The effects of voice and manual control mode on dual task performance
NASA Technical Reports Server (NTRS)
Wickens, C. D.; Zenyuh, J.; Culp, V.; Marshak, W.
1986-01-01
Two fundamental principles of human performance, compatibility and resource competition, are combined with two structural dichotomies in the human information processing system, manual versus voice output, and left versus right cerebral hemisphere, in order to predict the optimum combination of voice and manual control with either hand, for time-sharing performance of a dicrete and continuous task. Eight right handed male subjected performed a discrete first-order tracking task, time-shared with an auditorily presented Sternberg Memory Search Task. Each task could be controlled by voice, or by the left or right hand, in all possible combinations except for a dual voice mode. When performance was analyzed in terms of a dual-task decrement from single task control conditions, the following variables influenced time-sharing efficiency in diminishing order of magnitude, (1) the modality of control, (discrete manual control of tracking was superior to discrete voice control of tracking and the converse was true with the memory search task), (2) response competition, (performance was degraded when both tasks were responded manually), (3) hemispheric competition, (performance degraded whenever two tasks were controlled by the left hemisphere) (i.e., voice or right handed control). The results confirm the value of predictive models invoice control implementation.
Interface Anywhere: Development of a Voice and Gesture System for Spaceflight Operations
NASA Technical Reports Server (NTRS)
Thompson, Shelby; Haddock, Maxwell; Overland, David
2013-01-01
The Interface Anywhere Project was funded through Innovation Charge Account (ICA) at NASA JSC in the Fall of 2012. The project was collaboration between human factors and engineering to explore the possibility of designing an interface to control basic habitat operations through gesture and voice control; (a) Current interfaces require the users to be physically near an input device in order to interact with the system; and (b) By using voice and gesture commands, the user is able to interact with the system anywhere they want within the work environment.
DOT National Transportation Integrated Search
1976-03-01
Two Quadrature Modulation/Phase Shift Keyed (QM/PSK) Voice/Data Modem systems have been developed as part of the satellite communications hardware for advanced air traffic control systems. These systems consist of a modulator and demodulator unti whi...
Quantitative evaluation of the voice range profile in patients with voice disorder.
Ikeda, Y; Masuda, T; Manako, H; Yamashita, H; Yamamoto, T; Komiyama, S
1999-01-01
In 1953, Calvet first displayed the fundamental frequency (pitch) and sound pressure level (intensity) of a voice on a two-dimensional plane and created a voice range profile. This profile has been used to evaluate clinically various vocal disorders, although such evaluations to date have been subjective without quantitative assessment. In the present study, a quantitative system was developed to evaluate the voice range profile utilizing a personal computer. The area of the voice range profile was defined as the voice volume. This volume was analyzed in 137 males and 175 females who were treated for various dysphonias at Kyushu University between 1984 and 1990. Ten normal subjects served as controls. The voice volume in cases with voice disorders significantly decreased irrespective of the disease and sex. Furthermore, cases having better improvement after treatment showed a tendency for the voice volume to increase. These findings illustrated the voice volume as a useful clinical test for evaluating voice control in cases with vocal disorders.
NASA Technical Reports Server (NTRS)
1977-01-01
Michael Condon, a quadraplegic from Pasadena, California, demonstrates the NASA-developed voice-controlled wheelchair and its manipulator, which can pick up packages, open doors, turn a TV knob, and perform a variety of other functions. A possible boon to paralyzed and other severely handicapped persons, the chair-manipulator system responds to 35 one-word voice commands, such as "go," "stop," "up," "down," "right," "left," "forward," "backward." The heart of the system is a voice-command analyzer which utilizes a minicomputer. Commands are taught I to the computer by the patient's repeating them a number of times; thereafter the analyzer recognizes commands only in the patient's particular speech pattern. The computer translates commands into electrical signals which activate appropriate motors and cause the desired motion of chair or manipulator. Based on teleoperator and robot technology for space-related programs, the voice-controlled system was developed by Jet Propulsion Laboratory under the joint sponsorship of NASA and the Veterans Administration. The wheelchair-manipulator has been tested at Rancho Los Amigos Hospital, Downey, California, and is being evaluated at the VA Prosthetics Center in New York City.
Do What I Say! Voice Recognition Makes Major Advances.
ERIC Educational Resources Information Center
Ruley, C. Dorsey
1994-01-01
Explains voice recognition technology applications in the workplace, schools, and libraries. Highlights include a voice-controlled work station using the DragonDictate system that can be used with dyslexic students, converting text to speech, and converting speech to text. (LRW)
Design and realization of intelligent tourism service system based on voice interaction
NASA Astrophysics Data System (ADS)
Hu, Lei-di; Long, Yi; Qian, Cheng-yang; Zhang, Ling; Lv, Guo-nian
2008-10-01
Voice technology is one of the important contents to improve the intelligence and humanization of tourism service system. Combining voice technology, the paper concentrates on application needs and the composition of system to present an overall intelligent tourism service system's framework consisting of presentation layer, Web services layer, and tourism application service layer. On the basis, the paper further elaborated the implementation of the system and its key technologies, including intelligent voice interactive technology, seamless integration technology of multiple data sources, location-perception-based guides' services technology, and tourism safety control technology. Finally, according to the situation of Nanjing tourism, a prototype of Tourism Services System is realized.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-10-24
... system. The information collected on the payment voucher will also be used as an internal control measure... LOCCS/VRS voice activated system. The information collected on the form serves also as an internal control measure to ensure the lawful and appropriate disbursement of Federal funds. DATES: Comments Due...
NASA Astrophysics Data System (ADS)
Sherley, Patrick L.; Pujol, Alfonso, Jr.; Meadow, John S.
1990-07-01
To provide a means of rendering complex computer architectures languages and input/output modalities transparent to experienced and inexperienced users research is being conducted to develop a voice driven/voice response computer graphics imaging system. The system will be used for reconstructing and displaying computed tomography and magnetic resonance imaging scan data. In conjunction with this study an artificial intelligence (Al) control strategy was developed to interface the voice components and support software to the computer graphics functions implemented on the Sun Microsystems 4/280 color graphics workstation. Based on generated text and converted renditions of verbal utterances by the user the Al control strategy determines the user''s intent and develops and validates a plan. The program type and parameters within the plan are used as input to the graphics system for reconstructing and displaying medical image data corresponding to that perceived intent. If the plan is not valid the control strategy queries the user for additional information. The control strategy operates in a conversation mode and vocally provides system status reports. A detailed examination of the various AT techniques is presented with major emphasis being placed on their specific roles within the total control strategy structure. 1.
DOT National Transportation Integrated Search
1975-10-01
This report details the result of an experiment performed by the Transportation Systems Center of the Department of Transportation to evaluate candidate voice and data modulation systems for use in an L-Band Air Traffic Control System. The experiment...
NASA Astrophysics Data System (ADS)
White, R. W.; Parks, D. L.
1985-07-01
A study was conducted to determine potential commercial aircraft flight deck applications and implementation guidelines for voice recognition and synthesis. At first, a survey of voice recognition and synthesis technology was undertaken to develop a working knowledge base. Then, numerous potential aircraft and simulator flight deck voice applications were identified and each proposed application was rated on a number of criteria in order to achieve an overall payoff rating. The potential voice recognition applications fell into five general categories: programming, interrogation, data entry, switch and mode selection, and continuous/time-critical action control. The ratings of the first three categories showed the most promise of being beneficial to flight deck operations. Possible applications of voice synthesis systems were categorized as automatic or pilot selectable and many were rated as being potentially beneficial. In addition, voice system implementation guidelines and pertinent performance criteria are proposed. Finally, the findings of this study are compared with those made in a recent NASA study of a 1995 transport concept.
NASA Technical Reports Server (NTRS)
White, R. W.; Parks, D. L.
1985-01-01
A study was conducted to determine potential commercial aircraft flight deck applications and implementation guidelines for voice recognition and synthesis. At first, a survey of voice recognition and synthesis technology was undertaken to develop a working knowledge base. Then, numerous potential aircraft and simulator flight deck voice applications were identified and each proposed application was rated on a number of criteria in order to achieve an overall payoff rating. The potential voice recognition applications fell into five general categories: programming, interrogation, data entry, switch and mode selection, and continuous/time-critical action control. The ratings of the first three categories showed the most promise of being beneficial to flight deck operations. Possible applications of voice synthesis systems were categorized as automatic or pilot selectable and many were rated as being potentially beneficial. In addition, voice system implementation guidelines and pertinent performance criteria are proposed. Finally, the findings of this study are compared with those made in a recent NASA study of a 1995 transport concept.
Voice Over Internet Protocol (VoIP) in a Control Center Environment
NASA Technical Reports Server (NTRS)
Pirani, Joseph; Calvelage, Steven
2010-01-01
The technology of transmitting voice over data networks has been available for over 10 years. Mass market VoIP services for consumers to make and receive standard telephone calls over broadband Internet networks have grown in the last 5 years. While operational costs are less with VoIP implementations as opposed to time division multiplexing (TDM) based voice switches, is it still advantageous to convert a mission control center s voice system to this newer technology? Marshall Space Flight Center (MSFC) Huntsville Operations Support Center (HOSC) has converted its mission voice services to a commercial product that utilizes VoIP technology. Results from this testing, design, and installation have shown unique considerations that must be addressed before user operations. There are many factors to consider for a control center voice design. Technology advantages and disadvantages were investigated as they refer to cost. There were integration concerns which could lead to complex failure scenarios but simpler integration for the mission infrastructure. MSFC HOSC will benefit from this voice conversion with less product replacement cost, less operations cost and a more integrated mission services environment.
Assured Information Flow Capping Architecture.
1985-05-01
Air Control System Deployment, ESD-TR-71-371, AD 733 584, Electronic Systems Division, AFSC, Hanscom Air Force Base, MA, November 1971. 3. I. Gitman and...H. Frank, "Economic Analysis of Integrated Voice and Data Networks: A Case Study," Proceedings of the IEEE, November 1978. 4. H. Frank and I. Gitman ... Gitman , "Study Shows Packet Switching Best for Voice Traffic, Too," Data Communications, March 1979. ___ "Economic Analysis of Integrated Voice and
1988-09-01
Group Subgroup Command and control; Computational linguistics; expert system voice recognition; man- machine interface; U.S. Government 19 Abstract...simulates the characteristics of FRESH on a smaller scale. This study assisted NOSC in developing a voice-recognition, man- machine interface that could...scale. This study assisted NOSC in developing a voice-recogni- tion, man- machine interface that could be used with TONE and upgraded at a later date
Central voice production and pathophysiology of spasmodic dysphonia.
Mor, Niv; Simonyan, Kristina; Blitzer, Andrew
2018-01-01
Our ability to speak is complex, and the role of the central nervous system in controlling speech production is often overlooked in the field of otolaryngology. In this brief review, we present an integrated overview of speech production with a focus on the role of central nervous system. The role of central control of voice production is then further discussed in relation to the potential pathophysiology of spasmodic dysphonia (SD). Peer-review articles on central laryngeal control and SD were identified from PUBMED search. Selected articles were augmented with designated relevant publications. Publications that discussed central and peripheral nervous system control of voice production and the central pathophysiology of laryngeal dystonia were chosen. Our ability to speak is regulated by specialized complex mechanisms coordinated by high-level cortical signaling, brainstem reflexes, peripheral nerves, muscles, and mucosal actions. Recent studies suggest that SD results from a primary central disturbance associated with dysfunction at our highest levels of central voice control. The efficacy of botulinum toxin in treating SD may not be limited solely to its local effect on laryngeal muscles and also may modulate the disorder at the level of the central nervous system. Future therapeutic options that target the central nervous system may help modulate the underlying disorder in SD and allow clinicians to better understand the principal pathophysiology. NA.Laryngoscope, 128:177-183, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-05-28
... request vouchers for distribution of grant funds using the automated Voice Response System (VRS). An... Information Collection for Public Comment; Request Voucher for Grant Payment and Line of Credit Control System (LOCCS) Voice Response System Access AGENCY: Office of the Chief Financial Officer, HUD. ACTION: Notice...
Evaluation of Different Speech and Touch Interfaces to In-Vehicle Music Retrieval Systems
Garay-Vega, L.; Pradhan, A. K.; Weinberg, G.; Schmidt-Nielsen, B.; Harsham, B.; Shen, Y.; Divekar, G.; Romoser, M.; Knodler, M.; Fisher, D. L.
2010-01-01
In-vehicle music retrieval systems are becoming more and more popular. Previous studies have shown that they pose a real hazard to drivers when the interface is a tactile one which requires multiple entries and a combination of manual control and visual feedback. Voice interfaces exist as an alternative. Such interfaces can require either multiple or single conversational turns. In this study, each of 17 participants between the ages of 18 and 30 years old was asked to use three different music-retrieval systems (one with a multiple entry touch interface, the iPod™, one with a multiple turn voice interface, interface B, and one with a single turn voice interface, interface C) while driving through a virtual world. Measures of secondary task performance, eye behavior, vehicle control, and workload were recorded. When compared with the touch interface, the voice interfaces reduced the total time drivers spent with their eyes off the forward roadway, especially in prolonged glances, as well as both the total number of glances away from the forward roadway and the perceived workload. Furthermore, when compared with driving without a secondary task, both voice interfaces did not significantly impact hazard anticipation, the frequency of long glances away from the forward roadway, or vehicle control. The multiple turn voice interface (B) significantly increased both the time it took drivers to complete the task and the workload. The implications for interface design and safety are discussed. PMID:20380920
Evaluation of a voice recognition system for the MOTAS pseudo pilot station function
NASA Technical Reports Server (NTRS)
Houck, J. A.
1982-01-01
The Langley Research Center has undertaken a technology development activity to provide a capability, the mission oriented terminal area simulation (MOTAS), wherein terminal area and aircraft systems studies can be performed. An experiment was conducted to evaluate state-of-the-art voice recognition technology and specifically, the Threshold 600 voice recognition system to serve as an aircraft control input device for the MOTAS pseudo pilot station function. The results of the experiment using ten subjects showed a recognition error of 3.67 percent for a 48-word vocabulary tested against a programmed vocabulary of 103 words. After the ten subjects retrained the Threshold 600 system for the words which were misrecognized or rejected, the recognition error decreased to 1.96 percent. The rejection rates for both cases were less than 0.70 percent. Based on the results of the experiment, voice recognition technology and specifically the Threshold 600 voice recognition system were chosen to fulfill this MOTAS function.
Defining the ATC Controller Interface for Data Link Clearances
NASA Technical Reports Server (NTRS)
Rankin, James
1998-01-01
The Controller Interface (CI) is the primary method for Air Traffic Controllers to communicate with aircraft via Controller-Pilot Data Link Communications (CPDLC). The controller, wearing a microphone/headset, aurally gives instructions to aircraft as he/she would with today's voice radio systems. The CI's voice recognition system converts the instructions to digitized messages that are formatted according to the RTCA DO-219 Operational Performance Standards for ATC Two-Way Data Link Communications. The DO-219 messages are transferred via RS-232 to the ATIDS system for uplink using a Mode-S datalink. Pilot acknowledgments of controller messages are downlinked to the ATIDS system and transferred to the Cl. A computer monitor is used to convey information to the controller. Aircraft data from the ARTS database are displayed on flight strips. The flight strips are electronic versions of the strips currently used in the ATC system. Outgoing controller messages cause the respective strip to change color to indicate an unacknowledged transmission. The message text is shown on the flight strips for reference. When the pilot acknowledges the message, the strip returns to its normal color. A map of the airport can also be displayed on the monitor. In addition to voice recognition, the controller can enter messages using the monitor's touch screen or by mouse/keyboard.
NASA Technical Reports Server (NTRS)
Birch, J. N.; Getzin, N.
1971-01-01
Analog and digital voice coding techniques for application to an L-band satellite-basedair traffic control (ATC) system for over ocean deployment are examined. In addition to performance, the techniques are compared on the basis of cost, size, weight, power consumption, availability, reliability, and multiplexing features. Candidate systems are chosen on the bases of minimum required RF bandwidth and received carrier-to-noise density ratios. A detailed survey of automated and nonautomated intelligibility testing methods and devices is presented and comparisons given. Subjective evaluation of speech system by preference tests is considered. Conclusion and recommendations are developed regarding the selection of the voice system. Likewise, conclusions and recommendations are developed for the appropriate use of intelligibility tests, speech quality measurements, and preference tests with the framework of the proposed ATC system.
Park, Kyihwan; Choi, Dongyoub; Ozer, Abdullah; Kim, Sangyoo; Lee, Yongkwan; Joo, Dongik
2008-06-01
We develop a four-mount active vibration isolation system (AVIS) using voice coil actuators. The flexible body modes in the upper plate of the AVIS can cause an instability problem due to control signal whose frequency is close to the resonant frequency of the flexible modes. The loop shaping technique is applied to reduce the amplitude of the control signal. We investigate the performances of the active vibration isolation system proposed in the word in the time domain and frequency domain by comparing to the passive isolation system.
NASA Astrophysics Data System (ADS)
Bose, Sanjay K.
1991-02-01
Various mobile satellite communication systems are being developed for providing integrated voice/data services over a shared satellite transponder which is power-limited in nature. A common strategy is to use slotted ALOHA request channels to request channel assignments for voice/data calls from a network management station. To maximize efficiency in a system with a power-limited satellite transponder, it is proposed that the bursty nature of voice sources be exploited by the NMS to 'over-assign' channels. This may cause problems of inefficiency and potential instability, as well as a degradation in the quality of service. Augmenting this with the introduction of simple state-dependent control procedures provides systems which exhibit more desirable operational features.
NASA Astrophysics Data System (ADS)
Bose, Sanjay K.; Gordon, J. J.
The modeling and analysis of a system providing integrated voice/data services to mobile terminals over a power-limited satellite channel are discussed. The mobiles use slotted Aloha random access to send requests for channel assignments to a central station. For successful requests, the actual transmission of voice/data within a call is done using the channel assigned for this purpose by the central station. The satellite channel is assumed to be power limited. Taking into account the known burstiness of voice sources (which use a voice-activated switch), the central station overassigns channels so that the average total power is below the power limit of the satellite transponder. The performance of this model is analyzed. Certain simple, static control strategies for improving performance are also proposed.
Exploring the anatomical encoding of voice with a mathematical model of the vocal system.
Assaneo, M Florencia; Sitt, Jacobo; Varoquaux, Gael; Sigman, Mariano; Cohen, Laurent; Trevisan, Marcos A
2016-11-01
The faculty of language depends on the interplay between the production and perception of speech sounds. A relevant open question is whether the dimensions that organize voice perception in the brain are acoustical or depend on properties of the vocal system that produced it. One of the main empirical difficulties in answering this question is to generate sounds that vary along a continuum according to the anatomical properties the vocal apparatus that produced them. Here we use a mathematical model that offers the unique possibility of synthesizing vocal sounds by controlling a small set of anatomically based parameters. In a first stage the quality of the synthetic voice was evaluated. Using specific time traces for sub-glottal pressure and tension of the vocal folds, the synthetic voices generated perceptual responses, which are indistinguishable from those of real speech. The synthesizer was then used to investigate how the auditory cortex responds to the perception of voice depending on the anatomy of the vocal apparatus. Our fMRI results show that sounds are perceived as human vocalizations when produced by a vocal system that follows a simple relationship between the size of the vocal folds and the vocal tract. We found that these anatomical parameters encode the perceptual vocal identity (male, female, child) and show that the brain areas that respond to human speech also encode vocal identity. On the basis of these results, we propose that this low-dimensional model of the vocal system is capable of generating realistic voices and represents a novel tool to explore the voice perception with a precise control of the anatomical variables that generate speech. Furthermore, the model provides an explanation of how auditory cortices encode voices in terms of the anatomical parameters of the vocal system. Copyright © 2016 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Chen, Charlie C.; Vannoy, Sandra
2013-01-01
Voice over Internet Protocol- (VoIP) enabled online learning service providers struggling with high attrition rates and low customer loyalty issues despite VoIP's high degree of system fit for online global learning applications. Effective solutions to this prevalent problem rely on the understanding of system quality, information quality, and…
An automatic speech recognition system with speaker-independent identification support
NASA Astrophysics Data System (ADS)
Caranica, Alexandru; Burileanu, Corneliu
2015-02-01
The novelty of this work relies on the application of an open source research software toolkit (CMU Sphinx) to train, build and evaluate a speech recognition system, with speaker-independent support, for voice-controlled hardware applications. Moreover, we propose to use the trained acoustic model to successfully decode offline voice commands on embedded hardware, such as an ARMv6 low-cost SoC, Raspberry PI. This type of single-board computer, mainly used for educational and research activities, can serve as a proof-of-concept software and hardware stack for low cost voice automation systems.
Perceived control and voice handicap in patients with voice disorders.
Frazier, Patricia; Merians, Addie; Misono, Stephanie
2017-11-01
The purpose of the study was to replicate and extend previous research on the relation between perceived present control and voice handicap and to further examine the psychometric properties of a present control scale adapted for patients with voice disorders (Misono, Meredith, Peterson, & Frazier, 2016). Sample 1 consisted of 1,129 patients recruited from a voice disorder clinic who completed measures of perceived present control, distress, and voice handicap in the clinic. Sample 2 consisted of 62 patients from the same clinic who completed measures of present control, distress, voice handicap, and general control beliefs online at baseline and measures of present control and voice handicap again 3 weeks later (n = 59). With regard to the psychometric properties of the voice-adapted present control scale, alpha coefficients were above .80 and the 3-week test-reliability coefficient was .69. There was mixed support for the hypothesized 1-factor structure of the scale. In Sample 1, present control was more strongly associated with lower voice handicap than was distress and accounted for significant variance in voice handicap controlling for distress. In Sample 2, present control at baseline predicted later voice handicap, controlling for general control beliefs and distress. Present control appears to be a promising target for adjunctive interventions for patients with voice disorders. An evidence-based online present control intervention (Hintz, Frazier, & Meredith, 2015) is being adapted for this patient population. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Real-Time Reconfigurable Adaptive Speech Recognition Command and Control Apparatus and Method
NASA Technical Reports Server (NTRS)
Salazar, George A. (Inventor); Haynes, Dena S. (Inventor); Sommers, Marc J. (Inventor)
1998-01-01
An adaptive speech recognition and control system and method for controlling various mechanisms and systems in response to spoken instructions and in which spoken commands are effective to direct the system into appropriate memory nodes, and to respective appropriate memory templates corresponding to the voiced command is discussed. Spoken commands from any of a group of operators for which the system is trained may be identified, and voice templates are updated as required in response to changes in pronunciation and voice characteristics over time of any of the operators for which the system is trained. Provisions are made for both near-real-time retraining of the system with respect to individual terms which are determined not be positively identified, and for an overall system training and updating process in which recognition of each command and vocabulary term is checked, and in which the memory templates are retrained if necessary for respective commands or vocabulary terms with respect to an operator currently using the system. In one embodiment, the system includes input circuitry connected to a microphone and including signal processing and control sections for sensing the level of vocabulary recognition over a given period and, if recognition performance falls below a given level, processing audio-derived signals for enhancing recognition performance of the system.
NASA Technical Reports Server (NTRS)
Johnson, Walter W.; Lachter, Joel B.; Battiste, Vernol; Lim, Veranika; Brandt, Summer L.; Koteskey, Robert W.; Dao, Arik-Quang V.; Ligda, Sarah V.; Wu, Shu-Chieh
2011-01-01
A primary feature of the Next Generation Air Transportation System (NextGen) is trajectory based operations (TBO). Under TBO, aircraft flight plans are known to computer systems on the ground that aid in scheduling and separation. The Future Air Navigation System (FANS) was developed to support TBO, but relatively few aircraft in the US are FANSequipped. Thus, any near-term implementation must provide TBO procedures for non-FANS aircraft. Previous research has explored controller clearances, but any implementation must also provide procedures for aircraft requests. The work presented here aims to surface issues surrounding TBO communication procedures for non-FANS aircraft and for aircraft requesting deviations around weather. Three types of communication were explored: Voice, FANS, and ACARS,(Aircraft Communications Addressing and Reporting System). ACARS and FANS are datacom systems that differ in that FANS allows uplinked flight plans to be loaded into the Flight Management System (FMS), while ACARS delivers flight plans as text that must be entered manually via the Control Display Unit (CDU). Sixteen pilots (eight two-person flight decks) and four controllers participated in 32 20-minute scenarios that required the flight decks to navigate through convective weather as they approached their top of descents (TODs). Findings: The rate of non-conformance was higher than anticipated, with aircraft off path more than 20% of the time. Controllers did not differentiate between the ACARS and FANS datacom, and were mixed in their preference for Voice vs. datacom (ACARS and FANS). Pilots uniformly preferred Voice to datacom, particularly ACARS. Much of their dislike appears to result from the slow response times in the datacom conditions. As a result, participants frequently resorted to voice communication. These results imply that, before implementing TBO in environments where pilots make weather deviation requests, further research is needed to develop communication procedures that integrate voice and datacom.
Factors associated with voice disorders among teachers: a case-control study.
Giannini, Susana Pimentel Pinto; Latorre, Maria do Rosário Dias de Oliveira; Ferreira, Léslie Piccolotto
2013-01-01
We aimed at verifying an association between voice disorders/stress and loss of work ability among female teachers who work in São Paulo's public school system. This is a paired case- control study. The case group was composed offiteachers with alterations in speech and larynges assessments, and the control group was formed by teachers without alterations in these evaluations who work in the same schools. Both groups answered the following questionnaires: Conditions of Vocal Production-Teachers, Job Stress Scale, and Work Ability Index. The analysis was performed using the chi-square association test and logistic regression models with the purpose of estimating the association between independent variables and voice disorders. We found differences between the groups in relation to stress in the workplace under high demand, a situation that poses greater risks of adverse reactions to the workers' physical and mental health. Regarding the ability to work, the categories poor and moderate ability for work are associated with voice disorders, regardless of job stress factors, age, and the unsatisfactory acoustic properties of the classrooms. This study confirmed the association between voice disorders and job stress, as well as between voice disorders and loss of work ability.
NASA Technical Reports Server (NTRS)
Voorhees, J. W.; Bucher, N. M.
1983-01-01
The cockpit has been one of the most rapidly changing areas of new aircraft design over the past thirty years. In connection with these developments, a pilot can now be considered a decision maker/system manager as well as a vehicle controller. There is, however, a trend towards an information overload in the cockpit, and information processing problems begin to occur for the rotorcraft pilot. One approach to overcome the arising difficulties is based on the utilization of voice technology to improve the information transfer rate in the cockpit with respect to both input and output. Attention is given to the background of speech technology, the application of speech technology within the cockpit, voice interactive electronic warning system (VIEWS) simulation, and methodology. Information subsystems are considered along with a dynamic simulation study, and data collection.
Mechanics of human voice production and control
Zhang, Zhaoyan
2016-01-01
As the primary means of communication, voice plays an important role in daily life. Voice also conveys personal information such as social status, personal traits, and the emotional state of the speaker. Mechanically, voice production involves complex fluid-structure interaction within the glottis and its control by laryngeal muscle activation. An important goal of voice research is to establish a causal theory linking voice physiology and biomechanics to how speakers use and control voice to communicate meaning and personal information. Establishing such a causal theory has important implications for clinical voice management, voice training, and many speech technology applications. This paper provides a review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality. Current efforts to develop mechanical and computational models of voice production are also critically reviewed. Finally, issues and future challenges in developing a causal theory of voice production and perception are discussed. PMID:27794319
Mechanics of human voice production and control.
Zhang, Zhaoyan
2016-10-01
As the primary means of communication, voice plays an important role in daily life. Voice also conveys personal information such as social status, personal traits, and the emotional state of the speaker. Mechanically, voice production involves complex fluid-structure interaction within the glottis and its control by laryngeal muscle activation. An important goal of voice research is to establish a causal theory linking voice physiology and biomechanics to how speakers use and control voice to communicate meaning and personal information. Establishing such a causal theory has important implications for clinical voice management, voice training, and many speech technology applications. This paper provides a review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality. Current efforts to develop mechanical and computational models of voice production are also critically reviewed. Finally, issues and future challenges in developing a causal theory of voice production and perception are discussed.
Telephony-based voice pathology assessment using automated speech analysis.
Moran, Rosalyn J; Reilly, Richard B; de Chazal, Philip; Lacy, Peter D
2006-03-01
A system for remotely detecting vocal fold pathologies using telephone-quality speech is presented. The system uses a linear classifier, processing measurements of pitch perturbation, amplitude perturbation and harmonic-to-noise ratio derived from digitized speech recordings. Voice recordings from the Disordered Voice Database Model 4337 system were used to develop and validate the system. Results show that while a sustained phonation, recorded in a controlled environment, can be classified as normal or pathologic with accuracy of 89.1%, telephone-quality speech can be classified as normal or pathologic with an accuracy of 74.2%, using the same scheme. Amplitude perturbation features prove most robust for telephone-quality speech. The pathologic recordings were then subcategorized into four groups, comprising normal, neuromuscular pathologic, physical pathologic and mixed (neuromuscular with physical) pathologic. A separate classifier was developed for classifying the normal group from each pathologic subcategory. Results show that neuromuscular disorders could be detected remotely with an accuracy of 87%, physical abnormalities with an accuracy of 78% and mixed pathology voice with an accuracy of 61%. This study highlights the real possibility for remote detection and diagnosis of voice pathology.
The Effect of Hydration on the Voice Quality of Future Professional Vocal Performers.
van Wyk, Liezl; Cloete, Mariaan; Hattingh, Danel; van der Linde, Jeannie; Geertsema, Salome
2017-01-01
The application of systemic hydration as an instrument for optimal voice quality has been a common practice by several professional voice users over the years. Although the physiological action has been determined, the benefits on acoustic and perceptual characteristics are relatively unknown. The present study aimed to determine whether systemic hydration has beneficial outcomes on the voice quality of future professional voice users. A within-subject, pretest posttest design is applied to determine quantitative research results of female singing students between 18 and 32 years of age without a history of voice pathology. Acoustic and perceptual data were collected before and after a 2-hour singing rehearsal. The difference between the hypohydrated condition (controlled) and the hydrated condition (experimental) and the relationship between adequate hydration and acoustic and perceptual parameters of voice was then investigated. A statistical significant (P = 0.041) increase in jitter values were obtained for the hypohydrated condition. Increased maximum phonation time (MPT/z/) and higher maximum frequency for hydration indicated further statistical significant changes in voice quality (P = 0.028 and P = 0.015, respectively). Systemic hydration has positive outcomes on perceptual and acoustic parameters of voice quality for future professional singers. The singer's ability to sustain notes for longer and reach higher frequencies may reflect well in performances. Any positive change in voice quality may benefit the singer's occupational success and subsequently their social, emotional, and vocational well-being. More research evidence is needed to determine the parameters for implementing adequate hydration in vocal hygiene programs. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Mapping Phonetic Features for Voice-Driven Sound Synthesis
NASA Astrophysics Data System (ADS)
Janer, Jordi; Maestre, Esteban
In applications where the human voice controls the synthesis of musical instruments sounds, phonetics convey musical information that might be related to the sound of the imitated musical instrument. Our initial hypothesis is that phonetics are user- and instrument-dependent, but they remain constant for a single subject and instrument. We propose a user-adapted system, where mappings from voice features to synthesis parameters depend on how subjects sing musical articulations, i.e. note to note transitions. The system consists of two components. First, a voice signal segmentation module that automatically determines note-to-note transitions. Second, a classifier that determines the type of musical articulation for each transition based on a set of phonetic features. For validating our hypothesis, we run an experiment where subjects imitated real instrument recordings with their voice. Performance recordings consisted of short phrases of saxophone and violin performed in three grades of musical articulation labeled as: staccato, normal, legato. The results of a supervised training classifier (user-dependent) are compared to a classifier based on heuristic rules (user-independent). Finally, from the previous results we show how to control the articulation in a sample-concatenation synthesizer by selecting the most appropriate samples.
[Study on the application of pyroelectric infrared sensor to safety protection system].
Wang, Song-de; Zhang, Shuan-ji; Zhu, Xiao-long; Yang, Jie-hui
2006-11-01
Using the infrared ray of human body, which is received and magnified by pyroelectric infrared sensor to form a certain voltage control signal, and using the control signal to trigger a voice recording-reproducing circuit, a pyroelectric infrared detector voice device with auto-control function designed. The circuit adopted new pyroelectric infrared detector assembly and voice recording-reproducing assembly. When someone is present in the detectable range of the pyroelectric infrared detector, first, the pyroelectric infrared sensor will transform the incepted radiation energy to a electric signal, which is then magnified and compared by an inside circuit, and an output control signal, touches off the voice recording-reproducing assembly with the reproducer sending out a beforehand transcribed caution voice to wise the man who does not know well the surrounding condition that the frontage is a danger zone and should not be approched. With the design of integrated structures, the distance-warning device has the advantages of strong anti-jamming ability, low temperature resistance, working stability and use-convenience, and it can be suitably installed and used in several locations which may endanger person safety, such as substation, high voltage switch panel, electric transformer, etc.
Voice Tremor in Parkinson's Disease: An Acoustic Study.
Gillivan-Murphy, Patricia; Miller, Nick; Carding, Paul
2018-01-30
Voice tremor associated with Parkinson disease (PD) has not been characterized. Its relationship with voice disability and disease variables is unknown. This study aimed to evaluate voice tremor in people with PD (pwPD) and a matched control group using acoustic analysis, and to examine correlations with voice disability and disease variables. Acoustic voice tremor analysis was completed on 30 pwPD and 28 age-gender matched controls. Voice disability (Voice Handicap Index), and disease variables of disease duration, Activities of Daily Living (Unified Parkinson's Disease Rating Scale [UPDRS II]), and motor symptoms related to PD (UPDRS III) were examined for relationship with voice tremor measures. Voice tremor was detected acoustically in pwPD and controls with similar frequency. PwPD had a statistically significantly higher rate of amplitude tremor (Hz) than controls (P = 0.001). Rate of amplitude tremor was negatively and significantly correlated with UPDRS III total score (rho -0.509). For pwPD, the magnitude and periodicity of acoustic tremor was higher than for controls without statistical significance. The magnitude of frequency tremor (Mftr%) was positively and significantly correlated with disease duration (rho 0.463). PwPD had higher Voice Handicap Index total, functional, emotional, and physical subscale scores than matched controls (P < 0.001). Voice disability did not correlate significantly with acoustic voice tremor measures. Acoustic analysis enhances understanding of PD voice tremor characteristics, its pathophysiology, and its relationship with voice disability and disease symptomatology. Copyright © 2018 The Voice Foundation. All rights reserved.
29 CFR 1910.66 - Powered platforms for building maintenance.
Code of Federal Regulations, 2013 CFR
2013-07-01
... used to supply electrical power and/or control current for equipment or to provide voice communication... access to, and egress from, the equipment and sufficient space to conduct necessary maintenance of the... in use; and (vi) An effective two-way voice communication system shall be provided between the...
29 CFR 1910.66 - Powered platforms for building maintenance.
Code of Federal Regulations, 2014 CFR
2014-07-01
... used to supply electrical power and/or control current for equipment or to provide voice communication... access to, and egress from, the equipment and sufficient space to conduct necessary maintenance of the... in use; and (vi) An effective two-way voice communication system shall be provided between the...
Relationship Between Voice and Motor Disabilities of Parkinson's Disease.
Majdinasab, Fatemeh; Karkheiran, Siamak; Soltani, Majid; Moradi, Negin; Shahidi, Gholamali
2016-11-01
To evaluate voice of Iranian patients with Parkinson's disease (PD) and find any relationship between motor disabilities and acoustic voice parameters as speech motor components. We evaluated 27 Farsi-speaking PD patients and 21 age- and sex-matched healthy persons as control. Motor performance was assessed by the Unified Parkinson's Disease Rating Scale part III and Hoehn and Yahr rating scale in the "on" state. Acoustic voice evaluation, including fundamental frequency (f0), standard deviation of f0, minimum of f0, maximum of f0, shimmer, jitter, and harmonic to noise ratio, was done using the Praat software via /a/ prolongation. No difference was seen between the voice of the patients and the voice of the controls. f0 and its variation had a significant correlation with the duration of the disease, but did not have any relationships with the Unified Parkinson's Disease Rating Scale part III. Only limited relationship was observed between voice and motor disabilities. Tremor is an important main feature of PD that affects motor and phonation systems. Females had an older age at onset, more prolonged disease, and more severe motor disabilities (not statistically significant), but phonation disorders were more frequent in males and showed more relationship with severity of motor disabilities. Voice is affected by PD earlier than many other motor components and is more sensitive to disease progression. Tremor is the most effective part of PD that impacts voice. PD has more effect on voice of male versus female patients. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Acoustic-Perceptual Correlates of Voice in Indian Hindu Purohits.
Balasubramanium, Radish Kumar; Karuppali, Sudhin; Bajaj, Gagan; Shastry, Anuradha; Bhat, Jayashree
2018-05-16
Purohit, in the Indian religious context (Hindu), means priest. Purohits are professional voice users who use their voice while performing regular worships and rituals in temples and homes. Any deviations in their voice can have an impact on their profession. Hence, there is a need to investigate the voice characteristics of purohits using perceptual and acoustic analyses. A total of 44 men in the age range of 18-30 years were divided into two groups. Group 1 consisted of purohits who were trained since childhood (n = 22) in the traditional gurukul system. Group 2 (n = 22) consisted of normal controls. Phonation and spontaneous speech samples were obtained from all the participants at a comfortable pitch and loudness. The Praat software (Version 5.3.31) and the Speech tool were used to analyze the traditional acoustic and cepstral parameters, respectively, whereas GRBAS was used to perceptually evaluate the voice. Results of the independent t test revealed no significant differences across the groups for perceptual and traditional acoustic measures except for intensity, which was significantly higher in purohits' voices at P < 0.05. However, the cepstral values (cepstral peak prominence and smoothened cepstral peak prominence) were much higher in purohits than in controls at P < 0.05 CONCLUSIONS: Results revealed that purohits did not exhibit vocal deviations as analyzed through perceptual and acoustic parameters. In contrast, cepstral measures were higher in Indian Hindu purohits in comparison with normal controls, suggestive of a higher degree of harmonic organization in purohits. Further studies are required to analyze the physiological correlates of increased cepstral measures in purohits' voices. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Intra-oral pressure-based voicing control of electrolaryngeal speech with intra-oral vibrator.
Takahashi, Hirokazu; Nakao, Masayuki; Kikuchi, Yataro; Kaga, Kimitaka
2008-07-01
In normal speech, coordinated activities of intrinsic laryngeal muscles suspend a glottal sound at utterance of voiceless consonants, automatically realizing a voicing control. In electrolaryngeal speech, however, the lack of voicing control is one of the causes of unclear voice, voiceless consonants tending to be misheard as the corresponding voiced consonants. In the present work, we developed an intra-oral vibrator with an intra-oral pressure sensor that detected utterance of voiceless phonemes during the intra-oral electrolaryngeal speech, and demonstrated that an intra-oral pressure-based voicing control could improve the intelligibility of the speech. The test voices were obtained from one electrolaryngeal speaker and one normal speaker. We first investigated on the speech analysis software how a voice onset time (VOT) and first formant (F1) transition of the test consonant-vowel syllables contributed to voiceless/voiced contrasts, and developed an adequate voicing control strategy. We then compared the intelligibility of consonant-vowel syllables among the intra-oral electrolaryngeal speech with and without online voicing control. The increase of intra-oral pressure, typically with a peak ranging from 10 to 50 gf/cm2, could reliably identify utterance of voiceless consonants. The speech analysis and intelligibility test then demonstrated that a short VOT caused the misidentification of the voiced consonants due to a clear F1 transition. Finally, taking these results together, the online voicing control, which suspended the prosthetic tone while the intra-oral pressure exceeded 2.5 gf/cm2 and during the 35 milliseconds that followed, proved efficient to improve the voiceless/voiced contrast.
Burnett, Greg C [Livermore, CA; Holzrichter, John F [Berkeley, CA; Ng, Lawrence C [Danville, CA
2006-08-08
The present invention is a system and method for characterizing human (or animate) speech voiced excitation functions and acoustic signals, for removing unwanted acoustic noise which often occurs when a speaker uses a microphone in common environments, and for synthesizing personalized or modified human (or other animate) speech upon command from a controller. A low power EM sensor is used to detect the motions of windpipe tissues in the glottal region of the human speech system before, during, and after voiced speech is produced by a user. From these tissue motion measurements, a voiced excitation function can be derived. Further, the excitation function provides speech production information to enhance noise removal from human speech and it enables accurate transfer functions of speech to be obtained. Previously stored excitation and transfer functions can be used for synthesizing personalized or modified human speech. Configurations of EM sensor and acoustic microphone systems are described to enhance noise cancellation and to enable multiple articulator measurements.
Burnett, Greg C.; Holzrichter, John F.; Ng, Lawrence C.
2004-03-23
The present invention is a system and method for characterizing human (or animate) speech voiced excitation functions and acoustic signals, for removing unwanted acoustic noise which often occurs when a speaker uses a microphone in common environments, and for synthesizing personalized or modified human (or other animate) speech upon command from a controller. A low power EM sensor is used to detect the motions of windpipe tissues in the glottal region of the human speech system before, during, and after voiced speech is produced by a user. From these tissue motion measurements, a voiced excitation function can be derived. Further, the excitation function provides speech production information to enhance noise removal from human speech and it enables accurate transfer functions of speech to be obtained. Previously stored excitation and transfer functions can be used for synthesizing personalized or modified human speech. Configurations of EM sensor and acoustic microphone systems are described to enhance noise cancellation and to enable multiple articulator measurements.
Burnett, Greg C.; Holzrichter, John F.; Ng, Lawrence C.
2006-02-14
The present invention is a system and method for characterizing human (or animate) speech voiced excitation functions and acoustic signals, for removing unwanted acoustic noise which often occurs when a speaker uses a microphone in common environments, and for synthesizing personalized or modified human (or other animate) speech upon command from a controller. A low power EM sensor is used to detect the motions of windpipe tissues in the glottal region of the human speech system before, during, and after voiced speech is produced by a user. From these tissue motion measurements, a voiced excitation function can be derived. Further, the excitation function provides speech production information to enhance noise removal from human speech and it enables accurate transfer functions of speech to be obtained. Previously stored excitation and transfer functions can be used for synthesizing personalized or modified human speech. Configurations of EM sensor and acoustic microphone systems are described to enhance noise cancellation and to enable multiple articulator measurements.
Burnett, Greg C.; Holzrichter, John F.; Ng, Lawrence C.
2006-04-25
The present invention is a system and method for characterizing human (or animate) speech voiced excitation functions and acoustic signals, for removing unwanted acoustic noise which often occurs when a speaker uses a microphone in common environments, and for synthesizing personalized or modified human (or other animate) speech upon command from a controller. A low power EM sensor is used to detect the motions of windpipe tissues in the glottal region of the human speech system before, during, and after voiced speech is produced by a user. From these tissue motion measurements, a voiced excitation function can be derived. Further, the excitation function provides speech production information to enhance noise removal from human speech and it enables accurate transfer functions of speech to be obtained. Previously stored excitation and transfer functions can be used for synthesizing personalized or modified human speech. Configurations of EM sensor and acoustic microphone systems are described to enhance noise cancellation and to enable multiple articulator measurements.
2015-09-01
Gateway 2 4. Voice Packet Flow: SIP , Session Description Protocol (SDP), and RTP 3 5. Voice Data Analysis 5 6. Call Analysis 6 7. Call Metrics 6...analysis processing is designed for a general VoIP system architecture based on Session Initiation Protocol ( SIP ) for negotiating call sessions and...employs Skinny Client Control Protocol for network communication between the phone and the local CallManager (e.g., for each dialed digit), SIP
Research on Precision Tracking on Fast Steering Mirror and Control Strategy
NASA Astrophysics Data System (ADS)
Di, Lin; Yi-ming, Wu; Fan, Zhu
2018-01-01
Fast steering mirror is a device used for controlling the beam direction precisely. Due to the short travel of the push-pull FSM, a compound fast steering mirror system driven by both limited-angle voice coil motor and push-pull FSM together is proposed. In the compound FSM system, limited-angle voice coil motor quickly swings at wide angle, while the push-pull FSM do high frequency movement in a small range, which provides the system with the high bandwidth and long travel. In the control strategy, the method of combining feed-forward control in Kalman filtering with auto-disturbance rejection control is used to improve trajectory tracking accuracy. The simulation result shows that tracking accuracy measured by the compound method can be improved by more than 5 times than that of the conventional PID.
NASA Technical Reports Server (NTRS)
Huber, W. C.
1986-01-01
Voice synthesizer tells what key is about to be depressed. Verbal feedback useful for blind operators or where dim light prevents sighted operator from seeing keyboard. Also used where operator is busy observing other things while keying data into control system. Used as training aid for touch typing, and to train blind operators to use both standard and braille keyboards. Concept adapted to such equipment as typewriters, computers, calculators, telephones, cash registers, and on/off controls.
Voice intelligibility in satellite mobile communications
NASA Technical Reports Server (NTRS)
Wishna, S.
1973-01-01
An amplitude control technique is reported that equalizes low level phonemes in a satellite narrow band FM voice communication system over channels having low carrier to noise ratios. This method presents at the transmitter equal amplitude phonemes so that the low level phonemes, when they are transmitted over the noisey channel, are above the noise and contribute to output intelligibility. The amplitude control technique provides also for squelching of noise when speech is not being transmitted.
Voice Disorders in Teacher Students-A Prospective Study and a Randomized Controlled Trial.
Ohlsson, Ann-Christine; Andersson, Eva M; Södersten, Maria; Simberg, Susanna; Claesson, Silwa; Barregård, Lars
2016-11-01
Teachers are at risk of developing voice disorders, but longitudinal studies on voice problems among teachers are lacking. The aim of this randomized trial was to investigate long-term effects of voice education for teacher students with mild voice problems. In addition, vocal health was examined prospectively in a group of students without voice problems. First-semester students answered three questionnaires: one about background factors, one about voice symptoms (Screen6), and the Voice Handicap Index. Students with voice problems according to the questionnaire results were randomized to a voice training group or a control group. At follow-up in the sixth semester, all students answered Screen6 again together with four questions about factors that could have affected vocal health during their teacher education. The training group and the control group also answered the Voice Handicap Index a second time. At follow-up, 400 students remained in the study: 27 in the training group, 54 in the control group, and 319 without voice problems at baseline. Voice problems had decreased somewhat more in the training group than in the control group, but the difference was not statistically significant (P = 0.1). However, subgroup analyses showed significantly larger improvement among the students in the group with complete participation in the training program compared with the group with incomplete participation. Of the 319 students without voice problems at baseline, 14% had developed voice problems. Voice problems often develop in teacher students. Despite extensive dropout, our results support the hypothesis that voice education for teacher students has a preventive effect. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
A pneumatic Bionic Voice prosthesis-Pre-clinical trials of controlling the voice onset and offset.
Ahmadi, Farzaneh; Noorian, Farzad; Novakovic, Daniel; van Schaik, André
2018-01-01
Despite emergent progress in many fields of bionics, a functional Bionic Voice prosthesis for laryngectomy patients (larynx amputees) has not yet been achieved, leading to a lifetime of vocal disability for these patients. This study introduces a novel framework of Pneumatic Bionic Voice Prostheses as an electronic adaptation of the Pneumatic Artificial Larynx (PAL) device. The PAL is a non-invasive mechanical voice source, driven exclusively by respiration with an exceptionally high voice quality, comparable to the existing gold standard of Tracheoesophageal (TE) voice prosthesis. Following PAL design closely as the reference, Pneumatic Bionic Voice Prostheses seem to have a strong potential to substitute the existing gold standard by generating a similar voice quality while remaining non-invasive and non-surgical. This paper designs the first Pneumatic Bionic Voice prosthesis and evaluates its onset and offset control against the PAL device through pre-clinical trials on one laryngectomy patient. The evaluation on a database of more than five hours of continuous/isolated speech recordings shows a close match between the onset/offset control of the Pneumatic Bionic Voice and the PAL with an accuracy of 98.45 ±0.54%. When implemented in real-time, the Pneumatic Bionic Voice prosthesis controller has an average onset/offset delay of 10 milliseconds compared to the PAL. Hence it addresses a major disadvantage of previous electronic voice prostheses, including myoelectric Bionic Voice, in meeting the short time-frames of controlling the onset/offset of the voice in continuous speech.
A pneumatic Bionic Voice prosthesis—Pre-clinical trials of controlling the voice onset and offset
Noorian, Farzad; Novakovic, Daniel; van Schaik, André
2018-01-01
Despite emergent progress in many fields of bionics, a functional Bionic Voice prosthesis for laryngectomy patients (larynx amputees) has not yet been achieved, leading to a lifetime of vocal disability for these patients. This study introduces a novel framework of Pneumatic Bionic Voice Prostheses as an electronic adaptation of the Pneumatic Artificial Larynx (PAL) device. The PAL is a non-invasive mechanical voice source, driven exclusively by respiration with an exceptionally high voice quality, comparable to the existing gold standard of Tracheoesophageal (TE) voice prosthesis. Following PAL design closely as the reference, Pneumatic Bionic Voice Prostheses seem to have a strong potential to substitute the existing gold standard by generating a similar voice quality while remaining non-invasive and non-surgical. This paper designs the first Pneumatic Bionic Voice prosthesis and evaluates its onset and offset control against the PAL device through pre-clinical trials on one laryngectomy patient. The evaluation on a database of more than five hours of continuous/isolated speech recordings shows a close match between the onset/offset control of the Pneumatic Bionic Voice and the PAL with an accuracy of 98.45 ±0.54%. When implemented in real-time, the Pneumatic Bionic Voice prosthesis controller has an average onset/offset delay of 10 milliseconds compared to the PAL. Hence it addresses a major disadvantage of previous electronic voice prostheses, including myoelectric Bionic Voice, in meeting the short time-frames of controlling the onset/offset of the voice in continuous speech. PMID:29466455
ATC/pilot voice communications: A survey of the literature
NASA Astrophysics Data System (ADS)
Prinzo, O. Veronika; Britton, Thomas W.
1993-11-01
The first radio-equipped control tower in the United States opened at the Cleveland Municipal Airport in 1930. From that time to the present, voice radio communications have played a primary role in air safety. Verbal communications in air traffic control (ATC) operations have been frequently cited as causal factors in operational errors and pilot deviations in the FAA Operational Error and Deviation System, the NASA Aviation Safety Reporting System (ASRS), and reports derived from government sponsored research projects. Collectively, the data provided by these programs indicate that communications constitute a significant problem for pilots and controllers. Although the communications problem was well known the research literature was fragmented, making it difficult to appreciate the various types of verbal communications problems that existed and their unique influence on the quality of ATC/pilot communications. This is a survey of the voice radio communications literature. The 43 reports in the review represent survey data, field studies, laboratory studies, narrative reports, and reviews. The survey topics pertain to communications taxonomies, acoustical correlates and cognitive/psycholinguistic perspectives. Communications taxonomies were used to identify the frequency and types of information that constitute routine communications, as well as those communications involved in operational errors, pilot deviations, and other safety-related events. Acoustical correlate methodologies identified some qualities of a speaker's voice, such as loudness, pitch, and speech rate, which might be used potentially to monitor stress, mental workload, and other forms of psychological or physiological factors that affect performance. Cognitive/psycho-linguistic research offered an information processing perspective for understanding how pilots' and controllers' memory and language comprehension processes affect their ability to communicate effectively with one another. This analysis of the ATC/pilot voice radio communications literature was performed to provide an organized summary for the systematic study of interactive communications between controllers and pilots. Recommendations are given for new research initiatives, communications-based instructional materials, and human factors applications for new communications systems.
Systems concept for speech technology application in general aviation
NASA Technical Reports Server (NTRS)
North, R. A.; Bergeron, H.
1984-01-01
The application potential of voice recognition and synthesis circuits for general aviation, single-pilot IFR (SPIFR) situations is examined. The viewpoint of the pilot was central to workload analyses and assessment of the effectiveness of the voice systems. A twin-engine, high performance general aviation aircraft on a cross-country fixed route was employed as the study model. No actual control movements were considered and other possible functions were scored by three IFR-rated instructors. The SPIFR was concluded helpful in alleviating visual and manual workloads during take-off, approach and landing, particularly for data retrieval and entry tasks. Voice synthesis was an aid in alerting a pilot to in-flight problems. It is expected that usable systems will be available within 5 yr.
Muthukumar, P; Balasubramaniam, P; Ratnavelu, K
2017-07-26
This paper proposes a generalized robust synchronization method for different dimensional fractional order dynamical systems with mismatched fractional derivatives in the presence of function uncertainty and external disturbance by a designing sliding mode controller. Based on the proposed theory of generalized robust synchronization criterion, a novel audio cryptosystem is proposed for sending or sharing voice messages secretly via insecure channel. Numerical examples are given to verify the potency of the proposed theories. Copyright © 2017 ISA. Published by Elsevier Ltd. All rights reserved.
NASA Technical Reports Server (NTRS)
Parker, J. F., Jr.; Duffy, J. W.
1982-01-01
Key problems in single pilot instrument flight operations are in the management of flight data and the processing of cockpit information during conditions of heavy workload. A flight data console was developed to allow simulation of a digital data link to replace the current voice communications stem used in air traffic control. This is a human factors evaluation of a data link communications system to determine how such a system might reduce cockpit workload, improve flight proficiency, and be accepted by general aviation pilots. The need for a voice channel as backup to a digital link is examined. The evaluations cover both airport terminal area operations and full mission instrument flight. Results show that general aviation pilots operate well with a digital data link communications system. The findings indicate that a data link system for pilot/ATC communications, with a backup voice channel, is well accepted by general aviation pilots and is considered to be safer, more efficient, and result in less workload than the current voice system.
Design and development of a Space Station proximity operations research and development mockup
NASA Technical Reports Server (NTRS)
Haines, Richard F.
1986-01-01
Proximity operations (Prox-Ops) on-orbit refers to all activities taking place within one km of the Space Station. Designing a Prox-Ops control station calls for a comprehensive systems approach which takes into account structural constraints, orbital dynamics including approach/departure flight paths, myriad human factors and other topics. This paper describes a reconfigurable full-scale mock-up of a Prox-Ops station constructed at Ames incorporating an array of windows (with dynamic star field, target vehicle(s), and head-up symbology), head-down perspective display of manned and unmanned vehicles, voice- actuated 'electronic checklist', computer-generated voice system, expert system (to help diagnose subsystem malfunctions), and other displays and controls. The facility is used for demonstrations of selected Prox-Ops approach scenarios, human factors research (work-load assessment, determining external vision envelope requirements, head-down and head-up symbology design, voice synthesis and recognition research, etc.) and development of engineering design guidelines for future module interiors.
Advanced Electronic Technology
1977-11-15
Electronics 15 III. Materials Research 15 TV. Microelectronics 16 V. Surface- Wave Technology 16 DATA SYSTEMS DIVISION 2 INTRODUCTION This...Processing Digital Voice Processing Packet Speech Wideband Integrated Voice/Data Technology Radar Signal Processing Technology Nuclear Safety Designs...facilities make it possible to track the status of these jobs, retrieve their job control language listings, and direct a copy of printed or punched
Designing of Intelligent Multilingual Patient Reported Outcome System (IMPROS)
Pourasghar, Faramarz; Partovi, Yeganeh
2015-01-01
Background: By self-reporting outcome procedure the patients themselves record disease symptoms outside medical centers and then report them to medical staff in specific periods of time. One of the self-reporting methods is the application of interactive voice response (IVR), in which some pre-designed questions in the form of voice tracks would be played and then the caller responses the questions by pressing phone’s keypad bottoms. Aim: The present research explains the main framework of such system designing according to IVR technology that is for the first time designed and administered in Iran. Methods: Interactive Voice Response system was composed by two main parts of hardware and software. Hardware section includes one or several digital phone lines, a modem card with voice playing capability and a PC. IVR software on the other hand, acts as an intelligent control center, records call information and controls incoming data. Results: One of the main features of the system is its capability to be administered in common PCs, utilizing simple and cheap modems, high speed to take responses and it’s appropriateness to low literate patients. The system is applicable for monitoring chronic diseases, cancer and also in psychological diseases and can be suitable for taking care of elders and Children who require long term cares. Other features include user-friendly, decrease in direct and indirect costs of disease treatment and enjoying from high level of security to access patients’ profiles. Conclusions: Intelligent multilingual patient reported outcome system (IMPROS) by controlling diseases gives the opportunity to patients to have more participation during treatment and it improves mutual interaction between patient and medical staff. Moreover it increases the quality of medical services, Additional to empowering patients and their followers. PMID:26635441
Research on realization scheme of interactive voice response (IVR) system
NASA Astrophysics Data System (ADS)
Jin, Xin; Zhu, Guangxi
2003-12-01
In this paper, a novel interactive voice response (IVR) system is proposed, which is apparently different from the traditional. Using software operation and network control, the IVR system is presented which only depends on software in the server in which the system lies and the hardware in network terminals on user side, such as gateway (GW), personal gateway (PG), PC and so on. The system transmits the audio using real time protocol (RTP) protocol via internet to the network terminals and controls flow using finite state machine (FSM) stimulated by H.245 massages sent from user side and the system control factors. Being compared with other existing schemes, this IVR system results in several advantages, such as greatly saving the system cost, fully utilizing the existing network resources and enhancing the flexibility. The system is capable to be put in any service server anywhere in the Internet and even fits for the wireless applications based on packet switched communication. The IVR system has been put into reality and passed the system test.
A flight investigation of simulated data link communications during single-pilot IFR flight
NASA Technical Reports Server (NTRS)
Parker, J. F.; Duffy, J. W.; Christensen, D. G.
1983-01-01
A Flight Data Console (FDC) was developed to allow simulation of a digital communications link to replace the current voice communication system used in air traffic control (ATC). The voice system requires manipulation of radio equipment, read-back of clearances, and mental storage of critical information items, all contributing to high workload, particularly during single-pilot operations. This was an inflight study to determine how a digital communications system might reduce cockpit workload, improve flight proficiency, and be accepted by general aviation pilots. Results show that instrument flight, including approach and landing, can be accomplished quite effectively using a digital data link system for ATC communications. All pilots expressed a need for a back-up voice channel. When included, this channel was used sparingly and principally to confirm any item of information about which there might be uncertainty.
Watts, Christopher; Murphy, Jessica; Barnes-Burroughs, Kathryn
2003-06-01
At a physiological level, the act of singing involves control and coordination of several systems involved in the production of sound, including respiration, phonation, resonance, and afferent systems used to monitor production. The ability to produce a melodious singing voice (eg, in tune with accurate pitch) is dependent on control over these motor and sensory systems. To test this position, trained singers and untrained subjects with and without expressed singing talent were asked to match pitches of target pure tones. The ability to match pitch reflected the ability to accurately integrate sensory perception with motor planning and execution. Pitch-matching accuracy was measured at the onset of phonation (prephonatory set) before external feedback could be utilized to adjust the voiced source, during phonation when external auditory feedback could be utilized, and during phonation when external auditory feedback was masked. Results revealed trained singers and untrained subjects with singing talent were no different in their pitch-matching abilities when measured before or after external feedback could be utilized. The untrained subjects with singing talent were also significantly more accurate than the trained singers when external auditory feedback was masked. Both groups were significantly more accurate than the untrained subjects without singing talent.
Effects of Voice Harmonic Complexity on ERP Responses to Pitch-Shifted Auditory Feedback
Behroozmand, Roozbeh; Korzyukov, Oleg; Larson, Charles R.
2011-01-01
Objective The present study investigated the neural mechanisms of voice pitch control for different levels of harmonic complexity in the auditory feedback. Methods Event-related potentials (ERPs) were recorded in response to +200 cents pitch perturbations in the auditory feedback of self-produced natural human vocalizations, complex and pure tone stimuli during active vocalization and passive listening conditions. Results During active vocal production, ERP amplitudes were largest in response to pitch shifts in the natural voice, moderately large for non-voice complex stimuli and smallest for the pure tones. However, during passive listening, neural responses were equally large for pitch shifts in voice and non-voice complex stimuli but still larger than that for pure tones. Conclusions These findings suggest that pitch change detection is facilitated for spectrally rich sounds such as natural human voice and non-voice complex stimuli compared with pure tones. Vocalization-induced increase in neural responses for voice feedback suggests that sensory processing of naturally-produced complex sounds such as human voice is enhanced by means of motor-driven mechanisms (e.g. efference copies) during vocal production. Significance This enhancement may enable the audio-vocal system to more effectively detect and correct for vocal errors in the feedback of natural human vocalizations to maintain an intended vocal output for speaking. PMID:21719346
Emirati Teachers' Perceptions of Voice Handicap.
Natour, Yaser S; Sartawi, Abdealaziz M; Al Muhairy, Ousha; Efthymiou, Effie; Marie, Basem S
2016-05-01
The purpose of the study was to explore Emirati teachers' perceptions of voice handicap and to analyze their acoustic characteristics to determine whether acoustic measures of teachers' voice would verify their perceptions of voice handicap. Sixty-six Emirati school teachers (33 men and 33 women), with different years of teaching experience and age, and 100 control participants (50 men and 50 women) underwent vocal assessment that included the Voice Handicap Index (VHI-Arab) and acoustic measures (F0, jitter%, shimmer%, signal to noise ratio [SNR]). Significant differences between the teachers' group scores and the control group scores on the following subscales of VHI-Arab: physical (P = 0.006), emotional (P = 0.004), and total score of the test (P = 0.002). No significant differences were found among teachers in the three VHI subscales, and the total score regarding gender (functional P = 0.307; physical P = 0.341; emotional P = 0.126; and total P = 0.184), age (functional P = 0.972; physical P = 0.525; emotional P = 0.772; and total P = 0.848), and years of teaching experience (functional P = 0.319; physical P = 0.619; emotional P = 0.926; and total P = 0.638). The significant differences between the teacher's group and the control group in three acoustic measures: F0 (P = 0.000), shimmer% (P = 0.000), and SNR (P = 0.000) were further investigated. Significant differences were found among female and male teachers in F0 (P = 0.00) and SNR (P = 0.007). As for teachers' age, significant differences were found in SNR (P = 0.028). Teachers' years of experience did not show significant differences in any of the acoustic measures. Teachers have a higher perception of voice handicap. However, they were able to produce better voice quality than control participants were, as expressed in better SNRs. This might have been caused either by manipulation of vocal properties or abusive overloading the vocal system to produce a procedurally acceptable voice quality. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
The Voice as Computer Interface: A Look at Tomorrow's Technologies.
ERIC Educational Resources Information Center
Lange, Holley R.
1991-01-01
Discussion of voice as the communications device for computer-human interaction focuses on voice recognition systems for use within a library environment. Voice technologies are described, including voice response and voice recognition; examples of voice systems in use in libraries are examined; and further possibilities, including use with…
A miniaturized digital telemetry system for physiological data transmission
NASA Technical Reports Server (NTRS)
Portnoy, W. M.; Stotts, L. J.
1978-01-01
A physiological date telemetry system, consisting basically of a portable unit and a ground base station was designed, built, and tested. The portable unit to be worn by the subject is composed of a single crystal controlled transmitter with AM transmission of digital data and narrowband FM transmission of voice; a crystal controlled FM receiver; thirteen input channels follwed by a PCM encoder (three of these channels are designed for ECG data); a calibration unit; and a transponder control system. The ground base station consists of a standard telemetry reciever, a decoder, and an FM transmitter for transmission of voice and transponder signals to the portable unit. The ground base station has complete control of power to all subsystems in the portable unit. The phase-locked loop circuit which is used to decode the data, remains in operation even when the signal from the portable unit is interrupted.
MITRE-Bedford: Description of the ALEMBIC System as Used for MUC-4
1992-01-01
of control, and its corresponding encoding in SGML, consider the first paragraph of message TST2- MUC4 -0048 : SALVADORAN PRESIDENT-ELECT ALFREDO...markup simply encodes th e templates that the system has produced, e .g . , <p><template> <slotname>0 . MESSAGE :ID</slotname> <slotval>TST2- MUC4 -0048...verb in TST2- MUC4 -0048), this includes the active voice (e .g ., "Cristiani . condemned the terrorist killing"), the passive voice, and ancillary forms
Military and Government Applications of Human-Machine Communication by Voice
NASA Astrophysics Data System (ADS)
Weinstein, Clifford J.
1995-10-01
This paper describes a range of opportunities for military and government applications of human-machine communication by voice, based on visits and contacts with numerous user organizations in the United States. The applications include some that appear to be feasible by careful integration of current state-of-the-art technology and others that will require a varying mix of advances in speech technology and in integration of the technology into applications environments. Applications that are described include (1) speech recognition and synthesis for mobile command and control; (2) speech processing for a portable multifunction soldier's computer; (3) speech- and language-based technology for naval combat team tactical training; (4) speech technology for command and control on a carrier flight deck; (5) control of auxiliary systems, and alert and warning generation, in fighter aircraft and helicopters; and (6) voice check-in, report entry, and communication for law enforcement agents or special forces. A phased approach for transfer of the technology into applications is advocated, where integration of applications systems is pursued in parallel with advanced research to meet future needs.
Szabo Portela, Annika; Granqvist, Svante; Ternström, Sten; Södersten, Maria
2018-01-01
This study aimed to assess vocal behavior in women with voice-intensive occupations to investigate differences between patients and controls and between work and leisure conditions with environmental noise level as an experimental factor. Patients with work-related voice disorders, 10 with phonasthenia and 10 with vocal nodules, were matched regarding age, profession, and workplace with 20 vocally healthy colleagues. The sound pressure level of environmental noise and the speakers' voice, fundamental frequency, and phonation ratio were registered from morning to night during 1 week with a voice accumulator. Voice data were assessed in low (≤55 dBA), moderate, and high (>70 dBA) environmental noise levels. The average environmental noise level was significantly higher during the work condition for patients with vocal nodules (73.9 dBA) and their controls (73.0 dBA) compared with patients with phonasthenia (68.3 dBA) and their controls (67.1 dBA). The average voice level and the fundamental frequency were also significantly higher during work for the patients with vocal nodules and their controls. During the leisure condition, there were no significant differences in average noise and voice level nor fundamental frequency between the groups. The patients with vocal nodules and their controls spent significantly more time and used their voices significantly more in high-environmental noise levels. High noise levels during work and demands from the occupation impact vocal behavior. Thus, assessment of voice ergonomics should be part of the work environmental management. To reduce environmental noise levels is important to improve voice ergonomic conditions in communication-intensive and vocally demanding workplaces. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Noise Source Visualization Using a Digital Voice Recorder and Low-Cost Sensors
Cho, Yong Thung
2018-01-01
Accurate sound visualization of noise sources is required for optimal noise control. Typically, noise measurement systems require microphones, an analog-digital converter, cables, a data acquisition system, etc., which may not be affordable for potential users. Also, many such systems are not highly portable and may not be convenient for travel. Handheld personal electronic devices such as smartphones and digital voice recorders with relatively lower costs and higher performance have become widely available recently. Even though such devices are highly portable, directly implementing them for noise measurement may lead to erroneous results since such equipment was originally designed for voice recording. In this study, external microphones were connected to a digital voice recorder to conduct measurements and the input received was processed for noise visualization. In this way, a low cost, compact sound visualization system was designed and introduced to visualize two actual noise sources for verification with different characteristics: an enclosed loud speaker and a small air compressor. Reasonable accuracy of noise visualization for these two sources was shown over a relatively wide frequency range. This very affordable and compact sound visualization system can be used for many actual noise visualization applications in addition to educational purposes. PMID:29614038
NASA Technical Reports Server (NTRS)
Hymer, R. L.
1970-01-01
System provides automatic volume control for an audio amplifier or a voice communication system without introducing noise surges during pauses in the input, and without losing the initial signal when the input resumes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tuomi, Lisa, E-mail: lisa.tuomi@vgregion.se; Andréll, Paulin; Finizia, Caterina
Background: Patients treated with radiation therapy for laryngeal cancer often experience voice problems. The aim of this randomized controlled trial was to assess the efficacy of voice rehabilitation for laryngeal cancer patients after having undergone radiation therapy and to investigate whether differences between different tumor localizations with regard to rehabilitation outcomes exist. Methods and Materials: Sixty-nine male patients irradiated for laryngeal cancer participated. Voice recordings and self-assessments of communicative dysfunction were performed 1 and 6 months after radiation therapy. Thirty-three patients were randomized to structured voice rehabilitation with a speech-language pathologist and 36 to a control group. Furthermore, comparisons withmore » 23 healthy control individuals were made. Acoustic analyses were performed for all patients, including the healthy control individuals. The Swedish version of the Self Evaluation of Communication Experiences after Laryngeal Cancer and self-ratings of voice function were used to assess vocal and communicative function. Results: The patients who received vocal rehabilitation experienced improved self-rated vocal function after rehabilitation. Patients with supraglottic tumors who received voice rehabilitation had statistically significant improvements in voice quality and self-rated vocal function, whereas the control group did not. Conclusion: Voice rehabilitation for male patients with laryngeal cancer is efficacious regarding patient-reported outcome measurements. The patients experienced better voice function after rehabilitation. Patients with supraglottic tumors also showed an improvement in terms of acoustic voice outcomes. Rehabilitation with a speech-language pathologist is recommended for laryngeal cancer patients after radiation therapy, particularly for patients with supraglottic tumors.« less
Using Ambulatory Voice Monitoring to Investigate Common Voice Disorders: Research Update
Mehta, Daryush D.; Van Stan, Jarrad H.; Zañartu, Matías; Ghassemi, Marzyeh; Guttag, John V.; Espinoza, Víctor M.; Cortés, Juan P.; Cheyne, Harold A.; Hillman, Robert E.
2015-01-01
Many common voice disorders are chronic or recurring conditions that are likely to result from inefficient and/or abusive patterns of vocal behavior, referred to as vocal hyperfunction. The clinical management of hyperfunctional voice disorders would be greatly enhanced by the ability to monitor and quantify detrimental vocal behaviors during an individual’s activities of daily life. This paper provides an update on ongoing work that uses a miniature accelerometer on the neck surface below the larynx to collect a large set of ambulatory data on patients with hyperfunctional voice disorders (before and after treatment) and matched-control subjects. Three types of analysis approaches are being employed in an effort to identify the best set of measures for differentiating among hyperfunctional and normal patterns of vocal behavior: (1) ambulatory measures of voice use that include vocal dose and voice quality correlates, (2) aerodynamic measures based on glottal airflow estimates extracted from the accelerometer signal using subject-specific vocal system models, and (3) classification based on machine learning and pattern recognition approaches that have been used successfully in analyzing long-term recordings of other physiological signals. Preliminary results demonstrate the potential for ambulatory voice monitoring to improve the diagnosis and treatment of common hyperfunctional voice disorders. PMID:26528472
Crossmodal plasticity in the fusiform gyrus of late blind individuals during voice recognition.
Hölig, Cordula; Föcker, Julia; Best, Anna; Röder, Brigitte; Büchel, Christian
2014-12-01
Blind individuals are trained in identifying other people through voices. In congenitally blind adults the anterior fusiform gyrus has been shown to be active during voice recognition. Such crossmodal changes have been associated with a superiority of blind adults in voice perception. The key question of the present functional magnetic resonance imaging (fMRI) study was whether visual deprivation that occurs in adulthood is followed by similar adaptive changes of the voice identification system. Late blind individuals and matched sighted participants were tested in a priming paradigm, in which two voice stimuli were subsequently presented. The prime (S1) and the target (S2) were either from the same speaker (person-congruent voices) or from two different speakers (person-incongruent voices). Participants had to classify the S2 as either coming from an old or a young person. Only in late blind but not in matched sighted controls, the activation in the anterior fusiform gyrus was modulated by voice identity: late blind volunteers showed an increase of the BOLD signal in response to person-incongruent compared with person-congruent trials. These results suggest that the fusiform gyrus adapts to input of a new modality even in the mature brain and thus demonstrate an adult type of crossmodal plasticity. Copyright © 2014 Elsevier Inc. All rights reserved.
Voice Based City Panic Button System
NASA Astrophysics Data System (ADS)
Febriansyah; Zainuddin, Zahir; Bachtiar Nappu, M.
2018-03-01
The development of voice activated panic button application aims to design faster early notification of hazardous condition in community to the nearest police by using speech as the detector where the current application still applies touch-combination on screen and use coordination of orders from control center then the early notification still takes longer time. The method used in this research was by using voice recognition as the user voice detection and haversine formula for the comparison of closest distance between the user and the police. This research was equipped with auto sms, which sent notification to the victim’s relatives, that was also integrated with Google Maps application (GMaps) as the map to the victim’s location. The results show that voice registration on the application reaches 100%, incident detection using speech recognition while the application is running is 94.67% in average, and the auto sms to the victim relatives reaches 100%.
Simpler ISS Flight Control Communications and Log Keeping via Social Tools and Techniques
NASA Technical Reports Server (NTRS)
Scott, David W.; Cowart, Hugh; Stevens, Dan
2012-01-01
The heart of flight operations control involves a) communicating effectively in real time with other controllers in the room and/or in remote locations and b) tracking significant events, decisions, and rationale to support the next set of decisions, provide a thorough shift handover, and troubleshoot/improve operations. International Space Station (ISS) flight controllers speak with each other via multiple voice circuits or loops, each with a particular purpose and constituency. Controllers monitor and/or respond to several loops concurrently. The primary tracking tools are console logs, typically kept by a single operator and not visible to others in real-time. Information from telemetry, commanding, and planning systems also plays into decision-making. Email is very secondary/tertiary due to timing and archival considerations. Voice communications and log entries supporting ISS operations have increased by orders of magnitude because the number of control centers, flight crew, and payload operations have grown. This paper explores three developmental ground system concepts under development at Johnson Space Center s (JSC) Mission Control Center Houston (MCC-H) and Marshall Space Flight Center s (MSFC) Payload Operations Integration Center (POIC). These concepts could reduce ISS control center voice traffic and console logging yet increase the efficiency and effectiveness of both. The goal of this paper is to kindle further discussion, exploration, and tool development.
Effects of voice harmonic complexity on ERP responses to pitch-shifted auditory feedback.
Behroozmand, Roozbeh; Korzyukov, Oleg; Larson, Charles R
2011-12-01
The present study investigated the neural mechanisms of voice pitch control for different levels of harmonic complexity in the auditory feedback. Event-related potentials (ERPs) were recorded in response to+200 cents pitch perturbations in the auditory feedback of self-produced natural human vocalizations, complex and pure tone stimuli during active vocalization and passive listening conditions. During active vocal production, ERP amplitudes were largest in response to pitch shifts in the natural voice, moderately large for non-voice complex stimuli and smallest for the pure tones. However, during passive listening, neural responses were equally large for pitch shifts in voice and non-voice complex stimuli but still larger than that for pure tones. These findings suggest that pitch change detection is facilitated for spectrally rich sounds such as natural human voice and non-voice complex stimuli compared with pure tones. Vocalization-induced increase in neural responses for voice feedback suggests that sensory processing of naturally-produced complex sounds such as human voice is enhanced by means of motor-driven mechanisms (e.g. efference copies) during vocal production. This enhancement may enable the audio-vocal system to more effectively detect and correct for vocal errors in the feedback of natural human vocalizations to maintain an intended vocal output for speaking. Copyright © 2011 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.
Acoustical analysis of trained and untrained singers onsite before and after prolonged voice use
NASA Astrophysics Data System (ADS)
Jackson, Christophe E.
Controlled acoustic environments are important in voice research. Recording environment affects the quality of voice recordings. While sound booths and anechoic chambers are examples of controlled acoustic environments widely used in research, they are both costly and not portable. The long-term goal of this project is to compare the voice usage and efficiency of trained and untrained singers onsite immediately before and after vocal performance. The specific goal of this project is the further of development a Portable Sound Booth (PSB) and standardization of onsite voice recording procedures under controlled conditions. We hypothesized that the simple and controlled acoustic environment provided by the PSB would enable consistent reliable onsite voice recordings and the immediate differences as a consequence of voice usage were measurable. Research has suggested that it would be possible to conduct onsite voice recordings. Proof of concept research titled "Construction and Characterization of a Portable Sound Booth for Onsite Measurement" was conducted before initiating the full research effort. Preliminary findings revealed that: (1) it was possible to make high-quality voice recordings onsite, (2) the use of a Portable Sound Booth (PSB) required further acoustic characterization of its inherent acoustic properties, and (3) testable differences before and after performance were evident. The specific aims were to (1) develop and refine onsite objective voice measurements in the PSB and (2) evaluate use of the PSB to measure voice quality changes before and after voice usage.
Voice handicap in essential tremor: a comparison with normal controls and Parkinson's disease.
Louis, Elan D; Gerbin, Marina
2013-01-01
Although voice tremor is one of the most commonly noted clinical features of essential tremor (ET), there are nearly no published data on the handicap associated with it. The Voice Handicap Index (VHI) was self-administered by participants enrolled in a research study at Columbia University Medical Center. The VHI quantifies patients' perceptions of handicap due to voice difficulties. Data from 98 ET cases were compared with data from 100 controls and 85 patients with another movement disorder (Parkinson's disease, PD). Voice tremor was present on examination in 25 (25.5%) ET cases; 12 had mild voice tremor (ETMild VT) and 13 had marked voice tremor (ETMarked VT). VHI scores were higher in ET cases than controls (p = 0.02). VHI scores among ETMarked VT were similar to those of PD cases; both were significantly higher than controls (p<0.001). The three VHI subscale scores (physical, functional, emotional) were highest in ETMarked VT, with values that were similar to those observed in PD. The voice handicap associated with ET had multiple (i.e., physical, functional, and emotional) dimensions. Moreover, ET cases with marked voice tremor on examination had a level of self-reported voice handicap that was similar to that observed in patients with PD.
Kim, Yongdae; Kim, Sangyoo; Park, Kyihwan
2009-04-01
A six-axis active vibration isolation system (AVIS) is developed using voice coil actuators. Point contact configuration is employed to have an easy assembly of eight voice coil actuators to an upper and a base plates. The velocity sensor, using an electromagnetic principle that is commonly used in the vibration control, is investigated since its phase lead characteristic causes an instability problem for a low frequency vibration. The performances of the AVIS are investigated in the frequency domain and finally validated by comparing with the passive isolation system using the atomic force microscope images.
Robotics control using isolated word recognition of voice input
NASA Technical Reports Server (NTRS)
Weiner, J. M.
1977-01-01
A speech input/output system is presented that can be used to communicate with a task oriented system. Human speech commands and synthesized voice output extend conventional information exchange capabilities between man and machine by utilizing audio input and output channels. The speech input facility is comprised of a hardware feature extractor and a microprocessor implemented isolated word or phrase recognition system. The recognizer offers a medium sized (100 commands), syntactically constrained vocabulary, and exhibits close to real time performance. The major portion of the recognition processing required is accomplished through software, minimizing the complexity of the hardware feature extractor.
Audio-vocal system regulation in children with autism spectrum disorders.
Russo, Nicole; Larson, Charles; Kraus, Nina
2008-06-01
Do children with autism spectrum disorders (ASD) respond similarly to perturbations in auditory feedback as typically developing (TD) children? Presentation of pitch-shifted voice auditory feedback to vocalizing participants reveals a close coupling between the processing of auditory feedback and vocal motor control. This paradigm was used to test the hypothesis that abnormalities in the audio-vocal system would negatively impact ASD compensatory responses to perturbed auditory feedback. Voice fundamental frequency (F(0)) was measured while children produced an /a/ sound into a microphone. The voice signal was fed back to the subjects in real time through headphones. During production, the feedback was pitch shifted (-100 cents, 200 ms) at random intervals for 80 trials. Averaged voice F(0) responses to pitch-shifted stimuli were calculated and correlated with both mental and language abilities as tested via standardized tests. A subset of children with ASD produced larger responses to perturbed auditory feedback than TD children, while the other children with ASD produced significantly lower response magnitudes. Furthermore, robust relationships between language ability, response magnitude and time of peak magnitude were identified. Because auditory feedback helps to stabilize voice F(0) (a major acoustic cue of prosody) and individuals with ASD have problems with prosody, this study identified potential mechanisms of dysfunction in the audio-vocal system for voice pitch regulation in some children with ASD. Objectively quantifying this deficit may inform both the assessment of a subgroup of ASD children with prosody deficits, as well as remediation strategies that incorporate pitch training.
[Computed assisted voice recognition. A dream or reality in the pathologist's routine work?].
Delling, G; Delling, D
1999-03-01
During the last 30 years the analysis of human speech with powerful computers has taken great strides; therefore, cost-effective, comfortable solutions are now available for use in professional routine work. The advantages of using voice recognition are the creation of new documentation or archives, reduced personnel costs and, last but not least, independence in cases of unforeseen notification of illness or owing to annual leave. For voice recognition systems to be used easily, a considerable amount of time must be invested for the first 3 months. Younger colleagues in particular will be more motivated to dictate more precisely and more detailed because of the introduction of voice recognition. The effects on other sectors of medical training, quality control, histology report preparation, and transmission can only be speculated.
Investigation of air transportation technology at Princeton University, 1983
NASA Technical Reports Server (NTRS)
Stengel, Robert F.
1987-01-01
Progress is discussed for each of the following areas: voice recognition technology for flight control; guidance and control strategies for penetration of microbursts and wind shear; application of artificial intelligence in flight control systems; and computer-aided aircraft design.
Analysis of Controller-Pilot Voice Communications from Kansas City Air Route Traffic Control Center
DOT National Transportation Integrated Search
2017-07-01
The implementation of Controller Pilot Datalink Communications (CPDLC) in domestic en route airspace is a key enabling technology in the Next Generation Air Transportation System. The Federal Aviation Administration plans to implement en route CPDLC ...
Code of Federal Regulations, 2014 CFR
2014-10-01
... satellite systems and non-voice, non-geostationary satellite systems in the 137-138 MHz band. 25.259 Section... systems and non-voice, non-geostationary satellite systems in the 137-138 MHz band. (a) The space stations of a non-voice, non-geostationary Mobile-Satellite Service (NVNG MSS) system time-sharing downlink...
Code of Federal Regulations, 2013 CFR
2013-10-01
... satellite systems and non-voice, non-geostationary satellite systems in the 137-138 MHz band. 25.259 Section... systems and non-voice, non-geostationary satellite systems in the 137-138 MHz band. (a) The space stations of a non-voice, non-geostationary Mobile-Satellite Service (NVNG MSS) system time-sharing downlink...
Koç, Eltaf Ayça Özbal; Koç, Bülent; Erbek, Selim
2014-12-01
In our experience Allergic Rhinitis (AR) patients suffer from voice problems more than health subjects. To investigate the acoustic analysis of voice, stroscopic findings of larynx and Voice Handicap Index scores in allergic rhinitis patients compared with healthy controls. Case-control study. Thirty adult patients diagnosed with perennial allergic rhinitis were compared with 30 age- and sex-matched healthy controls without allergy. All assessments were performed in the speech physiology laboratory and the testing sequence was as follows: 1. Voice Handicap Index (VHI) questionnaire, 2. Laryngovideostroboscopy, 3. Acoustic analyses. No difference was observed between the allergic rhinitis and control groups regarding mean Maximum Phonation Time (MPT) values, Fo values, and stroboscopic assessment (p>0.05). On the other hand, mean VHI score (p=0.001) and s/z ratio (p=0.011) were significantly higher in the allergic rhinitis group than in controls. Our findings suggest that the presence of allergies could have effects on laryngeal dysfunction and voice-related quality of life.
Lyberg Åhlander, Viveka; Rydell, Roland; Löfqvist, Anders
2012-07-01
This randomized case-control study compares teachers with self-reported voice problems to age-, gender-, and school-matched colleagues with self-reported voice health. The self-assessed voice function is related to factors known to influence the voice: laryngeal findings, voice quality, personality, psychosocial and coping aspects, searching for causative factors of voice problems in teachers. Subjects and controls, recruited from a teacher group in an earlier questionnaire study, underwent examinations of the larynx by high-speed imaging and kymograms; voice recordings; voice range profile; audiometry; self-assessment of voice handicap and voice function; teaching and environmental aspects; personality; coping; burnout, and work-related issues. The laryngeal and voice recordings were assessed by experienced phoniatricians and speech pathologists. The subjects with self-assessed voice problems differed from their peers with self-assessed voice health by significantly longer recovery time from voice problems and scored higher on all subscales of the Voice Handicap Index-Throat. The results show that the cause of voice dysfunction in this group of teachers with self-reported voice problems is not found in the vocal apparatus or within the individual. The individual's perception of a voice problem seems to be based on a combination of the number of symptoms and of how often the symptoms occur, along with the recovery time. The results also underline the importance of using self-assessed reports of voice dysfunction. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
The effect of deep brain stimulation on the speech motor system.
Mücke, Doris; Becker, Johannes; Barbe, Michael T; Meister, Ingo; Liebhart, Lena; Roettger, Timo B; Dembek, Till; Timmermann, Lars; Grice, Martine
2014-08-01
Chronic deep brain stimulation of the nucleus ventralis intermedius is an effective treatment for individuals with medication-resistant essential tremor. However, these individuals report that stimulation has a deleterious effect on their speech. The present study investigates one important factor leading to these effects: the coordination of oral and glottal articulation. Sixteen native-speaking German adults with essential tremor, between 26 and 86 years old, with and without chronic deep brain stimulation of the nucleus ventralis intermedius and 12 healthy, age-matched subjects were recorded performing a fast syllable repetition task (/papapa/, /tatata/, /kakaka/). Syllable duration and voicing-to-syllable ratio as well as parameters related directly to consonant production, voicing during constriction, and frication during constriction were measured. Voicing during constriction was greater in subjects with essential tremor than in controls, indicating a perseveration of voicing into the voiceless consonant. Stimulation led to fewer voiceless intervals (voicing-to-syllable ratio), indicating a reduced degree of glottal abduction during the entire syllable cycle. Stimulation also induced incomplete oral closures (frication during constriction), indicating imprecise oral articulation. The detrimental effect of stimulation on the speech motor system can be quantified using acoustic measures at the subsyllabic level.
Vocal Responses to Perturbations in Voice Auditory Feedback in Individuals with Parkinson's Disease
Liu, Hanjun; Wang, Emily Q.; Metman, Leo Verhagen; Larson, Charles R.
2012-01-01
Background One of the most common symptoms of speech deficits in individuals with Parkinson's disease (PD) is significantly reduced vocal loudness and pitch range. The present study investigated whether abnormal vocalizations in individuals with PD are related to sensory processing of voice auditory feedback. Perturbations in loudness or pitch of voice auditory feedback are known to elicit short latency, compensatory responses in voice amplitude or fundamental frequency. Methodology/Principal Findings Twelve individuals with Parkinson's disease and 13 age- and sex- matched healthy control subjects sustained a vowel sound (/α/) and received unexpected, brief (200 ms) perturbations in voice loudness (±3 or 6 dB) or pitch (±100 cents) auditory feedback. Results showed that, while all subjects produced compensatory responses in their voice amplitude or fundamental frequency, individuals with PD exhibited larger response magnitudes than the control subjects. Furthermore, for loudness-shifted feedback, upward stimuli resulted in shorter response latencies than downward stimuli in the control subjects but not in individuals with PD. Conclusions/Significance The larger response magnitudes in individuals with PD compared with the control subjects suggest that processing of voice auditory feedback is abnormal in PD. Although the precise mechanisms of the voice feedback processing are unknown, results of this study suggest that abnormal voice control in individuals with PD may be related to dysfunctional mechanisms of error detection or correction in sensory feedback processing. PMID:22448258
Vibration isolation and dual-stage actuation pointing system for space precision payloads
NASA Astrophysics Data System (ADS)
Kong, Yongfang; Huang, Hai
2018-02-01
Pointing and stability requirements for future space missions are becoming more and more stringent. This work follows the pointing control method which consists of a traditional spacecraft attitude control system and a payload active pointing loop, further proposing a vibration isolation and dual-stage actuation pointing system for space precision payloads based on a soft Stewart platform. Central to the concept is using the dual-stage actuator instead of the traditional voice coil motor single-stage actuator to improve the payload active pointing capability. Based on a specified payload, the corresponding platform was designed to be installed between the spacecraft bus and the payload. The performance of the proposed system is demonstrated by preliminary closed-loop control investigations in simulations. With the ordinary spacecraft bus, the line-of-sight pointing accuracy can be controlled to below a few milliarcseconds in tip and tilt. Meanwhile, utilizing the voice coil motor with the softening spring in parallel, which is a portion of the dual-stage actuator, the system effectively achieves low-frequency motion transmission and high-frequency vibration isolation along the other four degree-of-freedom directions.
Google Home: smart speaker as environmental control unit.
Noda, Kenichiro
2017-08-23
Environmental Control Units (ECU) are devices or a system that allows a person to control appliances in their home or work environment. Such system can be utilized by clients with physical and/or functional disability to enhance their ability to control their environment, to promote independence and improve their quality of life. Over the last several years, there have been an emergence of several inexpensive, commercially-available, voice activated smart speakers into the market such as Google Home and Amazon Echo. These smart speakers are equipped with far field microphone that supports voice recognition, and allows for complete hand-free operation for various purposes, including for playing music, for information retrieval, and most importantly, for environmental control. Clients with disability could utilize these features to turn the unit into a simple ECU that is completely voice activated and wirelessly connected to appliances. Smart speakers, with their ease of setup, low cost and versatility, may be a more affordable and accessible alternative to the traditional ECU. Implications for Rehabilitation Environmental Control Units (ECU) enable independence for physically and functionally disabled clients, and reduce burden and frequency of demands on carers. Traditional ECU can be costly and may require clients to learn specialized skills to use. Smart speakers have the potential to be used as a new-age ECU by overcoming these barriers, and can be used by a wider range of clients.
Code of Federal Regulations, 2014 CFR
2014-10-01
... satellite systems and non-voice, non-geostationary satellite systems in the 400.15-401 MHz band. 25.260... systems and non-voice, non-geostationary satellite systems in the 400.15-401 MHz band. (a) The space stations of a non-voice, non-geostationary Mobile-Satellite Service (NVNG MSS) system time-sharing downlink...
Code of Federal Regulations, 2010 CFR
2010-10-01
... satellite systems and non-voice, non-geostationary satellite systems in the 400.15-401 MHz band. 25.260... systems and non-voice, non-geostationary satellite systems in the 400.15-401 MHz band. (a) A non-voice, non-geostationary mobile-satellite service system licensee (“NVNG licensee”) time-sharing spectrum in...
Code of Federal Regulations, 2013 CFR
2013-10-01
... satellite systems and non-voice, non-geostationary satellite systems in the 400.15-401 MHz band. 25.260... systems and non-voice, non-geostationary satellite systems in the 400.15-401 MHz band. (a) The space stations of a non-voice, non-geostationary Mobile-Satellite Service (NVNG MSS) system time-sharing downlink...
Code of Federal Regulations, 2011 CFR
2011-10-01
... satellite systems and non-voice, non-geostationary satellite systems in the 400.15-401 MHz band. 25.260... systems and non-voice, non-geostationary satellite systems in the 400.15-401 MHz band. (a) A non-voice, non-geostationary mobile-satellite service system licensee (“NVNG licensee”) time-sharing spectrum in...
Code of Federal Regulations, 2010 CFR
2010-10-01
... satellite systems and non-voice, non-geostationary satellite systems in the 137-138 MHz band. 25.259 Section... systems and non-voice, non-geostationary satellite systems in the 137-138 MHz band. (a) A non-voice, non-geostationary mobile-satellite service system licensee (“NVNG licensee”) time-sharing spectrum in the 137-138...
Code of Federal Regulations, 2012 CFR
2012-10-01
... satellite systems and non-voice, non-geostationary satellite systems in the 400.15-401 MHz band. 25.260... systems and non-voice, non-geostationary satellite systems in the 400.15-401 MHz band. (a) A non-voice, non-geostationary mobile-satellite service system licensee (“NVNG licensee”) time-sharing spectrum in...
Code of Federal Regulations, 2011 CFR
2011-10-01
... satellite systems and non-voice, non-geostationary satellite systems in the 137-138 MHz band. 25.259 Section... systems and non-voice, non-geostationary satellite systems in the 137-138 MHz band. (a) A non-voice, non-geostationary mobile-satellite service system licensee (“NVNG licensee”) time-sharing spectrum in the 137-138...
Code of Federal Regulations, 2012 CFR
2012-10-01
... satellite systems and non-voice, non-geostationary satellite systems in the 137-138 MHz band. 25.259 Section... systems and non-voice, non-geostationary satellite systems in the 137-138 MHz band. (a) A non-voice, non-geostationary mobile-satellite service system licensee (“NVNG licensee”) time-sharing spectrum in the 137-138...
Borowiak, Kamila; von Kriegstein, Katharina
2016-01-01
The ability to recognise the identity of others is a key requirement for successful communication. Brain regions that respond selectively to voices exist in humans from early infancy on. Currently, it is unclear whether dysfunction of these voice-sensitive regions can explain voice identity recognition impairments. Here, we used two independent functional magnetic resonance imaging studies to investigate voice processing in a population that has been reported to have no voice-sensitive regions: autism spectrum disorder (ASD). Our results refute the earlier report that individuals with ASD have no responses in voice-sensitive regions: Passive listening to vocal, compared to non-vocal, sounds elicited typical responses in voice-sensitive regions in the high-functioning ASD group and controls. In contrast, the ASD group had a dysfunction in voice-sensitive regions during voice identity but not speech recognition in the right posterior superior temporal sulcus/gyrus (STS/STG)—a region implicated in processing complex spectrotemporal voice features and unfamiliar voices. The right anterior STS/STG correlated with voice identity recognition performance in controls but not in the ASD group. The findings suggest that right STS/STG dysfunction is critical for explaining voice recognition impairments in high-functioning ASD and show that ASD is not characterised by a general lack of voice-sensitive responses. PMID:27369067
NASA Technical Reports Server (NTRS)
1987-01-01
Skylab derived Heating System offers computerized control with an innovative voice synthesizer that literally allows the control unit to talk to the system user. It reports time of day, outside temperature and system temperature, and asks questions as to how the user wants the system programmed. Master Module collects energy from the Sun and either transfers it directly to the home water heater or stores it until needed.
Baker, Janet; Ben-Tovim, David; Butcher, Andrew; Esterman, Adrian; McLaughlin, Kristin
2013-12-01
This study aimed to explore psychosocial factors contributing to the development of functional voice disorders (FVD) and those differentiating between organic voice disorders (OVD) and a non-voice-disordered control group. A case-control study was undertaken of 194 women aged 18-80 years diagnosed with FVD (n = 73), OVD (n = 55), and controls (n = 66). FVD women were allocated into psychogenic voice disorder (PVD) (n = 37) and muscle tension voice disorder (MTVD) (n = 36) for sub-group analysis. Dependent variables included biographical and voice assessment data, the number and severity of life events and difficulties and conflict over speaking out (COSO) situations derived from the Life Events and Difficulties Schedule (LEDS), and psychological traits including emotional expressiveness scales. Four psychosocial components differentiated between the FVD and control group accounting for 84.9% of the variance: severe events, moderate events, severe COSO, and mild COSO difficulties. Severe events, severe and mild COSO difficulties differentiated between FVD and OVD groups, accounting for 80.5% of the variance. Moderate events differentiated between PVD and MTVD sub-groups, accounting for 58.9% of the variance. Psychological traits did not differentiate between groups. Stressful life events and COSO situations best differentiated FVD from OVD and control groups. More refined aetiological studies are needed to differentiate between PVD and MTVD.
Military and government applications of human-machine communication by voice.
Weinstein, C J
1995-01-01
This paper describes a range of opportunities for military and government applications of human-machine communication by voice, based on visits and contacts with numerous user organizations in the United States. The applications include some that appear to be feasible by careful integration of current state-of-the-art technology and others that will require a varying mix of advances in speech technology and in integration of the technology into applications environments. Applications that are described include (1) speech recognition and synthesis for mobile command and control; (2) speech processing for a portable multifunction soldier's computer; (3) speech- and language-based technology for naval combat team tactical training; (4) speech technology for command and control on a carrier flight deck; (5) control of auxiliary systems, and alert and warning generation, in fighter aircraft and helicopters; and (6) voice check-in, report entry, and communication for law enforcement agents or special forces. A phased approach for transfer of the technology into applications is advocated, where integration of applications systems is pursued in parallel with advanced research to meet future needs. Images Fig. 1 Fig. 2 Fig. 3 Fig. 4 Fig. 5 Fig. 6 PMID:7479718
Design and control of six degree-of-freedom active vibration isolation table.
Hong, Jinpyo; Park, Kyihwan
2010-03-01
A six-axis active vibration isolation system (AVIS) is designed by using the direct driven guide and ball contact mechanisms in order to have no cross-coupling between actuators. The point contact configuration gives an advantage of having an easy assembly of eight voice coil actuators to an upper and a base plate. A voice coil actuator is used since it can provide a large displacement and sufficient bandwidth required for vibration control. The AVIS is controlled considering the effect of flexible vibration mode in the upper plate and velocity sensor dynamics. A loop shaping technique and phase margin condition are applied to design a vibration controller. The performances of the AVIS are investigated in the frequency domain and finally validated by comparing with the passive isolation system. The scanning profiles of the specimen are compared together by using the atomic force microscope. The robustness of the AVIS is verified by showing the impulse response.
Design and control of six degree-of-freedom active vibration isolation table
NASA Astrophysics Data System (ADS)
Hong, Jinpyo; Park, Kyihwan
2010-03-01
A six-axis active vibration isolation system (AVIS) is designed by using the direct driven guide and ball contact mechanisms in order to have no cross-coupling between actuators. The point contact configuration gives an advantage of having an easy assembly of eight voice coil actuators to an upper and a base plate. A voice coil actuator is used since it can provide a large displacement and sufficient bandwidth required for vibration control. The AVIS is controlled considering the effect of flexible vibration mode in the upper plate and velocity sensor dynamics. A loop shaping technique and phase margin condition are applied to design a vibration controller. The performances of the AVIS are investigated in the frequency domain and finally validated by comparing with the passive isolation system. The scanning profiles of the specimen are compared together by using the atomic force microscope. The robustness of the AVIS is verified by showing the impulse response.
Stegemöller, Elizabeth L; Radig, Hollie; Hibbing, Paul; Wingate, Judith; Sapienza, Christine
2017-03-01
Purpose Interventions focused on singing may provide additional benefits to established voice and respiratory therapies, due to their greater emphasis on the respiratory muscle control system in those with Parkinson's disease (PD) progresses. The purpose of this study was to examine if singing can improve voice, respiratory pressure and quality of life (QOL) in persons with PD. Methods This pilot study measured the effects of a singing intervention in 27 participants with PD. Participants were assigned to a high (met twice weekly) or low (met once weekly) dosage group. Voice, respiratory and QOL measures were recorded before and after an 8-week singing intervention. Sessions were led by board-certified music therapists and included a series of vocal and articulation exercises and group singing. Results Both groups demonstrated significant improvements in maximum inspiratory and expiratory pressure, as well as phonation time. While other voice measures improved, they did not reach statistical significance. Voice QOL and whole health QOL also significantly improved. Conclusion These results suggest singing may be a beneficial and engaging treatment choice for improving and maintaining vocal function and respiratory pressure in persons with PD. Implications for Rehabilitation In a small sample, group singing proved beneficial for improving voice and respiratory impairment in persons with Parkinson's disease. Completing group singing one time per week for 8 weeks was as effective as completing group singing two times per week for 8 weeks in persons with Parkinson's disease. Group singing is an effective means of improving overall quality of life in persons with Parkinson's disease.
Knowledge-based operation and management of communications systems
NASA Technical Reports Server (NTRS)
Heggestad, Harold M.
1988-01-01
Expert systems techniques are being applied in operation and control of the Defense Communications System (DCS), which has the mission of providing reliable worldwide voice, data and message services for U.S. forces and commands. Thousands of personnel operate DCS facilities, and many of their functions match the classical expert system scenario: complex, skill-intensive environments with a full spectrum of problems in training and retention, cost containment, modernization, and so on. Two of these functions are: (1) fault isolation and restoral of dedicated circuits at Tech Control Centers, and (2) network management for the Defense Switched Network (the modernized dial-up voice system currently replacing AUTOVON). An expert system for the first of these is deployed for evaluation purposes at Andrews Air Force Base, and plans are being made for procurement of operational systems. In the second area, knowledge obtained with a sophisticated simulator is being embedded in an expert system. The background, design and status of both projects are described.
Knowledge-based operation and management of communications systems
NASA Astrophysics Data System (ADS)
Heggestad, Harold M.
1988-11-01
Expert systems techniques are being applied in operation and control of the Defense Communications System (DCS), which has the mission of providing reliable worldwide voice, data and message services for U.S. forces and commands. Thousands of personnel operate DCS facilities, and many of their functions match the classical expert system scenario: complex, skill-intensive environments with a full spectrum of problems in training and retention, cost containment, modernization, and so on. Two of these functions are: (1) fault isolation and restoral of dedicated circuits at Tech Control Centers, and (2) network management for the Defense Switched Network (the modernized dial-up voice system currently replacing AUTOVON). An expert system for the first of these is deployed for evaluation purposes at Andrews Air Force Base, and plans are being made for procurement of operational systems. In the second area, knowledge obtained with a sophisticated simulator is being embedded in an expert system. The background, design and status of both projects are described.
A Comparison of Voice Activity and Participation Profiles Among Etiological Groups.
Lee, Seung Jin; Choi, Hong-Shik; Kim, HyangHee
2018-05-11
The purpose of this study was to determine whether patients with functional voice disorders show voice activity and participation profiles different from those of the organic and neurogenic groups. The Korean Version of the Voice Activity and Participation Profile (K-VAPP) was administered to 200 participants (150 patients with functional, organic, and neurogenic voice disorders, 50 for each etiological group, 50 controls without vocal complaint). The K-VAPP subscale scores of the etiological groups were compared, controlling for age, professional use of voice, and severity of voice disorder measured by overall severity of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V). Results of a one-way analysis of variance indicated significant differences in the overall severity across groups (neurogenic > functional = organic > control). Among four groups, the organic group showed higher mean Z-scores of the K-VAPP than the control group, and the functional group showed higher mean Z-scores of the K-VAPP than the organic group. Compared with the neurogenic group, the functional group showed lower mean Z-scores for total score, Activity Limitation Score, SUB3, and SUB5. A comparison among three etiological groups showed that the functional group did not show higher scores than the organic group. On the contrary, the functional group showed a lower total score, Participation Restriction Score, and score for subsection 3 (effect on daily communication) than the neurogenic group. Psychometric assessment of voice disorders using the K-VAPP could provide clinicians with baseline information that is applicable to various voice disorders. Further studies pertaining to the follow-up of voice disorders with various etiologies are needed to extend its clinical usefulness. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
DOT National Transportation Integrated Search
2017-05-01
Fundamental differences exist between transmissions of Air Traffic Control clearances over voice and those transmitted via Controller Pilot Data Link Communications (CPDLC). This paper provides flight deck human factors issues that apply to processin...
Compensation for pitch-shifted auditory feedback during the production of Mandarin tone sequences
NASA Astrophysics Data System (ADS)
Xu, Yi; Larson, Charles R.; Bauer, Jay J.; Hain, Timothy C.
2004-08-01
Recent research has found that while speaking, subjects react to perturbations in pitch of voice auditory feedback by changing their voice fundamental frequency (F0) to compensate for the perceived pitch-shift. The long response latencies (150-200 ms) suggest they may be too slow to assist in on-line control of the local pitch contour patterns associated with lexical tones on a syllable-to-syllable basis. In the present study, we introduced pitch-shifted auditory feedback to native speakers of Mandarin Chinese while they produced disyllabic sequences /ma ma/ with different tonal combinations at a natural speaking rate. Voice F0 response latencies (100-150 ms) to the pitch perturbations were shorter than syllable durations reported elsewhere. Response magnitudes increased from 50 cents during static tone to 85 cents during dynamic tone productions. Response latencies and peak times decreased in phrases involving a dynamic change in F0. The larger response magnitudes and shorter latency and peak times in tasks requiring accurate, dynamic control of F0, indicate this automatic system for regulation of voice F0 may be task-dependent. These findings suggest that auditory feedback may be used to help regulate voice F0 during production of bi-tonal Mandarin phrases.
Dysphonia, Perceived Control, and Psychosocial Distress: A Qualitative Study.
Misono, Stephanie; Haut, Caroline; Meredith, Liza; Frazier, Patricia A; Stockness, Ali; Michael, Deirdre D; Butcher, Lisa; Harwood, Eileen M
2018-05-11
The purpose of this qualitative study was to examine relationships between psychological factors, particularly perceived control, and voice symptoms in adults seeking treatment for a voice problem. Semistructured interviews of adult patients with a clinical diagnosis of muscle tension dysphonia were conducted and transcribed. Follow-up interviews were conducted as needed for further information or clarification. A multidisciplinary team analyzed interview content using inductive techniques. Common themes and subthemes were identified. A conceptual model was developed describing the association between voice symptoms, psychological factors, precipitants of ongoing voice symptoms, and perceived control. Thematic saturation was reached after 23 interviews. No participants reported a direct psychological cause for their voice problem, although half described significant life events preceding voice problem onset (eg, miscarriage and other health events, interpersonal conflicts, and family members' illnesses, injuries, and deaths). Participants described psychological influences on voice symptoms that led to rapid exacerbation of their voice symptoms. Participants described the helpfulness of speech therapy and sometimes also challenges of applying techniques in daily life. They also discussed personal coping strategies that included behavioral (eg, avoiding triggers and seeking social support) and psychological (eg, mind-body awareness and emotion regulation) components. Voice-related perceived control was associated with adaptive emotional and behavioral responses, which appeared to facilitate symptom improvement. In this qualitative pilot study, participant narratives suggested that psychological factors and emotions influence voice symptoms, facilitating development of a preliminary conceptual model of how adaptive and maladaptive responses develop and how they influence vocal function. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Ocular attention-sensing interface system
NASA Technical Reports Server (NTRS)
Zaklad, Allen; Glenn, Floyd A., III; Iavecchia, Helene P.; Stokes, James M.
1986-01-01
The purpose of the research was to develop an innovative human-computer interface based on eye movement and voice control. By eliminating a manual interface (keyboard, joystick, etc.), OASIS provides a control mechanism that is natural, efficient, accurate, and low in workload.
Interference effects of vocalization on dual task performance
NASA Astrophysics Data System (ADS)
Owens, J. M.; Goodman, L. S.; Pianka, M. J.
1984-09-01
Voice command and control systems have been proposed as a potential means of off-loading the typically overburdened visual information processing system. However, prior to introducing novel human-machine interfacing technologies in high workload environments, consideration must be given to the integration of the new technologists within existing task structures to ensure that no new sources of workload or interference are systematically introduced. This study examined the use of voice interactive systems technology in the joint performance of two cognitive information processing tasks requiring continuous memory and choice reaction wherein a basis for intertask interference might be expected. Stimuli for the continuous memory task were presented aurally and either voice or keyboard responding was required in the choice reaction task. Performance was significantly degraded in each task when voice responding was required in the choice reaction time task. Performance degradation was evident in higher error scores for both the choice reaction and continuous memory tasks. Performance decrements observed under conditions of high intertask stimulus similarity were not statistically significant. The results signal the need to consider further the task requirements for verbal short-term memory when applying speech technology in multitask environments.
Effectiveness of a Voice Training Program for Student Teachers on Vocal Health.
Richter, Bernhard; Nusseck, Manfred; Spahn, Claudia; Echternach, Matthias
2016-07-01
The effectiveness of a preventive training program on vocal health for German student teachers was investigated on specific vocal parameters. The voice quality as described by the Dysphonia Severity Index of 204 student teachers (training group: n = 123; control group: n = 81) was measured at the beginning and at the end of the student teachers training period (duration 1.5 years). Additionally, for investigating the voice-carrying capacity, a vocal loading test (VLT) was performed. Finally, participants had to provide a subjective judgment of a possible Voice Handicap Index. The training program improved the voice quality of the trained group compared with that of the control group, whose voice quality declined. The trained group was also able to better sustain their voice quality across the VLT than the control group. Both groups, however, reported a similar increase in subjective vocal strain. The presented training program clearly showed a positive impact on the voice quality and the vocal capacity. The results maintain the importance of such a training program to be integrated in the education and occupational routine of teachers. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Effect on long-term average spectrum of pop singers' vocal warm-up with vocal function exercises.
Guzman, Marco; Angulo, Mabel; Muñoz, Daniel; Mayerhoff, Ross
2013-04-01
Abstract This case-control study aimed to investigate if there is any change on the spectral slope declination immediately after vocal function exercises (VFE) vs traditional vocal warm-up exercises in normal singers. Thirty-eight pop singers with perceptually normal voices were divided into two groups: an experimental group (n = 20) and a control group (n = 18). One single session with VFE for the experimental group and traditional singing warm-up exercises for the control group was applied. Voice was recorded before and after the exercises. The recorded tasks were to read a phonetically balanced text and to sing a song. Long-term average spectrum (LTAS) analysis included alpha ratio, L1-L0 ratio, and singing power ratio (SPR). Acoustic parameters of voice samples pre- and post-training were compared. Comparison between VFE and control group was also performed. Significant changes after treatment included the alpha ratio and singing power ratio in speaking voice, and SPR in the singing voice for VFE group. The traditional vocal warm-up of the control group also showed pre-post changes. Significant differences between VFE group and control group for alpha ratio and SPR were found in speaking voice samples. This study demonstrates that VFE have an immediate effect on the spectrum of the voice, specifically a decrease on the spectral slope declination. The results of this study provide support for the advantageous effect of VFE as vocal warm-up on voice quality.
Sykes, Ingrid
2011-10-01
This essay explores new models of the citizen-patient by attending to the post-Revolutionary blind 'voice'. Voice, in both a literal and figurative sense, was central to the way in which members of the Hospice des Quinze-Vingts, an institution for the blind and partially sighted, interacted with those in the community. Musical voices had been used by members to collect alms and to project the particular spiritual principle of their institution since its foundation in the thirteenth century. At the time of the Revolution, the Quinze-Vingts voice was understood by some political authorities as an exemplary call of humanity. Yet many others perceived it as deeply threatening. After 1800, productive dialogue between those in political control and Quinze-Vingts blind members broke down. Authorities attempted to silence the voice of members through the control of blind musicians and institutional management. The Quinze-Vingts blind continued to reassert their voices until around 1850, providing a powerful form of resistance to political control. The blind 'voice' ultimately recognised the right of the citizen-patient to dialogue with their political carers.
Birth Control Pills and Nonprofessional Voice: Acoustic Analyses
ERIC Educational Resources Information Center
Amir, Ofer; Biron-Shental, Tal; Shabtai, Esther
2006-01-01
Purpose: Two studies are presented here. Study 1 was aimed at evaluating whether the voice characteristics of women who use birth control pills that contain different progestins differ from the voice characteristics of a control group. Study 2 presents a meta-analysis that combined the results of Study 1 with those from 3 recent studies that…
Characteristics and professional use of voice in street children in Aracaju, Brazil.
Sales, Neuza Josina; Gurgel, Ricardo Queiroz; Gonçalves, Maria Inês Rebelo; Cunha, Edílson; Barreto, Valeria Maria Prado; Todt Neto, João Carlos; D'Avila, Jeferson Sampaio
2010-07-01
The objective of the study was to evaluate voice characteristics of children engaged in street selling, which involves an essentially professional use of voice in this population. A controlled cross-sectional study was carried out. A randomly chosen sample of 200 school children with a history of street selling assisted by public social services and 400 school children without this experience was selected. Seven- to 10-year-old children of both sexes were studied. Both groups were interviewed and given vocal assessment (auditory-perceptual assessment and spectrographic acoustic measures) and otorhinolaryngological evaluation (physical and videonasolaryngoscopic examination). Children with abnormal results in both groups were compared using chi(2) (Chi-squared test). The significance level was established at 5% (P<0.05). Voice problems were detected more frequently in working children (106-53%) than in regular school children (90-22.5%). The control group achieved better school performance as more children in this group attend school regularly than street children, although age-for-grade deficit was similar. The control group had more access to medical visits (80-40%) and treatment with a doctor (34-17%). Language assessment has shown that the control group had more dysphonia (73-37%) and myofunctional orofacial disorders (20-10%). Street children had more normal voice but had more nasal disorders and greater glottal closure than the school control group. Voice disorders were present in both groups, but less frequently in street children. Although subject to inadequate living conditions, street children had better voice quality than the control group. An explanation could be that by adapting their voice professionally for selling goods in the streets, they developed adequate resilience to their difficult living conditions. Copyright (c) 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Talking Fire Alarms Calm Kids.
ERIC Educational Resources Information Center
Executive Educator, 1984
1984-01-01
The new microprocessor-based fire alarm systems can help to control smoke movement throughout school buildings by opening vents and doors, identify the burning section, activate voice alarms, provide firefighters with telephone systems during the fire, and release fire-preventing gas. (KS)
The effect of singing training on voice quality for people with quadriplegia.
Tamplin, Jeanette; Baker, Felicity A; Buttifant, Mary; Berlowitz, David J
2014-01-01
Despite anecdotal reports of voice impairment in quadriplegia, the exact nature of these impairments is not well described in the literature. This article details objective and subjective voice assessments for people with quadriplegia at baseline and after a respiratory-targeted singing intervention. Randomized controlled trial. Twenty-four participants with quadriplegia were randomly assigned to a 12-week program of either a singing intervention or active music therapy control. Recordings of singing and speech were made at baseline, 6 weeks, 12 weeks, and 6 months postintervention. These deidentified recordings were used to measure sound pressure levels and assess voice quality using the Multidimensional Voice Profile and the Perceptual Voice Profile. Baseline voice quality data indicated deviation from normality in the areas of breathiness, strain, and roughness. A greater percentage of intervention participants moved toward more normal voice quality in terms of jitter, shimmer, and noise-to-harmonic ratio; however, the improvements failed to achieve statistical significance. Subjective and objective assessments of voice quality indicate that quadriplegia may have a detrimental effect on voice quality; in particular, causing a perception of roughness and breathiness in the voice. The results of this study suggest that singing training may have a role in ameliorating these voice impairments. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Kim, Kyoung Min; Park, Kyeong Seon; Lee, Hyun Ju; Lee, Yun Hee; Bae, Ji Seon; Lee, Young Joon; Choi, Sung Hee; Jang, Hak Chul; Lim, Soo
2015-12-11
We have demonstrated previously that an individualized health management system using advanced medical information technology, named ubiquitous (u)-healthcare, was helpful in achieving better glycemic control than routine care. Recently, we generated a new u-healthcare system using a voice inception technique for elderly diabetic patients to communicate information about their glucose control, physical activity, and diet more easily. In a randomized clinical trial, 70 diabetic patients aged 60-85 years were assigned randomly to a standard care group or u-healthcare group for 6 months. The primary end points were the changes in glycated hemoglobin (HbA1c) and glucose fluctuation assessed by the mean amplitude glycemic excursion (MAGE). Changes in body weight, lifestyle, and knowledge about diabetes were also investigated. After 6 months, the HbA1c levels decreased significantly in the u-healthcare group (from 8.6 ± 1.0% to 7.5 ± 0.6%) compared with the standard care group (from 8.7 ± 0.9% to 8.2 ± 1.1%, P < 0.01). The MAGE decreased more in the u-healthcare group than in the standard care group. Systolic blood pressure and body weight decreased and liver functions improved in the u-healthcare group, but not in the standard care group. The u-healthcare system with voice inception technique was effective in achieving glycemic control without hypoglycemia in elderly diabetic patients (Clinicaltrials.gov: NCT01891474).
Changes After Voice Therapy in Acoustic Voice Analysis of Chinese Patients With Voice Disorders.
Lu, Dan; Chen, Fei; Yang, Hui; Yu, Rong; Zhou, Qi; Zhang, Xinyuan; Ren, Jia; Zheng, Yitao; Zhang, Xiaoyan; Zou, Jian; Wang, Haiyang; Liu, Jun
2018-05-01
This study aimed to evaluate the effects of voice therapy on patients with voice disorders by comparing the acoustic parameter changes before and after treatment. This is a retrospective study. Forty-five female patients with early-stage vocal nodules or polyps, postoperative patients, and patients with chronic laryngitis were divided into three subgroups. Videostroboscopic, acoustic analysis (fundamental frequency, jitter, shimmer, mean harmonics-to-noise ratio), and maximum phonation time (MPT) were measured before and after treatment. Fifty healthy female volunteers were the control group. After treatment, 24.4% of nodules or polyps had decreased in size, 11.1% of patients with chronic laryngitis and postoperative patients had reduced edema, and the mucosal wave of vocal folds had different degrees of recovery in postoperative patients. All acoustic analysis values and MPT in the patient group were statistically worse than in the control group, except for fundamental frequency before treatment (P > 0.05). After treatment, the acoustic analysis and MPT values were improved. However, the jitter, mean harmonics-to-noise ratio, and MPT values in the patient group were still worse after voice therapy than in the control group (P < 0.05). Most of acoustic analysis values can be useful as a complementary tool in diagnosis and assessment of voice disorders; however, it is not recommended to use a single parameter to assess voice quality. Voice therapy can improve voice quality in patients with voice disorders, but a period longer than 8 weeks is recommended for these patients. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Ma, E P; Yiu, E M
2001-06-01
Traditional clinical voice evaluation focuses primarily on the severity of voice impairment, with little emphasis on the impact of voice disorders on the individual's quality of life. This study reports the development of a 28-item assessment tool that evaluates the perception of voice problem, activity limitation, and participation restriction using the International Classification of Impairments, Disabilities and Handicaps-2 Beta-1 concept (World Health Organization, 1997). The questionnaire was administered to 40 subjects with dysphonia and 40 control subjects with normal voices. Results showed that the dysphonic group reported significantly more severe voice problems, limitation in daily voice activities, and restricted participation in these activities than the control group. The study also showed that the perception of a voice problem by the dysphonic subjects correlated positively with the perception of limitation in voice activities and restricted participation. However, the self-perceived voice problem had little correlation with the degree of voice-quality impairment measured acoustically and perceptually by speech pathologists. The data also showed that the aggregate scores of activity limitation and participation restriction were positively correlated, and the extent of activity limitation and participation restriction was similar in all except the job area. These findings highlight the importance of identifying and quantifying the impact of dysphonia on the individual's quality of life in the clinical management of voice disorders.
Wireless Augmented Reality Prototype (WARP)
NASA Technical Reports Server (NTRS)
Devereaux, A. S.
1999-01-01
Initiated in January, 1997, under NASA's Office of Life and Microgravity Sciences and Applications, the Wireless Augmented Reality Prototype (WARP) is a means to leverage recent advances in communications, displays, imaging sensors, biosensors, voice recognition and microelectronics to develop a hands-free, tetherless system capable of real-time personal display and control of computer system resources. Using WARP, an astronaut may efficiently operate and monitor any computer-controllable activity inside or outside the vehicle or station. The WARP concept is a lightweight, unobtrusive heads-up display with a wireless wearable control unit. Connectivity to the external system is achieved through a high-rate radio link from the WARP personal unit to a base station unit installed into any system PC. The radio link has been specially engineered to operate within the high- interference, high-multipath environment of a space shuttle or space station module. Through this virtual terminal, the astronaut will be able to view and manipulate imagery, text or video, using voice commands to control the terminal operations. WARP's hands-free access to computer-based instruction texts, diagrams and checklists replaces juggling manuals and clipboards, and tetherless computer system access allows free motion throughout a cabin while monitoring and operating equipment.
Crovato, César David Paredes; Schuck, Adalberto
2007-10-01
This paper presents a dysphonic voice classification system using the wavelet packet transform and the best basis algorithm (BBA) as dimensionality reductor and 06 artificial neural networks (ANN) acting as specialist systems. Each ANN was a 03-layer multilayer perceptron with 64 input nodes, 01 output node and in the intermediary layer the number of neurons depends on the related training pathology group. The dysphonic voice database was separated in five pathology groups and one healthy control group. Each ANN was trained and associated with one of the 06 groups, and fed by the best base tree (BBT) nodes' entropy values, using the multiple cross validation (MCV) method and the leave-one-out (LOO) variation technique and success rates obtained were 87.5%, 95.31%, 87.5%, 100%, 96.87% and 89.06% for the groups 01 to 06, respectively.
Comparison of two MAC protocols based on LEO satellite networks
NASA Astrophysics Data System (ADS)
Guan, Mingxiang; Wang, Ruichun
2009-12-01
With the development of LEO satellite communication, it is the basic requirement that various kinds of services will be provided. Considering that weak channel collision detection ability, long propagation delay and heavy load in LEO satellite communication system, a valid adaptive access control protocol APRMA is proposed. Different access probability functions for different services are obtained and appropriate access probabilities for voice and data users are updated slot by slot based on the estimation of the voice traffic and the channel status. Finally simulation results demonstrate that the performance of system is improved by the APRMA compared with the conventional PRMA, with an acceptable trade-off between QoS of voice and delay of data. Also the APRMA protocol will be suitable for HAPS (high altitude platform station) with the characters of weak channel collision detection ability, long propagation delay and heavy load.
Single-channel voice-response-system program documentation volume I : system description
DOT National Transportation Integrated Search
1977-01-01
This report documents the design and implementation of a Voice Response System (VRS) using Adaptive Differential Pulse Code Modulation (ADPCM) voice coding. Implemented on a Digital Equipment Corporation PDP-11/20,R this VRS system supports a single ...
Lin, Szu-Han Joanna; Johnson, Russell E
2015-09-01
One way that employees contribute to organizational effectiveness is by expressing voice. They may offer suggestions for how to improve the organization (promotive voice behavior), or express concerns to prevent harmful events from occurring (prohibitive voice behavior). Although promotive and prohibitive voices are thought to be distinct types of behavior, very little is known about their unique antecedents and consequences. In this study we draw on regulatory focus and ego depletion theories to derive a theoretical model that outlines a dynamic process of the antecedents and consequences of voice behavior. Results from 2 multiwave field studies revealed that promotion and prevention foci have unique ties to promotive and prohibitive voice, respectively. Promotive and prohibitive voice, in turn, were associated with decreases and increases, respectively, in depletion. Consistent with the dynamic nature of self-control, depletion was associated with reductions in employees' subsequent voice behavior, regardless of the type of voice (promotive or prohibitive). Results were consistent across 2 studies and remained even after controlling for other established antecedents of voice and alternative mediating mechanisms beside depletion. (c) 2015 APA, all rights reserved).
Fantini, Marco; Succo, Giovanni; Crosetti, Erika; Borragán Torre, Alfonso; Demo, Roberto; Fussi, Franco
2017-05-01
The current study aimed at investigating the immediate effects of a semi-occluded vocal tract exercise with a ventilation mask in a group of contemporary commercial singers. A randomized controlled study was carried out. Thirty professional or semi-professional singers with no voice complaints were randomly divided into two groups on recruitment: an experimental group and a control group. The same warm-up exercise was performed by the experimental group with an occluded ventilation mask placed over the nose and the mouth and by the control group without the ventilation mask. Voice was recorded before and after the exercise. Acoustic and self-assessment analysis were accomplished. The acoustic parameters of the voice samples recorded before and after training were compared, as well as the parameters' variations between the experimental and the control group. Self-assessment results of the experimental and the control group were compared too. Significant changes after the warm-up exercise included jitter, shimmer, and singing power ratio (SPR) in the experimental group. No significant changes were recorded in the control group. Significant differences between the experimental and the control group were found for ΔShimmer and ΔSPR. Self-assessment analysis confirmed a significantly higher phonatory comfort and voice quality perception for the experimental group. The results of the present study support the immediate advantageous effects on singing voice of a semi-occluded vocal tract exercise with a ventilation mask in terms of acoustic quality, phonatory comfort, and voice quality perception in contemporary commercial singers. Long-term effects still remain to be studied. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Mawson, Amy; Berry, Katherine; Murray, Craig; Hayward, Mark
2011-09-01
Research has found relational qualities of power and intimacy to exist within hearer-voice interactions. The present study aimed to provide a deeper understanding of the interpersonal context of voice hearing by exploring participants' relationships with their voices and other people in their lives. This research was designed in consultation with service users and employed a qualitative, phenomenological, and idiographic design using semi-structured interviews. Ten participants, recruited via mental health services, and who reported hearing voices in the previous week, completed the interviews. These were transcribed verbatim and analysed using interpretative phenomenological analysis. Five themes resulted from the analysis. Theme 1: 'person and voice' demonstrated that participants' voices often reflected the identity, but not always the quality of social acquaintances. Theme 2: 'voices changing and confirming relationship with the self' explored the impact of voice hearing in producing an inferior sense-of-self in comparison to others. Theme 3: 'a battle for control' centred on issues of control and a dilemma of independence within voice relationships. Theme 4: 'friendships facilitating the ability to cope' and theme 5: 'voices creating distance in social relationships' explored experiences of social relationships within the context of voice hearing, and highlighted the impact of social isolation for voice hearers. The study demonstrated the potential role of qualitative research in developing theories of voice hearing. It extended previous research by highlighting the interface between voices and the social world of the hearer, including reciprocal influences of social relationships on voices and coping. Improving voice hearers' sense-of-self may be a key factor in reducing the distress caused by voices. ©2010 The British Psychological Society.
``The perceptual bases of speaker identity'' revisited
NASA Astrophysics Data System (ADS)
Voiers, William D.
2003-10-01
A series of experiments begun 40 years ago [W. D. Voiers, J. Acoust. Soc. Am. 36, 1065-1073 (1964)] was concerned with identifying the perceived voice traits (PVTs) on which human recognition of voices depends. It culminated with the development of a voice taxonomy based on 20 PVTs and a set of highly reliable rating scales for classifying voices with respect to those PVTs. The development of a perceptual voice taxonomy was motivated by the need for a practical method of evaluating speaker recognizability in voice communication systems. The Diagnostic Speaker Recognition Test (DSRT) evaluates the effects of systems on speaker recognizability as reflected in changes in the inter-listener reliability of voice ratings on the 20 PVTs. The DSRT thus provides a qualitative, as well as quantitative, evaluation of the effects of a system on speaker recognizability. A fringe benefit of this project is PVT rating data for a sample of 680 voices. [Work partially supported by USAFRL.
Awan, Shaheen N; Roy, Nelson; Zhang, Dong; Cohen, Seth M
2016-03-01
The purposes of this study were to (1) evaluate the performance of the Cepstral Spectral Index of Dysphonia (CSID--a multivariate estimate of dysphonia severity) as a potential screening tool for voice disorder identification and (2) identify potential clinical cutoff scores to classify voice-disordered cases versus controls. Subjects were 332 men and women (116 men, 216 women) comprised of subjects who presented to a physician with a voice-related complaint and a group of non-voice-related control subjects. Voice-disordered cases versus controls were initially defined via three reference standards: (1) auditory-perceptual judgment (dysphonia +/-); (2) Voice Handicap Index (VHI) score (VHI +/-); and (3) laryngoscopic description (laryngoscopic +/-). Speech samples were analyzed using the Analysis of Dysphonia in Speech and Voice program. Cepstral and spectral measures were combined into a CSID multivariate formula which estimated dysphonia severity for Rainbow Passage samples (i.e., the CSIDR). The ability of the CSIDR to accurately classify cases versus controls in relation to each reference standard was evaluated via a combination of logistic regression and receiver operating characteristic (ROC) analyses. The ability of the CSIDR to discriminate between cases and controls was represented by the "area under the ROC curve" (AUC). ROC classification of dysphonia-positive cases versus controls resulted in a strong AUC = 0.85. A CSIDR cutoff of ≈24 achieved the best balance between sensitivity and specificity, whereas a more liberal cutoff score of ≈19 resulted in higher sensitivity while maintaining respectable specificity which may be preferred for screening purposes. Weaker but adequate AUCs = 0.75 and 0.73 were observed for the classification of VHI-positive and laryngoscopic-positive cases versus controls, respectively. Logistic regression analyses indicated that subject age may be a significant covariate in the discrimination of dysphonia-positive and VHI-positive cases versus controls. The CSIDR can provide a strong level of accuracy for the classification of voice-disordered cases versus controls, particularly when auditory-perceptual judgment is used as the reference standard. Although users often focus on a cutoff score that achieves a balance between sensitivity and specificity, more liberal cutoffs for screening purposes versus conservative cutoffs when cost or risk of further evaluation is deemed to be high should also be considered. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Overgeneral autobiographical memory bias in clinical and non-clinical voice hearers.
Jacobsen, Pamela; Peters, Emmanuelle; Ward, Thomas; Garety, Philippa A; Jackson, Mike; Chadwick, Paul
2018-03-14
Hearing voices can be a distressing and disabling experience for some, whilst it is a valued experience for others, so-called 'healthy voice-hearers'. Cognitive models of psychosis highlight the role of memory, appraisal and cognitive biases in determining emotional and behavioural responses to voices. A memory bias potentially associated with distressing voices is the overgeneral memory bias (OGM), namely the tendency to recall a summary of events rather than specific occasions. It may limit access to autobiographical information that could be helpful in re-appraising distressing experiences, including voices. We investigated the possible links between OGM and distressing voices in psychosis by comparing three groups: (1) clinical voice-hearers (N = 39), (2) non-clinical voice-hearers (N = 35) and (3) controls without voices (N = 77) on a standard version of the autobiographical memory test (AMT). Clinical and non-clinical voice-hearers also completed a newly adapted version of the task, designed to assess voices-related memories (vAMT). As hypothesised, the clinical group displayed an OGM bias by retrieving fewer specific autobiographical memories on the AMT compared with both the non-clinical and control groups, who did not differ from each other. The clinical group also showed an OGM bias in recall of voice-related memories on the vAMT, compared with the non-clinical group. Clinical voice-hearers display an OGM bias when compared with non-clinical voice-hearers on both general and voices-specific recall tasks. These findings have implications for the refinement and targeting of psychological interventions for psychosis.
Voice responses to changes in pitch of voice or tone auditory feedback
NASA Astrophysics Data System (ADS)
Sivasankar, Mahalakshmi; Bauer, Jay J.; Babu, Tara; Larson, Charles R.
2005-02-01
The present study was undertaken to examine if a subject's voice F0 responded not only to perturbations in pitch of voice feedback but also to changes in pitch of a side tone presented congruent with voice feedback. Small magnitude brief duration perturbations in pitch of voice or tone auditory feedback were randomly introduced during sustained vowel phonations. Results demonstrated a higher rate and larger magnitude of voice F0 responses to changes in pitch of the voice compared with a triangular-shaped tone (experiment 1) or a pure tone (experiment 2). However, response latencies did not differ across voice or tone conditions. Data suggest that subjects responded to the change in F0 rather than harmonic frequencies of auditory feedback because voice F0 response prevalence, magnitude, or latency did not statistically differ across triangular-shaped tone or pure-tone feedback. Results indicate the audio-vocal system is sensitive to the change in pitch of a variety of sounds, which may represent a flexible system capable of adapting to changes in the subject's voice. However, lower prevalence and smaller responses to tone pitch-shifted signals suggest that the audio-vocal system may resist changes to the pitch of other environmental sounds when voice feedback is present. .
Interventions for preventing voice disorders in adults.
Ruotsalainen, J H; Sellman, J; Lehto, L; Jauhiainen, M; Verbeek, J H
2007-10-17
Poor voice quality due to a voice disorder can lead to a reduced quality of life. In occupations where voice use is substantial it can lead to periods of absence from work. To evaluate the effectiveness of interventions to prevent voice disorders in adults. We searched MEDLINE (PubMed, 1950 to 2006), EMBASE (1974 to 2006), CENTRAL (The Cochrane Library, Issue 2 2006), CINAHL (1983 to 2006), PsychINFO (1967 to 2006), Science Citation Index (1986 to 2006) and the Occupational Health databases OSH-ROM (to 2006). The date of the last search was 05/04/06. Randomised controlled clinical trials (RCTs) of interventions evaluating the effectiveness of treatments to prevent voice disorders in adults. For work-directed interventions interrupted time series and prospective cohort studies were also eligible. Two authors independently extracted data and assessed trial quality. Meta-analysis was performed where appropriate. We identified two randomised controlled trials including a total of 53 participants in intervention groups and 43 controls. One study was conducted with teachers and the other with student teachers. Both trials were poor quality. Interventions were grouped into 1) direct voice training, 2) indirect voice training and 3) direct and indirect voice training combined.1) Direct voice training: One study did not find a significant decrease of the Voice Handicap Index for direct voice training compared to no intervention.2) Indirect voice training: One study did not find a significant decrease of the Voice Handicap Index for indirect voice training when compared to no intervention.3) Direct and indirect voice training combined: One study did not find a decrease of the Voice Handicap Index for direct and indirect voice training combined when compared to no intervention. The same study did however find an improvement in maximum phonation time (Mean Difference -3.18 sec; 95 % CI -4.43 to -1.93) for direct and indirect voice training combined when compared to no intervention. No work-directed studies were found. None of the studies found evaluated the effectiveness of prevention in terms of sick leave or number of diagnosed voice disorders. We found no evidence that either direct or indirect voice training or the two combined are effective in improving self-reported vocal functioning when compared to no intervention. The current practice of giving training to at-risk populations for preventing the development of voice disorders is therefore not supported by definitive evidence of effectiveness. Larger and methodologically better trials are needed with outcome measures that better reflect the aims of interventions.
Suggestions for Layout and Functional Behavior of Software-Based Voice Switch Keysets
NASA Technical Reports Server (NTRS)
Scott, David W.
2010-01-01
Marshall Space Flight Center (MSFC) provides communication services for a number of real time environments, including Space Shuttle Propulsion support and International Space Station (ISS) payload operations. In such settings, control team members speak with each other via multiple voice circuits or loops. Each loop has a particular purpose and constituency, and users are assigned listen and/or talk capabilities for a given loop based on their role in fulfilling the purpose. A voice switch is a given facility's hardware and software that supports such communication, and may be interconnected with other facilities switches to create a large network that, from an end user perspective, acts like a single system. Since users typically monitor and/or respond to several voice loops concurrently for hours on end and real time operations can be very dynamic and intense, it s vital that a control panel or keyset for interfacing with the voice switch be a servant that reduces stress, not a master that adds it. Implementing the visual interface on a computer screen provides tremendous flexibility and configurability, but there s a very real risk of overcomplication. (Remember how office automation made life easier, which led to a deluge of documents that made life harder?) This paper a) discusses some basic human factors considerations related to keysets implemented as application software windows, b) suggests what to standardize at the facility level and what to leave to the user's preference, and c) provides screen shot mockups for a robust but reasonably simple user experience. Concepts apply to keyset needs in almost any type of operations control or support center.
Liu, Hanjun; Wang, Emily Q.; Chen, Zhaocong; Liu, Peng; Larson, Charles R.; Huang, Dongfeng
2010-01-01
The purpose of this cross-language study was to examine whether the online control of voice fundamental frequency (F0) during vowel phonation is influenced by language experience. Native speakers of Cantonese and Mandarin, both tonal languages spoken in China, participated in the experiments. Subjects were asked to vocalize a vowel sound ∕u∕ at their comfortable habitual F0, during which their voice pitch was unexpectedly shifted (±50, ±100, ±200, or ±500 cents, 200 ms duration) and fed back instantaneously to them over headphones. The results showed that Cantonese speakers produced significantly smaller responses than Mandarin speakers when the stimulus magnitude varied from 200 to 500 cents. Further, response magnitudes decreased along with the increase in stimulus magnitude in Cantonese speakers, which was not observed in Mandarin speakers. These findings suggest that online control of voice F0 during vocalization is sensitive to language experience. Further, systematic modulations of vocal responses across stimulus magnitude were observed in Cantonese speakers but not in Mandarin speakers, which indicates that this highly automatic feedback mechanism is sensitive to the specific tonal system of each language. PMID:21218905
Misono, Stephanie; Meredith, Liza; Peterson, Carol B; Frazier, Patricia A
2016-03-01
Although an association between psychosocial distress (depression, anxiety, somatization, and perceived stress) and voice disorders has been observed, little is known about the relationship between distress and patient-reported voice handicap. Furthermore, the psychological mechanisms underlying this relationship are poorly understood. Perceived control plays an important role in distress associated with other medical disorders. The objectives of this study were to (1) characterize the relationship between distress and patient-reported voice handicap and (2) examine the role of perceived control in this relationship. This is a cross-sectional study in a tertiary care academic voice clinic. Distress, perceived stress, voice handicap, and perceived control were measured using established assessment scales. Association was measured with Pearson correlation coefficients; moderation was assessed using multiple hierarchical regression. A total of 533 patients enrolled. Thirty-four percent of the patients met criteria for clinically significant distress (ie, depression, anxiety, and/or somatization). A weak association (r = 0.13; P = 0.003) was observed between severity of psychosocial distress and vocal handicap. Present perceived control was inversely associated with distress (r = -0.41; P < 0.0001), stress (r = -0.30; P < 0.0001), and voice handicap (r = -0.30; P < 0.0001). The relationship between voice handicap and psychosocial distress was moderated by perceived control (b for interaction term, -0.15; P < 0.001); greater vocal handicap was associated with greater distress in patients with low perceived control. Severity of distress and vocal handicap were positively related, and the relation between them was moderated by perceived control. Vocal handicap was more related to distress among those with low perceived control; targeting this potential mechanism may facilitate new approaches for improved care. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Föcker, Julia; Best, Anna; Hölig, Cordula; Röder, Brigitte
2012-07-01
Blind people rely much more on voices compared to sighted individuals when identifying other people. Previous research has suggested a faster processing of auditory input in blind individuals than sighted controls and an enhanced activation of temporal cortical regions during voice processing. The present study used event-related potentials (ERPs) to single out the sub-processes of auditory person identification that change and allow for superior voice processing after congenital blindness. A priming paradigm was employed in which two successive voices (S1 and S2) of either the same (50% of the trials) or different actors were presented. Congenitally blind and matched sighted participants made an old-young decision on the S2. During the pre-experimental familiarization with the stimuli, congenitally blind individuals showed faster learning rates than sighted controls. Reaction times were shorter in person-congruent trials than in person-incongruent trials in both groups. ERPs to S2 stimuli in person-incongruent as compared to person-congruent trials were significantly enhanced at early processing stages (100-160 ms) in congenitally blind participants only. A later negative ERP effect (>200 ms) was found in both groups. The scalp topographies of the experimental effects were characterized by a central and parietal distribution in the sighted but a more posterior distribution in the congenitally blind. These results provide evidence for an improvement of early voice processing stages and a reorganization of the person identification system as a neural correlate of compensatory behavioral improvements following congenital blindness. Copyright © 2012 Elsevier Ltd. All rights reserved.
The Lincoln Training System: A Summary Report.
ERIC Educational Resources Information Center
Butman, Robert C.; Frick, Frederick C.
The current status of the Lincoln Training System (LTS) is reported. This document describes LTS as a computer supported microfiche system which: 1) provides random access to voice quality audio and to graphics; 2) supports student-controlled interactive processes; and 3) functions in a variety of environments. The report offers a detailed…
NASA Technical Reports Server (NTRS)
1983-01-01
NASA research and design has significantly improved crew alert systems. The Engine Indication and Crew Alerting System (EICAS), developed by Psycho-Linguistic Research Associates, is technologically advanced and able to order alerts by priority. Ames has also developed computer controlled voice synthesizers for readouts during difficult landing approaches. This is available to airplane manufacturers.
Processing of voices in deafness rehabilitation by auditory brainstem implant.
Coez, Arnaud; Zilbovicius, Monica; Ferrary, Evelyne; Bouccara, Didier; Mosnier, Isabelle; Ambert-Dahan, Emmanuèle; Kalamarides, Michel; Bizaguet, Eric; Syrota, André; Samson, Yves; Sterkers, Olivier
2009-10-01
The superior temporal sulcus (STS) is specifically involved in processing the human voice. Profound acquired deafness by post-meningitis ossified cochlea and by bilateral vestibular schwannoma in neurofibromatosis type 2 patients are two indications for auditory brainstem implantation (ABI). In order to objectively measure the cortical voice processing of a group of ABI patients, we studied the activation of the human temporal voice areas (TVA) by PET H(2)(15)O, performed in a group of implanted deaf adults (n=7) with more than two years of auditory brainstem implant experience, with an intelligibility score average of 17%+/-17 [mean+/-SD]. Relative cerebral blood flow (rCBF) was measured in the three following conditions: during silence, while passive listening to human voice, and to non-voice stimuli. Compared to silence, the activations induced by voice and non-voice stimuli were bilaterally located in the superior temporal regions. However, compared to non-voice stimuli, the voice stimuli did not induce specific supplementary activation of the TVA along the STS. The comparison of ABI group with a normal-hearing controls group (n=7) showed that TVA activations were significantly enhanced among controls group. ABI allowed the transmission of sound stimuli to temporal brain regions but lacked transmitting the specific cues of the human voice to the TVA. Moreover, among groups, during silent condition, brain visual regions showed higher rCBF in ABI group, although temporal brain regions had higher rCBF in the controls group. ABI patients had consequently developed enhanced visual strategies to keep interacting with their environment.
Speech versus manual control of camera functions during a telerobotic task
NASA Technical Reports Server (NTRS)
Bierschwale, John M.; Sampaio, Carlos E.; Stuart, Mark A.; Smith, Randy L.
1993-01-01
This investigation has evaluated the voice-commanded camera control concept. For this particular task, total voice control of continuous and discrete camera functions was significantly slower than manual control. There was no significant difference between voice and manual input for several types of errors. There was not a clear trend in subjective preference of camera command input modality. Task performance, in terms of both accuracy and speed, was very similar across both levels of experience.
Meredith, Liza; Peterson, Carol B.; Frazier, Patricia A.
2015-01-01
Objectives Although an association between psychosocial distress (depression, anxiety, somatization, and perceived stress) and voice disorders has been observed, little is known about the relationship between distress and patient-reported voice handicap. Further, the psychological mechanisms underlying this relationship are poorly understood. Perceived control plays an important role in distress associated with other medical disorders. The objectives of this study were to 1) characterize the relationship between distress and patient-reported voice handicap and 2) examine the role of perceived control in this relationship. Study Design Cross-sectional study in tertiary care academic voice clinic. Methods Distress, perceived stress, voice handicap, and perceived control were measured using established assessment scales. Association was measured with Pearson’s correlation coefficient; moderation was assessed using multiple hierarchical regression. Results 533 patients enrolled. 34% met criteria for clinically significant distress (i.e., depression, anxiety, and/or somatization). A weak association (r=0.13, p=0.003) was observed between severity of psychosocial distress and vocal handicap. Present perceived control was inversely associated with distress (r=−0.41, p<0.0001), stress (r=−0.30, p<0.0001), and voice handicap (r=−0.30, p<0.0001). The relationship between voice handicap and psychosocial distress was moderated by perceived control (b for interaction term −0.15, p<0.001); greater vocal handicap was associated with greater distress in patients with low perceived control. Conclusions Severity of distress and vocal handicap were positively related, and the relation between them was moderated by perceived control. Vocal handicap was more related to distress among those with low perceived control; targeting this potential mechanism may facilitate new approaches for improved care. PMID:25795347
The Glasgow Voice Memory Test: Assessing the ability to memorize and recognize unfamiliar voices.
Aglieri, Virginia; Watson, Rebecca; Pernet, Cyril; Latinus, Marianne; Garrido, Lúcia; Belin, Pascal
2017-02-01
One thousand one hundred and twenty subjects as well as a developmental phonagnosic subject (KH) along with age-matched controls performed the Glasgow Voice Memory Test, which assesses the ability to encode and immediately recognize, through an old/new judgment, both unfamiliar voices (delivered as vowels, making language requirements minimal) and bell sounds. The inclusion of non-vocal stimuli allows the detection of significant dissociations between the two categories (vocal vs. non-vocal stimuli). The distributions of accuracy and sensitivity scores (d') reflected a wide range of individual differences in voice recognition performance in the population. As expected, KH showed a dissociation between the recognition of voices and bell sounds, her performance being significantly poorer than matched controls for voices but not for bells. By providing normative data of a large sample and by testing a developmental phonagnosic subject, we demonstrated that the Glasgow Voice Memory Test, available online and accessible from all over the world, can be a valid screening tool (~5 min) for a preliminary detection of potential cases of phonagnosia and of "super recognizers" for voices.
Spirituality and hearing voices: considering the relation
McCarthy-Jones, Simon; Waegeli, Amanda; Watkins, John
2013-01-01
For millennia, some people have heard voices that others cannot hear. These have been variously understood as medical, psychological and spiritual phenomena. In this article we consider the specific role of spirituality in voice-hearing in two ways. First, we examine how spirituality may help or hinder people who hear voices. Benefits are suggested to include offering an alternative meaning to the experience which can give more control and comfort, enabling the development of specific coping strategies, increasing social support, and encouraging forgiveness. Potential drawbacks are noted to include increased distress and reduced control resulting from placing frightening or coercive constructions on voices, social isolation, the development of dysfunctional beliefs, and missed/delayed opportunities for successful mental health interventions. After examining problems surrounding classifying voices as either spiritual or psychotic, we move beyond an essentialist position to examine how such a classification is likely to be fluid, and how a given voice may move between these designations. We also highlight tensions between modernist and postmodernist approaches to voice-hearing. PMID:24273597
Vocal Identity Recognition in Autism Spectrum Disorder
Lin, I-Fan; Yamada, Takashi; Komine, Yoko; Kato, Nobumasa; Kato, Masaharu; Kashino, Makio
2015-01-01
Voices can convey information about a speaker. When forming an abstract representation of a speaker, it is important to extract relevant features from acoustic signals that are invariant to the modulation of these signals. This study investigated the way in which individuals with autism spectrum disorder (ASD) recognize and memorize vocal identity. The ASD group and control group performed similarly in a task when asked to choose the name of the newly-learned speaker based on his or her voice, and the ASD group outperformed the control group in a subsequent familiarity test when asked to discriminate the previously trained voices and untrained voices. These findings suggest that individuals with ASD recognized and memorized voices as well as the neurotypical individuals did, but they categorized voices in a different way: individuals with ASD categorized voices quantitatively based on the exact acoustic features, while neurotypical individuals categorized voices qualitatively based on the acoustic patterns correlated to the speakers' physical and mental properties. PMID:26070199
Vocal Identity Recognition in Autism Spectrum Disorder.
Lin, I-Fan; Yamada, Takashi; Komine, Yoko; Kato, Nobumasa; Kato, Masaharu; Kashino, Makio
2015-01-01
Voices can convey information about a speaker. When forming an abstract representation of a speaker, it is important to extract relevant features from acoustic signals that are invariant to the modulation of these signals. This study investigated the way in which individuals with autism spectrum disorder (ASD) recognize and memorize vocal identity. The ASD group and control group performed similarly in a task when asked to choose the name of the newly-learned speaker based on his or her voice, and the ASD group outperformed the control group in a subsequent familiarity test when asked to discriminate the previously trained voices and untrained voices. These findings suggest that individuals with ASD recognized and memorized voices as well as the neurotypical individuals did, but they categorized voices in a different way: individuals with ASD categorized voices quantitatively based on the exact acoustic features, while neurotypical individuals categorized voices qualitatively based on the acoustic patterns correlated to the speakers' physical and mental properties.
Temporal Lobe Epilepsy Alters Auditory-motor Integration For Voice Control
Li, Weifeng; Chen, Ziyi; Yan, Nan; Jones, Jeffery A.; Guo, Zhiqiang; Huang, Xiyan; Chen, Shaozhen; Liu, Peng; Liu, Hanjun
2016-01-01
Temporal lobe epilepsy (TLE) is the most common drug-refractory focal epilepsy in adults. Previous research has shown that patients with TLE exhibit decreased performance in listening to speech sounds and deficits in the cortical processing of auditory information. Whether TLE compromises auditory-motor integration for voice control, however, remains largely unknown. To address this question, event-related potentials (ERPs) and vocal responses to vocal pitch errors (1/2 or 2 semitones upward) heard in auditory feedback were compared across 28 patients with TLE and 28 healthy controls. Patients with TLE produced significantly larger vocal responses but smaller P2 responses than healthy controls. Moreover, patients with TLE exhibited a positive correlation between vocal response magnitude and baseline voice variability and a negative correlation between P2 amplitude and disease duration. Graphical network analyses revealed a disrupted neuronal network for patients with TLE with a significant increase of clustering coefficients and path lengths as compared to healthy controls. These findings provide strong evidence that TLE is associated with an atypical integration of the auditory and motor systems for vocal pitch regulation, and that the functional networks that support the auditory-motor processing of pitch feedback errors differ between patients with TLE and healthy controls. PMID:27356768
Voice Response Systems Technology.
ERIC Educational Resources Information Center
Gerald, Jeanette
1984-01-01
Examines two methods of generating synthetic speech in voice response systems, which allow computers to communicate in human terms (speech), using human interface devices (ears): phoneme and reconstructed voice systems. Considerations prior to implementation, current and potential applications, glossary, directory, and introduction to Input Output…
Abrams, Daniel A.; Chen, Tianwen; Odriozola, Paola; Cheng, Katherine M.; Baker, Amanda E.; Padmanabhan, Aarthi; Ryali, Srikanth; Kochalka, John; Feinstein, Carl; Menon, Vinod
2016-01-01
The human voice is a critical social cue, and listeners are extremely sensitive to the voices in their environment. One of the most salient voices in a child’s life is mother's voice: Infants discriminate their mother’s voice from the first days of life, and this stimulus is associated with guiding emotional and social function during development. Little is known regarding the functional circuits that are selectively engaged in children by biologically salient voices such as mother’s voice or whether this brain activity is related to children’s social communication abilities. We used functional MRI to measure brain activity in 24 healthy children (mean age, 10.2 y) while they attended to brief (<1 s) nonsense words produced by their biological mother and two female control voices and explored relationships between speech-evoked neural activity and social function. Compared to female control voices, mother’s voice elicited greater activity in primary auditory regions in the midbrain and cortex; voice-selective superior temporal sulcus (STS); the amygdala, which is crucial for processing of affect; nucleus accumbens and orbitofrontal cortex of the reward circuit; anterior insula and cingulate of the salience network; and a subregion of fusiform gyrus associated with face perception. The strength of brain connectivity between voice-selective STS and reward, affective, salience, memory, and face-processing regions during mother’s voice perception predicted social communication skills. Our findings provide a novel neurobiological template for investigation of typical social development as well as clinical disorders, such as autism, in which perception of biologically and socially salient voices may be impaired. PMID:27185915
Abrams, Daniel A; Chen, Tianwen; Odriozola, Paola; Cheng, Katherine M; Baker, Amanda E; Padmanabhan, Aarthi; Ryali, Srikanth; Kochalka, John; Feinstein, Carl; Menon, Vinod
2016-05-31
The human voice is a critical social cue, and listeners are extremely sensitive to the voices in their environment. One of the most salient voices in a child's life is mother's voice: Infants discriminate their mother's voice from the first days of life, and this stimulus is associated with guiding emotional and social function during development. Little is known regarding the functional circuits that are selectively engaged in children by biologically salient voices such as mother's voice or whether this brain activity is related to children's social communication abilities. We used functional MRI to measure brain activity in 24 healthy children (mean age, 10.2 y) while they attended to brief (<1 s) nonsense words produced by their biological mother and two female control voices and explored relationships between speech-evoked neural activity and social function. Compared to female control voices, mother's voice elicited greater activity in primary auditory regions in the midbrain and cortex; voice-selective superior temporal sulcus (STS); the amygdala, which is crucial for processing of affect; nucleus accumbens and orbitofrontal cortex of the reward circuit; anterior insula and cingulate of the salience network; and a subregion of fusiform gyrus associated with face perception. The strength of brain connectivity between voice-selective STS and reward, affective, salience, memory, and face-processing regions during mother's voice perception predicted social communication skills. Our findings provide a novel neurobiological template for investigation of typical social development as well as clinical disorders, such as autism, in which perception of biologically and socially salient voices may be impaired.
Pilot study on the feasibility of a computerized speech recognition charting system.
Feldman, C A; Stevens, D
1990-08-01
The objective of this study was to determine the feasibility of developing and using a voice recognition computerized charting system to record dental clinical examination data. More specifically, the study was designed to analyze the time and error differential between the traditional examiner/recorder method (ASSISTANT) and computerized voice recognition method (VOICE). DMFS examinations were performed twice on 20 patients using the traditional ASSISTANT and the VOICE charting system. A statistically significant difference was found when comparing the mean ASSISTANT time of 2.69 min to the VOICE time of 3.72 min (P less than 0.001). No statistically significant difference was found when comparing the mean ASSISTANT recording errors of 0.1 to VOICE recording errors of 0.6 (P = 0.059). 90% of the patients indicated they felt comfortable with the dentist talking to a computer and only 5% of the sample indicated they opposed VOICE. Results from this pilot study indicate that a charting system utilizing voice recognition technology could be considered a viable alternative to traditional examiner/recorder methods of clinical charting.
Cannito, Michael P; Chorna, Lesya B; Kahane, Joel C; Dworkin, James P
2014-05-01
This study evaluated the hypotheses that sentence production by speakers with adductor (AD) and abductor (AB) spasmodic dysphonia (SD) may be differentially influenced by consonant voicing and manner features, in comparison with healthy, matched, nondysphonic controls. This was a prospective, single blind study, using a between-groups, repeated measures design for the independent variables of perceived voice quality and sentence duration. Sixteen subjects with ADSD and 10 subjects with ABSD, as well as 26 matched healthy controls produced four short, simple sentences that were systematically loaded with voiced or voiceless consonants of either obstruant or continuant manner categories. Experienced voice clinicians, who were "blind" as to speakers' group affixations, used visual analog scaling to judge the overall voice quality of each sentence. Acoustic sentence durations were also measured. Speakers with ABSD or ADSD demonstrated significantly poorer than normal voice quality on all sentences. Speakers with ABSD exhibited longer than normal duration for voiceless consonant sentences. Speakers with ADSD had poorer voice quality for voiced than for voiceless consonant sentences. Speakers with ABSD had longer durations for voiceless than for voiced consonant sentences. The two subtypes of SD exhibit differential performance on the basis of consonant voicing in short, simple sentences; however, each subgroup manifested voicing-related differences on a different variable (voice quality vs sentence duration). Findings suggest different underlying pathophysiological mechanisms for ABSD and ADSD. Findings also support inclusion of short, simple sentences containing voiced or voiceless consonants as part of the diagnostic protocol for SD, with measurement of sentence duration in addition to judments of voice quality severity. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
The Mobile Agents Integrated Field Test: Mars Desert Research Station April 2003
NASA Technical Reports Server (NTRS)
Clancey, William J.; Sierhuis, Maarten; Alena, Rick; Crawford, Sekou; Dowding, John; Graham, Jeff; Kaskiris, Charis; Tyree, Kim S.; vanHoof, Ron
2003-01-01
The Mobile Agents model-based, distributed architecture, which integrates diverse components in a system for lunar and planetary surface operations, was extensively tested in a two-week field "technology retreat" at the Mars Society s Desert Research Station (MDRS) during April 2003. More than twenty scientists and engineers from three NASA centers and two universities refined and tested the system through a series of incremental scenarios. Agent software, implemented in runtime Brahms, processed GPS, health data, and voice commands-monitoring, controlling and logging science data throughout simulated EVAs with two geologists. Predefined EVA plans, modified on the fly by voice command, enabled the Mobile Agents system to provide navigation and timing advice. Communications were maintained over five wireless nodes distributed over hills and into canyons for 5 km; data, including photographs and status was transmitted automatically to the desktop at mission control in Houston. This paper describes the system configurations, communication protocols, scenarios, and test results.
Communications dashboard (control rooms, take a cue from Facebook® !) Chapter 1
NASA Astrophysics Data System (ADS)
Scott, David W.
Papers published via IEEE and AIAA conferences have presented an overview of how social media could benefit NASA working environments in general [1] and proposed three specific social applications to benefit space flight control operations [2]. One of them, Communications Dashboard, would help a real time flight controller keep up with both the “ big picture” and significant details of operations via a cohesive interface similar to those of social networking services (SNS). Instead of recreational social features, “ CommDash” would support functions like console logging, categorized and threaded text chat streams with enhanced accountability and graphics display features, high-level status displays driven by telemetry or other events, and an on-screen hailing function for requesting voice or text stream conversation. Moving certain voice conversations to text streams would reduce confusion and stress in two ways. Within text conversations, there would be far less repetition of content since text conversations have visual persistence and are reviewable instantly, e.g., there's no need to brief new participants to a discussion - they just read what's already there. Remaining voice traffic would stand out more clearly, and quieter voice loops means fewer “ say again” calls and less distraction from visual and mental tasks, thus less stress. (Most flight controllers monitor 4 or 5 voice loops at once.) Links could be created from console log entries to chat selections so that underlying details are readily available yet unobtrusive. This would reduce the confusion that rises from having multiple and sometimes divergent copies of the same information due to cut/copy and paste operations, attachments, and asynchronous editing. This concept could apply to a plethora of real time control environments and to other settings with lots of information juggling. This paper explores the dashboard concept in further detail and chronic- es the first phase of a NASA IT Labs (Information Technology) project that could lead to a working system.
Communications Dashboard (Control Rooms Take a Cue from Facebook), Chapter 1
NASA Technical Reports Server (NTRS)
Scott, David w.
2013-01-01
Papers published via IEEE and AIAA conferences have presented an overview of how social media could benefit NASA working environments in general and proposed three specific social applications to benefit space flight control operations. One of them, Communications Dashboard, would help a real time flight controller keep up with both the "big picture" and significant details of operations via a cohesive interface similar to those of social networking services (SNS). Instead of recreational social features, "CommDash" would support functions like console logging, categorized and threaded text chat streams with enhanced accountability and graphics display features, high-level status displays driven by telemetry or other events, and an on-screen hailing function for requesting voice or text stream conversation. Moving certain voice conversations to text streams would reduce confusion and stress in two ways. Within text conversations, there would be far less repetition of content since text conversations have visual persistence and are reviewable instantly, e.g., there s no need to brief new participants to a discussion -- they just read what s already there. Remaining voice traffic would stand out more clearly, and quieter voice loops means fewer "say again" calls and less distraction from visual and mental tasks, thus less stress. (Most flight controllers monitor 4 or 5 voice loops at once.) Links could be created from console log entries to chat selections so that underlying details are readily available yet unobtrusive. This would reduce the confusion that rises from having multiple and sometimes divergent copies of the same information due to cut/copy and paste operations, attachments, and asynchronous editing. This concept could apply to a plethora of real time control environments and to other settings with lots of information juggling. This paper explores the dashboard concept in further detail and chronicles the first phase of a NASA IT Labs (Information Technology) project that could lead to a working system
Petrovic-Lazic, Mirjana; Jovanovic, Nadica; Kulic, Milan; Babac, Snezana; Jurisic, Vladimir
2015-03-01
The aim of the study was to assess the effect of endolaryngeal phonomicrosurgery (EPM) and voice therapy in patients with vocal fold polyps using perceptual and acoustic analysis before and after both therapies. The acoustic tests and perceptual evaluation of voice were carried out on 41 female patients with vocal fold polyp before and after EPM and voice therapy. Both therapy strategies were performed. Used acoustic parameters were Jitter percent (Jitt), pitch perturbation quotient (PPQ), shimmer percent (Shim), amplitude perturbation quotient (APQ), fundamental frequency variation (vF0), noise-to-harmonic ratio (NHR), Voice Turbulence Index (VTI). For perceptual evaluation, GRB scale was used. Results indicated higher values of investigated parameters in patients' group than in the control group (P < 0.01). Good correlation between the perceptual hoarseness factors of GRB scale and objective acoustic voice parameters were observed. All analyzed acoustic parameters improved after the phonomicrosurgery and voice therapy and tend to approach to values of the control group. For Jitt percent, Shim percent, vF0, VTI, and NHR, there were statistically significant differences. Perceptual voice evaluation revealed statistically significantly (P < 0.01) decreased rating of G (grade), R (rough) and B (breathy) after surgery and voice therapy. Our data indicated that both acoustic and perceptual characteristic of voice in patients with vocal polyps significantly improved after phonomicrosurgical and voice treatment. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Roy, Nelson; Weinrich, Barbara; Gray, Steven D; Tanner, Kristine; Toledo, Sue Walker; Dove, Heather; Corbin-Lewis, Kim; Stemple, Joseph C
2002-08-01
Voice problems are common among schoolteachers. This prospective, randomized clinical trial used patient-based treatment outcomes measures combined with acoustic analysis to evaluate the effectiveness of two treatment programs. Forty-four voice-disordered teachers were randomly assigned to one of three groups: voice amplification using the ChatterVox portable amplifier (VA, n = 15), vocal hygiene (VH, n = 15), and a nontreatment control group (n = 14). Before and after a 6-week treatment phase, all teachers completed: (a) the Voice Handicap Index (VHI), an instrument designed to appraise the self-perceived psychosocial consequences of voice disorders; (b) a voice severity self-rating scale; and (c) an audiorecording for later acoustic analysis. Based on pre- and posttreatment comparisons, only the amplification group experienced significant reductions on mean VHI scores (p = .045), voice severity self-ratings (p = .012), and the acoustic measures of percent jitter (p = .031) and shimmer (p = .008). The nontreatment control group reported a significant increase in level of vocal handicap as assessed by the VHI (p = .012). Although most pre- to posttreatment changes were in the desired direction, no significant improvements were observed within the VH group on any of the dependent measures. Between-group comparisons involving the three possible pairings of the groups revealed a pattern of results to suggest that: (a) compared to the control group, both treatment groups (i.e., VA and VH) experienced significantly more improvement on specific outcomes measures and (b) there were no significant differences between the VA and VH groups to indicate superiority of one treatment over another. Results, however, from a posttreatment questionnaire regarding the perceived benefits of treatment revealed that, compared to the VH group, the VA group reported more clarity of their speaking and singing voice (p = .061), greater ease of voice production (p = .001), and greater compliance with the treatment program (p = .045). These findings clearly support the clinical utility of voice amplification as an alternative for the treatment of voice problems in teachers.
Evidence for Auditory-Motor Impairment in Individuals with Hyperfunctional Voice Disorders
ERIC Educational Resources Information Center
Stepp, Cara E.; Lester-Smith, Rosemary A.; Abur, Defne; Daliri, Ayoub; Noordzij, J. Pieter; Lupiani, Ashling A.
2017-01-01
Purpose: The vocal auditory-motor control of individuals with hyperfunctional voice disorders was examined using a sensorimotor adaptation paradigm. Method: Nine individuals with hyperfunctional voice disorders and 9 individuals with typical voices produced sustained vowels over 160 trials in 2 separate conditions: (a) while experiencing gradual…
An Analysis of TRACON (Terminal Radar Approach Control) Controller-Pilot Voice Communication
DOT National Transportation Integrated Search
1996-06-01
The purpose of this analysis was to examine pilot-controller communication practices in the TRACONI (Terminal Radar Approach : Control) environment. Forty-eight hours of communications recorded on the voice tapes from eight TRACONs were analyzed. : T...
NASA Technical Reports Server (NTRS)
Degaudenzi, R.; Elia, C.; Viola, R.
1990-01-01
Discussed here is a new approach to code division multiple access applied to a mobile system for voice (and data) services based on Band Limited Quasi Synchronous Code Division Multiple Access (BLQS-CDMA). The system requires users to be chip synchronized to reduce the contribution of self-interference and to make use of voice activation in order to increase the satellite power efficiency. In order to achieve spectral efficiency, Nyquist chip pulse shaping is used with no detection performance impairment. The synchronization problems are solved in the forward link by distributing a master code, whereas carrier forced activation and closed loop control techniques have been adopted in the return link. System performance sensitivity to nonlinear amplification and timing/frequency synchronization errors are analyzed.
The persuasiveness of synthetic speech versus human speech.
Stern, S E; Mullennix, J W; Dyson, C; Wilson, S J
1999-12-01
Is computer-synthesized speech as persuasive as the human voice when presenting an argument? After completing an attitude pretest, 193 participants were randomly assigned to listen to a persuasive appeal under three conditions: a high-quality synthesized speech system (DECtalk Express), a low-quality synthesized speech system (Monologue), and a tape recording of a human voice. Following the appeal, participants completed a posttest attitude survey and a series of questionnaires designed to assess perceptions of speech qualities, perceptions of the speaker, and perceptions of the message. The human voice was generally perceived more favorably than the computer-synthesized voice, and the speaker was perceived more favorably when the voice was a human voice than when it was computer synthesized. There was, however, no evidence that computerized speech, as compared with the human voice, affected persuasion or perceptions of the message. Actual or potential applications of this research include issues that should be considered when designing synthetic speech systems.
Varieties of Voice-Hearing: Psychics and the Psychosis Continuum
Powers, Albert R.; Kelley, Megan S.; Corlett, Philip R.
2017-01-01
Hearing voices that are not present is a prominent symptom of serious mental illness. However, these experiences may be common in the non-help-seeking population, leading some to propose the existence of a continuum of psychosis from health to disease. Thus far, research on this continuum has focused on what is impaired in help-seeking groups. Here we focus on protective factors in non-help-seeking voice-hearers. We introduce a new study population: clairaudient psychics who receive daily auditory messages. We conducted phenomenological interviews with these subjects, as well as with patients diagnosed with a psychotic disorder who hear voices, people with a diagnosis of a psychotic disorder who do not hear voices, and matched control subjects (without voices or a diagnosis). We found the hallucinatory experiences of psychic voice-hearers to be very similar to those of patients who were diagnosed. We employed techniques from forensic psychiatry to conclude that the psychics were not malingering. Critically, we found that this sample of non-help-seeking voice hearers were able to control the onset and offset of their voices, that they were less distressed by their voice-hearing experiences and that, the first time they admitted to voice-hearing, the reception by others was much more likely to be positive. Patients had much more negative voice-hearing experiences, were more likely to receive a negative reaction when sharing their voices with others for the first time, and this was subsequently more disruptive to their social relationships. We predict that this sub-population of healthy voice-hearers may have much to teach us about the neurobiology, cognitive psychology and ultimately the treatment of voices that are distressing. PMID:28053132
Valadez, Victor; Ysunza, Antonio; Ocharan-Hernandez, Esther; Garrido-Bustamante, Norma; Sanchez-Valerio, Araceli; Pamplona, Ma C
2012-09-01
Vocal Nodules (VN) are a functional voice disorder associated with voice misuse and abuse in children. There are few reports addressing vocal parameters in children with VN, especially after a period of vocal rehabilitation. The purpose of this study is to describe measurements of vocal parameters including Fundamental Frequency (FF), Shimmer (S), and Jitter (J), videonasolaryngoscopy examination and clinical perceptual assessment, before and after voice therapy in children with VN. Voice therapy was provided using visual support through Speech-Viewer software. Twenty patients with VN were studied. An acoustical analysis of voice was performed and compared with data from subjects from a control group matched by age and gender. Also, clinical perceptual assessment of voice and videonasolaryngoscopy were performed to all patients with VN. After a period of voice therapy, provided with visual support using Speech Viewer-III (SV-III-IBM) software, new acoustical analyses, perceptual assessments and videonasolaryngoscopies were performed. Before the onset of voice therapy, there was a significant difference (p<0.05) in mean FF, S and J, between the patients with VN and subjects from the control group. After the voice therapy period, a significant improvement (p<0.05) was found in all acoustic voice parameters. Moreover, perceptual voice analysis demonstrated improvement in all cases. Finally, videonasolaryngoscopy demonstrated that vocal nodules were no longer discernible on the vocal folds in any of the cases. SV-III software seems to be a safe and reliable method for providing voice therapy in children with VN. Acoustic voice parameters, perceptual data and videonasolaryngoscopy were significantly improved after the speech therapy period was completed. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Code of Federal Regulations, 2010 CFR
2010-01-01
.... Hurley Air Traffic Control System Command Center. It is responsible for the administration of... 6). “Enhanced Computer Voice Reservation System (e-CVRS)” is the system used by the FAA to make... charter flights; hired aircraft service; ferry flights; and other non-passenger flights. Section 3...
Installing an Integrated Information System in a Centralized Network.
ERIC Educational Resources Information Center
Mendelson, Andrew D.
1992-01-01
Many schools are looking at ways to centralize the distribution and retrieval of video, voice, and data transmissions in an integrate information system (IIS). A centralized system offers greater control of hardware and software. Describes media network planning to retrofit an Illinois' high school with a fiber optic-based IIS. (MLF)
Investigation of air transportation technology at Princeton University, 1985
NASA Technical Reports Server (NTRS)
Stengel, Robert F.
1987-01-01
The program proceeded along five avenues during 1985. Guidance and control strategies for penetration of microbursts and wind shear, application of artificial intelligence in flight control and air traffic control systems, the use of voice recognition in the cockpit, the effects of control saturation on closed-loop stability and response of open-loop unstable aircraft, and computer aided control system design are among the topics briefly considered. Areas of investigation relate to guidance and control of commercial transports as well as general aviation aircraft. Interaction between the flight crew and automatic systems is the subject of principal concern.
Is there an effect of dysphonic teachers' voices on children's processing of spoken language?
Rogerson, Jemma; Dodd, Barbara
2005-03-01
There is a vast body of literature on the causes, prevalence, implications, and issues of vocal dysfunction in teachers. However, the educational effect of teacher vocal impairment is largely unknown. The purpose of this study was to investigate the effect of impaired voice quality on children's processing of spoken language. One hundred and seven children (age range, 9.2 to 10.6, mean 9.8, SD 3.76 months) listened to three video passages, one read in a control voice, one in a mild dysphonic voice, and one in a severe dysphonic voice. After each video passage, children were asked to answer six questions, with multiple-choice answers. The results indicated that children's perceptions of speech across the three voice qualities differed, regardless of gender, IQ, and school attended. Performance in the control voice passages was better than performance in the mild and severe dysphonic voice passages. No difference was found between performance in the mild and severe dysphonic voice passages, highlighting that any form of vocal impairment is detrimental to children's speech processing and is therefore likely to have a negative educational effect. These findings, in light of the high rate of vocal dysfunction in teachers, further support the implementation of specific voice care education for those in the teaching profession.
A Voice-Based E-Examination Framework for Visually Impaired Students in Open and Distance Learning
ERIC Educational Resources Information Center
Azeta, Ambrose A.; Inam, Itorobong A.; Daramola, Olawande
2018-01-01
Voice-based systems allow users access to information on the internet over a voice interface. Prior studies on Open and Distance Learning (ODL) e-examination systems that make use of voice interface do not sufficiently exhibit intelligent form of assessment, which diminishes the rigor of examination. The objective of this paper is to improve on…
ERIC Educational Resources Information Center
Foster, Karen R.; Kersh, Mildred E.; Masztal, Nancy B.
This study investigated the way kindergarten classroom teachers delivered information to students to see if it affected the amount of information students could remember about the solar system. The study also examined whether this difference would be related to the degree of musical aptitude possessed by each student. The students were pretested…
Definition of problems of persons in sheltered care environments
NASA Technical Reports Server (NTRS)
Fetzner, W. N.
1979-01-01
Innovations in health care using aerospace technologies are described. Voice synthesizer and voice recognition technologies were used in developing voice controlled wheel chairs and optacons. Telephone interface modules are also described.
Validation and Adaptation of the Singing Voice Handicap Index for Egyptian Singing Voice.
Abou-Elsaad, Tamer; Baz, Hemmat; Afsah, Omayma; Abo-Elsoud, Hend
2017-01-01
Measuring the severity of a voice disorder is difficult. This can be achieved by both subjective and objective measures. The Voice Handicap Index is the most known and used self-rating tool for voice disorders. The Classical Singing Handicap Index (CSHI) is a self-administered questionnaire measuring the impact of vocal deviation on the quality of life of singers. The objective of this study was to develop an Arabic version of the CSHI and to test its validity and reliability in Egyptian singers with different singing styles with normal voice and with voice disorders. The interpreted version was administered to 70 Egyptian singers including artistic singers (classical and popular) and specialized singers (Quran reciters and priests) who were divided into 40 asymptomatic singers (control group) and 30 singers with voice disorders. Participants' responses were statistically analyzed to assess the validity and reliability, and to compare the patient group with the control group. Quran reciters, patients with no previous professional training, and patients with vocal fold lesions demonstrated the highest scores. The Arabic version of CSHI is found to be a reliable, valid, and sensitive self-assessment tool that can be used in the clinical practice for the evaluation of the impact of voice disorders on singing voice. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Acoustic analysis of voice in children with cleft palate and velopharyngeal insufficiency.
Villafuerte-Gonzalez, Rocio; Valadez-Jimenez, Victor M; Hernandez-Lopez, Xochiquetzal; Ysunza, Pablo Antonio
2015-07-01
Acoustic analysis of voice can provide instrumental data concerning vocal abnormalities. These findings can be used for monitoring clinical course in cases of voice disorders. Cleft palate severely affects the structure of the vocal tract. Hence, voice quality can also be also affected. To study whether the main acoustic parameters of voice, including fundamental frequency, shimmer and jitter are significantly different in patients with a repaired cleft palate, as compared with normal children without speech, language and voice disorders. Fourteen patients with repaired unilateral cleft lip and palate and persistent or residual velopharyngeal insufficiency (VPI) were studied. A control group was assembled with healthy volunteer subjects matched by age and gender. Hypernasality and nasal emission were perceptually assessed in patients with VPI. Size of the gap as assessed by videonasopharyngoscopy was classified in patients with VPI. Acoustic analysis of voice including Fundamental frequency (F0), shimmer and jitter were compared between patients with VPI and control subjects. F0 was significantly higher in male patients as compared with male controls. Shimmer was significantly higher in patients with VPI regardless of gender. Moreover, patients with moderate VPI showed a significantly higher shimmer perturbation, regardless of gender. Although future research regarding voice disorders in patients with VPI is needed, at the present time it seems reasonable to include strategies for voice therapy in the speech and language pathology intervention plan for patients with VPI. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Arrabito, G R; McFadden, S M; Crabtree, R B
2001-07-01
Auditory speech thresholds were measured in this study. Subjects were required to discriminate a female voice recording of three-digit numbers in the presence of diotic speech babble. The voice stimulus was spatialized at 11 static azimuth positions on the horizontal plane using three different head-related transfer functions (HRTFs) measured on individuals who did not participate in this study. The diotic presentation of the voice stimulus served as the control condition. The results showed that two of the HRTFS performed similarly and had significantly lower auditory speech thresholds than the third HRTF. All three HRTFs yielded significantly lower auditory speech thresholds compared with the diotic presentation of the voice stimulus, with the largest difference at 60 degrees azimuth. The practical implications of these results suggest that lower headphone levels of the communication system in military aircraft can be achieved without sacrificing intelligibility, thereby lessening the risk of hearing loss.
[Voice disorders related to job stress in teaching: a case-control study].
Giannini, Susana Pimentel Pinto; Latorre, Maria do Rosário Dias de Oliveira; Ferreira, Leslie Piccolotto
2012-11-01
This case-control study aimed to test the association between voice disorders and job stress among public schoolteachers in São Paulo, Brazil. The groups consisted of teachers with (n = 165) and without (n = 105) voice-related complaints. Both groups answered the questionnaires Conditions of Vocal Production and Job Stress Scale. Analysis of cases and controls showed comparable samples, differing only by vocal symptoms. There was a statistically significant difference between cases and controls in relation to job stress involving high strain (OR = 2.1; 95%CI: 1.1-3.9), which places high demands combined with low job control. High strain in cases in this study represents the highest risk of physical and mental disorders for teachers. Loss of voice prevents teachers from continuing in their professional role, eliminates their professional identity, and jeopardizes their career.
NASA Technical Reports Server (NTRS)
Dunbar, Melisa; McGann, Alison; Mackintosh, Margaret-Anne; Lozito, Sandra; Ashford, Rose (Technical Monitor)
2001-01-01
A simulation in the B747-400 was conducted at NASA Ames Research Center that compared how crews handled voice and data link air traffic control (ATC) messages in a single medium versus a mixed voice and data link ATC environment The interval between ATC messages was also varied to examine the influence of time pressure in voice, data link, and mixed ATC environments. For messages sent via voice, transaction times were lengthened in the mixed media environment for closely spaced messages. The type of environment did not affect data link times. However, messages times were lengthened in both single and mixed-modality environments under time pressure. Closely spaced messages also increased the number of requests for clarification for voice messages in the mixed environment and review menu use for data link messages. Results indicated that when time pressure is introduced, the mix of voice and data link does not necessarily capitalize on the advantages of both media. These findings emphasize the need to develop procedures for managing communication in mixed voice and data link environments.
Combined Functional Voice Therapy in Singers With Muscle Tension Dysphonia in Singing.
Sielska-Badurek, Ewelina; Osuch-Wójcikiewicz, Ewa; Sobol, Maria; Kazanecka, Ewa; Rzepakowska, Anna; Niemczyk, Kazimierz
2017-07-01
The purpose of this study was to evaluate vocal tract function and the voice quality in singers with muscle tension dysphonia (MTD) after undergoing combined functional voice therapy of the singing voice. This is a prospective, randomized study. Forty singers (29 females and 11 males, mean age: 24.6 ± 8.8 years) with MTD were enrolled in the study. The study group consisted of 20 singers who underwent combined functional voice therapy (10-15 individual sessions, 30-40 minutes each). Singers who did not opt for vocal rehabilitation consisted of the control group. Effects of rehabilitation were assessed with videolaryngostroboscopy, palpation of the vocal tract structures, flexible fiberoptic evaluation of the pharynx and the larynx, perceptual speaking and singing voice assessment, acoustic analysis, maximal phonation time, and the Voice Handicap Index. After combined functional voice therapy in the study group, great improvement was noticed in palpation of the vocal tract structures (P < 0.001), perceptual voice assessment (P < 0.001), phonetograms (P = 0.002), and singing range obtained from acoustic analysis of glissando (P < 0.001). In the control group, no statistically significant differences were found between the first and the second assessments. Combined functional voice therapy proved to be an efficacious treatment method in singers with MTD in singing. Development of palpation and perceptual singing voice examination protocols enables one to compare results before and after rehabilitation in clinics. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Comparison of FDMA and CDMA for second generation land-mobile satellite communications
NASA Technical Reports Server (NTRS)
Yongacoglu, A.; Lyons, R. G.; Mazur, B. A.
1990-01-01
Code Division Multiple Access (CDMA) and Frequency Division Multiple Access (FDMA) (both analog and digital) systems capacities are compared on the basis of identical link availabilities and physical propagation models. Parameters are optimized for a bandwidth limited, multibeam environment. For CDMA, the benefits of voice activated carriers, antenna discrimination, polarization reuse, return link power control and multipath suppression are included in the analysis. For FDMA, the advantages of bandwidth efficient modulation/coding combinations, voice activated carriers, polarization reuse, beam placement, and frequency staggering were taken into account.
NASA Technical Reports Server (NTRS)
Hinton, David A.; Lohr, Gary W.
1988-01-01
Studies have shown that radio communications between pilots and air traffic control contribute to high pilot workload and are subject to various errors. These errors result from congestion on the voice radio channel, and missed and misunderstood messages. The use of digital data link has been proposed as a means of reducing this workload and error rate. A critical factor, however, in determining the potential benefit of data link will be the interface between future data link systems and the operator of those systems, both in the air and on the ground. The purpose of this effort was to evaluate the pilot interface with various levels of data link capability, in simulated general aviation, single-pilot instrument flight rule operations. Results show that the data link reduced demands on pilots' short-term memory, reduced the number of communication transmissions, and permitted the pilots to more easily allocate time to critical cockpit tasks while receiving air traffic control messages. The pilots who participated unanimously indicated a preference for data link communications over voice-only communications. There were, however, situations in which the pilot preferred the use of voice communications, and the ability for pilots to delay processing the data link messages, during high workload events, caused delays in the acknowledgement of messages to air traffic control.
Andreas Vesalius' 500th Anniversary: Initial Integral Understanding of Voice Production.
Brinkman, Romy J; Hage, J Joris
2017-01-01
Voice production relies on the integrated functioning of a three-part system: respiration, phonation and resonance, and articulation. To commemorate the 500th anniversary of the great anatomist Andreas Vesalius (1515-1564), we report on his understanding of this integral system. The text of Vesalius' masterpiece De Humani Corporis Fabrica Libri Septum and an eyewitness report of the public dissection of three corpses by Vesalius in Bologna, Italy, in 1540, were searched for references to the voice-producing anatomical structures and their function. We clustered the traced, separate parts for the first time. We found that Vesalius recognized the importance for voice production of many details of the respiratory system, the voice box, and various structures of resonance and articulation. He stressed that voice production was a cerebral function and extensively recorded the innervation of the voice-producing organs by the cranial nerves. Vesalius was the first to publicly record the concept of voice production as an integrated and cerebrally directed function of respiration, phonation and resonance, and articulation. In doing so nearly 500 years ago, he laid a firm basis for the understanding of the physiology of voice production and speech and its management as we know it today. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Van Stan, Jarrad H.; Mehta, Daryush D.; Zeitels, Steven M.; Burns, James A.; Barbu, Anca M.; Hillman, Robert E.
2015-01-01
Objectives Clinical management of phonotraumatic vocal fold lesions (nodules, polyps) is based largely on assumptions that abnormalities in habitual levels of sound pressure level (SPL), fundamental frequency (f0), and/or amount of voice use play a major role in lesion development and chronic persistence. This study used ambulatory voice monitoring to evaluate if significant differences in voice use exist between patients with phonotraumatic lesions and normal matched controls. Methods Subjects were 70 adult females: 35 with vocal fold nodules or polyps and 35 age-, sex-, and occupation-matched normal individuals. Weeklong summary statistics of voice use were computed from anterior neck surface acceleration recorded using a smartphone-based ambulatory voice monitor. Results Paired t-tests and Kolmogorov-Smirnov tests resulted in no statistically significant differences between patients and matched controls regarding average measures of SPL, f0, vocal dose measures, and voicing/voice rest periods. Paired t-tests comparing f0 variability between the groups resulted in statistically significant differences with moderate effect sizes. Conclusions Individuals with phonotraumatic lesions did not exhibit differences in average ambulatory measures of vocal behavior when compared with matched controls. More refined characterizations of underlying phonatory mechanisms and other potentially contributing causes are warranted to better understand risk factors associated with phonotraumatic lesions. PMID:26024911
Scientific bases of human-machine communication by voice.
Schafer, R W
1995-01-01
The scientific bases for human-machine communication by voice are in the fields of psychology, linguistics, acoustics, signal processing, computer science, and integrated circuit technology. The purpose of this paper is to highlight the basic scientific and technological issues in human-machine communication by voice and to point out areas of future research opportunity. The discussion is organized around the following major issues in implementing human-machine voice communication systems: (i) hardware/software implementation of the system, (ii) speech synthesis for voice output, (iii) speech recognition and understanding for voice input, and (iv) usability factors related to how humans interact with machines. PMID:7479802
Dejonckere, P H; Wieneke, G H; Bloemenkamp, D; Lebacq, J
1996-04-01
Sustained phonations were compared in two groups of children (aged 7-12), one with special artistic voice education and one from a normal school, without voice complaints or problems. The hypothesis of specific (better) biomechanical vocal fold properties in the first group is confronted with the hypothesis of differences solely related to training of voice control. In both groups, Fo-aperiodicity was measured in a sustained phonation at 3 different SPL levels. As a general rule, aperiodicity clearly decreases when the voice becomes louder. Aperiodicity is highly significantly lower, at all SPL-levels, in children with trained singing voices: this implies better mechanical properties of the vocal oscillator. The Fo/SPL relation on a sustained /a:/ does not differ in trained and untrained children's voices: out of singing context, trained children do not spontaneously control the Fo/SPL dynamics differently from untrained children. The higher regularity of vocal fold pulses is not related to the duration of training.
Understanding The Neural Mechanisms Involved In Sensory Control Of Voice Production
Parkinson, Amy L.; Flagmeier, Sabina G.; Manes, Jordan L.; Larson, Charles R.; Rogers, Bill; Robin, Donald A.
2012-01-01
Auditory feedback is important for the control of voice fundamental frequency (F0). In the present study we used neuroimaging to identify regions of the brain responsible for sensory control of the voice. We used a pitch-shift paradigm where subjects respond to an alteration, or shift, of voice pitch auditory feedback with a reflexive change in F0. To determine the neural substrates involved in these audio-vocal responses, subjects underwent fMRI scanning while vocalizing with or without pitch-shifted feedback. The comparison of shifted and unshifted vocalization revealed activation bilaterally in the superior temporal gyrus (STG) in response to the pitch shifted feedback. We hypothesize that the STG activity is related to error detection by auditory error cells located in the superior temporal cortex and efference copy mechanisms whereby this region is responsible for the coding of a mismatch between actual and predicted voice F0. PMID:22406500
Practical applications of interactive voice technologies: Some accomplishments and prospects
NASA Technical Reports Server (NTRS)
Grady, Michael W.; Hicklin, M. B.; Porter, J. E.
1977-01-01
A technology assessment of the application of computers and electronics to complex systems is presented. Three existing systems which utilize voice technology (speech recognition and speech generation) are described. Future directions in voice technology are also described.
Utilization of Internet Protocol-Based Voice Systems in Remote Payload Operations
NASA Technical Reports Server (NTRS)
Best, Susan; Nichols, Kelvin; Bradford, Robert
2003-01-01
This viewgraph presentation provides an overview of a proposed voice communication system for use in remote payload operations performed on the International Space Station. The system, Internet Voice Distribution System (IVoDS), would make use of existing Internet protocols, and offer a number of advantages over the system currently in use. Topics covered include: system description and operation, system software and hardware, system architecture, project status, and technology transfer applications.
Körner Gustafsson, Joakim; Södersten, Maria; Ternström, Sten; Schalling, Ellika
2018-02-15
This study examines the effects of an intensive voice treatment focusing on increasing voice intensity, LSVT LOUD ® Lee Silverman Voice Treatment, on voice use in daily life in a participant with Parkinson's disease, using a portable voice accumulator, the VoxLog. A secondary aim was to compare voice use between the participant and a matched healthy control. Participants were an individual with Parkinson's disease and his healthy monozygotic twin. Voice use was registered with the VoxLog during 9 weeks for the individual with Parkinson's disease and 2 weeks for the control. This included baseline registrations for both participants, 4 weeks during LSVT LOUD for the individual with Parkinson's disease and 1 week after treatment for both participants. For the participant with Parkinson's disease, follow-up registrations at 3, 6, and 12 months post-treatment were made. The individual with Parkinson's disease increased voice intensity during registrations in daily life with 4.1 dB post-treatment and 1.4 dB at 1-year follow-up compared to before treatment. When monitored during laboratory recordings an increase of 5.6 dB was seen post-treatment and 3.8 dB at 1-year follow-up. Changes in voice intensity were interpreted as a treatment effect as no significant correlations between changes in voice intensity and background noise were found for the individual with Parkinson's disease. The increase in voice intensity in a laboratory setting was comparable to findings previously reported following LSVT LOUD. The increase registered using ambulatory monitoring in daily life was lower but still reflecting a clinically relevant change.
An Analysis of En Route Controller-Pilot Voice Communications
DOT National Transportation Integrated Search
1993-03-01
The purposes of this analysis were to examine current pilot-controller communication practices in the en route : environment. Forty-eight hours of voice tapes from eight different Air Route Traffic Control Centers (ARTCCs) were : examined. There were...
The Army word recognition system
NASA Technical Reports Server (NTRS)
Hadden, David R.; Haratz, David
1977-01-01
The application of speech recognition technology in the Army command and control area is presented. The problems associated with this program are described as well as as its relevance in terms of the man/machine interactions, voice inflexions, and the amount of training needed to interact with and utilize the automated system.
Superior voice recognition in a patient with acquired prosopagnosia and object agnosia.
Hoover, Adria E N; Démonet, Jean-François; Steeves, Jennifer K E
2010-11-01
Anecdotally, it has been reported that individuals with acquired prosopagnosia compensate for their inability to recognize faces by using other person identity cues such as hair, gait or the voice. Are they therefore superior at the use of non-face cues, specifically voices, to person identity? Here, we empirically measure person and object identity recognition in a patient with acquired prosopagnosia and object agnosia. We quantify person identity (face and voice) and object identity (car and horn) recognition for visual, auditory, and bimodal (visual and auditory) stimuli. The patient is unable to recognize faces or cars, consistent with his prosopagnosia and object agnosia, respectively. He is perfectly able to recognize people's voices and car horns and bimodal stimuli. These data show a reverse shift in the typical weighting of visual over auditory information for audiovisual stimuli in a compromised visual recognition system. Moreover, the patient shows selectively superior voice recognition compared to the controls revealing that two different stimulus domains, persons and objects, may not be equally affected by sensory adaptation effects. This also implies that person and object identity recognition are processed in separate pathways. These data demonstrate that an individual with acquired prosopagnosia and object agnosia can compensate for the visual impairment and become quite skilled at using spared aspects of sensory processing. In the case of acquired prosopagnosia it is advantageous to develop a superior use of voices for person identity recognition in everyday life. Copyright © 2010 Elsevier Ltd. All rights reserved.
Development of an air ground data exchange concept: Flight deck perspective
NASA Technical Reports Server (NTRS)
Flathers, G. W., II
1987-01-01
The planned modernization of the U.S. National Airspace System (NAS) includes the development and use of a digital data link as a means to exchange information between aircraft and ground-based facilities. This report presents an operationally-oriented concept on how data link could be used for applications related directly to air traffic control. The specific goal is to establish the role that data link could play in the air-ground communications. Due regard is given to the unique characteristics of data link and voice communications, current principles of air traffic control, operational procedures, human factors/man-machine interfaces, and the integration of data link with other air and ground systems. The resulting concept is illustrated in the form of a paper-and-pencil simulation in which data link and voice communications during the course of a hypothetical flight are described.
Flight tests with a data link used for air traffic control information exchange
NASA Technical Reports Server (NTRS)
Knox, Charles E.; Scanlon, Charles H.
1991-01-01
Previous studies showed that air traffic control (ATC) message exchange with a data link offers the potential benefits of increased airspace system safety and efficiency. To accomplish these benefits, data link can be used to reduce communication errors and relieve overloaded ATC voice radio frequencies, which hamper efficient message exchange during peak traffic periods. Flight tests with commercial airline pilots as test subjects were conducted in the NASA Transport Systems Research Vehicle Boeing 737 airplane to contrast flight operations that used current voice communications with flight operations that used data link to transmit both strategic and tactical ATC clearances during a typical commercial airflight from takeoff to landing. The results of these tests that used data link as the primary communication source with ATC showed flight crew acceptance, a perceived reduction in crew work load, and a reduction in crew communication errors.
NWR (National Weather Service) voice synthesis project, phase 1
NASA Astrophysics Data System (ADS)
Sampson, G. W.
1986-01-01
The purpose of the NOAA Weather Radio (NWR) Voice Synthesis Project is to provide a demonstration of the current voice synthesis technology. Phase 1 of this project is presented, providing a complete automation of an hourly surface aviation observation for broadcast over NWR. In examining the products currently available on the market, the decision was made that synthetic voice technology does not have the high quality speech required for broadcast over the NWR. Therefore the system presented uses the phrase concatenation type of technology for a very high quality, versatile, voice synthesis system.
Identification and human condition analysis based on the human voice analysis
NASA Astrophysics Data System (ADS)
Mieshkov, Oleksandr Yu.; Novikov, Oleksandr O.; Novikov, Vsevolod O.; Fainzilberg, Leonid S.; Kotyra, Andrzej; Smailova, Saule; Kozbekova, Ainur; Imanbek, Baglan
2017-08-01
The paper presents a two-stage biotechnical system for human condition analysis that is based on analysis of human voice signal. At the initial stage, the voice signal is pre-processed and its characteristics in time domain are determined. At the first stage, the developed system is capable of identifying the person in the database on the basis of the extracted characteristics. At the second stage, the model of a human voice is built on the basis of the real voice signals after clustering the whole database.
Gökdoğan, Çağıl; Gökdoğan, Ozan; Şahin, Esra; Yılmaz, Metin
2014-01-01
This study aims to evaluate phonetogram data of the students in the department of music who passed the entrance exam. The phonetogram data of 44 individuals with a good voice quality in the department of music and age-matched individuals who were not trained in the field of music or not involved in music amateurish as the control group were compared. The voice of both groups were recorded using the voice range profile within the scope of Kay Elemetrics CSL (Model 4300 B) programmed. There was a significant difference in the voice range profile parameters including max Fo, Fo range, Fo range (St), min dB SPL, and max dB sound pressure level (p<0.05). Our study results suggest that the voice interval of the department of music is higher than the control group and that plays a major role in their acceptance to the department of music.
An Analysis of Tower (Local) Controller - Pilot Voice Communications
DOT National Transportation Integrated Search
1994-06-01
The purposes of this analysis were to examine current pilot-controller communication practices in the terminal environment. Forty-nine hours of voice tapes from local positions in ten Air Traffic Control Towers (ATCTs) were examined. There were 8,444...
Coprevalence of anxiety and depression with spasmodic dysphonia: a case-control study.
White, Laura J; Hapner, Edie R; Klein, Adam M; Delgaudio, John M; Hanfelt, John J; Jinnah, H A; Johns, Michael M
2012-09-01
There is evidence supporting an association between depression and anxiety in patients with chronic disease. Spasmodic dysphonia (SD) is a chronic, incurable, and disabling voice disorder. Reported rates of depression and anxiety in SD range from 7.1% to 72%, with a maximum number of 18 patients. The goal of this study was to define the coprevalence of depression and anxiety with SD. A single-institution case-control study was performed from May to July 2010. Consecutive patients with SD and benign voice disorders were enrolled prospectively. On enrollment, patients were asked to fill out a questionnaire that reviewed the duration of the voice disorder and personal history of anxiety and depression, including current and lifetime diagnosis. One hundred forty-six controls with benign voice disorders and 128 patients with SD were enrolled. Patients with SD were no more likely to be diagnosed with depression or anxiety than those of the control group (odds ratio [OR]=0.985, 95% confidence interval [CI]=0.59-1.63; and OR=1.314; 95% CI=0.75-2.3, respectively). Additionally, duration of disease was a risk factor for depression in both the SD group and the control group, and the association was not significantly different between groups. Patients with SD were no more likely to have depression or anxiety than those with other voice disorders. It is important for otolaryngologists to be aware of the increased rates of depression in patients diagnosed with chronic diseases, including voice disorders, and to refer to a psychiatrist when appropriate. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Patel, Sona; Lodhavia, Anjli; Frankford, Saul; Korzyukov, Oleg; Larson, Charles R.
2016-01-01
Objective/Hypothesis It is known that singers are able to control their voice to maintain a relatively constant vocal quality while transitioning between vocal registers; however, the neural mechanisms underlying this effect are not understood. It was hypothesized that greater attention to the acoustical feedback of the voice and increased control of the vocal musculature during register transitions compared to singing within a register would be represented as neurological differences in event-related potentials (ERPs). Study Design/Methods Nine singers sang musical notes at the high end of the modal register (the boundary between the modal and head/falsetto registers) and at the low end (the boundary between the modal and fry/pulse registers). While singing, the pitch of the voice auditory feedback was unexpectedly shifted either into the adjacent register (“toward” the register boundary) or within the modal register (“away from” the boundary). Singers were instructed to maintain a constant pitch and ignore any changes to their voice feedback. Results Vocal response latencies and magnitude of the accompanying N1 and P2 ERPs were greatest at the lower (modal-fry) boundary when the pitch shift carried the subjects’ voices into the fry register as opposed to remaining within the modal register. Conclusions These findings suggest that when a singer lowers the pitch of their voice such that it enters the fry register from the modal register, there is increased sensory-motor control of the voice, reflected as increased magnitude of the neural potentials to help minimize qualitative changes in the voice. PMID:26739860
[Psychological effects of preventive voice care training in student teachers].
Nusseck, M; Richter, B; Echternach, M; Spahn, C
2017-07-01
Studies on the effectiveness of preventive voice care programs have focused mainly on voice parameters. Psychological parameters, however, have not been investigated in detail so far. The effect of a voice training program for German student teachers on psychological health parameters was investigated in a longitudinal study. The sample of 204 student teachers was divided into the intervention group (n = 123), who participated in the voice training program, and the control group (n = 81), who received no voice training. Voice training contained ten 90-min group courses and an individual visit by the voice trainer in a teaching situation with feedback afterwards. Participants were asked to fill out questionnaires (self-efficacy, Short-Form Health Survey, self-consciousness, voice self-concept, work-related behaviour and experience patterns) at the beginning and the end of their student teacher training period. The training program showed significant positive influences on psychological health, voice self-concept (i.e. more positive perception and increased awareness of one's own voice) and work-related coping behaviour in the intervention group. On average, the mental health status of all participants reduced over time, whereas the status in the trained group diminished significantly less than in the control group. Furthermore, the trained student teachers gained abilities to cope with work-related stress better than those without training. The training program clearly showed a positive impact on mental health. The results maintain the importance of such a training program not only for voice health, but also for wide-ranging aspects of constitutional health.
Pizolato, Raquel Aparecida; Rehder, Maria Inês Beltrati Cornacchioni; Meneghim, Marcelo de Castro; Ambrosano, Glaucia Maria Bovi; Mialhe, Fábio Luiz; Pereira, Antonio Carlos
2013-02-27
Voice problems are more common in teachers due to intensive voice use during routine at work. There is evidence that occupational disphonia prevention programs are important in improving the quality voice and consequently the quality of subjects' lives. To investigate the impact of educational voice interventions for teachers on quality of life and voice. A longitudinal interventional study involving 70 teachers randomly selected from 11 public schools, 30 to receive educational intervention with vocal training exercises and vocal hygiene habits (experimental group) and 40 to receive guidance on vocal hygiene habits (control group control). Before the process of educational activities, the Voice-Related Quality of Life instrument (V-RQOL) was applied, and 3 months after conclusion of the activities, the subjects were interviewed again, using the same instrument. For data analysis, Prox MIXED were applied, with a level of significance α < 0.05. Teachers showed significantly higher domain and overall V-RQOL scores after preventive intervention, in both control and experimental groups. Nevertheless, there was no statistical difference in scores between the groups. Educational actions for vocal health had a positive impact on the quality of life of the participants, and the incorporation of permanent educational actions at institutional level is suggested.
2013-01-01
Background Voice problems are more common in teachers due to intensive voice use during routine at work. There is evidence that occupational disphonia prevention programs are important in improving the quality voice and consequently the quality of subjects’ lives. Aim To investigate the impact of educational voice interventions for teachers on quality of life and voice. Methods A longitudinal interventional study involving 70 teachers randomly selected from 11 public schools, 30 to receive educational intervention with vocal training exercises and vocal hygiene habits (experimental group) and 40 to receive guidance on vocal hygiene habits (control group control). Before the process of educational activities, the Voice-Related Quality of Life instrument (V-RQOL) was applied, and 3 months after conclusion of the activities, the subjects were interviewed again, using the same instrument. For data analysis, Prox MIXED were applied, with a level of significance α < 0.05. Results: Teachers showed significantly higher domain and overall V-RQOL scores after preventive intervention, in both control and experimental groups. Nevertheless, there was no statistical difference in scores between the groups. Conclusion Educational actions for vocal health had a positive impact on the quality of life of the participants, and the incorporation of permanent educational actions at institutional level is suggested. PMID:23445566
A long distance voice transmission system based on the white light LED
NASA Astrophysics Data System (ADS)
Tian, Chunyu; Wei, Chang; Wang, Yulian; Wang, Dachi; Yu, Benli; Xu, Feng
2017-10-01
A long distance voice transmission system based on a visible light communication technology (VLCT) is proposed in the paper. Our proposed system includes transmitter, receiver and the voice signal processing of single chip microcomputer. In the compact-sized LED transmitter, we use on-off-keying and not-return-to-zero (OOK-NRZ) to easily realize high speed modulation, and then systematic complexity is reduced. A voice transmission system, which possesses the properties of the low-noise and wide modulation band, is achieved by the design of high efficiency receiving optical path and using filters to reduce noise from the surrounding light. To improve the speed of the signal processing, we use single chip microcomputer to code and decode voice signal. Furthermore, serial peripheral interface (SPI) is adopted to accurately transmit voice signal data. The test results of our proposed system show that the transmission distance of this system is more than100 meters with the maximum data rate of 1.5 Mbit/s and a SNR of 30dB. This system has many advantages, such as simple construction, low cost and strong practicality. Therefore, it has extensive application prospect in the fields of the emergency communication and indoor wireless communication, etc.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-11-29
... (FEHB) Open Season Express Interactive Voice Response (IVR) System and Open Season Web site AGENCY: U.S... Benefits (FEHB) Open Season Express Interactive Voice Response (IVR) System and the Open Season Web site... Season Express Interactive Voice Response (IVR) System, and the Open Season Web site, Open Season Online...
Khalid, Umer; Conti, Gerry E; Erlandson, Robert F; Ellis, Richard D; Brown, Vince; Pandya, Abhilash K
2015-01-01
The focus of this research was to design a functional and user-friendly reacher for people with spinal cord injuries (SCIs). Engineering advancements have taken assistive robotics to new dimensions. Technologies such as wheelchair robotics and myo-electronically controlled systems have opened up a wide range of new applications to assist people with physical disabilities. Similarly, exo-skeletal limbs and body suits have provided new foundations from which technologies can aid function. Unfortunately, these devices have issues of usability, weight, and discomfort with donning. The Smart Assistive Reacher Arm (SARA) system, developed in this research, is a voice-activated, lightweight, mobile device that can be used when needed. SARA was built to help overcome daily reach challenges faced by individuals with limited arm and hand movement capability, such as people with cervical level 5-6 (C5-6) SCI. This article shows that a functional reacher arm with voice control can be beneficial for this population. Comparison study with healthy participants and an SCI participant shows that, when using SARA, a person with SCI can perform simple reach and grasp tasks independently, without someone else's help. This suggests that the interface is intuitive and can be easily used to a high level of proficiency by a SCI individual.
Can You Hear Me Now? The Impact of Voice in an Online Gaming Community
ERIC Educational Resources Information Center
Williams, Dmitri; Caplan, Scott; Xiong, Li
2007-01-01
This paper reports the results of a controlled field experiment in which voice communication was introduced into an existing online community (online gaming guilds within the popular game "World of Warcraft"), comparing a mix of voice and text with text only. Quantitative results suggest increases in liking and trust due to the addition of voice,…
Deguchi, Shinji; Kawashima, Kazutaka; Washio, Seiichi
2008-12-01
The effect of artificially altered transglottal pressures on the voice fundamental frequency (F0) is known to be associated with vocal fold stiffness. Its measurement, though useful as a potential diagnostic tool for noncontact assessment of vocal fold stiffness, often requires manual and painstaking determination of an unstable F0 of voice. Here, we provide a computer-aided technique that enables one to carry out the determination easily and accurately. Human subjects vocalized in accordance with a series of reference sounds from a speaker controlled by a computer. Transglottal pressures were altered by means of a valve embedded in a mouthpiece. Time-varying vocal F0 was extracted, without manual procedures, from a specific range of the voice spectrum determined on the basis of the controlled reference sounds. The validity of the proposed technique was assessed for 11 healthy subjects. Fluctuating voice F0 was tracked automatically during experiments, providing the relationship between transglottal pressure change and F0 on the computer. The proposed technique overcomes the difficulty in automatic determination of the voice F0, which tends to be transient both in normal voice and in some types of pathological voice.
Zhu, Yanmei; Witt, Rachel E.; MacCallum, Julia K.; Jiang, Jack J.
2010-01-01
Objective In this study, a Voice over Internet Protocol (VoIP) communication based on G.729 protocol was simulated to determine the effects of this system on acoustic perturbation parameters of normal and pathological voice signals. Patients and Methods: Fifty recordings of normal voice and 48 recordings of pathological voice affected by laryngeal paralysis were transmitted through a VoIP communication system. The acoustic analysis programs of CSpeech and MDVP were used to determine the percent jitter and percent shimmer from the voice samples before and after VoIP transmission. The effects of three frequently used audio compression protocols (MP3, WMA, and FLAC) on the perturbation measures were also studied. Results It was found that VoIP transmission disrupts the waveform and increases the percent jitter and percent shimmer of voice samples. However, after VoIP transmission, significant discrimination between normal and pathological voices affected by laryngeal paralysis was still possible. It was found that the lossless compression method FLAC does not exert any influence on the perturbation measures. The lossy compression methods MP3 and WMA increase percent jitter and percent shimmer values. Conclusion This study validates the feasibility of these transmission and compression protocols in developing remote voice signal data collection and assessment systems. PMID:20588051
Effect of adenoid hypertrophy on the voice and laryngeal mucosa in children.
Gomaa, Mohammed A; Mohammed, Haitham M; Abdalla, Adel A; Nasr, Dalia M
2013-12-01
The adenoids, or pharyngeal tonsils, are lymphatic tissue localized at the mucous layer of the roof and posterior wall of nasopharynx. Dysphonia defined as perceptual audible change of a patient's habitual voice as self judged or judged by his or her listeners. The diagnosis of dysphonia relies on clinical judgment based on phoniatric symptoms, auditory perceptual assessment of voice (APA) and full laryngeal examination. Our study was conducted to evaluate the effect of adenoid hypertrophy on voice and laryngeal mucosa. The study sample composed of sixty children, forty of them had adenoid hypertrophy (patient's group) and twenty healthy children (control group). Patient's group composed of 17 boys (42.5%) and 23 girls (57.5%), while control group consists of 8 males (40%) and 12 females (60%). All patients and control group subjected to history taking, clinical examination, lateral soft tissue X-ray on the nasopharynx, APA based on the modified GRBAS scale and full laryngeal examination. The data are collected and analyzed statistically by using software SPSS. Our results showed that there is a significant association between adenoid hypertrophy and, degree of dysphonia, leaky voice, pitch of voice and laryngeal lesion. Adenoid hypertrophy did not associate with loudness of voice, as well as character (irregular, breathy and strained). Laryngeal lesions were detected in thirteen children from patient group (32.5%): nodules (n = 6), thickening (n = 5), congestion (n = 2), while one child only out of 20 children of the control group had congestion (5.0%). Our results showed the importance of the assessment of voice and laryngeal examination in patients with adenoid hypertrophy, also treating the minimal mucosal lesions that results from adenoid hypertrophy should be taken in consideration. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
MIT-NASA/KSC space life science experiments - A telescience testbed
NASA Technical Reports Server (NTRS)
Oman, Charles M.; Lichtenberg, Byron K.; Fiser, Richard L.; Vordermark, Deborah S.
1990-01-01
Experiments performed at MIT to better define Space Station information system telescience requirements for effective remote coaching of astronauts by principal investigators (PI) on the ground are described. The experiments were conducted via satellite video, data, and voice links to surrogate crewmembers working in a laboratory at NASA's Kennedy Space Center. Teams of two PIs and two crewmembers performed two different space life sciences experiments. During 19 three-hour interactive sessions, a variety of test conditions were explored. Since bit rate limits are necessarily imposed on Space Station video experiments surveillance video was varied down to 50 Kb/s and the effectiveness of PI controlled frame rate, resolution, grey scale, and color decimation was investigated. It is concluded that remote coaching by voice works and that dedicated crew-PI voice loops would be of great value on the Space Station.
Design of the ultraprecision stage for lithography using VCM
NASA Astrophysics Data System (ADS)
Kim, Jung-Han; Kim, Mun-Su; Oh, Min-Taek
2007-12-01
This paper presents a new design of precision stage for the reticle in lithography process and a low hunting control method for the stage. The stage has three axes for X,Y, θ Z, those actuated by three voice coil motors individually. The proposed precision stage system has three gap sensors and voice coil motors, and supported by four air bearings, so it do not have any mechanical contact and nonlinear effect such as hysterisis which usually degrade performance in nano level movement. The reticle stage has cross coupled dynamics between X,Y,θ Z, axes, so the forward and inverse kinematics were solved to get an accurate reference position. When the stage is in regulating control mode, there always exist small fluctuations (stage hunting) in the stage movement. Because the low stage hunting characteristic is very important in recent lithography and nano-level applications, the proposed stage has a special regulating controller composed of digital filter, adjustor and switching algorithm. Another importance factor that generates hunting noise is the system noise inside the lithography machine such as EMI from another motor and solenoids. For reducing such system noises, the proposed controller has a two-port transmission system that transfers torque command signal from the DSP board to the amplifier. The low hunting control algorithm and two-port transmission system reduced hunting noise as 35nm(rms) when a conventional PID generates 77nm(rms) in the same mechanical system. The experimental results showed that the reticle system has 100nm linear accuracy and 1μ rad rotation accuracy at the control frequency of 8 kHz.
Apollo experience report: Voice communications techniques and performance
NASA Technical Reports Server (NTRS)
Dabbs, J. H.; Schmidt, O. L.
1972-01-01
The primary performance requirement of the spaceborne Apollo voice communications system is percent word intelligibility, which is related to other link/channel parameters. The effect of percent word intelligibility on voice channel design and a description of the verification procedures are included. Development and testing performance problems and the techniques used to solve the problems are also discussed. Voice communications performance requirements should be comprehensive and verified easily; the total system must be considered in component design, and the necessity of voice processing and the associated effect on noise, distortion, and cross talk should be examined carefully.
Virtual Integrated Planning and Execution Resource System (VIPERS): The High Ground of 2025
1996-04-01
earth to one meter will allow modeling of 21 enemy actions, to a degree only dreamed of before.8 For example, before starting an air campaign, an...that is facilitated by the system. Interaction may take the form of the written word, voice, video conferencing, or mental telepathy . Control speaks
Subjective and objective voice evaluation in Sjögren's syndrome.
Saltürk, Ziya; Özdemir, Erdi; Kumral, Tolgar Lütfi; Karabacakoğlu, Zeynep; Kumral, Esra; Yildiz, Hatice Elvin; Mersinlioğlu, Gökhan; Atar, Yavuz; Berkiten, Güler; Yildirim, Güven; Uyar, Yavuz
2017-04-01
Objective The aim of this study is to assess the subjective and objective aspects of voice in Sjögren's syndrome. Methods The study enrolled 10 women with Sjögren's syndrome and 12 healthy women. Maximum phonation time, fundamental frequency, jitter, shimmer, and noise-to-harmonics ratio were determined during acoustic voice analysis. The Stroboscopy Evaluation Rating Form was used for the laryngostroboscopic evaluation. A subjective evaluation was performed using the Turkish version of Voice Handicap Index-10. Results The mean age of the Sjögren's syndrome and control groups was 46 ± 13.89 and 41.27 ± 6.99 years, respectively, and did not differ (P = 0.131). In the laryngostroboscopic evaluation, the smoothness and straightness of vocal folds, regularity, and glottal closure differed significantly. In the acoustic and aerodynamic analyses, none of the parameters differed statistically, while the Sjögren's syndrome group had significantly higher Voice Handicap Index-10 scores than the controls. Conclusion Sjögren's syndrome affects the voice and voice quality.
Mehler, Bruce; Kidd, David; Reimer, Bryan; Reagan, Ian; Dobres, Jonathan; McCartt, Anne
2016-03-01
One purpose of integrating voice interfaces into embedded vehicle systems is to reduce drivers' visual and manual distractions with 'infotainment' technologies. However, there is scant research on actual benefits in production vehicles or how different interface designs affect attentional demands. Driving performance, visual engagement, and indices of workload (heart rate, skin conductance, subjective ratings) were assessed in 80 drivers randomly assigned to drive a 2013 Chevrolet Equinox or Volvo XC60. The Chevrolet MyLink system allowed completing tasks with one voice command, while the Volvo Sensus required multiple commands to navigate the menu structure. When calling a phone contact, both voice systems reduced visual demand relative to the visual-manual interfaces, with reductions for drivers in the Equinox being greater. The Equinox 'one-shot' voice command showed advantages during contact calling but had significantly higher error rates than Sensus during destination address entry. For both secondary tasks, neither voice interface entirely eliminated visual demand. Practitioner Summary: The findings reinforce the observation that most, if not all, automotive auditory-vocal interfaces are multi-modal interfaces in which the full range of potential demands (auditory, vocal, visual, manipulative, cognitive, tactile, etc.) need to be considered in developing optimal implementations and evaluating drivers' interaction with the systems. Social Media: In-vehicle voice-interfaces can reduce visual demand but do not eliminate it and all types of demand need to be taken into account in a comprehensive evaluation.
Mehler, Bruce; Kidd, David; Reimer, Bryan; Reagan, Ian; Dobres, Jonathan; McCartt, Anne
2016-01-01
Abstract One purpose of integrating voice interfaces into embedded vehicle systems is to reduce drivers’ visual and manual distractions with ‘infotainment’ technologies. However, there is scant research on actual benefits in production vehicles or how different interface designs affect attentional demands. Driving performance, visual engagement, and indices of workload (heart rate, skin conductance, subjective ratings) were assessed in 80 drivers randomly assigned to drive a 2013 Chevrolet Equinox or Volvo XC60. The Chevrolet MyLink system allowed completing tasks with one voice command, while the Volvo Sensus required multiple commands to navigate the menu structure. When calling a phone contact, both voice systems reduced visual demand relative to the visual–manual interfaces, with reductions for drivers in the Equinox being greater. The Equinox ‘one-shot’ voice command showed advantages during contact calling but had significantly higher error rates than Sensus during destination address entry. For both secondary tasks, neither voice interface entirely eliminated visual demand. Practitioner Summary: The findings reinforce the observation that most, if not all, automotive auditory–vocal interfaces are multi-modal interfaces in which the full range of potential demands (auditory, vocal, visual, manipulative, cognitive, tactile, etc.) need to be considered in developing optimal implementations and evaluating drivers’ interaction with the systems. Social Media: In-vehicle voice-interfaces can reduce visual demand but do not eliminate it and all types of demand need to be taken into account in a comprehensive evaluation. PMID:26269281
2015-12-01
Rack Mount System Mechanical Drawing Package by Steven P Callaway Approved for public release; distribution unlimited...Laboratory White House Communications Agency (WHCA) Presidential Voice Communications Rack Mount System Mechanical Drawing Package by Steven P...Note 3. DATES COVERED (From - To) 04/2013 4. TITLE AND SUBTITLE White House Communications Agency (WHCA) Presidential Voice Communications Rack
Neurobiological correlates of emotional intelligence in voice and face perception networks
Karle, Kathrin N; Ethofer, Thomas; Jacob, Heike; Brück, Carolin; Erb, Michael; Lotze, Martin; Nizielski, Sophia; Schütz, Astrid; Wildgruber, Dirk; Kreifelts, Benjamin
2018-01-01
Abstract Facial expressions and voice modulations are among the most important communicational signals to convey emotional information. The ability to correctly interpret this information is highly relevant for successful social interaction and represents an integral component of emotional competencies that have been conceptualized under the term emotional intelligence. Here, we investigated the relationship of emotional intelligence as measured with the Salovey-Caruso-Emotional-Intelligence-Test (MSCEIT) with cerebral voice and face processing using functional and structural magnetic resonance imaging. MSCEIT scores were positively correlated with increased voice-sensitivity and gray matter volume of the insula accompanied by voice-sensitivity enhanced connectivity between the insula and the temporal voice area, indicating generally increased salience of voices. Conversely, in the face processing system, higher MSCEIT scores were associated with decreased face-sensitivity and gray matter volume of the fusiform face area. Taken together, these findings point to an alteration in the balance of cerebral voice and face processing systems in the form of an attenuated face-vs-voice bias as one potential factor underpinning emotional intelligence. PMID:29365199
Neurobiological correlates of emotional intelligence in voice and face perception networks.
Karle, Kathrin N; Ethofer, Thomas; Jacob, Heike; Brück, Carolin; Erb, Michael; Lotze, Martin; Nizielski, Sophia; Schütz, Astrid; Wildgruber, Dirk; Kreifelts, Benjamin
2018-02-01
Facial expressions and voice modulations are among the most important communicational signals to convey emotional information. The ability to correctly interpret this information is highly relevant for successful social interaction and represents an integral component of emotional competencies that have been conceptualized under the term emotional intelligence. Here, we investigated the relationship of emotional intelligence as measured with the Salovey-Caruso-Emotional-Intelligence-Test (MSCEIT) with cerebral voice and face processing using functional and structural magnetic resonance imaging. MSCEIT scores were positively correlated with increased voice-sensitivity and gray matter volume of the insula accompanied by voice-sensitivity enhanced connectivity between the insula and the temporal voice area, indicating generally increased salience of voices. Conversely, in the face processing system, higher MSCEIT scores were associated with decreased face-sensitivity and gray matter volume of the fusiform face area. Taken together, these findings point to an alteration in the balance of cerebral voice and face processing systems in the form of an attenuated face-vs-voice bias as one potential factor underpinning emotional intelligence.
Finding and Learning to Use the Singing Voice: A Manual for Teachers.
ERIC Educational Resources Information Center
Gould, A. Oren
The child who is unable to reproduce a melody at a given pitch range can begin to "carry a tune" by learning to hear and control his singing voice and to match his voice with voices of other singers or with instruments. The "too low problem singer," the child with the most common difficulty, must learn to make successful song responses in his…
Optimal Duration for Voice Rest After Vocal Fold Surgery: Randomized Controlled Clinical Study.
Kaneko, Mami; Shiromoto, Osamu; Fujiu-Kurachi, Masako; Kishimoto, Yo; Tateya, Ichiro; Hirano, Shigeru
2017-01-01
Voice rest is commonly recommended after phonomicrosurgery to prevent worsening of vocal fold injuries. However, the most effective duration of voice rest is unknown. Recently, early vocal stimulation was recommended as a means to improve wound healing. The purpose of this study is to examine the optimal duration of voice rest after phonomicrosurgery. Randomized controlled clinical study. Patients undergoing phonomicrosurgery for leukoplakia, carcinoma in situ, vocal fold polyp, Reinke's edema, and cyst were chosen. Participants were randomly assigned to voice rest for 3 or 7 postoperative days. Voice therapy was administered to both groups after voice rest. Grade, roughness, breathiness, asthenia, and strain (GRBAS) scale, stroboscopic examination, aerodynamic assessment, acoustic analysis, and Voice Handicap Index-10 (VHI-10) were performed pre- and postoperatively at 1, 3, and 6 months. Stroboscopic examination evaluated normalized mucosal wave amplitude (NMWA). Parameters were compared between both groups. Thirty-one patients were analyzed (3-day group, n = 16; 7-day group, n = 15). Jitter, shimmer, and VHI-10 were significantly better in the 3-day group at 1 month post operation. GRBAS was significantly better in the 3-day group at 1 and 3 months post operation, and NMWA was significantly better in the 3-day group at 1, 3, and 6 months post operation compared to the 7-day group. The data suggest that 3 days of voice rest followed by voice therapy may lead to better wound healing of the vocal fold compared to 7 days of voice rest. Appropriate mechanical stimulation during early stages of vocal fold wound healing may lead to favorable functional recovery. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Sykes, Ingrid
2011-01-01
This essay explores new models of the citizen–patient by attending to the post-Revolutionary blind ‘voice’. Voice, in both a literal and figurative sense, was central to the way in which members of the Hospice des Quinze-Vingts, an institution for the blind and partially sighted, interacted with those in the community. Musical voices had been used by members to collect alms and to project the particular spiritual principle of their institution since its foundation in the thirteenth century. At the time of the Revolution, the Quinze-Vingts voice was understood by some political authorities as an exemplary call of humanity. Yet many others perceived it as deeply threatening. After 1800, productive dialogue between those in political control and Quinze-Vingts blind members broke down. Authorities attempted to silence the voice of members through the control of blind musicians and institutional management. The Quinze-Vingts blind continued to reassert their voices until around 1850, providing a powerful form of resistance to political control. The blind ‘voice’ ultimately recognised the right of the citizen–patient to dialogue with their political carers. PMID:22025797
Hopkins, David James [Livermore, CA
2008-05-13
A control system and method for actively reducing vibration in a spindle housing caused by unbalance forces on a rotating spindle, by measuring the force-induced spindle-housing motion, determining control signals based on synchronous demodulation, and provide compensation for the measured displacement to cancel or otherwise reduce or attenuate the vibration. In particular, the synchronous demodulation technique is performed to recover a measured spindle housing displacement signal related only to the rotation of a machine tool spindle, and consequently rejects measured displacement not related to spindle motion or synchronous to a cycle of revolution. Furthermore, the controller actuates at least one voice-coil (VC) motor, to cancel the original force-induced motion, and adapts the magnitude of voice coil signal until this measured displacement signal is brought to a null. In order to adjust the signal to a null, it must have the correct phase relative to the spindle angle. The feedback phase signal is used to adjust a common (to both outputs) commutation offset register (offset relative to spindle encoder angle) to force the feedback phase signal output to a null. Once both of these feedback signals are null, the system is compensating properly for the spindle-induced motion.
Literature review of voice recognition and generation technology for Army helicopter applications
NASA Astrophysics Data System (ADS)
Christ, K. A.
1984-08-01
This report is a literature review on the topics of voice recognition and generation. Areas covered are: manual versus vocal data input, vocabulary, stress and workload, noise, protective masks, feedback, and voice warning systems. Results of the studies presented in this report indicate that voice data entry has less of an impact on a pilot's flight performance, during low-level flying and other difficult missions, than manual data entry. However, the stress resulting from such missions may cause the pilot's voice to change, reducing the recognition accuracy of the system. The noise present in helicopter cockpits also causes the recognition accuracy to decrease. Noise-cancelling devices are being developed and improved upon to increase the recognition performance in noisy environments. Future research in the fields of voice recognition and generation should be conducted in the areas of stress and workload, vocabulary, and the types of voice generation best suited for the helicopter cockpit. Also, specific tasks should be studied to determine whether voice recognition and generation can be effectively applied.
Loss of controller-pilot voice communications in domestic en route airspace
DOT National Transportation Integrated Search
2017-02-10
With the planned implementation of ControllerPilot Data Link Communications (CPDLC) in en route airspace, information on voice communication performance in this airspace can help to predict specific benefits associated with CPDLC, identify adjustm...
NASA Astrophysics Data System (ADS)
Lazar, Aurel A.; White, John S.
1987-07-01
Theoretical analysis of integrated local area network model of MAGNET, an integrated network testbed developed at Columbia University, shows that the bandwidth freed up during video and voice calls during periods of little movement in the images and periods of silence in the speech signals could be utilized efficiently for graphics and data transmission. Based on these investigations, an architecture supporting adaptive protocols that are dynamicaly controlled by the requirements of a fluctuating load and changing user environment has been advanced. To further analyze the behavior of the network, a real-time packetized video system has been implemented. This system is embedded in the real-time multimedia workstation EDDY, which integrates video, voice, and data traffic flows. Protocols supporting variable-bandwidth, fixed-quality packetized video transport are described in detail.
NASA Astrophysics Data System (ADS)
Lazar, Aurel A.; White, John S.
1986-11-01
Theoretical analysis of an ILAN model of MAGNET, an integrated network testbed developed at Columbia University, shows that the bandwidth freed up by video and voice calls during periods of little movement in the images and silence periods in the speech signals could be utilized efficiently for graphics and data transmission. Based on these investigations, an architecture supporting adaptive protocols that are dynamically controlled by the requirements of a fluctuating load and changing user environment has been advanced. To further analyze the behavior of the network, a real-time packetized video system has been implemented. This system is embedded in the real time multimedia workstation EDDY that integrates video, voice and data traffic flows. Protocols supporting variable bandwidth, constant quality packetized video transport are descibed in detail.
Meerschman, Iris; Van Lierde, Kristiane; Peeters, Karen; Meersman, Eline; Claeys, Sofie; D'haeseleer, Evelien
2017-09-18
The purpose of this study was to determine the short-term effect of 2 semi-occluded vocal tract training programs, "resonant voice training using nasal consonants" versus "straw phonation," on the vocal quality of vocally healthy future occupational voice users. A multigroup pretest-posttest randomized control group design was used. Thirty healthy speech-language pathology students with a mean age of 19 years (range: 17-22 years) were randomly assigned into a resonant voice training group (practicing resonant exercises across 6 weeks, n = 10), a straw phonation group (practicing straw phonation across 6 weeks, n = 10), or a control group (receiving no voice training, n = 10). A voice assessment protocol consisting of both subjective (questionnaire, participant's self-report, auditory-perceptual evaluation) and objective (maximum performance task, aerodynamic assessment, voice range profile, acoustic analysis, acoustic voice quality index, dysphonia severity index) measurements and determinations was used to evaluate the participants' voice pre- and posttraining. Groups were compared over time using linear mixed models and generalized linear mixed models. Within-group effects of time were determined using post hoc pairwise comparisons. No significant time × group interactions were found for any of the outcome measures, indicating no differences in evolution over time among the 3 groups. Within-group effects of time showed a significant improvement in dysphonia severity index in the resonant voice training group, and a significant improvement in the intensity range in the straw phonation group. Results suggest that the semi-occluded vocal tract training programs using resonant voice training and straw phonation may have a positive impact on the vocal quality and vocal capacities of future occupational voice users. The resonant voice training caused an improved dysphonia severity index, and the straw phonation training caused an expansion of the intensity range in this population.
Screening value of V-RQOL in the evaluation of occupational voice disorders.
Morawska, Joanna; Niebudek-Bogusz, Ewa; Wiktorowicz, Justyna; Śliwińska-Kowalska, Mariola
2018-03-09
Given the growing number of occupational voice users, easy and quick broad-scale screening is necessary to provide prophylaxis of voice disorders. The aim of the study was to assess applicability of the Voice Related Quality of Life questionnaire (V-RQOL) to screening occupational voice disorders. The research comprised 284 subjects divided into 3 groups: 0 - the control group of normophonic subjects, non-professional voice users (N = 60), 1 - occupational voice users with objectively confirmed voice disorders (N = 124), 2 - the non-randomized group of occupational voice users with and without voice problems (N = 100). Self-assessment of voice was performed by means of the V-RQOL in comparison to the Voice Handicap Index (VHI). The relation between the V-RQOL and VHI was determined by means of linear regression. Receiver Operating Characteristic (ROC) curves were constructed and the cut-off point of the VRQOL was determined to discriminate between normophonic and dysphonic subjects. The relationship between the VHI and V-RQOL scores indicated a satisfactory coefficient of determination: R2 = 0.7266. High values of Cronbach's α confirmed high reliability of the V-RQOL test (0.867). Voice-Related Quality of Life questionnaire (V-RQOL) results were significantly worse in the study group than for normophonic controls (p < 0.001). The cut-off point for the test was set at 79 points. The determined area under the curve (AUC) = 0.910 (p < 0.001) showed high diagnostic accuracy of the V-RQOL. Results of the VRQOL differed for diagnose-based subgroups of dysphonic patients. The study gives grounds for application of the V-RQOL as a reliable tool for screening occupational voice disorders. Med Pr 2018;69(2):119-128. This work is available in Open Access model and licensed under a CC BY-NC 3.0 PL license.
Next generation keyboards: The importance of cognitive compatibility
NASA Technical Reports Server (NTRS)
Amell, John R.; Ewry, Michael E.; Colle, Herbert A.
1988-01-01
The computer keyboard of today is essentially the same as it has been for many years. Few advances have been made in keyboard design even though computer systems in general have made remarkable progress in improvements. This paper discusses the future of keyboards, their competition and compatibility with voice input systems, and possible special-application intelligent keyboards for controlling complex systems.
A model for treating voice disorders in school-age children within a video gaming environment.
King, Suzanne N; Davis, Larry; Lehman, Jeffrey J; Ruddy, Bari Hoffman
2012-09-01
Clinicians use a variety of approaches to motivate children with hyperfunctional voice disorders to comply with voice therapy in a therapeutic session and improve the motivation of children to practice home-based exercises. Utilization of current entertainment technology in such approaches may improve participation and motivation in voice therapy. The purpose of this study is to test the feasibility of using an entertainment video game as a therapy device. Prospective cohort and case-control study. Three levels of game testing were conducted to an existing entertainment video game for use as a voice therapy protocol. The game was tested by two computer programmers and five normal participants. The third level of testing was a case study with a child diagnosed with a hyperfunctional voice disorder. Modifications to the game were made after each feasibility test. Errors with the video game performance were modified, including the addition of a time stamp directory and game controller. Resonance voice exercises were modified to accommodate the gaming environment and unique competitive situation, including speech rate, acoustic parameters, game speed, and point allocations. The development of video games for voice therapeutic purposes attempt to replicate the high levels of engagement and motivation attained with entertainment video games, stimulating a more productive means of learning while doing. This case study found that a purely entertainment video game can be implemented as a voice therapeutic protocol based on information obtained from the case study. Copyright © 2012 The Voice Foundation. All rights reserved.
Voice characteristics in the progression of Parkinson's disease.
Holmes, R J; Oates, J M; Phyland, D J; Hughes, A J
2000-01-01
This study examined the acoustic and perceptual voice characteristics of patients with Parkinson's disease according to disease severity. The perceptual and acoustic voice characteristics of 30 patients with early stage PD and 30 patients with later stage PD were compared with data from 30 normal control subjects. Voice recordings consisted of prolongation of the vowel /a/, scale singing, and a 1-min monologue. In comparison with controls and previously published normative data, both early and later stage PD patients' voices were characterized perceptually by limited pitch and loudness variability, breathiness, harshness and reduced loudness. High modal pitch levels also characterized the voices of males in both early and later stages of PD. Acoustically, the voices of both groups of PD patients demonstrated lower mean intensity levels and reduced maximum phonational frequency ranges in comparison with normative data. Although less clear, the present data also suggested that the PD patients' voices were characterized by excess jitter, a high-speaking fundamental frequency for males and a reduced fundamental frequency variability for females. While several of these voice features did not appear to deteriorate with disease progression (i.e. harshness, high modal pitch and speaking fundamental frequency in males, fundamental frequency variability in females, low intensity and jitter), breathiness, monopitch and monoloudness, low loudness and reduced maximum phonational frequency range were all worse in the later stages of PD. Tremor was the sole voice feature which was associated only with later stage PD.
Acoustic Analysis of Voice and Electroglottography in Patients With Laryngopharyngeal Reflux.
Ramírez, Daphne Anahit Morales; Jiménez, Víctor Manuel Valadez; López, Xochiquetzal Hernández; Ysunza, Pablo Antonio
2018-05-01
Laryngopharyngeal reflux (LPR) refers to the flow of gastric acid content into the laryngopharynx. It has been reported that 10% of the patients consulting an otolaryngologist present with this condition. Signs of LPR can be identified during flexible or rigid laryngoscopy. The Voice Handicap Index (VHI) is a reliable tool for detecting the impact of voice disorders, and acoustic assessment of voice including acoustic analysis of voice (AAV) and electroglottography (EGG) provide objective data of voice production and voice disorders. This study aimed to describe changes in AAV, EGG, and VHI in patients who present with LPR compared with a matched control group of healthy subjects. Seventeen patients with LPR were studied. A group of healthy subjects matched by age and gender without any history of voice disorder, LPR, or gastroesophageal reflux disease was assembled. Both groups of patients were studied by VHI, flexible laryngoscopy, AAV, and EGG. All patients with LPR demonstrated abnormal VHI values. Shimmer, jitter, open quotient, and irregularity were significantly increased in the patients with LPR. Nonsignificant correlations were found between VHI scores and abnormal acoustic parameters in patients with LPR. Although abnormal acoustic parameters of patients with LPR were not predictive of the overall VHI score, the abnormal acoustic parameters of patients with LPR suggest a decrease in adequate laryngeal control during phonation. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Design and fabrication of a new electrolarynx and voice amplifier for laryngectomees.
Sundeep Krishna, M; Jayanthy, A K; Divakar, C; Mekhala, R
2005-01-01
A Laryngectomee is a person whose vocal cords i.e. voice box is surgically removed owing to cancer or due to automobile accidents, burns or trauma. The patient, therefore permanently loses the ability to speak normally. An Electrolarynx is an electronic speech aid that enables the Laryngectomee to communicate with other people as quickly as possible after the successful removal of the larynx. A neck type Electrolarynx has been designed. Earlier designs could not alter frequency and intensity simultaneously during conversation. The Electrolarynx developed can control both frequency and intensity simultaneously during conversation. The device has been tested on the patient and found to be very effective. A portable, pocket size, battery powered voice amplifier (PA system) has also been developed which uses an electret condenser microphone as the input. The voice amplifier developed is a two stage amplifier which uses a preamplifier stage and a power amplifier stage. The output of the power amplifier is connected to a speaker. The device is being used by the patient and found to be very useful.
Impact of a voice recognition system on report cycle time and radiologist reading time
NASA Astrophysics Data System (ADS)
Melson, David L.; Brophy, Robert; Blaine, G. James; Jost, R. Gilbert; Brink, Gary S.
1998-07-01
Because of its exciting potential to improve clinical service, as well as reduce costs, a voice recognition system for radiological dictation was recently installed at our institution. This system will be clinically successful if it dramatically reduces radiology report turnaround time without substantially affecting radiologist dictation and editing time. This report summarizes an observer study currently under way in which radiologist reporting times using the traditional transcription system and the voice recognition system are compared. Four radiologists are observed interpreting portable intensive care unit (ICU) chest examinations at a workstation in the chest reading area. Data are recorded with the radiologists using the transcription system and using the voice recognition system. The measurements distinguish between time spent performing clerical tasks and time spent actually dictating the report. Editing time and the number of corrections made are recorded. Additionally, statistics are gathered to assess the voice recognition system's impact on the report cycle time -- the time from report dictation to availability of an edited and finalized report -- and the length of reports.
Voice disorder in systemic lupus erythematosus
de Macedo, Milena S. F. C.; da Silva Filho, Manoel
2017-01-01
Systemic lupus erythematosus (SLE) is a chronic disease characterized by progressive tissue damage. In recent decades, novel treatments have greatly extended the life span of SLE patients. This creates a high demand for identifying the overarching symptoms associated with SLE and developing therapies that improve their life quality under chronic care. We hypothesized that SLE patients would present dysphonic symptoms. Given that voice disorders can reduce life quality, identifying a potential SLE-related dysphonia could be relevant for the appraisal and management of this disease. We measured objective vocal parameters and perceived vocal quality with the GRBAS (Grade, Roughness, Breathiness, Asthenia, Strain) scale in SLE patients and compared them to matched healthy controls. SLE patients also filled a questionnaire reporting perceived vocal deficits. SLE patients had significantly lower vocal intensity and harmonics to noise ratio, as well as increased jitter and shimmer. All subjective parameters of the GRBAS scale were significantly abnormal in SLE patients. Additionally, the vast majority of SLE patients (29/36) reported at least one perceived vocal deficit, with the most prevalent deficits being vocal fatigue (19/36) and hoarseness (17/36). Self-reported voice deficits were highly correlated with altered GRBAS scores. Additionally, tissue damage scores in different organ systems correlated with dysphonic symptoms, suggesting that some features of SLE-related dysphonia are due to tissue damage. Our results show that a large fraction of SLE patients suffers from perceivable dysphonia and may benefit from voice therapy in order to improve quality of life. PMID:28414781
Quantitative Analysis of Voice in Parkinson Disease Compared to Motor Performance: A Pilot Study.
Silbergleit, Alice K; LeWitt, Peter A; Peterson, Edward L; Gardner, Glendon M
2015-01-01
Characteristic features of hypokinetic dysarthria develop in Parkinson disease (PD). We hypothesized that quantified acoustic changes of voice might provide a correlate of disease severity. To determine if there are significant differences in acoustic measures of voice between mild and moderate PD; 2) To evaluate correlations between acoustic parameters of voice and subtests of the UPDRS in mild and moderate PD. Twenty six participants with PD underwent vocal acoustic testing while off PD medication, for comparison to 22 healthy controls. Participants with PD were divided into two groups based upon UPDRS activities of daily living (ADL) ratings: summed scores were used to define mild and moderate PD. Participants voiced /i/ ("ee") at comfort, high, and low pitch (3 trials/pitch). The CSpeech Waveform Analysis Program was used to analyze cycle-to-cycle frequency ("jitter") and amplitude ("shimmer") irregularities of the vocal signal, signal-to-noise ratio, and maximum phonation frequency range converted to semitones. Sections of UPDRS scores were correlated to acoustic variables of voice. Key findings included a significant difference between the semitone range of the control subjects and the moderate PD group (p = 0.036). Further analyses revealed significant differences in semitone range for males between the controls vs. mild PD (p = 0.014), and controls vs. moderate PD (p = 0.005). Significant correlations were also found between acoustic findings and both the ADL and motor portions of the UPDRS. Acoustic analysis of voice, particularly frequency range, may provide a quantifiable correlate of disease progression in PD.
Abrantes, D; Gomes, P; Pereira, D; Coimbra, M
2016-08-01
The gastroenterology specialty could benefit from the introduction of Computer Assisted Decision (CAD) systems, since gastric cancer is a serious concern in which an accurate and early diagnosis usually leads to a good prognosis. Still, the way doctors interact with these systems is very important because it will often determine its embracement or rejection, as any gains in productivity will frequently hinge on how comfortable they are with it. Using other types of interaction paradigms such as voice and motion control, is important in a way that typical inputs such as keyboard and mouse are sometimes not the best choice for certain clinical scenarios. In order to ascertain how a doctor could control a hypothetical CAD system during a gastroenterology exam, we measured the natural response of users when faced with three different task requests, using three types of interaction paradigms: voice, gesture and endoscope. Results fit in what was expected, with gesture control being the most intuitive to use, and the endoscope being on the other edge. All the technologies are mature enough to cope with the response concepts the participants gave us. However, when having into account the scenario context, better natural response scores may not always be the best choice for implementation. That way, simplification or reduction of tasks, along with a well tought-out interface, or even mixing more oriented paradigms for particular requests, could allow for better system control with fewer inconveniences for the user.
Effects of Radioactive Iodine Ablation Therapy on Voice Quality.
Aydoğdu, İmran; Atar, Yavuz; Saltürk, Ziya; Sarı, Hüseyin; Ataç, Enes; Aydoğdu, Zeynep; İnan, Muzaffer; Mersinlioğlu, Gökhan; Uyar, Yavuz
2017-01-01
The goal of this study was to evaluate the effects of radioactive iodine ablation therapy on voice quality of patients diagnosed with well-differentiated thyroid carcinoma. We enrolled 36 patients who underwent total or subtotal thyroidectomy due to well-differentiated thyroid carcinoma. Voice recordings from patients were analyzed for acoustic and aerodynamic voice. The Voice Handicap Index-10 was used for subjective analysis. The control group consisted of 36 healthy participants. Results taken before and after therapy were compared statistically. There were no differences in the results taken before and after therapy for the radioactive iodine ablation group. The Voice Handicap Index-10 results did not differ between groups before and after therapy. Radioactive iodine ablation therapy has no effect on voice quality objectively or subjectively. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Alexa, Siri, Cortana, and More: An Introduction to Voice Assistants.
Hoy, Matthew B
2018-01-01
Voice assistants are software agents that can interpret human speech and respond via synthesized voices. Apple's Siri, Amazon's Alexa, Microsoft's Cortana, and Google's Assistant are the most popular voice assistants and are embedded in smartphones or dedicated home speakers. Users can ask their assistants questions, control home automation devices and media playback via voice, and manage other basic tasks such as email, to-do lists, and calendars with verbal commands. This column will explore the basic workings and common features of today's voice assistants. It will also discuss some of the privacy and security issues inherent to voice assistants and some potential future uses for these devices. As voice assistants become more widely used, librarians will want to be familiar with their operation and perhaps consider them as a means to deliver library services and materials.
14 CFR 25.1457 - Cockpit voice recorders.
Code of Federal Regulations, 2014 CFR
2014-01-01
... 14 Aeronautics and Space 1 2014-01-01 2014-01-01 false Cockpit voice recorders. 25.1457 Section 25... recorders. (a) Each cockpit voice recorder required by the operating rules of this chapter must be approved... interphone system. (4) Voice or audio signals identifying navigation or approach aids introduced into a...
14 CFR 25.1457 - Cockpit voice recorders.
Code of Federal Regulations, 2013 CFR
2013-01-01
... 14 Aeronautics and Space 1 2013-01-01 2013-01-01 false Cockpit voice recorders. 25.1457 Section 25... recorders. (a) Each cockpit voice recorder required by the operating rules of this chapter must be approved... interphone system. (4) Voice or audio signals identifying navigation or approach aids introduced into a...
14 CFR 29.1457 - Cockpit voice recorders.
Code of Federal Regulations, 2012 CFR
2012-01-01
... 14 Aeronautics and Space 1 2012-01-01 2012-01-01 false Cockpit voice recorders. 29.1457 Section 29... recorders. (a) Each cockpit voice recorder required by the operating rules of this chapter must be approved... interphone system. (4) Voice or audio signals identifying navigation or approach aids introduced into a...
14 CFR 29.1457 - Cockpit voice recorders.
Code of Federal Regulations, 2013 CFR
2013-01-01
... 14 Aeronautics and Space 1 2013-01-01 2013-01-01 false Cockpit voice recorders. 29.1457 Section 29... recorders. (a) Each cockpit voice recorder required by the operating rules of this chapter must be approved... interphone system. (4) Voice or audio signals identifying navigation or approach aids introduced into a...
14 CFR 25.1457 - Cockpit voice recorders.
Code of Federal Regulations, 2012 CFR
2012-01-01
... 14 Aeronautics and Space 1 2012-01-01 2012-01-01 false Cockpit voice recorders. 25.1457 Section 25... recorders. (a) Each cockpit voice recorder required by the operating rules of this chapter must be approved... interphone system. (4) Voice or audio signals identifying navigation or approach aids introduced into a...
Johansson, Kerstin; Strömbergsson, Sofia; Robieux, Camille; McAllister, Anita
2017-01-01
Reduced respiratory function following lower cervical spinal cord injuries (CSCIs) may indirectly result in vocal dysfunction. Although self-reports indicate voice change and limitations following CSCI, earlier efforts using global perceptual ratings to distinguish speakers with CSCI from noninjured speakers have not been very successful. We investigate the use of an audience response system-based approach to distinguish speakers with CSCI from noninjured speakers, and explore whether specific vocal traits can be identified as characteristic for speakers with CSCI. Fourteen speech-language pathologists participated in a web-based perceptual task, where their overt reactions to vocal dysfunction were registered during the continuous playback of recordings of 36 speakers (18 with CSCI, and 18 matched controls). Dysphonic events were identified through manual perceptual analysis, to allow the exploration of connections between dysphonic events and listener reactions. More dysphonic events, and more listener reactions, were registered for speakers with CSCI than for noninjured speakers. Strain (particularly in phrase-final position) and creak (particularly in nonphrase-final position) distinguish speakers with CSCI from noninjured speakers. For the identification of intermittent and subtle signs of vocal dysfunction, an approach where the temporal distribution of symptoms is registered offers a viable means to distinguish speakers affected by voice dysfunction from non-affected speakers. In speakers with CSCI, clinicians should listen for presence of final strain and nonfinal creak, and pay attention to self-reported voice function and voice problems, to identify individuals in need for clinical assessment and intervention. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Motorcycle Start-stop System based on Intelligent Biometric Voice Recognition
NASA Astrophysics Data System (ADS)
Winda, A.; E Byan, W. R.; Sofyan; Armansyah; Zariantin, D. L.; Josep, B. G.
2017-03-01
Current mechanical key in the motorcycle is prone to bulgary, being stolen or misplaced. Intelligent biometric voice recognition as means to replace this mechanism is proposed as an alternative. The proposed system will decide whether the voice is belong to the user or not and the word utter by the user is ‘On’ or ‘Off’. The decision voice will be sent to Arduino in order to start or stop the engine. The recorded voice is processed in order to get some features which later be used as input to the proposed system. The Mel-Frequency Ceptral Coefficient (MFCC) is adopted as a feature extraction technique. The extracted feature is the used as input to the SVM-based identifier. Experimental results confirm the effectiveness of the proposed intelligent voice recognition and word recognition system. It show that the proposed method produces a good training and testing accuracy, 99.31% and 99.43%, respectively. Moreover, the proposed system shows the performance of false rejection rate (FRR) and false acceptance rate (FAR) accuracy of 0.18% and 17.58%, respectively. In the intelligent word recognition shows that the training and testing accuracy are 100% and 96.3%, respectively.
Network Speech Systems Technology Program
NASA Astrophysics Data System (ADS)
Weinstein, C. J.
1980-09-01
This report documents work performed during FY 1980 on the DCA-sponsored Network Speech Systems Technology Program. The areas of work reported are: (1) communication systems studies in Demand-Assignment Multiple Access (DAMA), voice/data integration, and adaptive routing, in support of the evolving Defense Communications System (DCS) and Defense Switched Network (DSN); (2) a satellite/terrestrial integration design study including the functional design of voice and data interfaces to interconnect terrestrial and satellite network subsystems; and (3) voice-conferencing efforts dealing with support of the Secure Voice and Graphics Conferencing (SVGC) Test and Evaluation Program. Progress in definition and planning of experiments for the Experimental Integrated Switched Network (EISN) is detailed separately in an FY 80 Experiment Plan Supplement.
Satellite switched FDMA advanced communication technology satellite program
NASA Technical Reports Server (NTRS)
Atwood, S.; Higton, G. H.; Wood, K.; Kline, A.; Furiga, A.; Rausch, M.; Jan, Y.
1982-01-01
The satellite switched frequency division multiple access system provided a detailed system architecture that supports a point to point communication system for long haul voice, video and data traffic between small Earth terminals at Ka band frequencies at 30/20 GHz. A detailed system design is presented for the space segment, small terminal/trunking segment at network control segment for domestic traffic model A or B, each totaling 3.8 Gb/s of small terminal traffic and 6.2 Gb/s trunk traffic. The small terminal traffic (3.8 Gb/s) is emphasized, for the satellite router portion of the system design, which is a composite of thousands of Earth stations with digital traffic ranging from a single 32 Kb/s CVSD voice channel to thousands of channels containing voice, video and data with a data rate as high as 33 Mb/s. The system design concept presented, effectively optimizes a unique frequency and channelization plan for both traffic models A and B with minimum reorganization of the satellite payload transponder subsystem hardware design. The unique zoning concept allows multiple beam antennas while maximizing multiple carrier frequency reuse. Detailed hardware design estimates for an FDMA router (part of the satellite transponder subsystem) indicate a weight and dc power budget of 353 lbs, 195 watts for traffic model A and 498 lbs, 244 watts for traffic model B.
Al-Mamgani, A; van Rooij, P H; Woutersen, D P; Mehilal, R; Tans, L; Monserez, D; Baatenburg de Jong, R J
2013-08-01
To evaluate the outcomes of patients with early stage glottic cancer (GC) treated with radiotherapy (RT). The current study report on a retrospective analysis of oncologic outcome of 1050 patients with T1-2N0 glottic cancer treated with radiotherapy. Prospective assessment of quality of life (QoL) and voice handicap index (VHI) was performed in all patients treated from 2006 onwards (n = 233). Local control (LC), regional control (RC), disease-free survival (DFS), overall survival (OS), quality of life and voice handicap index. After a median follow-up of 90 months (range 3-309), the actuarial rates of local control, regional control, disease-free survival and overall survival were 85%, 99%, 84% and 81% at 5 years and 82%, 98%, 80% and 61% at 10 years, respectively. On multivariate analysis, T2 tumours, smoking after radiotherapy and conventional radiation scheme correlated significantly with poor local control. Patients who continued smoking after radiotherapy had also significantly lower overall survival rates (OR 4.3, P < 0.001). Hypothyroidism was reported in 18% of patients. Slight and temporary deterioration of quality of life scores was reported. Patient-reported xerostomia and dysphagia at 48 months were -7.1 and -6.5, compared with baseline, respectively. Voice handicap index improved significantly from 37 at baseline to 18 at 48 months. Patients with T2b and those who continued smoking had significantly worse voice handicap index. In the current study, excellent outcome with good quality of life and voice handicap index scores were reported. T2 tumours, in particular T2b, and continuing smoking after radiotherapy correlated significantly with poor local control and worse voice handicap index. © 2013 John Wiley & Sons Ltd.
Singing ability is rooted in vocal-motor control of pitch.
Hutchins, Sean; Larrouy-Maestri, Pauline; Peretz, Isabelle
2014-11-01
The inability to vocally match a pitch can be caused by poor pitch perception or by poor vocal-motor control. Although previous studies have tried to examine the relationship between pitch perception and vocal production, they have failed to control for the timbre of the target to be matched. In the present study, we compare pitch-matching accuracy with an unfamiliar instrument (the slider) and with the voice, designed such that the slider plays back recordings of the participant's own voice. We also measured pitch accuracy in singing a familiar melody ("Happy Birthday") to assess the relationship between single-pitch-matching tasks and melodic singing. Our results showed that participants (all nonmusicians) were significantly better at matching recordings of their own voices with the slider than with their voice, indicating that vocal-motor control is an important limiting factor on singing ability. We also found significant correlations between the ability to sing a melody in tune and vocal pitch matching, but not pitch matching on the slider. Better melodic singers also tended to have higher quality voices (as measured by acoustic variables). These results provide important evidence about the role of vocal-motor control in poor singing ability and demonstrate that single-pitch-matching tasks can be useful in measuring general singing abilities.
Conceptual Sound System Design for Clifford Odets' "GOLDEN BOY"
NASA Astrophysics Data System (ADS)
Yang, Yen Chun
There are two different aspects in the process of sound design, "Arts" and "Science". In my opinion, the sound design should engage both aspects strongly and in interaction with each other. I started the process of designing the sound for GOLDEN BOY by building the city soundscape of New York City in 1937. The scenic design for this piece is designed in the round, putting the audience all around the stage; this gave me a great opportunity to use surround and specialization techniques to transform the space into a different sonic world. My specialization design is composed of two subsystems -- one is the four (4) speakers center cluster diffusing towards the four (4) sections of audience, and the other is the four (4) speakers on the four (4) corners of the theatre. The outside ring provides rich sound source localization and the inside ring provides more support for control of the specialization details. In my design four (4) lavalier microphones are hung under the center iron cage from the four (4) corners of the stage. Each microphone is ten (10) feet above the stage. The signal for each microphone is sent to the two (2) center speakers in the cluster diagonally opposite the microphone. With the appropriate level adjustment of the microphones, the audience will not notice the amplification of the voices; however, through my specialization system, the presence and location of the voices of all actors are preserved for all audiences clearly. With such vocal reinforcements provided by the microphones, I no longer need to worry about overwhelming the dialogue on stage by the underscoring. A successful sound system design should not only provide a functional system, but also take the responsibility of bringing actors' voices to the audience and engaging the audience with the world that we create on stage. By designing a system which reinforces the actors' voices while at the same time providing control over localization of movement of sound effects, I was able not only to make the text present and clear for the audiences, but also to support the storyline strongly through my composed music, environmental soundscapes, and underscoring.
Hazell, Cassie M; Hayward, Mark; Cavanagh, Kate; Jones, Anna-Marie; Strauss, Clara
2018-05-01
Few patients have access to cognitive behaviour therapy for psychosis (CBTp) even though at least 16 sessions of CBTp is recommended in treatment guidelines. Briefer CBTp could improve access as the same number of therapists could see more patients. In addition, focusing on single psychotic symptoms, such as auditory hallucinations ('voices'), rather than on psychosis more broadly, may yield greater benefits. This pilot RCT recruited 28 participants (with a range of diagnoses) from NHS mental health services who were distressed by hearing voices. The study compared an 8-session guided self-help CBT intervention for distressing voices with a wait-list control. Data were collected at baseline and at 12weeks with post-therapy assessments conducted blind to allocation. Voice-impact was the pre-determined primary outcome. Secondary outcomes were depression, anxiety, wellbeing and recovery. Mechanism measures were self-esteem, beliefs about self, beliefs about voices and voice-relating. Recruitment and retention was feasible with low study (3.6%) and therapy (14.3%) dropout. There were large, statistically significant between-group effects on the primary outcome of voice-impact (d=1.78; 95% CIs: 0.86-2.70), which exceeded the minimum clinically important difference. Large, statistically significant effects were found on a number of secondary and mechanism measures. Large effects on the pre-determined primary outcome of voice-impact are encouraging, and criteria for progressing to a definitive trial are met. Significant between-group effects on measures of self-esteem, negative beliefs about self and beliefs about voice omnipotence are consistent with these being mechanisms of change and this requires testing in a future trial. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Göker, Ayşe Enise; Aydoğdu, İmran; Saltürk, Ziya; Berkiten, Güler; Atar, Yavuz; Kumral, Tolgar Lütfi; Uyar, Yavuz
2017-01-01
The aim of this study was to analyze and compare the vocal quality in patients who underwent either submucosal turbinectomy or radiofrequency cauterization. In this study, we enrolled 60 patients diagnosed with inferior concha hypertrophy. These patients were divided into two groups by using computer program "Research Randomizer." Of the 60 patients, 30 underwent submucosal inferior turbinoplasty and 30 underwent radiofrequency cauterization. The control group was composed of 30 healthy adults with no nasal or upper aerodigestive system pathology. The patients were checked at weeks 1, 2, and 4. Voice records were taken before the procedure and at week 4 postprocedure. The mean age of patients in the inferior turbinoplasty group was 29.4 years (range: 19-42 years); in the radiofrequency group, it was 30.30 years (range: 18-50 years). There was no statistical difference in age between groups. In the inferior turbinoplasty group, there were 16 male and 14 female patients, and in the radiofrequency group, there were 13 male and 17 female patients. There was no significant difference in the number of males and females between groups. Voice professionals, especially singers, actors, and actresses, should be informed about possible voice changes before undergoing endonasal surgery because these individuals are more sensitive to changes in resonance organs. We believe that voice quality should be regarded as a highly important parameter when measuring the success of endonasal surgery. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
A screening questionnaire for voice problems after treatment of early glottic cancer
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gogh, Christine D.L. van; Verdonck-de Leeuw, Irma M.; Boon-Kamma, Brigitte A.
Purpose: After treatment for early glottic cancer, a considerable number of patients end up with voice problems interfering with daily life activities. A 5-item screening questionnaire was designed for detection of voice impairment. The purpose of this study is to assess psychometric properties of this questionnaire in clinical practice. Methods and Materials: The questionnaire was completed by 110 controls without voice complaints and 177 patients after radiotherapy or laser surgery for early glottic cancer. Results: Based on normative data of the controls, a score of 5 or less on at least 1 of the 5 questions was considered to statemore » overall voice impairment. Reliability of the questionnaire proved to be good. Voice impairment was reported in 44% of the patients treated with radiotherapy vs. 29% of the patients treated with endoscopic laser surgery. Conclusions: The questionnaire proved to be a reliable, valid, and feasible method to detect voice impairment in daily life. The questionnaire is easy to fill in, and interpretation is straightforward. It is useful for both radiation oncologists and otorhinolaryngologists in their follow-up of patients treated for early glottic cancer.« less
[Case-control survey on risk factors of benign vocal fold lesions].
Huang, Dong-Yan; Yang, Wei-Yan; Yu, Ping; He, Yao; Han, Dong-Yi
2008-02-01
To investigate the risk factors that may relate with benign vocal fold lesions including vocal fold nodule, vocal fold polyp, chronic laryngitis and Reinke's edema In present series, 321 cases who were performed laryngoscope were invited to participate the survey. Among them 168 cases with benign vocal fold lesions composed the case group. Another 153 cases with normal larynx composed the control group. Each case were undertook the same questionnaire. Logistic regression analysis was preformed to investigate the possible risk factors. The result demonstrated the occurring of benign vocal fold lesions positively correlated to five factors, including occupation, work or residence environment noise, alcohol-consuming, voice-using hours per day and abuse of voice. Occupations with intensive voice-use were more vulnerable to developing these disorders. Occurring risk of occupations type II with moderate voice-use was 1.934 times than that of occupations type I with lesser voice-use (OR = 1.934). And risk of occupations type III with upper voice-use was 2.633 times than that of type I. Risk raised 1.302 times with each more hour of voice use per day. OR of the following factors of voice abuse, environment noise, alcohol-consuming was 4.744, 2.115 and 2.177, respectively. The result suggested that people should abstain from alcohol, lowering the environment noise, prevent overuse and abuse of voice in order to decrease the prevalence of these disorders, which is especially important for the professional voice users, e. g. teachers or managers. The essential therapy for these disorders is to correct bad phonation habits.
On the definition and interpretation of voice selective activation in the temporal cortex
Bethmann, Anja; Brechmann, André
2014-01-01
Regions along the superior temporal sulci and in the anterior temporal lobes have been found to be involved in voice processing. It has even been argued that parts of the temporal cortices serve as voice-selective areas. Yet, evidence for voice-selective activation in the strict sense is still missing. The current fMRI study aimed at assessing the degree of voice-specific processing in different parts of the superior and middle temporal cortices. To this end, voices of famous persons were contrasted with widely different categories, which were sounds of animals and musical instruments. The argumentation was that only brain regions with statistically proven absence of activation by the control stimuli may be considered as candidates for voice-selective areas. Neural activity was found to be stronger in response to human voices in all analyzed parts of the temporal lobes except for the middle and posterior STG. More importantly, the activation differences between voices and the other environmental sounds increased continuously from the mid-posterior STG to the anterior MTG. Here, only voices but not the control stimuli excited an increase of the BOLD response above a resting baseline level. The findings are discussed with reference to the function of the anterior temporal lobes in person recognition and the general question on how to define selectivity of brain regions for a specific class of stimuli or tasks. In addition, our results corroborate recent assumptions about the hierarchical organization of auditory processing building on a processing stream from the primary auditory cortices to anterior portions of the temporal lobes. PMID:25071527
NASA Technical Reports Server (NTRS)
Sword, A. J.; Park, W. T.
1975-01-01
A teleoperator system with a computer for manipulator control to combine the capabilities of both man and computer to accomplish a task is described. This system allows objects in unpredictable locations to be successfully located and acquired. By using a method of characterizing the work-space together with man's ability to plan a strategy and coarsely locate an object, the computer is provided with enough information to complete the tedious part of the task. In addition, the use of voice control is shown to be a useful component of the man/machine interface.
Nagar, Ruchit; Venkat, Preethi; Stone, Logan D; Engel, Kyle A; Sadda, Praneeth; Shahnawaz, Mohammed
2017-11-18
Five hundred thousand children under the age of 5 die from vaccine preventable diseases in India every year. More than just improving coverage, increasing timeliness of immunizations is critical to ensuring infant health in the first year of life. Novel, culturally appropriate community engagement strategies are worth exploring to close the immunization gap. In our study, a digital NFC (Near Field Communication) pendant worn on black thread and voice call reminder system was tested for the effectiveness in improving DTP3 adherence within 2 monthly camps from DTP1 administration. A cluster randomized controlled trial was conducted in which 96 village health camps were randomized to 3 arms: NFC sticker, NFC pendant, and NFC pendant with voice call reminder in local dialect. Randomization was done across 5 blocks in the Udaipur District serviced by Seva Mandir from August 2015 to April 2016. In terms of our three primary outcomes related to DTP3 adherence, point estimates show conflicting results. Two outcomes presented adherence in the control. DTP3 completion within two camps after DTP1 showed higher adherence in the Control (Sticker) (74.2%) arm compared to the Pendant (67.2%) and Pendant and Voice arms (69.3%). Likewise, the estimate for DTP3 completion within 180 days of birth in the Control (Sticker) (69.4%) arm was higher than estimates in the Pendant (57.4%) and Pendant and Voice arms (58.7%). However, one outcome displayed higher adherence in the intervention. DTP3 completion within two months from the time of registration was higher in the Pendant (37.7%) and Pendant and Voice arms (38.7%) compared to the Control (Sticker) arm (27.4%). In all primary outcomes, differences in adherence were statistically insignificant both before and after controlling for confounding factors. In terms of secondary outcomes, our results suggest that providing a necklace generated significant community discussion (H = 8.8796, df = 2, p = .0118), had strong satisfaction among users (χ2=26.039, df = 4, p < .0001), and resulted in increased visibility within families (grandmothers:χ2=34.023, df = 2, p < .0001, fathers: χ2=34.588, df = 2, p < .0001). Neither the NFC necklace nor the necklace with additional voice call reminders in the local dialect directly resulted in an increase in infant immunization timeliness through DTP3, the primary outcome. Still our process outcomes suggest that our culturally symbolic necklace has potential to be an assistive tool in immunization campaigns. Follow-on work will seek to examine whether positive behavior change towards vaccines can be fostered with earlier engagement of this platform beginning in the prenatal stage, under a continuum of care framework. Copyright © 2017. Published by Elsevier Ltd.
Risk and protective factors for spasmodic dysphonia: a case-control investigation.
Tanner, Kristine; Roy, Nelson; Merrill, Ray M; Kimber, Kamille; Sauder, Cara; Houtz, Daniel R; Doman, Darrin; Smith, Marshall E
2011-01-01
Spasmodic dysphonia (SD) is a chronic, incurable, and often disabling voice disorder of unknown pathogenesis. The purpose of this study was to identify possible endogenous and exogenous risk and protective factors uniquely associated with SD. Prospective, exploratory, case-control investigation. One hundred fifty patients with SD and 150 medical controls (MCs) were interviewed regarding their personal and family histories, environmental exposures, illnesses, injuries, voice use patterns, and general health using a previously vetted and validated epidemiologic questionnaire. Odds ratios and multiple logistic regression analyses (α<0.15) identified several factors that significantly increased the likelihood of having SD. These factors included (1) a personal history of mumps, blepharospasm, tremor, intense occupational and avocational voice use, and a family history of voice disorders; (2) an immediate family history of meningitis, tremor, tics, cancer, and compulsive behaviors; and (3) an extended family history of tremor and cancer. SD is likely multifactorial in etiology, involving both genetic and environmental factors. Viral infections/exposures, along with intense voice use, may trigger the onset of SD in genetically predisposed individuals. Future studies should examine the interaction among genetic and environmental factors to determine the pathogenesis of SD. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Current Hale ROA Voice and Control Communication Practices and Performance: White Paper
NASA Technical Reports Server (NTRS)
2007-01-01
The objective of this white paper is to help achieve the ACCESS 5 goal by sharing the UNITE members knowledge of current HALE ROA communication systems with other ACCESS 5 participants so that all interested parties start from a common understanding as we begin the clarification of requirements for voice and C2 communication. This white paper is also intended to describe the point of departure for any future developments that need to be realized to achieve the long term ACCESS 5 goal. Although this white paper describes the current systems, the functional and performance requirements that are also being developed under ACCESS 5 may not require the same levels of functionality and performance as currently exist. The paper addresses the following: 1) A description of a typical current HALE ROA communications system, 2) HALE ROA communications systems performance metrics, 3) HALE ROA communications systems performance, and 5) A comparison of current HALE ROA communications systems with current regulations.
Automating CapCom Using Mobile Agents and Robotic Assistants
NASA Technical Reports Server (NTRS)
Clancey, William J.; Sierhuis, Maarten; Alena, Richard L.; Graham, Jeffrey S.; Tyree, Kim S.; Hirsh, Robert L.; Garry, W. Brent; Semple, Abigail; Shum, Simon J. Buckingham; Shadbolt, Nigel;
2007-01-01
Mobile Agents (MA) is an advanced Extra-Vehicular Activity (EVA) communications and computing system to increase astronaut self-reliance and safety, reducing dependence on continuous monitoring and advising from mission control on Earth. MA is voice controlled and provides information verbally to the astronauts through programs called "personal agents." The system partly automates the role of CapCom in Apollo-including monitoring and managing navigation, scheduling, equipment deployment, telemetry, health tracking, and scientific data collection. Data are stored automatically in a shared database in the habitat/vehicle and mirrored to a site accessible by a remote science team. The program has been developed iteratively in authentic work contexts, including six years of ethnographic observation of field geology. Analog field experiments in Utah enabled empirically discovering requirements and testing alternative technologies and protocols. We report on the 2004 system configuration, experiments, and results, in which an EVA robotic assistant (ERA) followed geologists approximately 150 m through a winding, narrow canyon. On voice command, the ERA took photographs and panoramas and was directed to serve as a relay on the wireless network.
NASA Astrophysics Data System (ADS)
Patankar, Manoj Shashikant
Federal Aviation Regulations require Aviation Maintenance Technicians (AMTs) to refer to approved maintenance manuals when performing maintenance on airworthy aircraft. Because these manuals are paper-based, larger the size of the aircraft, more cumbersome are the manuals. Federal Aviation Administration (FAA) recognized the difficulties associated with the use of large manuals and conducted studies on the use of electronic media as an alternative to the traditional paper format. However, these techniques do not employ any artificial intelligence technologies and the user interface is limited to either a keyboard or a stylus pen. The primary emphasis of this research was to design a generic framework that would allow future development of voice-activated, intelligent, and hypermedia-based aircraft maintenance manuals. A prototype (VIHAMS-Voice-activated, Intelligent, and Hypermedia-based Aircraft Maintenance System) was developed, as a secondary emphasis, using the design and development techniques that evolved from this research. An evolutionary software design approach was used to design the proposed framework and the structured rapid prototyping technique was used to produce the VIHAMS prototype. VoiceAssist by Creative Labs was used to provide the voice interface so that the users (AMTs) could keep their hands free to work on the aircraft while maintaining complete control over the computer through discrete voice commands. KnowledgePro for Windows sp{TM}, an expert system shell, provided "intelligence" to the prototype. As a result of this intelligence, the system provided expert guidance to the user. The core information contained in conventional manuals was available in a hypermedia format. The prototype's operating hardware included a notebook computer with a fully functional audio system. An external microphone and the built-in speaker served as the input and output devices (along with the color monitor), respectively. Federal Aviation Administration estimates the United States air carriers to operate 3,991 large jet aircraft in the year 1996 (FAA Aviation Forecasts, 1987-1998). With an estimate of seventy manuals per such aircraft, the development of intelligent manuals is expected to impact 279,370 manuals in this country. Soon, over 55 thousand maintenance technicians will be able to carry the seven pound system to an aircraft, use voice commands to access the aircraft's files on the system, seek assistance from the expert system to diagnose the fault, and obtain instructions on how to rectify the fault. The evolutionary design approach and the rapid prototyping techniques were very well suited for the spiral testing strategy. Therefore, this strategy was used to test the structural and functional validity of this research. Professors Darrell Anderson and Brian Stout (Aviation faculty at San Jose State University) and Mr. Gregory Shea (a United Airlines mechanic and SJSU student) are representatives of the real-world users of the final product. Therefore, they conducted the alpha test of this prototype. Mr. Daniel Neal and Mr. Stephen Harms have been actively involved in light aircraft maintenance for more than ten years. They evaluated the prototype's usability. All the above evaluators used standard testing tools and evaluated the prototype under field conditions. The evaluators concluded that the VIHAMS prototype used a valid fault diagnosis strategy, the system architecture could be used to develop similar systems using off-the-shelf tools, and the voice input system could be refined to improve its usability.
Coprevalence of Anxiety and Depression With Spasmodic Dysphonia: A Case-Control Study
White, Laura J.; Hapner, Edie R.; Klein, Adam M.; Delgaudio, John M.; Hanfelt, John J.; Jinnah, H. A.; Johns, Michael M.
2012-01-01
Summary Introduction There is evidence supporting an association between depression and anxiety in patients with chronic disease. Spasmodic dysphonia (SD) is a chronic, incurable, and disabling voice disorder. Reported rates of depression and anxiety in SD range from 7.1% to 72%, with a maximum number of 18 patients. The goal of this study was to define the coprevalence of depression and anxiety with SD. Materials and Methods A single-institution case-control study was performed from May to July 2010. Consecutive patients with SD and benign voice disorders were enrolled prospectively. On enrollment, patients were asked to fill out a questionnaire that reviewed the duration of the voice disorder and personal history of anxiety and depression, including current and lifetime diagnosis. Results One hundred forty-six controls with benign voice disorders and 128 patients with SD were enrolled. Patients with SD were no more likely to be diagnosed with depression or anxiety than those of the control group (odds ratio [OR] = 0.985, 95% confidence interval [CI] = 0.59–1.63; and OR = 1.314; 95% CI 0.75–2.3, respectively). Additionally, duration of disease was a risk factor for depression in both the SD group and the control group, and the association was not significantly different between groups. Conclusion Patients with SD were no more likely to have depression or anxiety than those with other voice disorders. It is important for otolaryngologists to be aware of the increased rates of depression in patients diagnosed with chronic diseases, including voice disorders, and to refer to a psychiatrist when appropriate. PMID:22209056
NASA Astrophysics Data System (ADS)
Poock, G. K.; Martin, B. J.
1984-02-01
This was an applied investigation examining the ability of a speech recognition system to recognize speakers' inputs when the speakers were under different stress levels. Subjects were asked to speak to a voice recognition system under three conditions: (1) normal office environment, (2) emotional stress, and (3) perceptual-motor stress. Results indicate a definite relationship between voice recognition system performance and the type of low stress reference patterns used to achieve recognition.
The effectiveness of a voice treatment approach for teachers with self-reported voice problems.
Gillivan-Murphy, Patricia; Drinnan, Michael J; O'Dwyer, Tadhg P; Ridha, Hayder; Carding, Paul
2006-09-01
Teachers are considered the professional group most at risk of developing voice-problems, but limited treatment effectiveness evidence exists. We studied prospectively the effectiveness of a 6-week combined treatment approach using vocal function exercises (VFEs) and vocal hygiene (VH) education with 20 teachers with self-reported voice problems. Twenty subjects were randomly assigned to a no-treatment control (n = 11) and a treatment group (n = 9). Fibreoptic endoscopic evaluation was carried out on all subjects before randomization. Two self-report voice outcome measures were used: the Voice-Related Quality of Life (VRQOL) and the Voice Symptom Severity Scale (VoiSS). A Voice Care Knowledge Visual Analogue Scale (VAS), developed specifically for the study, was also used to evaluate change in selected voice knowledge areas. A Student unpaired t test revealed a statistically significant (P < 0.05) improvement in the treatment group as measured by the VoiSS. There was not a significant improvement in the treatment group as measured by the V-RQOL. The difference in voice care knowledge areas was also significant for the treatment group (P < 0.05). This study suggests that a voice treatment approach of VFEs and VH education improved self-reported voice symptoms and voice care knowledge in a group of teachers.
Easterling, Caryn
2008-09-01
Age-related sarcopenia or muscle wasting contributes to changes in the ability to perform activities of daily living, changes in deglutition, and changes in vocal function. The Shaker Exercise, an isometric and isokinetic exercise, has been shown to strengthen suprahyoid muscles and increase deglutitive anteroposterior (AP) upper esophageal sphincter (UES) opening diameter. The aim of this study was to determine if this exercise has an effect on the age-related changes in vocal function and deglutition in healthy older adults. Eleven females and 10 males, aged 65-78 years (mean = 70 +/- 4 years) and with a negative history for dysphagia and voice disorders, participated by exercising three times per day for 6 weeks. Five age-matched controls did not perform the exercise. Acoustic analysis of voice and biomechanical analysis of deglutition were performed before and after 6 weeks of exercise. Controls participated in voice analysis only. Dysphonia Severity Index (DSI), a multivariate voice index, was used to compare voice production initially and after 6 weeks. Deglutitive biomechanical measures increased and DSI scores improved in 10 of 21 participants following 6 weeks of the exercise. DSI for controls did not change over the 6-week period. Ten of 21 exercise participants experienced improved deglutitive biomechanics and DSI scores. Accuracy of exercise performance, compliance, and/or disclosed alterations in health status may contribute to the lack of deglutitive and DSI change in the participants who did not experience change in function. A large randomized control study, including periodic monitoring of health status, exercise performance accuracy, and compliance, is warranted to evaluate the affect of this exercise on deglutition as well as voice. The Shaker Exercise could be recommended as a preventative measure to diminish the effect of sarcopenia on the muscles used in deglutition and voice and alter the progression of the characteristic senescent voice and swallow changes.
Remote voice training: A case study on space shuttle applications, appendix C
NASA Technical Reports Server (NTRS)
Mollakarimi, Cindy; Hamid, Tamin
1990-01-01
The Tile Automation System includes applications of automation and robotics technology to all aspects of the Shuttle tile processing and inspection system. An integrated set of rapid prototyping testbeds was developed which include speech recognition and synthesis, laser imaging systems, distributed Ada programming environments, distributed relational data base architectures, distributed computer network architectures, multi-media workbenches, and human factors considerations. Remote voice training in the Tile Automation System is discussed. The user is prompted over a headset by synthesized speech for the training sequences. The voice recognition units and the voice output units are remote from the user and are connected by Ethernet to the main computer system. A supervisory channel is used to monitor the training sequences. Discussions include the training approaches as well as the human factors problems and solutions for this system utilizing remote training techniques.
Period for Normalization of Voice Acoustic Parameters in Indian Pediatric Cochlear Implantees.
Joy, Jeena V; Deshpande, Shweta; Vaid, Dr Neelam
2017-05-01
The purpose of this study was to investigate the duration required by children with cochlear implants to approximate the norms of voice acoustic parameters. The study design is retrospective. Thirty children with cochlear implants (chronological ages ranging between 4.1 and 6.7 years) were divided into three groups, based on the postimplantation duration. Ten normal-hearing children (chronological ages ranging between 4 and 7 years) were selected as the control group. All implanted children underwent an objective voice analysis using Dr. Speech software (Tiger DRS, Inc., Seattle, WA, USA) at 6 months and at 1 and 2 years of implant use. Voice analysis was done for the children in the control group and means were derived for all the parameters analyzed to obtain the normal values. Habitual fundamental frequency (HFF), jitter (frequency variation), and shimmer (amplitude variation) were the voice acoustic parameters analyzed for the vowels |a|, |i|, and |u|. The obtained values of these parameters were then compared with the norms. HFF for the children with implant use for 6 months and 1 year did significantly differ from the control group. However, there was no significant difference (P > 0.5) observed in the children with implant use for 2 years, thus matching the norms. Jitter and shimmer showed a significant difference (P < 0.5) even at 2 years of implant use when compared with the control group. The findings of the study divulge that children with cochlear implants approximate age-matched normal-hearing kids with respect to the voice acoustic parameter of HFF by 2 years of implant use. However, jitter and shimmer were not found to stabilize for the duration studied. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Speaking in Character: Voice Communication in Virtual Worlds
NASA Astrophysics Data System (ADS)
Wadley, Greg; Gibbs, Martin R.
This chapter summarizes 5 years of research on the implications of introducing voice communication systems to virtual worlds. Voice introduces both benefits and problems for players of fast-paced team games, from better coordination of groups and greater social presence of fellow players on the positive side, to negative features such as channel congestion, transmission of noise, and an unwillingness by some to use voice with strangers online. Similarly, in non-game worlds like Second Life, issues related to identity and impression management play important roles, as voice may build greater trust that is especially important for business users, yet it erodes the anonymity and ability to conceal social attributes like gender that are important for other users. A very different mixture of problems and opportunities exists when users conduct several simultaneous conversations in multiple text and voice channels. Technical difficulties still exist with current systems, including the challenge of debugging and harmonizing all the participants' voice setups. Different groups use virtual worlds for very different purposes, so a single modality may not suit all.
Epidemiology of Voice Disorders in Latvian School Teachers.
Trinite, Baiba
2017-07-01
The prevalence of voice disorders in the teacher population in Latvia has not been studied so far and this is the first epidemiological study whose goal is to investigate the prevalence of voice disorders and their risk factors in this professional group. A wide cross-sectional study using stratified sampling methodology was implemented in the general education schools of Latvia. The self-administered voice risk factor questionnaire and the Voice Handicap Index were completed by 522 teachers. Two teachers groups were formed: the voice disorders group which included 235 teachers with actual voice problems or problems during the last 9 months; and the control group which included 174 teachers without voice disorders. Sixty-six percent of teachers gave a positive answer to the following question: Have you ever had problems with your voice? Voice problems are more often found in female than male teachers (68.2% vs 48.8%). Music teachers suffer from voice disorders more often than teachers of other subjects. Eighty-two percent of teachers first faced voice problems in their professional carrier. The odds of voice disorders increase if the following risk factors exist: extra vocal load, shouting, throat clearing, neglecting of personal health, background noise, chronic illnesses of the upper respiratory tract, allergy, job dissatisfaction, and regular stress in the working place. The study findings indicated a high risk of voice disorders among Latvian teachers. The study confirmed data concerning the multifactorial etiology of voice disorders. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Building VoiceXML-Based Applications
2002-01-01
basketball games. The Busline systems were pri- y developed using an early implementation of VoiceXML he NBA Update Line was developed using VoiceXML...traveling in and out of Pittsburgh’s rsity neighborhood. The second project is the NBA Up- Line, which provides callers with real-time information NBA ... NBA UPDATE LINE The target user of this system is a fairly knowledgeable basket- ball fan; the system must therefore be able to provide detailed
Voice stress analysis and evaluation
NASA Astrophysics Data System (ADS)
Haddad, Darren M.; Ratley, Roy J.
2001-02-01
Voice Stress Analysis (VSA) systems are marketed as computer-based systems capable of measuring stress in a person's voice as an indicator of deception. They are advertised as being less expensive, easier to use, less invasive in use, and less constrained in their operation then polygraph technology. The National Institute of Justice have asked the Air Force Research Laboratory for assistance in evaluating voice stress analysis technology. Law enforcement officials have also been asking questions about this technology. If VSA technology proves to be effective, its value for military and law enforcement application is tremendous.
Broadening the interface bandwidth in simulation based training
NASA Technical Reports Server (NTRS)
Somers, Larry E.
1989-01-01
Currently most computer based simulations rely exclusively on computer generated graphics to create the simulation. When training is involved, the method almost exclusively used to display information to the learner is text displayed on the cathode ray tube. MICROEXPERT Systems is concentrating on broadening the communications bandwidth between the computer and user by employing a novel approach to video image storage combined with sound and voice output. An expert system is used to combine and control the presentation of analog video, sound, and voice output with computer based graphics and text. Researchers are currently involved in the development of several graphics based user interfaces for NASA, the U.S. Army, and the U.S. Navy. Here, the focus is on the human factors considerations, software modules, and hardware components being used to develop these interfaces.
Speech-recognition interfaces for music information retrieval
NASA Astrophysics Data System (ADS)
Goto, Masataka
2005-09-01
This paper describes two hands-free music information retrieval (MIR) systems that enable a user to retrieve and play back a musical piece by saying its title or the artist's name. Although various interfaces for MIR have been proposed, speech-recognition interfaces suitable for retrieving musical pieces have not been studied. Our MIR-based jukebox systems employ two different speech-recognition interfaces for MIR, speech completion and speech spotter, which exploit intentionally controlled nonverbal speech information in original ways. The first is a music retrieval system with the speech-completion interface that is suitable for music stores and car-driving situations. When a user only remembers part of the name of a musical piece or an artist and utters only a remembered fragment, the system helps the user recall and enter the name by completing the fragment. The second is a background-music playback system with the speech-spotter interface that can enrich human-human conversation. When a user is talking to another person, the system allows the user to enter voice commands for music playback control by spotting a special voice-command utterance in face-to-face or telephone conversations. Experimental results from use of these systems have demonstrated the effectiveness of the speech-completion and speech-spotter interfaces. (Video clips: http://staff.aist.go.jp/m.goto/MIR/speech-if.html)
Voice and choice in health care in England: understanding citizen responses to dissatisfaction.
Dowding, Keith; John, Peter
2011-01-01
Using data from a five-year online survey the paper examines the effects of relative satisfaction with health services on individuals' voice-and-choice activity in the English public health care system. Voice is considered in three parts – individual voice (complaints), collective voice voting and participation (collective action). Exercising choice is seen in terms of complete exit (not using health care), internal exit (choosing another public service provider) and private exit (using private health care). The interaction of satisfaction and forms of voice and choice are analysed over time. Both voice and choice are correlated with dissatisfaction with those who are unhappy with the NHS more likely to privately voice and to plan to take up private health care. Those unable to choose private provision are likely to use private voice. These factors are not affected by items associated with social capital – indeed, being more trusting leads to lower voice activity.
A Randomized Controlled Trial of Two Semi-Occluded Vocal Tract Voice Therapy Protocols
Hunter, Eric J.; Kirkham, Kimberly; Cox, Karin; Titze, Ingo R.
2015-01-01
Purpose Although there is a long history of use of semi-occluded vocal tract gestures in voice therapy, including phonation through thin tubes or straws, the efficacy of phonation through tubes has not been established. This study compares results from a therapy program on the basis of phonation through a flow-resistant tube (FRT) with Vocal Function Exercises (VFE), an established set of exercises that utilize oral semi-occlusions. Method Twenty subjects (16 women, 4 men) with dysphonia and/or vocal fatigue were randomly assigned to 1 of 4 treatment conditions: (a) immediate FRT therapy, (b) immediate VFE therapy, (c) delayed FRT therapy, or (d) delayed VFE therapy. Subjects receiving delayed therapy served as a no-treatment control group. Results Voice Handicap Index (Jacobson et al., 1997) scores showed significant improvement for both treatment groups relative to the no-treatment group. Comparison of the effect sizes suggests FRT therapy is noninferior to VFE in terms of reduction in Voice Handicap Index scores. Significant reductions in Roughness on the Consensus Auditory-Perceptual Evaluation of Voice (Kempster, Gerratt, Verdolini Abbott, Barkmeier-Kraemer, & Hillman, 2009) were found for the FRT subjects, with no other significant voice quality findings. Conclusions VFE and FRT therapy may improve voice quality of life in some individuals with dysphonia. FRT therapy was noninferior to VFE in improving voice quality of life in this study. PMID:25675335
Ali, Zulfiqar; Alsulaiman, Mansour; Muhammad, Ghulam; Elamvazuthi, Irraivan; Al-Nasheri, Ahmed; Mesallam, Tamer A; Farahat, Mohamed; Malki, Khalid H
2017-05-01
A large population around the world has voice complications. Various approaches for subjective and objective evaluations have been suggested in the literature. The subjective approach strongly depends on the experience and area of expertise of a clinician, and human error cannot be neglected. On the other hand, the objective or automatic approach is noninvasive. Automatic developed systems can provide complementary information that may be helpful for a clinician in the early screening of a voice disorder. At the same time, automatic systems can be deployed in remote areas where a general practitioner can use them and may refer the patient to a specialist to avoid complications that may be life threatening. Many automatic systems for disorder detection have been developed by applying different types of conventional speech features such as the linear prediction coefficients, linear prediction cepstral coefficients, and Mel-frequency cepstral coefficients (MFCCs). This study aims to ascertain whether conventional speech features detect voice pathology reliably, and whether they can be correlated with voice quality. To investigate this, an automatic detection system based on MFCC was developed, and three different voice disorder databases were used in this study. The experimental results suggest that the accuracy of the MFCC-based system varies from database to database. The detection rate for the intra-database ranges from 72% to 95%, and that for the inter-database is from 47% to 82%. The results conclude that conventional speech features are not correlated with voice, and hence are not reliable in pathology detection. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
McAllister, Anita; Brandt, Signe Kofoed
2012-09-01
A well-controlled recording in a studio is fundamental in most voice rehabilitation. However, this laboratory like recording method has been questioned because voice use in a natural environment may be quite different. In children's natural environment, high background noise levels are common and are an important factor contributing to voice problems. The primary noise source in day-care centers is the children themselves. The aim of the present study was to compare perceptual evaluations of voice quality and acoustic measures from a controlled recording with recordings of spontaneous speech in children's natural environment in a day-care setting. Eleven 5-year-old children were recorded three times during a day at the day care. The controlled speech material consisted of repeated sentences. Matching sentences were selected from the spontaneous speech. All sentences were repeated three times. Recordings were randomized and analyzed acoustically and perceptually. Statistic analyses showed that fundamental frequency was significantly higher in spontaneous speech (P<0.01) as was hyperfunction (P<0.001). The only characteristic the controlled sentences shared with spontaneous speech was degree of hoarseness (Spearman's rho=0.564). When data for boys and girls were analyzed separately, a correlation was found for the parameter breathiness (rho=0.551) for boys, and for girls the correlation for hoarseness remained (rho=0.752). Regarding acoustic data, none of the measures correlated across recording conditions for the whole group. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Voice Relative Fundamental Frequency via Neck-Skin Acceleration in Individuals with Voice Disorders
ERIC Educational Resources Information Center
Lien, Yu-An S.; Calabrese, Carolyn R.; Michener, Carolyn M.; Murray, Elizabeth Heller; Van Stan, Jarrad H.; Mehta, Daryush D.; Hillman, Robert E.; Noordzij, J. Pieter; Stepp, Cara E.
2015-01-01
Purpose: This study investigated the use of neck-skin acceleration for relative fundamental frequency (RFF) analysis. Method: Forty individuals with voice disorders associated with vocal hyperfunction and 20 age- and sex-matched control participants were recorded with a subglottal neck-surface accelerometer and a microphone while producing speech…
Feigned Depression and Feigned Sleepiness: A Voice Acoustical Analysis
ERIC Educational Resources Information Center
Reilly, Nicole; Cannizzaro, Michael S.; Harel, Brian T.; Snyder, Peter J.
2004-01-01
We sought to profile the voice acoustical correlates of simulated, or feigned depression by neurologically and psychiatrically healthy control subjects. We also sought to identify the voice acoustical correlates of feigned sleepiness for these same subjects. Twenty-two participants were asked to speak freely about a cartoon, to count from 1 to 10,…
Burnett, Greg C.; Holzrichter, John F.; Ng, Lawrence C.
2002-01-01
Low power EM waves are used to detect motions of vocal tract tissues of the human speech system before, during, and after voiced speech. A voiced excitation function is derived. The excitation function provides speech production information to enhance speech characterization and to enable noise removal from human speech.
Performance of wavelet analysis and neural networks for pathological voices identification
NASA Astrophysics Data System (ADS)
Salhi, Lotfi; Talbi, Mourad; Abid, Sabeur; Cherif, Adnane
2011-09-01
Within the medical environment, diverse techniques exist to assess the state of the voice of the patient. The inspection technique is inconvenient for a number of reasons, such as its high cost, the duration of the inspection, and above all, the fact that it is an invasive technique. This study focuses on a robust, rapid and accurate system for automatic identification of pathological voices. This system employs non-invasive, non-expensive and fully automated method based on hybrid approach: wavelet transform analysis and neural network classifier. First, we present the results obtained in our previous study while using classic feature parameters. These results allow visual identification of pathological voices. Second, quantified parameters drifting from the wavelet analysis are proposed to characterise the speech sample. On the other hand, a system of multilayer neural networks (MNNs) has been developed which carries out the automatic detection of pathological voices. The developed method was evaluated using voice database composed of recorded voice samples (continuous speech) from normophonic or dysphonic speakers. The dysphonic speakers were patients of a National Hospital 'RABTA' of Tunis Tunisia and a University Hospital in Brussels, Belgium. Experimental results indicate a success rate ranging between 75% and 98.61% for discrimination of normal and pathological voices using the proposed parameters and neural network classifier. We also compared the average classification rate based on the MNN, Gaussian mixture model and support vector machines.
Pavlikhin, O G; Romanenko, S G; Krasnikova, D I; Lesogorova, E V; Yakovlev, V S
The objective of the present study was to evaluate the clinical and functional condition of the voice apparatus in the elderly patients and to elaborate recommendations for the prevention of disturbances of the vocal function in the professional voice users. This comprehensive study involved 95 patients including the active professional voice users (n=48) and 45 non-occupational voice users at the age from 61 to 82 years with the employment history varying from 32 to 51 years. The study was designed to obtain the voice characteristics by means of the subjective auditory assessment, microlaryngoscopy, video laryngostroboscopy, determination of maximum phonation time (MPT), and computer-assisted acoustic analysis of the voice with the use of the MDVP Kay Pentaxy system. The level of anxiety of the patients was estimated based on the results of the HADS questionnaire study. It is concluded that the majority of the disturbances of the vocal function in the professional voice users have the functional nature. It is concluded that the method of neuro-muscular electrophonopedic stimulation (NMEPS) of laryngeal muscles is the method of choice for the diagnostics of the vocal function of the voice users in the late adulthood. It is recommended that the professional vocal load for such subjects should not exceed 12-14 hours per week. Rational psychotherapy must constitute an important component of the system of measures intended to support the working capacity of the voice users belonging to this age group.
Effectiveness of voice therapy in functional dysphonia: where are we now?
Bos-Clark, Marianne; Carding, Paul
2011-06-01
To review the recent literature since the 2009 Cochrane review regarding the effectiveness of voice therapy for patients with functional dysphonia. A range of articles report on the effects of voice therapy treatment for functional dysphonia, with a wide range of interventions described. Only one study is a randomized controlled trial. A number of excellent review articles have extended the knowledge base. In primary research, methodological issues persist: studies are small, and not adequately controlled. Studies show improved standards of outcome measurement and of description of the content of voice therapy. There is a continued need for larger, methodologically sound clinical effectiveness studies. Future studies need to be replicable and generalizable in order to inform and elucidate clinical practice.
Infant face interest is associated with voice information and maternal psychological health.
Taylor, Gemma; Slade, Pauline; Herbert, Jane S
2014-11-01
Early infant interest in their mother's face is driven by an experience based face processing system, and is associated with maternal psychological health, even within a non clinical community sample. The present study examined the role of the voice in eliciting infants' interest in mother and stranger faces and in the association between infant face interest and maternal psychological health. Infants aged 3.5-months were shown photographs of their mother's and a stranger's face paired with an audio recording of their mother's and a stranger's voice that was either matched (e.g., mother's face and voice) or mismatched (e.g., mother's face and stranger's voice). Infants spent more time attending to the stranger's matched face and voice than the mother's matched face and voice and the mismatched faces and voices. Thus, infants demonstrated an earlier preference for a stranger's face when given voice information than when the face is presented alone. In the present sample, maternal psychological health varied with 56.7% of mothers reporting mild mood symptoms (depression, anxiety or stress response to childbirth). Infants of mothers with significant mild maternal mood symptoms looked longer at the faces and voices compared to infants of mothers who did not report mild maternal mood symptoms. In sum, infants' experience based face processing system is sensitive to their mothers' maternal psychological health and the multimodal nature of faces. Copyright © 2014 Elsevier Inc. All rights reserved.
A simulation system for Space Station extravehicular activity
NASA Technical Reports Server (NTRS)
Marmolejo, Jose A.; Shepherd, Chip
1993-01-01
America's next major step into space will be the construction of a permanently manned Space Station which is currently under development and scheduled for full operation in the mid-1990's. Most of the construction of the Space Station will be performed over several flights by suited crew members during an extravehicular activity (EVA) from the Space Shuttle. Once fully operational, EVA's will be performed from the Space Station on a routine basis to provide, among other services, maintenance and repair operations of satellites currently in Earth orbit. Both voice recognition and helmet-mounted display technologies can improve the productivity of workers in space by potentially reducing the time, risk, and cost involved in performing EVA. NASA has recognized this potential and is currently developing a voice-controlled information system for Space Station EVA. Two bench-model helmet-mounted displays and an EVA simulation program have been developed to demonstrate the functionality and practicality of the system.
Team Training through Communications Control
1982-02-01
training * operational environment * team training research issues * training approach * team communications * models of operator beharior e...on the market soon, it certainly would be investigated carefully for its applicability to the team training problem. ce A text-to-speech voice...generation system. Votrax has recently marketed such a device, and others may soon follow suit. ’ d. A speech replay system designed to produce speech from
A survey of the state-of-the-art and focused research in range systems, task 1
NASA Technical Reports Server (NTRS)
Omura, J. K.
1986-01-01
This final report presents the latest research activity in voice compression. We have designed a non-real time simulation system that is implemented around the IBM-PC where the IBM-PC is used as a speech work station for data acquisition and analysis of voice samples. A real-time implementation is also proposed. This real-time Voice Compression Board (VCB) is built around the Texas Instruments TMS-3220. The voice compression algorithm investigated here was described in an earlier report titled, Low Cost Voice Compression for Mobile Digital Radios, by the author. We will assume the reader is familiar with the voice compression algorithm discussed in this report. The VCB compresses speech waveforms at data rates ranging from 4.8 K bps to 16 K bps. This board interfaces to the IBM-PC 8-bit bus, and plugs into a single expansion slot on the mother board.
Effect of Auditory-Perceptual Training With Natural Voice Anchors on Vocal Quality Evaluation.
Dos Santos, Priscila Campos Martins; Vieira, Maurílio Nunes; Sansão, João Pedro Hallack; Gama, Ana Cristina Côrtes
2018-01-10
To analyze the effects of auditory-perceptual training with anchor stimuli of natural voices on inter-rater agreement during the assessment of vocal quality. This is a quantitative nature study. An auditory-perceptual training site was developed consisting of Programming Interface A, an auditory training activity, and Programming Interface B, a control activity. Each interface had three stages: pre-training/pre-interval evaluation, training/interval, and post-training/post-interval evaluation. Two experienced evaluators classified 381 voices according to the GRBASI scale (G-grade, R-roughness, B-breathiness, A-asthenia, S-strain, I-instability). Voices were selected that received the same evaluation by both evaluators: 57 voices for evaluation and 56 for training were selected, with varying degrees of deviation across parameters. Fifteen inexperienced evaluators were then selected. In the pre-, post-training, pre-, and postinterval stages, evaluators listened to the voices and classified them via the GRBASI scale. In the stage interval evaluators read a text. In the stage training each parameter was trained separately. Evaluators analyzed the degrees of deviation of the GRBASI parameters based on anchor stimuli, and could only advance after correctly classifying the voices. To quantify inter-rater agreement and provide statistical analyses, the AC1 coefficient, confidence intervals, and percentage variation of agreement were employed. Except for the asthenia parameter, decreased agreement was observed in the control condition. Improved agreement was observed with auditory training, but this improvement did not achieve statistical significance. Training with natural voice anchors suggest an increased inter-rater agreement during perceptual voice analysis, potentially indicating that new internal references were established. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
1992-03-01
sites and support facilities are located on the islands of Niihau and Oahu. Figure 1 depicts the overall layout of PMRF. [Ref. 4: p. 2] In addition...the HIANG facility at Kokee: • a wideband microwave system serving Niihau Island remotely controls operation of the AN/APS-134 surveillance radar, and...provides relay of digitized radar data, control data and voice between the remotely operated, unmanned radar on Niihau Island and Barking Sands
Transmasculine People's Voice Function: A Review of the Currently Available Evidence.
Azul, David; Nygren, Ulrika; Södersten, Maria; Neuschaefer-Rube, Christiane
2017-03-01
This study aims to evaluate the currently available discursive and empirical data relating to those aspects of transmasculine people's vocal situations that are not primarily gender-related, to identify restrictions to voice function that have been observed in this population, and to make suggestions for future voice research and clinical practice. We conducted a comprehensive review of the voice literature. Publications were identified by searching six electronic databases and bibliographies of relevant articles. Twenty-two publications met inclusion criteria. Discourses and empirical data were analyzed for factors and practices that impact on voice function and for indications of voice function-related problems in transmasculine people. The quality of the evidence was appraised. The extent and quality of studies investigating transmasculine people's voice function was found to be limited. There was mixed evidence to suggest that transmasculine people might experience restrictions to a range of domains of voice function, including vocal power, vocal control/stability, glottal function, pitch range/variability, vocal endurance, and voice quality. More research into the different factors and practices affecting transmasculine people's voice function that takes account of a range of parameters of voice function and considers participants' self-evaluations is needed to establish how functional voice production can be best supported in this population. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Doppler compensation by shifting transmitted object frequency within limits
NASA Technical Reports Server (NTRS)
Laughlin, C. R., Jr.; Hollenbaugh, R. C.; Allen, W. K. (Inventor)
1973-01-01
A system and method are disclosed for position locating, deriving centralized air traffic control data, and communicating via voice and digital signals between a multiplicity of remote aircraft, including supersonic transports, and a central station. Such communication takes place through a synchronous satellite relay station. Side tone ranging patterns, as well as the digital and voice signals, are modulated on a carrier transmitted from the central station and received on all of the supersonic transports. Each aircraft communicates with the ground stations via a different frequency multiplexed spectrum. Supersonic transport position is derived from a computer at the central station and supplied to a local air traffic controller. Position is determined in response to variable phase information imposed on the side tones at the aircrafts. Common to all of the side tone techniques is Doppler compensation for the supersonic transport velocity.
Twenty-Channel Voice Response System
DOT National Transportation Integrated Search
1981-06-01
This report documents the design and implementation of a Voice Response System, which provides Direct-User Access to the FAA's aviation-weather data base. This system supports 20 independent audio channels, and as of this report, speaks three weather...
An intelligent control and virtual display system for evolutionary space station workstation design
NASA Technical Reports Server (NTRS)
Feng, Xin; Niederjohn, Russell J.; Mcgreevy, Michael W.
1992-01-01
Research and development of the Advanced Display and Computer Augmented Control System (ADCACS) for the space station Body-Ported Cupola Virtual Workstation (BP/VCWS) were pursued. The potential applications were explored of body ported virtual display and intelligent control technology for the human-system interfacing applications is space station environment. The new system is designed to enable crew members to control and monitor a variety of space operations with greater flexibility and efficiency than existing fixed consoles. The technologies being studied include helmet mounted virtual displays, voice and special command input devices, and microprocessor based intelligent controllers. Several research topics, such as human factors, decision support expert systems, and wide field of view, color displays are being addressed. The study showed the significant advantages of this uniquely integrated display and control system, and its feasibility for human-system interfacing applications in the space station command and control environment.
Human voice quality measurement in noisy environments.
Ueng, Shyh-Kuang; Luo, Cheng-Ming; Tsai, Tsung-Yu; Yeh, Hsuan-Chen
2015-01-01
Computerized acoustic voice measurement is essential for the diagnosis of vocal pathologies. Previous studies showed that ambient noises have significant influences on the accuracy of voice quality assessment. This paper presents a voice quality assessment system that can accurately measure qualities of voice signals, even though the input voice data are contaminated by low-frequency noises. The ambient noises in our living rooms and laboratories are collected and the frequencies of these noises are analyzed. Based on the analysis, a filter is designed to reduce noise level of the input voice signal. Then, improved numerical algorithms are employed to extract voice parameters from the voice signal to reveal the health of the voice signal. Compared with MDVP and Praat, the proposed method outperforms these two widely used programs in measuring fundamental frequency and harmonic-to-noise ratio, and its performance is comparable to these two famous programs in computing jitter and shimmer. The proposed voice quality assessment method is resistant to low-frequency noises and it can measure human voice quality in environments filled with noises from air-conditioners, ceiling fans and cooling fans of computers.
Rousseau, Bernard; Gutmann, Michelle L; Mau, Theodore; Francis, David O; Johnson, Jeffrey P; Novaleski, Carolyn K; Vinson, Kimberly N; Garrett, C Gaelyn
2015-03-01
This randomized trial investigated voice rest and supplemental text-to-speech communication versus voice rest alone on visual analog scale measures of communication effectiveness and magnitude of voice use. Randomized clinical trial. Multicenter outpatient voice clinics. Thirty-seven patients undergoing phonomicrosurgery. Patients undergoing phonomicrosurgery were randomized to voice rest and supplemental text-to-speech communication or voice rest alone. The primary outcome measure was the impact of voice rest on ability to communicate effectively over a 7-day period. Pre- and postoperative magnitude of voice use was also measured as an observational outcome. Patients randomized to voice rest and supplemental text-to-speech communication reported higher median communication effectiveness on each postoperative day compared to those randomized to voice rest alone, with significantly higher median communication effectiveness on postoperative days 3 (P=.03) and 5 (P=.01). Magnitude of voice use did not differ on any preoperative (P>.05) or postoperative day (P>.05), nor did patients significantly decrease voice use as the surgery date approached (P>.05). However, there was a significant reduction in median voice use pre- to postoperatively across patients (P<.001) with median voice use ranging from 0 to 3 throughout the postoperative week. Supplemental text-to-speech communication increased patient-perceived communication effectiveness on postoperative days 3 and 5 over voice rest alone. With the prevalence of smartphones and the widespread use of text messaging, supplemental text-to-speech communication may provide an accessible and cost-effective communication option for patients on vocal restrictions. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2015.
NASA Technical Reports Server (NTRS)
Fisher, Scott S.
1986-01-01
A head-mounted, wide-angle, stereoscopic display system controlled by operator position, voice and gesture has been developed for use as a multipurpose interface environment. The system provides a multisensory, interactive display environment in which a user can virtually explore a 360-degree synthesized or remotely sensed environment and can viscerally interact with its components. Primary applications of the system are in telerobotics, management of large-scale integrated information systems, and human factors research. System configuration, application scenarios, and research directions are described.
Acoustic and Perceived Measurements Certifying Tango as Voice Treatment Method.
Tafiadis, Dionysios; Kosma, Evangelia I; Chronopoulos, Spyridon K; Papadopoulos, Aggelos; Toki, Eugenia I; Vassiliki, Siafaka; Ziavra, Nausica
2018-03-01
Voice disorders are affecting everyday life in many levels, and their prevalence has been studied extensively in certain and general populations. Notably, several factors have a cohesive influence on voice disorders and voice characteristics. Several studies report that health and environmental and psychological etiologies can serve as risk factors for voice disorders. Many diagnostic protocols, in the literature, evaluate voice and its parameters leading to direct or indirect treatment intervention. This study was designed to examine the effect of tango on adult acoustic voice parameters. Fifty-two adults (26 male and 26 female) were recruited and divided into four subgroups (male dancers, female dancers, male nondancers, and female nondancers). The participants were asked to answer two questionnaires (Voice Handicap Index and Voice Evaluation Form), and their voices were recorded before and after the tango dance session. Moreover, water consumption was investigated. The study's results indicated that the voices' acoustic characteristics were different between tango dancers and the control group. The beneficial results are far from prominent as they prove that tango dance can serve stand-alone as voice therapy without the need for hydration. Also, more research is imperative to be conducted on a longitudinal basis to obtain a more accurate result on the required time for the proposed therapy. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
1988-05-01
Seeciv Limited- System for varying Senses term filter capacity output until some Figure 2. Original limited-capacity channel model (Frim Broadbent, 1958) S...2 Figure 2. Original limited-capacity channel model (From Broadbent, 1958) .... 10 Figure 3. Experimental...unlimited variety of human voices for digital recording sources. Synthesis by Analysis Analysis-synthesis methods electronically model the human voice
ERIC Educational Resources Information Center
Bruce, Carolyn; Edmundson, Anne; Coleman, Michael
2003-01-01
Background: People with aphasia may experience difficulties that prevent them from demonstrating in writing what they know and can produce orally. Voice recognition systems that allow the user to speak into a microphone and see their words appear on a computer screen have the potential to assist written communication. Aim: This study investigated…
Weinstein, Ronald S; López, Ana Mariá; Barker, Gail P; Krupinski, Elizabeth A; Beinar, Sandra J; Major, Janet; Skinner, Tracy; Holcomb, Michael J; McNeely, Richard A
2007-10-01
The Institute for Advanced Telemedicine and Telehealth (i.e., T-Health Institute), a division of the state-wide Arizona Telemedicine Program (ATP), specializes in the creation of innovative health care education programs. This paper describes a first-of-a-kind video amphitheater specifically designed to promote communication within heterogeneous student groups training in the various health care professions. The amphitheater has an audio-video system that facilitates the assembly of ad hoc "in-the-room" electronic interdisciplinary student groups. Off-site faculty members and students can be inserted into groups by video conferencing. When fully implemented, every student will have a personal video camera trained on them, a head phone/microphone, and a personal voice channel. A command and control system will manage the video inputs of the individual participant's head-and-shoulder video images. An audio mixer will manage the separate voice channels of the individual participants and mix them into individual group-specific voice channels for use by the groups' participants. The audio-video system facilitates the easy reconfiguration of the interprofessional electronic groups, viewed on the video wall, without the individual participants in the electronic groups leaving their seats. The amphitheater will serve as a classroom as well as a unique education research laboratory.
Prevalence and risk factors for voice problems among telemarketers.
Jones, Katherine; Sigmon, Jason; Hock, Lynette; Nelson, Eric; Sullivan, Marsha; Ogren, Frederic
2002-05-01
To investigate whether there is an increased prevalence of voice problems among telemarketers compared with the general population and if these voice problems affect productivity and are associated with the presence of known risk factors for voice problems. Cross-sectional survey study. One outbound telemarketing firm, 3 reservations firms, 1 messaging firm, 1 survey research firm, and 1 community college. Random and cluster sampling identified 373 employees of the 6 firms; 304 employees completed the survey. A convenience sample of 187 community college students similar in age, sex, education level, and smoking prevalence served as a control group. Demographic, vocational, personality, and biological risk factors for voice problems; symptoms of vocal attrition; and effects of symptoms on work. Telemarketers were twice as likely to report 1 or more symptoms of vocal attrition compared with controls after adjusting for age, sex, and smoking status (P<.001). Of those surveyed, 31% reported that their work was affected by an average of 5.0 symptoms These respondents tended to be women (P<.001) and were more likely to smoke (P =.02); take drying medications (P<.001); have sinus problems (P =.04), frequent colds (P<.001), and dry mouth (P<.001); and be sedentary (P<.001). Telemarketers have a higher prevalence of voice problems than the control group. These problems affect productivity and are associated with modifiable risk factors. Evaluation of occupational voice disorders must encompass all of the determinants of health status, and treatment must focus on modifiable risk factors, not just the reduction of occupational vocal load.
Technology for Work, Home, and Leisure. Tech Use Guide: Using Computer Technology.
ERIC Educational Resources Information Center
Williams, John M.
This guide provides a brief introduction to several types of technological devices useful to individuals with disabilities and illustrates how some individuals are applying technology in the workplace and at home. Devices described include communication aids, low-vision products, voice-activated systems, environmental controls, and aids for…
47 CFR 0.551 - Purpose and scope; definitions.
Code of Federal Regulations, 2010 CFR
2010-10-01
...: (1) Individual means a citizen of the United States or an alien lawfully admitted for permanent... maintained by the Commission, including but not limited to, such individual's education, financial... finger or voice print or a photograph. (3) System of Records means a group of records under the control...
Hu, Xueping; Wang, Xiangpeng; Gu, Yan; Luo, Pei; Yin, Shouhang; Wang, Lijun; Fu, Chao; Qiao, Lei; Du, Yi; Chen, Antao
2017-10-01
Numerous behavioral studies have found a modulation effect of phonological experience on voice discrimination. However, the neural substrates underpinning this phenomenon are poorly understood. Here we manipulated language familiarity to test the hypothesis that phonological experience affects voice discrimination via mediating the engagement of multiple perceptual and cognitive resources. The results showed that during voice discrimination, the activation of several prefrontal regions was modulated by language familiarity. More importantly, the same effect was observed concerning the functional connectivity from the fronto-parietal network to the voice-identity network (VIN), and from the default mode network to the VIN. Our findings indicate that phonological experience could bias the recruitment of cognitive control and information retrieval/comparison processes during voice discrimination. Therefore, the study unravels the neural substrates subserving the modulation effect of phonological experience on voice discrimination, and provides new insights into studying voice discrimination from the perspective of network interactions. Copyright © 2017. Published by Elsevier Inc.
Moradi, Negin; Pourshahbaz, Abbas; Soltani, Majid; Javadipour, Shiva; Hashemi, Hedieh; Soltaninejad, Nasibeh
2013-03-01
Quality of life is one of the important aspects in the assessment of health and treatment data output. The purpose of this study was to adapt and determine reliability and validity of Voice Handicap Index (VHI) in Persian. The subjects were 80 patients with voice disorders and 80 volunteers without any voice disorders as a control group. All subjects filled in the Persian version of VHI. The test was repeated 2 weeks later. The reliability and validity were studied. All items had significant discrimination coefficient. The internal consistency and reliability of test and retest in VHI total score and three subtests were achieved. It seems that the Persian version of VHI is a valid and reliable questionnaire, which voice therapists may use for completing their evaluation for patients with voice disorders, and it gives more information about the nature of voice disorder to specialists. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Design of remote control alarm system by microwave detection
NASA Astrophysics Data System (ADS)
Wang, Junli
2018-04-01
A microwave detection remote control alarm system is designed, which is composed of a Microwave detectors, a radio receiving/transmitting module and a digital encoding/decoding IC. When some objects move into the surveillance area, microwave detectors will generate a control signal to start transmitting system. A radio control signal will be spread by the transmitting module, once the signal can be received, and it will be disposed by some circuits, arousing some voices that awake the watching people. The whole device is a modular configuration, it not only has some advantage of frequency stable, but also reliable and adjustment-free, and it is suitable for many kinds of demands within the distance of 100m.
A human factors approach to range scheduling for satellite control
NASA Technical Reports Server (NTRS)
Wright, Cameron H. G.; Aitken, Donald J.
1991-01-01
Range scheduling for satellite control presents a classical problem: supervisory control of a large-scale dynamic system, with unwieldy amounts of interrelated data used as inputs to the decision process. Increased automation of the task, with the appropriate human-computer interface, is highly desirable. The development and user evaluation of a semi-automated network range scheduling system is described. The system incorporates a synergistic human-computer interface consisting of a large screen color display, voice input/output, a 'sonic pen' pointing device, a touchscreen color CRT, and a standard keyboard. From a human factors standpoint, this development represents the first major improvement in almost 30 years to the satellite control network scheduling task.
Analysis of the Auditory Feedback and Phonation in Normal Voices.
Arbeiter, Mareike; Petermann, Simon; Hoppe, Ulrich; Bohr, Christopher; Doellinger, Michael; Ziethe, Anke
2018-02-01
The aim of this study was to investigate the auditory feedback mechanisms and voice quality during phonation in response to a spontaneous pitch change in the auditory feedback. Does the pitch shift reflex (PSR) change voice pitch and voice quality? Quantitative and qualitative voice characteristics were analyzed during the PSR. Twenty-eight healthy subjects underwent transnasal high-speed video endoscopy (HSV) at 8000 fps during sustained phonation [a]. While phonating, the subjects heard their sound pitched up for 700 cents (interval of a fifth), lasting 300 milliseconds in their auditory feedback. The electroencephalography (EEG), acoustic voice signal, electroglottography (EGG), and high-speed-videoendoscopy (HSV) were analyzed to compare feedback mechanisms for the pitched and unpitched condition of the phonation paradigm statistically. Furthermore, quantitative and qualitative voice characteristics were analyzed. The PSR was successfully detected within all signals of the experimental tools (EEG, EGG, acoustic voice signal, HSV). A significant increase of the perturbation measures and an increase of the values of the acoustic parameters during the PSR were observed, especially for the audio signal. The auditory feedback mechanism seems not only to control for voice pitch but also for voice quality aspects.
Bell, Imogen H; Fielding-Smith, Sarah F; Hayward, Mark; Rossell, Susan L; Lim, Michelle H; Farhall, John; Thomas, Neil
2018-05-02
Smartphone-based ecological momentary assessment and intervention (EMA/I) show promise for enhancing psychological treatments for psychosis. EMA has the potential to improve assessment and formulation of experiences which fluctuate day-to-day, and EMI may be used to prompt use of therapeutic strategies in daily life. The current study is an examination of these capabilities in the context of a brief, coping-focused intervention for distressing voice hearing experiences. This is a rater-blinded, pilot randomised controlled trial comparing a four-session intervention in conjunction with use of smartphone EMA/I between sessions, versus treatment-as-usual. The recruitment target is 34 participants with persisting and distressing voice hearing experiences, recruited through a Voices Clinic based in Melbourne, Australia, and via wider advertising. Allocation will be made using minimisation procedure, balancing of the frequency of voices between groups. Assessments are completed at baseline and 8 weeks post-baseline. The primary outcomes of this trial will focus on feasibility and acceptability of the intervention and trial methodology, with secondary outcomes examining preliminary clinical effects related to overall voice severity, the emotional and functional impact of the voices, and emotional distress. This study offers a highly novel examination of specific smartphone capabilities and their integration with traditional psychological treatment for distressing voices. Such technology has potential to enhance psychological interventions and promote adaptation to distressing experiences. Australian New Zealand Clinical Trial Registry, ACTRN12617000348358 . Registered on 7 March 2017.
NASA Astrophysics Data System (ADS)
Choinzonov, E. L.; Balatskaya, L. N.; Chizhevskaya, S. Yu.; Meshcheryakov, R. V.; Kostyuchenko, E. Yu.; Ivanova, T. A.
2016-08-01
The aim of the research is to develop and introduce a new technique of post-laryngectomy voice rehabilitation of laryngeal and hypopharyngeal carcinoma patients. The study involves comparing and analyzing 82 cases of voice function restoration by using biological feedback based on mathematical modeling of voice production. The advantage of the modern technology-based method in comparison with the conventional one is proved. Restoration of voice function using biofeedback allows taking into account patient's abilities, adjusting parameters of voice trainings, and controlling their efficiency in real-time mode. The data obtained indicate that the new method contributes to the rapid inclusion of self-regulation mechanisms of the body and results in the overall success rate of voice rehabilitation in totally laryngectomized patients reaching 92%, which reduces the rehabilitation period to 18 days, compared to 86% and 38 days in the control group, respectively. Restoration of disturbed functions after successful treatment is an important task of rehabilitation and is crucial in terms of the quality of cancer patients' lives. To assess life quality of laryngeal cancer patients, the EORTC Quality of Life Core Questionnaire (QLQ-C30), and head and neck module (QLQ-H&N35) were used. The analyzed results proved that the technique of biofeedback voice restoration significantly improves the quality of life of laryngectomized patients. It allows reducing the number of disabled people, restoring patients' ability to work-related activities, and significantly improving social adaptation of these patients.
Guo, Zhiqiang; Wu, Xiuqin; Li, Weifeng; Jones, Jeffery A; Yan, Nan; Sheft, Stanley; Liu, Peng; Liu, Hanjun
2017-10-25
Although working memory (WM) is considered as an emergent property of the speech perception and production systems, the role of WM in sensorimotor integration during speech processing is largely unknown. We conducted two event-related potential experiments with female and male young adults to investigate the contribution of WM to the neurobehavioural processing of altered auditory feedback during vocal production. A delayed match-to-sample task that required participants to indicate whether the pitch feedback perturbations they heard during vocalizations in test and sample sequences matched, elicited significantly larger vocal compensations, larger N1 responses in the left middle and superior temporal gyrus, and smaller P2 responses in the left middle and superior temporal gyrus, inferior parietal lobule, somatosensory cortex, right inferior frontal gyrus, and insula compared with a control task that did not require memory retention of the sequence of pitch perturbations. On the other hand, participants who underwent extensive auditory WM training produced suppressed vocal compensations that were correlated with improved auditory WM capacity, and enhanced P2 responses in the left middle frontal gyrus, inferior parietal lobule, right inferior frontal gyrus, and insula that were predicted by pretraining auditory WM capacity. These findings indicate that WM can enhance the perception of voice auditory feedback errors while inhibiting compensatory vocal behavior to prevent voice control from being excessively influenced by auditory feedback. This study provides the first evidence that auditory-motor integration for voice control can be modulated by top-down influences arising from WM, rather than modulated exclusively by bottom-up and automatic processes. SIGNIFICANCE STATEMENT One outstanding question that remains unsolved in speech motor control is how the mismatch between predicted and actual voice auditory feedback is detected and corrected. The present study provides two lines of converging evidence, for the first time, that working memory cannot only enhance the perception of vocal feedback errors but also exert inhibitory control over vocal motor behavior. These findings represent a major advance in our understanding of the top-down modulatory mechanisms that support the detection and correction of prediction-feedback mismatches during sensorimotor control of speech production driven by working memory. Rather than being an exclusively bottom-up and automatic process, auditory-motor integration for voice control can be modulated by top-down influences arising from working memory. Copyright © 2017 the authors 0270-6474/17/3710324-11$15.00/0.
Golan, Ofer; Baron-Cohen, Simon; Hill, Jacqueline
2006-02-01
Adults with Asperger Syndrome (AS) can recognise simple emotions and pass basic theory of mind tasks, but have difficulties recognising more complex emotions and mental states. This study describes a new battery of tasks, testing recognition of 20 complex emotions and mental states from faces and voices. The battery was given to males and females with AS and matched controls. Results showed the AS group performed worse than controls overall, on emotion recognition from faces and voices and on 12/20 specific emotions. Females recognised faces better than males regardless of diagnosis, and males with AS had more difficulties recognising emotions from faces than from voices. The implications of these results are discussed in relation to social functioning in AS.
Hazell, Cassie M; Hayward, Mark; Cavanagh, Kate; Jones, Anna-Marie; Strauss, Clara
2016-07-26
Cognitive behavior therapy for psychosis (CBTp) is an effective intervention for people who hear distressing voices (auditory hallucinations). However, there continues to be a problem of poor access to CBTp. Constraints on health care funding require this problem to be addressed without a substantial increase in funding. One solution is to develop guided self-help forms of CBTp to improve access, and a symptom-specific focus on, for example, distressing voices (auditory verbal hallucinations) has the potential to enhance effectiveness. We term this cognitive behavior therapy for distressing voices (CBTv). This trial is an external pilot randomized controlled trial comparing the effects of 12-week guided self-help CBTv (with eight therapist support sessions) with a wait list control condition. Informed consent will be obtained from each participant. Half of the 30 participants will be randomized to receive guided self-help CBTv immediately; the remaining half will receive the intervention after a 12-week delay. All participants will continue with their usual treatment throughout the study. Outcomes will be assessed using questionnaires completed at baseline and 12 weeks postrandomization. Interviews will be offered to all those who receive therapy immediately to explore their experiences with the intervention. The outcomes of this trial, both quantitative and qualitative, will inform the design of a definitive randomized controlled trial of guided self-help CBTv. If this intervention is effective, it could help to increase access to CBT for those who hear distressing voices. ISRCTN registration number ISRCTN77762753 . Registered on 23 July 2015.
ERIC Educational Resources Information Center
Fisher, Ryan A.
2014-01-01
The purposes of the study are to describe characteristics of the voice change in sixth-, seventh-, and eighth-grade choir students using Cooksey's voice-change classification system and to determine if the singing self-efficacy of adolescent males is affected by the voice change, grade level, and experience. Participants (N = 80) consisted of…
Voice and gesture-based 3D multimedia presentation tool
NASA Astrophysics Data System (ADS)
Fukutake, Hiromichi; Akazawa, Yoshiaki; Okada, Yoshihiro
2007-09-01
This paper proposes a 3D multimedia presentation tool that allows the user to manipulate intuitively only through the voice input and the gesture input without using a standard keyboard or a mouse device. The authors developed this system as a presentation tool to be used in a presentation room equipped a large screen like an exhibition room in a museum because, in such a presentation environment, it is better to use voice commands and the gesture pointing input rather than using a keyboard or a mouse device. This system was developed using IntelligentBox, which is a component-based 3D graphics software development system. IntelligentBox has already provided various types of 3D visible, reactive functional components called boxes, e.g., a voice input component and various multimedia handling components. IntelligentBox also provides a dynamic data linkage mechanism called slot-connection that allows the user to develop 3D graphics applications by combining already existing boxes through direct manipulations on a computer screen. Using IntelligentBox, the 3D multimedia presentation tool proposed in this paper was also developed as combined components only through direct manipulations on a computer screen. The authors have already proposed a 3D multimedia presentation tool using a stage metaphor and its voice input interface. This time, we extended the system to make it accept the user gesture input besides voice commands. This paper explains details of the proposed 3D multimedia presentation tool and especially describes its component-based voice and gesture input interfaces.
Reliability of human-supervised formant-trajectory measurement for forensic voice comparison.
Zhang, Cuiling; Morrison, Geoffrey Stewart; Ochoa, Felipe; Enzinger, Ewald
2013-01-01
Acoustic-phonetic approaches to forensic voice comparison often include human-supervised measurement of vowel formants, but the reliability of such measurements is a matter of concern. This study assesses the within- and between-supervisor variability of three sets of formant-trajectory measurements made by each of four human supervisors. It also assesses the validity and reliability of forensic-voice-comparison systems based on these measurements. Each supervisor's formant-trajectory system was fused with a baseline mel-frequency cepstral-coefficient system, and performance was assessed relative to the baseline system. Substantial improvements in validity were found for all supervisors' systems, but some supervisors' systems were more reliable than others.
1980-06-01
34 LIST OF ILLUSTRATIONS FIGURE PAGE 1 Block Diagram of DLMS Voice Recognition System .............. S 2 Flowchart of DefaulV...particular are a speech preprocessor and a minicomputer. In the VRS, as shown in the block diagram of Fig. 1, the preprocessor is a TTI model 8040 and...Data General 6026 Magnetic Zo 4 Tape Unit Display L-- - Equipment Cabinet Fig. 1 block Diagram of DIMS Voice Recognition System qS 2. Flexible Disk
A robotic voice simulator and the interactive training for hearing-impaired people.
Sawada, Hideyuki; Kitani, Mitsuki; Hayashi, Yasumori
2008-01-01
A talking and singing robot which adaptively learns the vocalization skill by means of an auditory feedback learning algorithm is being developed. The robot consists of motor-controlled vocal organs such as vocal cords, a vocal tract and a nasal cavity to generate a natural voice imitating a human vocalization. In this study, the robot is applied to the training system of speech articulation for the hearing-impaired, because the robot is able to reproduce their vocalization and to teach them how it is to be improved to generate clear speech. The paper briefly introduces the mechanical construction of the robot and how it autonomously acquires the vocalization skill in the auditory feedback learning by listening to human speech. Then the training system is described, together with the evaluation of the speech training by auditory impaired people.
NASA Astrophysics Data System (ADS)
Meiyanti, R.; Subandi, A.; Fuqara, N.; Budiman, M. A.; Siahaan, A. P. U.
2018-03-01
A singer doesn’t just recite the lyrics of a song, but also with the use of particular sound techniques to make it more beautiful. In the singing technique, more female have a diverse sound registers than male. There are so many registers of the human voice, but the voice registers used while singing, among others, Chest Voice, Head Voice, Falsetto, and Vocal fry. Research of speech recognition based on the female’s voice registers in singing technique is built using Borland Delphi 7.0. Speech recognition process performed by the input recorded voice samples and also in real time. Voice input will result in weight energy values based on calculations using Hankel Transformation method and Macdonald Functions. The results showed that the accuracy of the system depends on the accuracy of sound engineering that trained and tested, and obtained an average percentage of the successful introduction of the voice registers record reached 48.75 percent, while the average percentage of the successful introduction of the voice registers in real time to reach 57 percent.
Automatic speech recognition in air-ground data link
NASA Technical Reports Server (NTRS)
Armstrong, Herbert B.
1989-01-01
In the present air traffic system, information presented to the transport aircraft cockpit crew may originate from a variety of sources and may be presented to the crew in visual or aural form, either through cockpit instrument displays or, most often, through voice communication. Voice radio communications are the most error prone method for air-ground data link. Voice messages can be misstated or misunderstood and radio frequency congestion can delay or obscure important messages. To prevent proliferation, a multiplexed data link display can be designed to present information from multiple data link sources on a shared cockpit display unit (CDU) or multi-function display (MFD) or some future combination of flight management and data link information. An aural data link which incorporates an automatic speech recognition (ASR) system for crew response offers several advantages over visual displays. The possibility of applying ASR to the air-ground data link was investigated. The first step was to review current efforts in ASR applications in the cockpit and in air traffic control and evaluated their possible data line application. Next, a series of preliminary research questions is to be developed for possible future collaboration.
Information technologies for Marine Corps combat medicine.
Carey, N B; Rattelman, C R; Nguyen, H Q
1998-09-01
Future Marine Corps warfighting concepts will make it more difficult to locate casualties, which will complicate casualty evacuation, lengthen casualty wait times, and require infantrymen or corpsmen to provide more extensive treatment. In these future scenarios, information flow and communications will be critical to medical functions. We asked, for Navy medical support to the Marines, what information will future combat medicine require and what technologies should supply those information needs? Based on analyses of patient data streams, focus groups of Navy medical personnel, and our estimates of the cost and feasibility of communications systems, we recommend the following: (1) increase medical training for some fraction of Marines, especially in hemorrhage control; (2) augment corpsmen's training; (3) furnish data systems for evacuation and supply that would provide in-transit visibility and simplify requests; (4) provide all ground medical personnel with access to treatment information systems and limited voice communications; and (5) exploit e-mail systems to reduce reliance on voice communications. Implementation time frames are discussed.
Case-control study of risk factors for spasmodic dysphonia: A comparison with other voice disorders.
Tanner, Kristine; Roy, Nelson; Merrill, Ray M; Sauder, Cara; Houtz, Daniel R; Smith, Marshall E
2012-05-01
This epidemiology study examined risk factors uniquely associated with spasmodic dysphonia (SD). Case-control. A questionnaire was administered to 150 patients with SD (with and without coexisting vocal tremor) and 136 patients with other structural, neurological, and functional voice disorders (excluding SD and vocal tremor). Questions included personal and family medical histories, environmental exposures, trauma, illnesses, voice use habits, and the Short Form 36. Several factors were uniquely associated with SD (α = .05), including: 1) a personal history of cervical dystonia, sinus and throat illnesses, mumps, rubella, dust exposure, and frequent volunteer voice use, 2) a family history of voice disorders, 3) an immediate family history of vocal tremor and meningitis, and 4) an extended family history of head and neck tremor, ocular disease, and meningitis. Vocal tremor coexisted with SD in 29% of cases. Measles and mumps vaccines were protective for SD. SD is likely multifactorial and associated with several endogenous and exogenous factors. Certain viral exposures, voice use patterns, and familial neurological conditions may contribute to the onset of SD later in life. Copyright © 2011 The American Laryngological, Rhinological, and Otological Society, Inc.
Perceptual evaluation and acoustic analysis of pneumatic artificial larynx.
Xu, Jie Jie; Chen, Xi; Lu, Mei Ping; Qiao, Ming Zhe
2009-12-01
To investigate the perceptual and acoustic characteristics of the pneumatic artificial larynx (PAL) and evaluate its speech ability and clinical value. Prospective study. The study was conducted in the Voice Lab, Department of Otorhinolaryngology, The First Affiliated Hospital of Nanjing Medical University. Forty-six laryngectomy patients using the PAL were rated for intelligibility and fluency of speech. The voice signals of sustained vowel /a/ for 40 healthy controls and 42 successful patients using the PAL were measured by a computer system. The acoustic parameters and sound spectrographs were analyzed and compared between the two groups. Forty-two of 46 patients using the PAL (91.3%) acquired successful speech capability. The intelligibility scores of 42 successful PAL speakers ranged from 71 to 95 percent, and the intelligibility range of four unsuccessful speakers was 30 to 50 percent. The fluency was judged as good or excellent in 42 successful patients, and poor or fair in four unsuccessful patients. There was no significant difference in average fundamental frequency, maximum intensity, jitter, shimmer, and normalized noise energy (NNE) between 42 successful PAL speakers and 40 healthy controls, while the maximum phonation time (MPT) of PAL speakers was slightly lower than that of the controls. The sound spectrographs of the patients using the PAL approximated those of the healthy controls. The PAL has the advantage of a high percentage of successful vocal rehabilitation. PAL speech is fluent and intelligible. The acoustic characteristics of the PAL are similar to those of a normal voice.
Rousseau, Bernard; Gutmann, Michelle L.; Mau, I-fan Theodore; Francis, David O.; Johnson, Jeffrey P.; Novaleski, Carolyn K.; Vinson, Kimberly N.; Garrett, C. Gaelyn
2015-01-01
Objective This randomized trial investigated voice rest and supplemental text-to-speech communication versus voice rest alone on visual analog scale measures of communication effectiveness and magnitude of voice use. Study Design Randomized clinical trial. Setting Multicenter outpatient voice clinics. Subjects Thirty-seven patients undergoing phonomicrosurgery. Methods Patients undergoing phonomicrosurgery were randomized to voice rest and supplemental text-to-speech communication or voice rest alone. The primary outcome measure was the impact of voice rest on ability to communicate effectively over a seven-day period. Pre- and post-operative magnitude of voice use was also measured as an observational outcome. Results Patients randomized to voice rest and supplemental text-to-speech communication reported higher median communication effectiveness on each post-operative day compared to those randomized to voice rest alone, with significantly higher median communication effectiveness on post-operative day 3 (p = 0.03) and 5 (p = 0.01). Magnitude of voice use did not differ on any pre-operative (p > 0.05) or post-operative day (p > 0.05), nor did patients significantly decrease voice use as the surgery date approached (p > 0.05). However, there was a significant reduction in median voice use pre- to post-operatively across patients (p < 0.001) with median voice use ranging from 0–3 throughout the post-operative week. Conclusion Supplemental text-to-speech communication increased patient perceived communication effectiveness on post-operative days 3 and 5 over voice rest alone. With the prevalence of smartphones and the widespread use of text messaging, supplemental text-to-speech communication may provide an accessible and cost-effective communication option for patients on vocal restrictions. PMID:25605690
United States Homeland Security and National Biometric Identification
2002-04-09
security number. Biometrics is the use of unique individual traits such as fingerprints, iris eye patterns, voice recognition, and facial recognition to...technology to control access onto their military bases using a Defense Manpower Management Command developed software application. FACIAL Facial recognition systems...installed facial recognition systems in conjunction with a series of 200 cameras to fight street crime and identify terrorists. The cameras, which are
Evaluating effectiveness of dynamic soundfield system in the classroom.
da Cruz, Aline Duarte; Alves Silvério, Kelly Cristina; Da Costa, Aline Roberta Aceituno; Moret, Adriane Lima Mortari; Lauris, José Roberto Pereira; de Souza Jacob, Regina Tangerino
2016-01-01
Research has reported on the use of soundfield amplification devices in the classroom. However, no study has used standardized tests to determine the potential advantages of the dynamic soundfield system for normally hearing students and for the teacher's voice. Our aim was to evaluate the impact of using dynamic soundfield system on the noise of the classroom, teacher's voice and students' academic performance. This was a prospective cohort study in which 20 student participants enrolled in the third year of basic education were divided into two groups (i.e., control and experimental); their teacher participated. The experimental group was exposed to the dynamic soundfield system for 3 consecutive months. The groups were assessed using standardized tests to evaluate their academic performance. Further, questionnaires and statements were collected on the participants' experience of using the soundfield system. We statistically analyzed the results to compare the academic performance of the control group with that of the experimental group. In all cases, a significance level of P < .05 was adopted. Use of the dynamic soundfield system was effective for improving the students' academic performance on standardized tests for reading, improving the teacher's speech intelligibility, and reducing the teacher's vocal strain. The dynamic soundfield system minimizes the impact of noise in the classroom as demonstrated by the mensuration of the signal-to-noise ratio (SNR) and pupil performance on standardized tests for reading and student and teacher ratings of amplification system effectiveness.
Voice Response System Statistics Program : Operational Handbook.
DOT National Transportation Integrated Search
1980-06-01
This report documents the Voice Response System (VRS) Statistics Program developed for the preflight weather briefing VRS. It describes the VRS statistical report format and contents, the software program structure, and the program operation.
Generation of surgical pathology report using a 5,000-word speech recognizer.
Tischler, A S; Martin, M R
1989-10-01
Pressures to decrease both turnaround time and operating costs simultaneously have placed conflicting demands on traditional forms of medical transcription. The new technology of voice recognition extends the promise of enabling the pathologist or other medical professional to dictate a correct report and have it printed and/or transmitted to a database immediately. The usefulness of voice recognition systems depends on several factors, including ease of use, reliability, speed, and accuracy. These in turn depend on the general underlying design of the systems and inclusion in the systems of a specific knowledge base appropriate for each application. Development of a good knowledge base requires close collaboration between a domain expert and a knowledge engineer with expertise in voice recognition. The authors have recently completed a knowledge base for surgical pathology using the Kurzweil VoiceReport 5,000-word system.
Functional Dysphonia during Mental Imagery: Testing the Trait Theory of Voice Disorders
ERIC Educational Resources Information Center
van Mersbergen, Miriam; Patrick, Christopher; Glaze, Leslie
2008-01-01
Purpose: Previous research has proposed that persons with functional dysphonia (FD) present with temperamental traits that predispose them to their voice disorder. We investigated this theory in a controlled experiment and compared them with social anxiety (SA) and healthy control (HC) groups. Method: Twelve participants with FD, 19 participants…
Maternal Voice and Short-Term Outcomes in Preterm Infants
Krueger, Charlene; Parker, Leslie; Chiu, Sheau-Huey; Theriaque, Douglas
2013-01-01
This study explored effects of exposure to maternal voice on short-term outcomes in very low birth weight preterm infants cared for within an neonatal intensive care unit (NICU) without an ongoing program of developmental care. Using a comparative design, 53 infants born during their 27th to 28th postmenstrual week were sampled by convenience. Experimental groups were exposed to maternal voice during two developmental time periods. Group 1 listened to a recording of their mothers reciting a rhyme from 28 to 34 postmenstrual weeks. Group 2 waited 4 weeks and heard the recording from 32 to 34 weeks. The control group received routine care. The primary analysis of combined experimental groups compared to the control group revealed that the experimental infants experienced significantly fewer episodes of feeding intolerance and achieved full enteral feeds quicker compared to the control group. Further, in an analysis evaluating all three groups separately, it was noted that Group 1 experienced significantly fewer episodes of feeding intolerance compared to the control group. Study findings warrant further investigation of exposure to maternal voice and the developmental timing at which exposure is begun. PMID:20112262
Space Shuttle Orbiter audio subsystem. [to communication and tracking system
NASA Technical Reports Server (NTRS)
Stewart, C. H.
1978-01-01
The selection of the audio multiplex control configuration for the Space Shuttle Orbiter audio subsystem is discussed and special attention is given to the evaluation criteria of cost, weight and complexity. The specifications and design of the subsystem are described and detail is given to configurations of the audio terminal and audio central control unit (ATU, ACCU). The audio input from the ACCU, at a signal level of -12.2 to 14.8 dBV, nominal range, at 1 kHz, was found to have balanced source impedance and a balanced local impedance of 6000 + or - 600 ohms at 1 kHz, dc isolated. The Lyndon B. Johnson Space Center (JSC) electroacoustic test laboratory, an audio engineering facility consisting of a collection of acoustic test chambers, analyzed problems of speaker and headset performance, multiplexed control data coupled with audio channels, and the Orbiter cabin acoustic effects on the operational performance of voice communications. This system allows technical management and project engineering to address key constraining issues, such as identifying design deficiencies of the headset interface unit and the assessment of the Orbiter cabin performance of voice communications, which affect the subsystem development.
Payne, Thomas H; Alonso, W David; Markiel, J Andrew; Lybarger, Kevin; White, Andrew A
2018-01-01
We describe the development and design of a smartphone app-based system to create inpatient progress notes using voice, commercial automatic speech recognition software, with text processing to recognize spoken voice commands and format the note, and integration with a commercial EHR. This new system fits hospital rounding workflow and was used to support a randomized clinical trial testing whether use of voice to create notes improves timeliness of note availability, note quality, and physician satisfaction with the note creation process. The system was used to create 709 notes which were placed in the corresponding patient's EHR record. The median time from pressing the Send button to appearance of the formatted note in the Inbox was 8.8 min. It was generally very reliable, accepted by physician users, and secure. This approach provides an alternative to use of keyboard and templates to create progress notes and may appeal to physicians who prefer voice to typing. Copyright © 2017 Elsevier Inc. All rights reserved.
Niebudek-Bogusz, Ewa; Sliwińska-Kowalska, Mariola
2006-01-01
An assessment of the vocal system, as a part of the medical certification of occupational diseases, should be objective and reliable. Therefore, interest in the method of acoustic voice analysis enabling objective assessment of voice parameters is still growing. The aim of the present study was to evaluate the applicability of acoustic analysis with vocal loading test to the diagnostics of occupational voice disorders. The results of acoustic voice analysis were compared using IRIS software for phoniatrics, before and after a 30-min vocal loading test in 35 female teachers with diagnosed occupational voice disorders (group I) and in 31 female teachers with functional dysphonia (group II). In group I, vocal effort produced significant abnormalities in voice acoustic parameters, compared to group II. These included significantly increased mean fundamental frequency (Fo) value (by 11 Hz) and worsened jitter, shimmer and NHR parameters. Also, the percentage of subjects showing abnormalities in voice acoustic analysis was higher in this group. Conducting voice acoustic analysis before and after the vocal loading test makes it possible to objectively confirm irreversible voice impairments in persons with work-related pathologies of the larynx, which is essential for medical certification of occupational voice diseases.
Virtual workstation - A multimodal, stereoscopic display environment
NASA Astrophysics Data System (ADS)
Fisher, S. S.; McGreevy, M.; Humphries, J.; Robinett, W.
1987-01-01
A head-mounted, wide-angle, stereoscopic display system controlled by operator position, voice and gesture has been developed for use in a multipurpose interface environment. The system provides a multisensory, interactive display environment in which a user can virtually explore a 360-degree synthesized or remotely sensed environment and can viscerally interact with its components. Primary applications of the system are in telerobotics, management of large-scale integrated information systems, and human factors research. System configuration, application scenarios, and research directions are described.
Aydınlı, Fatma Esen; Özcebe, Esra; Kulak Kayıkçı, Maviş E; Yılmaz, Taner; Özgür, Fatma F
2016-11-01
The aim was to investigate the effects of glottal stop productions (GS) on voice in children with cleft palate using multidimensional voice assessment methods. This is a prospective case-control study. Children with repaired cleft palate (n = 34) who did not have any vocal fold lesions were separated into two groups based on the results of the articulation test. The glottal stop group (GSG) consisted of 17 children who had GS. The control group (CG) consisted of an equal number of age- and gender-matched children who did not have GS. The voice evaluation protocol included acoustic analysis, Pediatric Voice Handicap Index (pVHI), and perceptual analysis (Grade, Roughness, Breathiness, Asthenia, Strain method). The velopharyngeal statuses of the groups were compared using the nasopharyngoscopy and the nasometer. The total pVHI score and the subscales of the pVHI were found to be significantly higher in the GSG. The F0, jitter, and shimmer were found to be numerically higher in the GSG with the difference being statistically significant in jitter (P < 0.05). Audioperceptual analysis revealed a difference in overall voice quality and roughness between the groups. Greater incidence of significant velopharyngeal insufficiency and higher nasalance scores were found in the GSG (P < 0.05). These results may indicate that the vocal quality characteristics of children with GS differ from children who do not have this type of production. It is suggested that children with cleft palate who have GS should receive a comprehensive speech and language pathology intervention including voice therapy techniques. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Current trends in small vocabulary speech recognition for equipment control
NASA Astrophysics Data System (ADS)
Doukas, Nikolaos; Bardis, Nikolaos G.
2017-09-01
Speech recognition systems allow human - machine communication to acquire an intuitive nature that approaches the simplicity of inter - human communication. Small vocabulary speech recognition is a subset of the overall speech recognition problem, where only a small number of words need to be recognized. Speaker independent small vocabulary recognition can find significant applications in field equipment used by military personnel. Such equipment may typically be controlled by a small number of commands that need to be given quickly and accurately, under conditions where delicate manual operations are difficult to achieve. This type of application could hence significantly benefit by the use of robust voice operated control components, as they would facilitate the interaction with their users and render it much more reliable in times of crisis. This paper presents current challenges involved in attaining efficient and robust small vocabulary speech recognition. These challenges concern feature selection, classification techniques, speaker diversity and noise effects. A state machine approach is presented that facilitates the voice guidance of different equipment in a variety of situations.
The interaction of tone with voicing and foot structure: evidence from Kera phonetics and phonology
NASA Astrophysics Data System (ADS)
Pearce, Mary Dorothy
This thesis uses acoustic measurements as a basis for the phonological analysis of the interaction of tone with voicing and foot structure in Kera (a Chadic language). In both tone spreading and vowel harmony, the iambic foot acts as a domain for spreading. Further evidence for the foot comes from measurements of duration, intensity and vowel quality. Kera is unusual in combining a tone system with a partially independent metrical system based on iambs. In words containing more than one foot, the foot is the tone bearing unit (TBU), but in shorter words, the TBU is the syllable. In perception and production experiments, results show that Kera speakers, unlike English and French, use the fundamental frequency as the principle cue to 'Voicing" contrast. Voice onset time (VOT) has only a minor role. Historically, tones probably developed from voicing through a process of tonogenesis, but synchronically, the feature voice is no longer contrastive and VOT is used in an enhancing role. Some linguists have claimed that Kera is a key example for their controversial theory of long-distance voicing spread. But as voice is not part of Kera phonology, this thesis gives counter-evidence to the voice spreading claim. An important finding from the experiments is that the phonological grammars are different between village women, men moving to town and town men. These differences are attributed to French contact. The interaction between Kera tone and voicing and contact with French have produced changes from a 2-way voicing contrast, through a 3-way tonal contrast, to a 2-way voicing contrast plus another contrast with short VOT. These diachronic and synchronic tone/voicing facts are analysed using laryngeal features and Optimality Theory. This thesis provides a body of new data, detailed acoustic measurements, and an analysis incorporating current theoretical issues in phonology, which make it of interest to Africanists and theoreticians alike.
[Signs and symptoms of autonomic dysfunction in dysphonic individuals].
Park, Kelly; Behlau, Mara
2011-01-01
To verify the occurrence of signs and symptoms of autonomic nervous system dysfunction in individuals with behavioral dysphonia, and to compare it with the results obtained by individuals without vocal complaints. Participants were 128 adult individuals with ages between 14 and 74 years, divided into two groups: behavioral dysphonia (61 subjects) and without vocal complaints (67 subjects). It was administered the Protocol of Autonomic Dysfunction, containing 46 questions: 22 related to the autonomic nervous system and had no direct relationship with voice, 16 related to both autonomic nervous system and voice, six non-relevant questions, and two reliability questions. There was a higher occurrence of reported neurovegetative signs in the group with behavioral dysphonia, in questions related to voice, such as frequent throat clearing, frequent swallowing need, fatigability when speaking, and sore throat. In questions not directly related to voice, dysphonic individuals presented greater occurrence of three out of 22 symptoms: gas, tinnitus and aerophagia. Both groups presented similar results in questions non-relevant to the autonomic nervous system. Reliability questions needed reformulation. Individuals with behavioral dysphonia present higher occurrence of neurovegetative signs and symptoms, particularly those with direct relationship with voice, indicating greater lability of the autonomic nervous system in these subjects.
Evaluating iPhone recordings for acoustic voice assessment.
Lin, Emily; Hornibrook, Jeremy; Ormond, Tika
2012-01-01
This study examined the viability of using iPhone recordings for acoustic measurements of voice quality. Acoustic measures were compared between voice signals simultaneously recorded from 11 normal speakers (6 females and 5 males) through an iPhone (model A1303, Apple, USA) and a comparison recording system. Comparisons were also conducted between the pre- and post-operative voices recorded from 10 voice patients (4 females and 6 males) through the iPhone. Participants aged between 27 and 79 years. Measures from iPhone and comparison signals were found to be highly correlated. Findings of the effects of vowel type on the selected measures were consistent between the two recording systems and congruent with previous findings. Analysis of the patient data revealed that a selection of acoustic measures, such as vowel space area and voice perturbation measures, consistently demonstrated a positive change following phonosurgery. The present findings indicated that the iPhone device tested was useful for tracking voice changes for clinical management. Preliminary findings regarding factors such as gender and type of pathology suggest that intra-subject, instead of norm-referenced, comparisons of acoustic measures would be more useful in monitoring the progression of a voice disorder or tracking the treatment effect. Copyright © 2012 S. Karger AG, Basel.
Bauer, Jay J; Mittal, Jay; Larson, Charles R; Hain, Timothy C
2006-04-01
The present study tested whether subjects respond to unanticipated short perturbations in voice loudness feedback with compensatory responses in voice amplitude. The role of stimulus magnitude (+/- 1,3 vs 6 dB SPL), stimulus direction (up vs down), and the ongoing voice amplitude level (normal vs soft) were compared across compensations. Subjects responded to perturbations in voice loudness feedback with a compensatory change in voice amplitude 76% of the time. Mean latency of amplitude compensation was 157 ms. Mean response magnitudes were smallest for 1-dB stimulus perturbations (0.75 dB) and greatest for 6-dB conditions (0.98 dB). However, expressed as gain, responses for 1-dB perturbations were largest and almost approached 1.0. Response magnitudes were larger for the soft voice amplitude condition compared to the normal voice amplitude condition. A mathematical model of the audio-vocal system captured the main features of the compensations. Previous research has demonstrated that subjects can respond to an unanticipated perturbation in voice pitch feedback with an automatic compensatory response in voice fundamental frequency. Data from the present study suggest that voice loudness feedback can be used in a similar manner to monitor and stabilize voice amplitude around a desired loudness level.
The Johns Hopkins Medical Institutions' Premise Distribution Plan
Barta, Wendy; Buckholtz, Howard; Johnston, Mark; Lenhard, Raymond; Tolchin, Stephen; Vienne, Donald
1987-01-01
A Premise Distribution Plan is being developed to address the growing voice and data communications needs at Johns Hopkins Medical Institutions. More specifically, the use of a rapidly expanding Ethernet computer network and a new Integrated Services Digital Network (ISDN) Digital Centrex system must be planned to provide easy, reliable and cost-effective data and voice communications services. Existing Premise Distribution Systems are compared along with voice and data technologies which would use them.
Voice disorders and mental health in teachers: a cross-sectional nationwide study.
Nerrière, Eléna; Vercambre, Marie-Noël; Gilbert, Fabien; Kovess-Masféty, Viviane
2009-10-02
Teachers, as professional voice users, are at particular risk of voice disorders. Among contributing factors, stress and psychological tension could play a role but epidemiological data on this problem are scarce. The aim of this study was to evaluate prevalence and cofactors of voice disorders among teachers in the French National Education system, with particular attention paid to the association between voice complaint and psychological status. The source data come from an epidemiological postal survey on physical and mental health conducted in a sample of 20,099 adults (in activity or retired) selected at random from the health plan records of the national education system. Overall response rate was 53%. Of the 10,288 respondents, 3,940 were teachers in activity currently giving classes to students. In the sample of those with complete data (n = 3,646), variables associated with voice disorders were investigated using logistic regression models. Studied variables referred to demographic characteristics, socio-professional environment, psychological distress, mental health disorders (DSM-IV), and sick leave. One in two female teachers reported voice disorders (50.0%) compared to one in four males (26.0%). Those who reported voice disorders presented higher level of psychological distress. Sex- and age-adjusted odds ratios [95% confidence interval] were respectively 1.8 [1.5-2.2] for major depressive episode, 1.7 [1.3-2.2] for general anxiety disorder, and 1.6 [1.2-2.2] for phobia. A significant association between voice disorders and sick leave was also demonstrated (1.5 [1.3-1.7]). Voice disorders were frequent among French teachers. Associations with psychiatric disorders suggest that a situation may exist which is more complex than simple mechanical failure. Further longitudinal research is needed to clarify the comorbidity between voice and psychological disorders.
[Research on Barrier-free Home Environment System Based on Speech Recognition].
Zhu, Husheng; Yu, Hongliu; Shi, Ping; Fang, Youfang; Jian, Zhuo
2015-10-01
The number of people with physical disabilities is increasing year by year, and the trend of population aging is more and more serious. In order to improve the quality of the life, a control system of accessible home environment for the patients with serious disabilities was developed to control the home electrical devices with the voice of the patients. The control system includes a central control platform, a speech recognition module, a terminal operation module, etc. The system combines the speech recognition control technology and wireless information transmission technology with the embedded mobile computing technology, and interconnects the lamp, electronic locks, alarms, TV and other electrical devices in the home environment as a whole system through a wireless network node. The experimental results showed that speech recognition success rate was more than 84% in the home environment.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kazi, Rehan; Institute of Cancer Research, London; Venkitaraman, Ramachandran
2008-02-01
Purpose: To conduct prospective electroglottographic analyses of voice outcomes after radical chemoradiotherapy for locally advanced laryngopharyngeal cancers and to compare them with patients who have undergone total laryngectomy (TL). Patients and Methods: Twenty-one patients (19 male, 2 female, median age [range] 65 [50-85] years) with Stage III/IV laryngopharyngeal cancer received induction chemotherapy followed by radical chemoradiotherapy. Electroglottography, using the sustained vowel /i/ and connected speech, was performed before treatment and 1, 6, and 12 months after treatment. In addition, single voice recordings were taken from 21 patients (16 male, 5 female, aged 65 [50-84] years) who had undergone TL andmore » surgical voice restoration and from 21 normal controls (18 male, 3 female, aged 65 [33-80] years). Results: Before treatment the vocal measures for the chemoradiotherapy patients were significantly different from normal controls in jitter (p = 0.02), maximum phonation time (MPT) (p = 0.001), and words per minute (WPM) (p = 0.01). At 12 months after treatment MPT and WPM had normalized, but jitter and normalized noise energy were significantly worse than in normal controls. Comparison of voice outcomes at 12 months for chemoradiotherapy patients revealed superiority over the TL group in all parameters except MPT (18.2 s vs. 10.4 s, p = 0.06). Analysis of the recovery of voice up to 12 months after treatment revealed progressive improvement in most electroglottographic measures. Conclusions: This prospective study demonstrates significantly better outcome for patients treated with chemoradiotherapy as compared with TL. Progressive normalization of many voice parameters occurs over the 12 months following chemoradiotherapy.« less
The singer's formant and speaker's ring resonance: a long-term average spectrum analysis.
Lee, Sang-Hyuk; Kwon, Hee-Jun; Choi, Hyun-Jin; Lee, Nam-Hun; Lee, Sung-Jin; Jin, Sung-Min
2008-06-01
We previously showed that a trained tenor's voice has the conventional singer's formant at the region of 3 kHz and another energy peak at 8-9 kHz. Singers in other operatic voice ranges are assumed to have the same peak in their singing and speaking voice. However, to date, no specific measurement of this has been made. Tenors, baritones, sopranos and mezzo sopranos were chosen to participate in this study of the singer's formant and the speaker's ring resonance. Untrained males (n=15) and females (n=15) were included in the control group. Each subject was asked to produce successive /a/ vowel sounds in their singing and speaking voice. For singing, the low pitch was produced in the chest register and the high notes in the head register. We collected the data on the long-term average spectra of the speaking and singing voices of the trained singers and the control groups. For the sounds produced from the head register, a significant energy concentration was seen in both 2.2-3.4 kHz and 7.5-8.4 kHz regions (except for the voices of the mezzo sopranos) in the trained singer group when compared to the control groups. Also, the chest register had a significant energy concentration in the 4 trained singer groups at the 2.2-3.1 kHz and 7.8-8.4 kHz. For speaking sound, all trained singers had a significant energy concentration at 2.2-5.3 kHz and sopranos had another energy concentration at 9-10 kHz. The results of this study suggest that opera singers have more energy concentration in the singer's formant/speaker's ring region, in both singing and speaking voices. Furthermore, another region of energy concentration was identified in opera singer's singing sound and in sopranos' speaking sound at 8-9 kHz. The authors believe that these energy concentrations may contribute to the rich voice of trained singers.
ERIC Educational Resources Information Center
Davis, Barbara D.; Tisdale, Judy Jones; Krapels, Roberta H.
2001-01-01
Surveys corporate use of voice message systems by interviewing employees in four different companies. Finds that all four companies viewed their voicemail systems as a supplement to personal contact (not a replacement) and provided training, but had no formal method to assess customer satisfaction with their system. Suggests business communication…
Impact of voice- and knowledge-enabled clinical reporting--US example.
Bushko, Renata G; Havlicek, Penny L; Deppert, Edward; Epner, Stephen
2002-01-01
This study shows qualitative and quantitative estimates of the national and the clinic level impact of utilizing voice and knowledge enabled clinical reporting systems. Using common sense estimation methodology, we show that the delivery of health care can experience a dramatic improvement in four areas as a result of the broad use of voice and knowledge enabled clinical reporting: (1) Process Quality as measured by cost savings, (2) Organizational Quality as measured by compliance, (3) Clinical Quality as measured by clinical outcomes and (4) Service Quality as measured by patient satisfaction. If only 15 percent of US physicians replaced transcription with modem clinical reporting voice-based methodology, about one half billion dollars could be saved. $6.7 Billion could be saved annually if all medical reporting currently transcribed was handled with voice-and knowledge-enabled dictation and reporting systems.
Lechner, William J; MacGlashan, James; Wray, Tyler B; Littman, Michael L
2017-01-01
Background Computer-delivered interventions have been shown to be effective in reducing alcohol consumption in heavy drinking college students. However, these computer-delivered interventions rely on mouse, keyboard, or touchscreen responses for interactions between the users and the computer-delivered intervention. The principles of motivational interviewing suggest that in-person interventions may be effective, in part, because they encourage individuals to think through and speak aloud their motivations for changing a health behavior, which current computer-delivered interventions do not allow. Objective The objective of this study was to take the initial steps toward development of a voice-based computer-delivered intervention that can ask open-ended questions and respond appropriately to users’ verbal responses, more closely mirroring a human-delivered motivational intervention. Methods We developed (1) a voice-based computer-delivered intervention that was run by a human controller and that allowed participants to speak their responses to scripted prompts delivered by speech generation software and (2) a text-based computer-delivered intervention that relied on the mouse, keyboard, and computer screen for all interactions. We randomized 60 heavy drinking college students to interact with the voice-based computer-delivered intervention and 30 to interact with the text-based computer-delivered intervention and compared their ratings of the systems as well as their motivation to change drinking and their drinking behavior at 1-month follow-up. Results Participants reported that the voice-based computer-delivered intervention engaged positively with them in the session and delivered content in a manner consistent with motivational interviewing principles. At 1-month follow-up, participants in the voice-based computer-delivered intervention condition reported significant decreases in quantity, frequency, and problems associated with drinking, and increased perceived importance of changing drinking behaviors. In comparison to the text-based computer-delivered intervention condition, those assigned to voice-based computer-delivered intervention reported significantly fewer alcohol-related problems at the 1-month follow-up (incident rate ratio 0.60, 95% CI 0.44-0.83, P=.002). The conditions did not differ significantly on perceived importance of changing drinking or on measures of drinking quantity and frequency of heavy drinking. Conclusions Results indicate that it is feasible to construct a series of open-ended questions and a bank of responses and follow-up prompts that can be used in a future fully automated voice-based computer-delivered intervention that may mirror more closely human-delivered motivational interventions to reduce drinking. Such efforts will require using advanced speech recognition capabilities and machine-learning approaches to train a program to mirror the decisions made by human controllers in the voice-based computer-delivered intervention used in this study. In addition, future studies should examine enhancements that can increase the perceived warmth and empathy of voice-based computer-delivered intervention, possibly through greater personalization, improvements in the speech generation software, and embodying the computer-delivered intervention in a physical form. PMID:28659259
[The lombard reflex as a test of vocal function (author's transl)].
Schultz-Coulon, H J; Fues, C P
1976-06-01
Any impairment of audio-phonatory control by background noise is followed by an increase in both the intensity and pitch of the speaking voice (Lombard reflex, 1911), thus increasing vocal strain. As a consequence, it might be anticipated that persons reacting to noise with marked changes in voice might be more liable to develop dysphonia. 22 singers, 34 normal controls, and 22 patients with hyperfunctional dysphonia where studied. In all patients, both ears were gradually masked with white noise. The change of the mean intensity level and of the mean pitch level of the speaking voice were then measured objectively with a special fundamental frequency analyzer (Fedders and Schultz-Coulon, 1975). Results show that the increase of intensity is comparable in all subjects, whereas the elevation of the mean pitch level differs significantly: trained voices (singers) react with the least pitch increment whereas dysphonic patients react with the most. The following conclusions were made from the present investigation: 1. Extreme increments in pitch level can be considered to be a more significant etiological factor of dysphonia than intensity increments; 2. Vocal therapy and voice training may have a favorable effect on the Lombard reflex (probably by improvement of the kinesthetic control mechanism) so that the speaking voice in a noisy environment is raised less with less vocal strain. The study also indicates that measurement of pitch changes during binaural masking can provide important information for the diagnosis, therapy and prophylaxis of dysphonia.
Reimer, Bryan; Mehler, Bruce; Reagan, Ian; Kidd, David; Dobres, Jonathan
2016-12-01
There is limited research on trade-offs in demand between manual and voice interfaces of embedded and portable technologies. Mehler et al. identified differences in driving performance, visual engagement and workload between two contrasting embedded vehicle system designs (Chevrolet MyLink and Volvo Sensus). The current study extends this work by comparing these embedded systems with a smartphone (Samsung Galaxy S4). None of the voice interfaces eliminated visual demand. Relative to placing calls manually, both embedded voice interfaces resulted in less eyes-off-road time than the smartphone. Errors were most frequent when calling contacts using the smartphone. The smartphone and MyLink allowed addresses to be entered using compound voice commands resulting in shorter eyes-off-road time compared with the menu-based Sensus but with many more errors. Driving performance and physiological measures indicated increased demand when performing secondary tasks relative to 'just driving', but were not significantly different between the smartphone and embedded systems. Practitioner Summary: The findings show that embedded system and portable device voice interfaces place fewer visual demands on the driver than manual interfaces, but they also underscore how differences in system designs can significantly affect not only the demands placed on drivers, but also the successful completion of tasks.
Embracing the Complexity of Educational Programmes
ERIC Educational Resources Information Center
Govers, Elly
2016-01-01
Systems of monitoring and control have left many educators and organisations in the field of post-compulsory education struggling to find ways to meet the needs of an increasingly diverse society. Education is complex. Many educators would agree that it is influenced by many, often contradictory, voices and power structures. Based on the findings…
Xu, Qingsong
2013-05-01
Limited-angle rotary micropositioning stages are required in precision engineering applications where an ultrahigh-precision rotational motion within a restricted range is needed. This paper presents the design, fabrication, and control of a compliant rotary micropositioning stage dedicated to the said applications. To tackle the challenge of achieving both a large rotational range and a compact size, a new idea of multi-stage compound radial flexure is proposed. A compact rotary stage is devised to deliver an over 10° rotational range while possessing a negligible magnitude of center shift. The stage is driven by a linear voice coil motor and its output motion is measured by laser displacement sensors. Analytical models are derived to facilitate the parametric design, which is validated by conducting finite element analysis. The actuation and sensing issues are addressed to guarantee the stage performance. A prototype is fabricated and a proportional-integral-derivative control is implemented to achieve a precise positioning. Experimental results demonstrate a resolution of 2 μrad over 10° rotational range as well as a low level of center shift of the rotary micropositioning system.
Sherrard, Heather; Duchesne, Lloyd; Wells, George; Kearns, Sharon Ann; Struthers, Christine
2015-01-01
There is evidence from large clinical trials that compliance with standardized best practice guidelines (BPGs) improves survival of acute coronary syndrome (ACS) patients. However, their application is often suboptimal. In this study, the researchers evaluated whether the use of an interactive voice response (IVR) follow-up system improved ACS BPG compliance. This was a single-centre randomized control trial (RCT) of 1,608 patients (IVR=803; usual care=805). The IVR group received five automated calls in 12 months. The primary composite outcome was increased medication compliance and decreased adverse events. A significant improvement of 60% in the IVR group for the primary composite outcome was found (RR 1.60, 95% CI: 1.29 to 2.00, p <0.001). There was significant improvement in medication compliance (p <0.001) and decrease in unplanned medical visits (p = 0.023). At one year, the majority of patients ( 85%) responded positively to using the system again. Follow-up by IVR produced positive outcomes in ACS patients.
National Voice Response System (VRS) Implementation Plan Alternatives Study
DOT National Transportation Integrated Search
1979-07-01
This study examines the alternatives available to implement a national Voice Response System (VRS) for automated preflight weather briefings and flight plan filing. Four major hardware configurations are discussed. A computerized analysis model was d...
Color and texture associations in voice-induced synesthesia
Moos, Anja; Simmons, David; Simner, Julia; Smith, Rachel
2013-01-01
Voice-induced synesthesia, a form of synesthesia in which synesthetic perceptions are induced by the sounds of people's voices, appears to be relatively rare and has not been systematically studied. In this study we investigated the synesthetic color and visual texture perceptions experienced in response to different types of “voice quality” (e.g., nasal, whisper, falsetto). Experiences of three different groups—self-reported voice synesthetes, phoneticians, and controls—were compared using both qualitative and quantitative analysis in a study conducted online. Whilst, in the qualitative analysis, synesthetes used more color and texture terms to describe voices than either phoneticians or controls, only weak differences, and many similarities, between groups were found in the quantitative analysis. Notable consistent results between groups were the matching of higher speech fundamental frequencies with lighter and redder colors, the matching of “whispery” voices with smoke-like textures, and the matching of “harsh” and “creaky” voices with textures resembling dry cracked soil. These data are discussed in the light of current thinking about definitions and categorizations of synesthesia, especially in cases where individuals apparently have a range of different synesthetic inducers. PMID:24032023
Cross-cultural Adaption and Validation of the Danish Voice Handicap Index.
Sorensen, Jesper Roed; Printz, Trine; Mehlum, Camilla Slot; Heidemann, Christian Hamilton; Groentved, Aagot Moeller; Godballe, Christian
2018-02-02
We aimed to assess psychometric properties, including internal consistency, reliability, and clinical validity of the Danish version of the Voice Handicap Index (VHI). A cross-sectional survey study was carried out. For validation, the existing nonvalidated Danish version of the VHI was used. Data from 208 patients with voice disorders of different etiology (neurogenic, functional, and structural) and a control group of 85 vocally healthy individuals were included. A test-retest reliability analysis of 42 patients and 45 control persons was performed. The internal consistency, test-retest reliability, and clinical validity of the questionnaire were assessed. Internal consistency was high with a Cronbach α >0.90 for both the patient and control group. Test-retest reliability measured as intraclass correlation coefficient was good with 0.93 (95% confidence interval [95% confidence interval]: 0.87-0.96) for patients and 0.78 (95% confidence interval: 0.63-0.87) for the control group which indicates sufficient reliability of the questionnaire. The Danish VHI has good clinical validity as it has a strong correlation between patient's perception of the severity of their voice disorder and the VHI score from the Spearman correlation of 0.69. The existing Danish version of the VHI has been thoroughly validated and found to be in line with the original VHI from Jacobsen et al. It showed good internal consistency, test-retest reliability, and clinical validity. It is suitable for use in daily practice and in research projects as it is able to assess patients' perception of their voice disorder severity. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Tamplin, Jeanette; Baker, Felicity A; Grocke, Denise; Brazzale, Danny J; Pretto, Jeffrey J; Ruehland, Warren R; Buttifant, Mary; Brown, Douglas J; Berlowitz, David J
2013-03-01
To explore the effects of singing training on respiratory function, voice, mood, and quality of life for people with quadriplegia. Randomized controlled trial. Large, university-affiliated public hospital, Victoria, Australia. Participants (N=24) with chronic quadriplegia (C4-8, American Spinal Injury Association grades A and B). The experimental group (n=13) received group singing training 3 times weekly for 12 weeks. The control group (n=11) received group music appreciation and relaxation for 12 weeks. Assessments were conducted pre, mid-, immediately post-, and 6-months postintervention. Standard respiratory function testing, surface electromyographic activity from accessory respiratory muscles, sound pressure levels during vocal tasks, assessments of voice quality (Perceptual Voice Profile, Multidimensional Voice Profile), and Voice Handicap Index, Profile of Mood States, and Assessment of Quality of Life instruments. The singing group increased projected speech intensity (P=.028) and maximum phonation length (P=.007) significantly more than the control group. Trends for improvements in respiratory function, muscle strength, and recruitment were also evident for the singing group. These effects were limited by small sample sizes with large intersubject variability. Both groups demonstrated an improvement in mood (P=.002), which was maintained in the music appreciation and relaxation group after 6 months (P=.017). Group music therapy can have a positive effect on not only physical outcomes, but also can improve mood, energy, social participation, and quality of life for an at-risk population, such as those with quadriplegia. Specific singing therapy can augment these general improvements by improving vocal intensity. Copyright © 2013 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
A disturbance observer-based adaptive control approach for flexure beam nano manipulators.
Zhang, Yangming; Yan, Peng; Zhang, Zhen
2016-01-01
This paper presents a systematic modeling and control methodology for a two-dimensional flexure beam-based servo stage supporting micro/nano manipulations. Compared with conventional mechatronic systems, such systems have major control challenges including cross-axis coupling, dynamical uncertainties, as well as input saturations, which may have adverse effects on system performance unless effectively eliminated. A novel disturbance observer-based adaptive backstepping-like control approach is developed for high precision servo manipulation purposes, which effectively accommodates model uncertainties and coupling dynamics. An auxiliary system is also introduced, on top of the proposed control scheme, to compensate the input saturations. The proposed control architecture is deployed on a customized-designed nano manipulating system featured with a flexure beam structure and voice coil actuators (VCA). Real time experiments on various manipulating tasks, such as trajectory/contour tracking, demonstrate precision errors of less than 1%. Copyright © 2015 ISA. Published by Elsevier Ltd. All rights reserved.
Voice Recognition Software Accuracy with Second Language Speakers of English.
ERIC Educational Resources Information Center
Coniam, D.
1999-01-01
Explores the potential of the use of voice-recognition technology with second-language speakers of English. Involves the analysis of the output produced by a small group of very competent second-language subjects reading a text into the voice recognition software Dragon Systems "Dragon NaturallySpeaking." (Author/VWL)
Vogel, Adam P; Fletcher, Janet; Snyder, Peter J; Fredrickson, Amy; Maruff, Paul
2011-03-01
Assessment of the voice for supporting classifications of central nervous system (CNS) impairment requires a different practical, methodological, and statistical framework compared with assessment of the voice to guide decisions about change in the CNS. In experimental terms, an understanding of the stability and sensitivity to change of an assessment protocol is required to guide decisions about CNS change. Five experiments (N = 70) were conducted using a set of commonly used stimuli (eg, sustained vowel, reading, extemporaneous speech) and easily acquired measures (eg, f₀-f₄, percent pause). Stability of these measures was examined through their repeated application in healthy adults over brief and intermediate retest intervals (ie, 30 seconds, 2 hours, and 1 week). Those measures found to be stable were then challenged using an experimental model that reliably changes voice acoustic properties (ie, the Lombard effect). Finally, adults with an established CNS-related motor speech disorder (dysarthria) were compared with healthy controls. Of the 61 acoustic variables studied, 36 showed good stability over all three stability experiments (eg, number of pauses, total speech time, speech rate, f₀-f₄. Of the measures with good stability, a number of frequency measures showed a change in response to increased vocal effort resulting from the Lombard effect challenge. Furthermore, several timing measures significantly separated the control and motor speech impairment groups. Measures with high levels of stability within healthy adults, and those that show sensitivity to change and impairment may prove effective for monitoring changes in CNS functioning. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
WES (Waterways Experiment Station) Communications Plan for Voice and Data
1989-01-01
modem on a leased line, and two wideband HDLC 56K connections not used on the Honeywell. 30. Honeywell DPS-8 configuration, as of October 1987, is as...based voice system to support additional asynchronous dial-up modem traffic. In June 1987, Dr. N. Radhakhrishnan of the WES Information Technology...voice system (PBX) and very low-speed data communications by the laboratories using 1,200/2,400-baud asynchronous modems over analog phone lines, and
Electromyographic activity of strap and cricothyroid muscles in pitch change.
Roubeau, B; Chevrie-Muller, C; Lacau Saint Guily, J
1997-05-01
The EMG activity of the cricothyroid muscle (CT) and the three extrinsic laryngeal muscles (thyohyoid, TH; sternothyroid, ST, and sternohyoid, SH) were recorded throughout the voice range of one female and one male subject, both untrained singers. The voice range was examined using rising and falling glissandos (production of a sustained sound with progressive and continuous variation of fundamental frequency). Muscle activity was observed at various pitches during the glissandos. The strap muscle activity during the production of glissandos appears to be synergistic. At the lowest frequency, the CT is inactive but strap muscles (TH, ST, SH) are active. As frequency increases, strap muscle activity decreases while the CT controls frequency in the middle of the range. At higher frequencies the strap muscles once again become active. This activity might depend on the vocal vibratory mechanism involved. The role of the strap muscles at high pitches is a widely debated point but it seems that in some way they control the phenomena relevant to the rising pitch. The phasic-type strap muscle activity contrasts with the tonic-type activity of the CT. The CT closely controls the frequency, while the straps are not directly linked to the pitch but rather to the evolution of the frequency of voice production (speaking voice, singing voice, held notes, glissandos, trillo, vibrato, etc.).
Design of rapid prototype of UAV line-of-sight stabilized control system
NASA Astrophysics Data System (ADS)
Huang, Gang; Zhao, Liting; Li, Yinlong; Yu, Fei; Lin, Zhe
2018-01-01
The line-of-sight (LOS) stable platform is the most important technology of UAV (unmanned aerial vehicle), which can reduce the effect to imaging quality from vibration and maneuvering of the aircraft. According to the requirement of LOS stability system (inertial and optical-mechanical combined method) and UAV's structure, a rapid prototype is designed using based on industrial computer using Peripheral Component Interconnect (PCI) and Windows RTX to exchange information. The paper shows the control structure, and circuit system including the inertial stability control circuit with gyro and voice coil motor driven circuit, the optical-mechanical stability control circuit with fast-steering-mirror (FSM) driven circuit and image-deviation-obtained system, outer frame rotary follower, and information-exchange system on PC. Test results show the stability accuracy reaches 5μrad, and prove the effectiveness of the combined line-of-sight stabilization control system, and the real-time rapid prototype runs stable.
Telecommunications Systems Career Ladder, AFSC 307XO.
1981-01-01
standard test tone levels perform impulse noise tests make in-service or out-of- service quality check.s on composite signal transmission levels Even...service or out-of- service quality control (QC) reports maintain trouble and restoration record forms (DD Form 1443) direct circuit or system checks...include: perform fault isolation on analog circuits make in-service or out-of- service quality checks on voice frequency carrier telegraph (VFCT) terminals
Trainor, Laurel J; Marie, Céline; Bruce, Ian C; Bidelman, Gavin M
2014-02-01
Natural auditory environments contain multiple simultaneously-sounding objects and the auditory system must parse the incoming complex sound wave they collectively create into parts that represent each of these individual objects. Music often similarly requires processing of more than one voice or stream at the same time, and behavioral studies demonstrate that human listeners show a systematic perceptual bias in processing the highest voice in multi-voiced music. Here, we review studies utilizing event-related brain potentials (ERPs), which support the notions that (1) separate memory traces are formed for two simultaneous voices (even without conscious awareness) in auditory cortex and (2) adults show more robust encoding (i.e., larger ERP responses) to deviant pitches in the higher than in the lower voice, indicating better encoding of the former. Furthermore, infants also show this high-voice superiority effect, suggesting that the perceptual dominance observed across studies might result from neurophysiological characteristics of the peripheral auditory system. Although musically untrained adults show smaller responses in general than musically trained adults, both groups similarly show a more robust cortical representation of the higher than of the lower voice. Finally, years of experience playing a bass-range instrument reduces but does not reverse the high voice superiority effect, indicating that although it can be modified, it is not highly neuroplastic. Results of new modeling experiments examined the possibility that characteristics of middle-ear filtering and cochlear dynamics (e.g., suppression) reflected in auditory nerve firing patterns might account for the higher-voice superiority effect. Simulations show that both place and temporal AN coding schemes well-predict a high-voice superiority across a wide range of interval spacings and registers. Collectively, we infer an innate, peripheral origin for the higher-voice superiority observed in human ERP and psychophysical music listening studies. Copyright © 2013 Elsevier B.V. All rights reserved.
A Multimodal Emotion Detection System during Human-Robot Interaction
Alonso-Martín, Fernando; Malfaz, María; Sequeira, João; Gorostiza, Javier F.; Salichs, Miguel A.
2013-01-01
In this paper, a multimodal user-emotion detection system for social robots is presented. This system is intended to be used during human–robot interaction, and it is integrated as part of the overall interaction system of the robot: the Robotics Dialog System (RDS). Two modes are used to detect emotions: the voice and face expression analysis. In order to analyze the voice of the user, a new component has been developed: Gender and Emotion Voice Analysis (GEVA), which is written using the Chuck language. For emotion detection in facial expressions, the system, Gender and Emotion Facial Analysis (GEFA), has been also developed. This last system integrates two third-party solutions: Sophisticated High-speed Object Recognition Engine (SHORE) and Computer Expression Recognition Toolbox (CERT). Once these new components (GEVA and GEFA) give their results, a decision rule is applied in order to combine the information given by both of them. The result of this rule, the detected emotion, is integrated into the dialog system through communicative acts. Hence, each communicative act gives, among other things, the detected emotion of the user to the RDS so it can adapt its strategy in order to get a greater satisfaction degree during the human–robot dialog. Each of the new components, GEVA and GEFA, can also be used individually. Moreover, they are integrated with the robotic control platform ROS (Robot Operating System). Several experiments with real users were performed to determine the accuracy of each component and to set the final decision rule. The results obtained from applying this decision rule in these experiments show a high success rate in automatic user emotion recognition, improving the results given by the two information channels (audio and visual) separately. PMID:24240598
33 CFR 157.136 - Two-way voice communications.
Code of Federal Regulations, 2010 CFR
2010-07-01
... OIL IN BULK Crude Oil Washing (COW) System on Tank Vessels Design, Equipment, and Installation § 157.136 Two-way voice communications. Each tank vessel having a COW system under § 157.10(e), § 157.10a(a...
Decision support system in an international-voice-services business company
NASA Astrophysics Data System (ADS)
Hadianti, R.; Uttunggadewa, S.; Syamsuddin, M.; Soewono, E.
2017-01-01
We consider a problem facing by an international telecommunication services company in maximizing its profit. From voice services by controlling cost and business partnership. The competitiveness in this industry is very high, so that any efficiency from controlling cost and business partnership can help the company to survive in the very high competitiveness situation. The company trades voice traffic with a large number of business partners. There are four trading schemes that can be chosen by this company, namely, flat rate, class tiering, volume commitment, and revenue capped. Each scheme has a specific characteristic on the rate and volume deal, where the last three schemes are regarded as strategic schemes to be offered to business partner to ensure incoming traffic volume for both parties. This company and each business partner need to choose an optimal agreement in a certain period of time that can maximize the company’s profit. In this agreement, both parties agree to use a certain trading scheme, rate and rate/volume/revenue deal. A decision support system is then needed in order to give a comprehensive information to the sales officers to deal with the business partners. This paper discusses the mathematical model of the optimal decision for incoming traffic volume control, which is a part of the analysis needed to build the decision support system. The mathematical model is built by first performing data analysis to see how elastic the incoming traffic volume is. As the level of elasticity is obtained, we then derive a mathematical modelling that can simulate the impact of any decision on trading to the revenue of the company. The optimal decision can be obtained from these simulations results. To evaluate the performance of the proposed method we implement our decision model to the historical data. A software tool incorporating our methodology is currently in construction.
A new VOX technique for reducing noise in voice communication systems. [voice operated keying
NASA Technical Reports Server (NTRS)
Morris, C. F.; Morgan, W. C.; Shack, P. E.
1974-01-01
A VOX technique for reducing noise in voice communication systems is described which is based on the separation of voice signals into contiguous frequency-band components with the aid of an adaptive VOX in each band. It is shown that this processing scheme can effectively reduce both wideband and narrowband quasi-periodic noise since the threshold levels readjust themselves to suppress noise that exceeds speech components in each band. Results are reported for tests of the adaptive VOX, and it is noted that improvements can still be made in such areas as the elimination of noise pulses, phoneme reproduction at high-noise levels, and the elimination of distortion introduced by phase delay.
Dragon Stream Cipher for Secure Blackbox Cockpit Voice Recorder
NASA Astrophysics Data System (ADS)
Akmal, Fadira; Michrandi Nasution, Surya; Azmi, Fairuz
2017-11-01
Aircraft blackbox is a device used to record all aircraft information, which consists of Flight Data Recorder (FDR) and Cockpit Voice Recorder (CVR). Cockpit Voice Recorder contains conversations in the aircraft during the flight.Investigations on aircraft crashes usually take a long time, because it is difficult to find the aircraft blackbox. Then blackbox should have the ability to send information to other places. Aircraft blackbox must have a data security system, data security is a very important part at the time of information exchange process. The system in this research is to perform the encryption and decryption process on Cockpit Voice Recorder by people who are entitled by using Dragon Stream Cipher algorithm. The tests performed are time of data encryption and decryption, and avalanche effect. Result in this paper show us time encryption and decryption are 0,85 seconds and 1,84 second for 30 seconds Cockpit Voice Recorder data witn an avalanche effect 48,67 %.
Baker, Janet
2002-01-01
The projected speaking voice and the singing voice are highly sensitive to external and internal influences, and teachers of spoken voice and singing are in a unique position to identify subtle and more serious vocal difficulties in their students. Persistent anomalies may herald early onset of changes in vocal fold structure, neurophysiological control, or emotional stability. Two cases are presented to illustrate the benefits of a collaborative approach to diagnosis and management. The first, a 21-year-old male drama and singing student with an abnormally high speaking voice and falsetto singing voice was found to have a psychogenic dysphonia referred to as "puberphonia" or "mutational falsetto". The second, a 34-year-old female alto with strained phonation and perceived stutter of the vocal folds was diagnosed with "adductor spasmodic dysphonia" or "focal laryngeal dystonia" of neurological origin.
Obligatory and facultative brain regions for voice-identity recognition
Roswandowitz, Claudia; Kappes, Claudia; Obrig, Hellmuth; von Kriegstein, Katharina
2018-01-01
Abstract Recognizing the identity of others by their voice is an important skill for social interactions. To date, it remains controversial which parts of the brain are critical structures for this skill. Based on neuroimaging findings, standard models of person-identity recognition suggest that the right temporal lobe is the hub for voice-identity recognition. Neuropsychological case studies, however, reported selective deficits of voice-identity recognition in patients predominantly with right inferior parietal lobe lesions. Here, our aim was to work towards resolving the discrepancy between neuroimaging studies and neuropsychological case studies to find out which brain structures are critical for voice-identity recognition in humans. We performed a voxel-based lesion-behaviour mapping study in a cohort of patients (n = 58) with unilateral focal brain lesions. The study included a comprehensive behavioural test battery on voice-identity recognition of newly learned (voice-name, voice-face association learning) and familiar voices (famous voice recognition) as well as visual (face-identity recognition) and acoustic control tests (vocal-pitch and vocal-timbre discrimination). The study also comprised clinically established tests (neuropsychological assessment, audiometry) and high-resolution structural brain images. The three key findings were: (i) a strong association between voice-identity recognition performance and right posterior/mid temporal and right inferior parietal lobe lesions; (ii) a selective association between right posterior/mid temporal lobe lesions and voice-identity recognition performance when face-identity recognition performance was factored out; and (iii) an association of right inferior parietal lobe lesions with tasks requiring the association between voices and faces but not voices and names. The results imply that the right posterior/mid temporal lobe is an obligatory structure for voice-identity recognition, while the inferior parietal lobe is only a facultative component of voice-identity recognition in situations where additional face-identity processing is required. PMID:29228111
Obligatory and facultative brain regions for voice-identity recognition.
Roswandowitz, Claudia; Kappes, Claudia; Obrig, Hellmuth; von Kriegstein, Katharina
2018-01-01
Recognizing the identity of others by their voice is an important skill for social interactions. To date, it remains controversial which parts of the brain are critical structures for this skill. Based on neuroimaging findings, standard models of person-identity recognition suggest that the right temporal lobe is the hub for voice-identity recognition. Neuropsychological case studies, however, reported selective deficits of voice-identity recognition in patients predominantly with right inferior parietal lobe lesions. Here, our aim was to work towards resolving the discrepancy between neuroimaging studies and neuropsychological case studies to find out which brain structures are critical for voice-identity recognition in humans. We performed a voxel-based lesion-behaviour mapping study in a cohort of patients (n = 58) with unilateral focal brain lesions. The study included a comprehensive behavioural test battery on voice-identity recognition of newly learned (voice-name, voice-face association learning) and familiar voices (famous voice recognition) as well as visual (face-identity recognition) and acoustic control tests (vocal-pitch and vocal-timbre discrimination). The study also comprised clinically established tests (neuropsychological assessment, audiometry) and high-resolution structural brain images. The three key findings were: (i) a strong association between voice-identity recognition performance and right posterior/mid temporal and right inferior parietal lobe lesions; (ii) a selective association between right posterior/mid temporal lobe lesions and voice-identity recognition performance when face-identity recognition performance was factored out; and (iii) an association of right inferior parietal lobe lesions with tasks requiring the association between voices and faces but not voices and names. The results imply that the right posterior/mid temporal lobe is an obligatory structure for voice-identity recognition, while the inferior parietal lobe is only a facultative component of voice-identity recognition in situations where additional face-identity processing is required. © The Author (2017). Published by Oxford University Press on behalf of the Guarantors of Brain.
Lundeborg, Inger; Hultcrantz, Elisabeth; Ericsson, Elisabeth; McAllister, Anita
2012-07-01
To evaluate outcome of two types of tonsil surgery (tonsillectomy [TE]+adenoidectomy or tonsillotomy [TT]+adenoidectomy) on vocal function perceptually and acoustically. Sixty-seven children, aged 50-65 months, on waiting list for tonsil surgery were randomized to TE (n=33) or TT (n=34). Fifty-seven age- and gender-matched healthy preschool children were controls. Twenty-eight of them, aged 48-59 months, served as control group before surgery, and 29, aged 60-71 months, served as control group after surgery. Before surgery and 6 months postoperatively, the children were recorded producing three sustained vowels (/ɑ/, /u/, and /i/) and 14 words. The control groups were recorded only once. Three trained speech and language pathologists performed the perceptual analysis using visual analog scale for eight voice quality parameters. Acoustic analysis from sustained vowels included average fundamental frequency, jitter percent, shimmer percent, noise-to-harmonic ratio, and the center frequencies of formants 1-3. Before surgery, the children were rated to have more hyponasality and compressed/throaty voice (P<0.05) and lower mean pitch (P<0.01) in comparison to the control group. They also had higher perturbation measures and lower frequencies of the second and third formants. After surgery, there were no differences perceptually. Perturbation measures decreased but were still higher compared with those of control group (P<0.05). Differences in formant frequencies for /i/ and /u/ remained. No differences were found between the two surgical methods. Voice quality is affected perceptually and acoustically by adenotonsillar hypertrophy. After surgery, the voice is perceptually normalized but acoustic differences remain. Outcome was equal for both surgical methods. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
MMIC Phased Array Demonstrations with ACTS
NASA Technical Reports Server (NTRS)
Raquet, Charles A. (Compiler); Martzaklis, Konstantinos (Compiler); Zakrajsek, Robert J. (Compiler); Andro, Monty (Compiler); Turtle, John P.
1996-01-01
Over a one year period from May 1994 to May 1995, a number of demonstrations were conducted by the NASA Lewis Research Center (LeRC) in which voice, data, and/or video links were established via NASA's advanced communications technology satellite (ACTS) between the ACTS link evaluation terminal (LET) in Cleveland, OH, and aeronautical and mobile or fixed Earth terminals having monolithic microwave integrated circuit (MMIC) phased array antenna systems. This paper describes four of these. In one, a duplex voice link between an aeronautical terminal on the LeRC Learjet and the ACTS was achieved. Two others demonstrated duplex voice (and in one case video as well) links between the ACTS and an Army vehicle. The fourth demonstrated a high data rate downlink from ACTS to a fixed terminal. Array antenna systems used in these demonstrations were developed by LeRC and featured LeRC and Air Force experimental arrays using gallium arsenide MMIC devices at each radiating element for electronic beam steering and distributed power amplification. The single 30 GHz transmit array was developed by NASA/LeRC and Texas Instruments. The three 20 GHz receive arrays were developed in a cooperative effort with the Air Force Rome Laboratory, taking advantage of existing Air Force array development contracts with Boeing and Lockheed Martin. The paper describes the four proof-of-concept arrays and the array control system. The system configured for each of the demonstrations is described, and results are discussed.
"Who's in Charge Here?": Teaching Narrative Voice in Frank O'Connor's "My Oedipus Complex."
ERIC Educational Resources Information Center
Wentworth, Michael
2001-01-01
Considers how Frank O'Connor's "My Oedipus Complex" provides a good introduction to the subtleties of narrative voice and control. Concludes by considering the notion of control and its relation to the narrative point of view in O'Connor's story and how it bears directly upon the value of reading literature and the reader's role. (SG)
Inferring speaker attributes in adductor spasmodic dysphonia: ratings from unfamiliar listeners.
Isetti, Derek; Xuereb, Linnea; Eadie, Tanya L
2014-05-01
To determine whether unfamiliar listeners' perceptions of speakers with adductor spasmodic dysphonia (ADSD) differ from control speakers on the parameters of relative age, confidence, tearfulness, and vocal effort and are related to speaker-rated vocal effort or voice-specific quality of life. Twenty speakers with ADSD (including 6 speakers with ADSD plus tremor) and 20 age- and sex-matched controls provided speech recordings, completed a voice-specific quality-of-life instrument (Voice Handicap Index; Jacobson et al., 1997), and rated their own vocal effort. Twenty listeners evaluated speech samples for relative age, confidence, tearfulness, and vocal effort using rating scales. Listeners judged speakers with ADSD as sounding significantly older, less confident, more tearful, and more effortful than control speakers (p < .01). Increased vocal effort was strongly associated with decreased speaker confidence (rs = .88-.89) and sounding more tearful (rs = .83-.85). Self-rated speaker effort was moderately related (rs = .45-.52) to listener impressions. Listeners' perceptions of confidence and tearfulness were also moderately associated with higher Voice Handicap Index scores (rs = .65-.70). Unfamiliar listeners judge speakers with ADSD more negatively than control speakers, with judgments extending beyond typical clinical measures. The results have implications for counseling and understanding the psychosocial effects of ADSD.
D'ALATRI, L.
2014-01-01
SUMMARY This study was carried out to compare the vocal limits obtained by speech range profile (SRP) with those of voice range profile (VRP) in untrained healthy and dysphonic females. Forty-six healthy voice volunteers (control group) and 148 dysphonic patients (dysphonic group) were evaluated using videolaryngostroboscopic assessment and phonetography for voice measurements. For VRP, subjects were asked to sustain the vowel /a/ as soft and as loud possible from the lowest to the highest frequencies using an automated procedure. The SRP was obtained by recording the speaking voice (SV) and the shouting voice (ShV) asking subjects to read a list of sentences aloud and to shout / ehi/ as loud as they could, respectively. All subjects in the control and dysphonic groups were able to perform SRP. fourty of 46 (85%) and 102 of 148 (68.91%) cases, respectively in control and dysphonic groups, were able to perform VRP. Most frequently, the VRP was not recorded because of the inability to perform or, especially in the dysphonic group, for inadequacy of the vocal signal. In the control group, there were no significant differences between the mean values of Fmin, Fmax, Imin and number of semitones (st) of the VRP and those of the SRP (p > 0.05). In the dysphonic group, the mean values of Fmin, Fmax and st SV+ShV for SRP were significantly higher than those of VRP. Our preliminary results suggest that the SRP may be a useful, alternative tool to assess vocal limits in both euphonic and dysphonic females. PMID:25210219
Micro-Based Speech Recognition: Instructional Innovation for Handicapped Learners.
ERIC Educational Resources Information Center
Horn, Carin E.; Scott, Brian L.
A new voice based learning system (VBLS), which allows the handicapped user to interact with a microcomputer by voice commands, is described. Speech or voice recognition is the computerized process of identifying a spoken word or phrase, including those resulting from speech impediments. This new technology is helpful to the severely physically…
Listener Perception of Respiratory-Induced Voice Tremor
ERIC Educational Resources Information Center
Farinella, Kimberly A.; Hixon, Thomas J.; Hoit, Jeannette D.; Story, Brad H.; Jones, Patricia A.
2006-01-01
Purpose: The purpose of this study was to determine the relation of respiratory oscillation to the perception of voice tremor. Method: Forced oscillation of the respiratory system was used to simulate variations in alveolar pressure such as are characteristic of voice tremor of respiratory origin. Five healthy men served as speakers, and 6…
Speech Motor Development during Acquisition of the Voicing Contrast
ERIC Educational Resources Information Center
Grigos, Maria I.; Saxman, John H.; Gordon, Andrew M.
2005-01-01
Lip and jaw movements were studied longitudinally in 19-month-old children as they acquired the voicing contrast for /p/ and /b/. A movement tracking system obtained lip and jaw kinematics as participants produced the target utterances /papa/ and /baba/. Laryngeal adjustments were also tracked through acoustically recorded voice onset time (VOT)…
Wireless infrared communications for space and terrestrial applications
NASA Technical Reports Server (NTRS)
Crimmins, James W.
1993-01-01
Voice and data communications via wireless (and fiberless) optical means has been commonplace for many years. However, continuous advances in optoelectronics and microelectronics have resulted in significant advances in wireless optical communications over the last decade. Wilton has specialized in diffuse infrared voice and data communications since 1979. In 1986, NASA Johnson Space Center invited Wilton to apply its wireless telecommunications and factory floor technology to astronaut voice communications aboard the shuttle. In September, 1988 a special infrared voice communications system flew aboard a 'Discovery' Shuttle mission as a flight experiment. Since then the technology has been further developed, resulting in a general purpose of 2Mbs wireless voice/data LAN which has been tested for a variety of applications including use aboard Spacelab. Funds for Wilton's wireless IR development were provided in part by NASA's Technology Utilization Office and by the NASA Small Business Innovative Research Program. As a consequence, Wilton's commercial product capability has been significantly enhanced to include diffuse infrared wireless LAN's as well as wireless infrared telecommunication systems for voice and data.
Perceptual and Acoustic Analyses of Good Voice Quality in Male Radio Performers.
Warhurst, Samantha; Madill, Catherine; McCabe, Patricia; Ternström, Sten; Yiu, Edwin; Heard, Robert
2017-03-01
Good voice quality is an asset to professional voice users, including radio performers. We examined whether (1) voices could be reliably categorized as good for the radio and (2) these categories could be predicted using acoustic measures. Male radio performers (n = 24) and age-matched male controls performed "The Rainbow Passage" as if presenting on the radio. Voice samples were rated using a three-stage paired-comparison paradigm by 51 naive listeners and perceptual categories were identified (Study 1), and then analyzed for fundamental frequency, long-term average spectrum, cepstral peak prominence, and pause or spoken-phrase duration (Study 2). Study 1: Good inter-judge reliability was found for perceptual judgments of the best 15 voices (good for radio category, 14/15 = radio performers), but agreement on the remaining 33 voices (unranked category) was poor. Study 2: Discriminant function analyses showed that the SD standard deviation of sounded portion duration, equivalent sound level, and smoothed cepstral peak prominence predicted membership of categories with moderate accuracy (R 2 = 0.328). Radio performers are heterogeneous for voice quality; good voice quality was judged reliably in only 14 out of 24 radio performers. Current acoustic analyses detected some of the relevant signal properties that were salient in these judgments. More refined perceptual analysis and the use of other perceptual methods might provide more information on the complex nature of judging good voices. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Defazio, Giovanni; Guerrieri, Marta; Liuzzi, Daniele; Gigante, Angelo Fabio; di Nicola, Vincenzo
2016-03-01
Changes in voice and speech are thought to involve 75-90% of people with PD, but the impact of PD progression on voice/speech parameters is not well defined. In this study, we assessed voice/speech symptoms in 48 parkinsonian patients staging <3 on the modified Hoehn and Yahr scale and 37 healthy subjects using the Robertson dysarthria profile (a clinical-perceptual method exploring all components potentially involved in speech difficulties), the Voice handicap index (a validated measure of the impact of voice symptoms on quality of life) and the speech evaluation parameter contained in the Unified Parkinson's Disease Rating Scale part III (UPDRS-III). Accuracy and metric properties of the Robertson dysarthria profile were also measured. On Robertson dysarthria profile, all parkinsonian patients yielded lower scores than healthy control subjects. Differently, the Voice Handicap Index and the speech evaluation parameter contained in the UPDRS-III could detect speech/voice disturbances in 10 and 75% of PD patients, respectively. Validation procedure in Parkinson's disease patients showed that the Robertson dysarthria profile has acceptable reliability, satisfactory internal consistency and scaling assumptions, lack of floor and ceiling effects, and partial correlations with UPDRS-III and Voice Handicap Index. We concluded that speech/voice disturbances are widely identified by the Robertson dysarthria profile in early parkinsonian patients, even when the disturbances do not carry a significant level of disability. Robertson dysarthria profile may be a valuable tool to detect speech/voice disturbances in Parkinson's disease.
Matching Speaking to Singing Voices and the Influence of Content.
Peynircioğlu, Zehra F; Rabinovitz, Brian E; Repice, Juliana
2017-03-01
We tested whether speaking voices of unfamiliar people could be matched to their singing voices, and, if so, whether the content of the utterances would influence this matching performance. Our hypothesis was that enough acoustic features would remain the same between speaking and singing voices such that their identification as belonging to the same or different individuals would be possible even upon a single hearing. We also hypothesized that the contents of the utterances would influence this identification process such that voices uttering words would be easier to match than those uttering vowels. We used a within-participant design with blocked stimuli that were counterbalanced using a Latin square design. In one block, mode (speaking vs singing) was manipulated while content was held constant; in another block, content (word vs syllable) was manipulated while mode was held constant, and in the control block, both mode and content were held constant. Participants indicated whether the voices in any given pair of utterances belonged to the same person or to different people. Cross-mode matching was above chance level, although mode-congruent performance was better. Further, only speaking voices were easier to match when uttering words. We can identify speaking and singing voices as the same or different even on just a single hearing. However, content interacts with mode such that words benefit matching of speaking voices but not of singing voices. Results are discussed within an attentional framework. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
A new voice rating tool for clinical practice.
Gould, James; Waugh, Jessica; Carding, Paul; Drinnan, Michael
2012-07-01
Perceptual rating of voice quality is a key component in the comprehensive assessment of voice, but there are practical difficulties in making reliable measurements. We have developed the Newcastle Audio Ranking (NeAR) test, a new referential system for the rating of voice parameters. In this article, we present our first results using NeAR. We asked five experts and 11 naive raters to assess 15 male and 15 female voices using the NeAR test. We assessed: validity with respect to the GRBAS scale; interrater reliability; sensitivity to subtle voice differences; and the performance of expert versus naïve raters. There was a uniformly excellent agreement with GRBAS (r=0.87) and interrater agreement (intraclass correlation coefficient=0.86). Considering each GRBAS grade of voice separately, there was still good interrater agreement in NeAR, implying it has good sensitivity to subtle changes. All these results were equally true for expert and naive raters. The NeAR test is a promising new tool in the assessment of voice disorders. Copyright © 2012 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Simon, Doerte; Becker, Michael; Mothes-Lasch, Martin; Miltner, Wolfgang H R; Straube, Thomas
2017-03-01
Angry expressions of both voices and faces represent disorder-relevant stimuli in social anxiety disorder (SAD). Although individuals with SAD show greater amygdala activation to angry faces, previous work has failed to find comparable effects for angry voices. Here, we investigated whether voice sound-intensity, a modulator of a voice's threat-relevance, affects brain responses to angry prosody in SAD. We used event-related functional magnetic resonance imaging to explore brain responses to voices varying in sound intensity and emotional prosody in SAD patients and healthy controls (HCs). Angry and neutral voices were presented either with normal or high sound amplitude, while participants had to decide upon the speaker's gender. Loud vs normal voices induced greater insula activation, and angry vs neutral prosody greater orbitofrontal cortex activation in SAD as compared with HC subjects. Importantly, an interaction of sound intensity, prosody and group was found in the insula and the amygdala. In particular, the amygdala showed greater activation to loud angry voices in SAD as compared with HC subjects. This finding demonstrates a modulating role of voice sound-intensity on amygdalar hyperresponsivity to angry prosody in SAD and suggests that abnormal processing of interpersonal threat signals in amygdala extends beyond facial expressions in SAD. © The Author (2016). Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
Gelfer, Marylou Pausewang; Tice, Ruthanne M
2013-05-01
The present study examined how effectively listeners' perceptions of gender could be changed from male to female for male-to-female (MTF) transgender (TG) clients based on the voice signal alone, immediately after voice therapy and at long-term follow-up. Short- and long-term changes in masculinity and femininity ratings and acoustic measures of speaking fundamental frequency (SFF) and vowel formant frequencies were also investigated. Prospective treatment study. Five MTF TG clients, five control female speakers, and five control male speakers provided a variety of speech samples for later analysis. The TG clients then underwent 8 weeks of voice therapy. Voice samples were collected immediately at the termination of therapy and again 15 months later. Two groups of listeners were recruited to evaluate gender and provide masculinity and femininity ratings. Perceptual results revealed that TG subjects were perceived as female 1.9% of the time in the pretest, 50.8% of the time in the immediate posttest, and 33.1% of the time in the long-term posttest. The TG speakers were also perceived as significantly less masculine and more feminine in the immediate posttest and the long-term posttest compared with the pre-test. Some acoustic measures showed significant differences between the pretest and the immediate posttest and long-term posttest. It appeared that 8 weeks of voice therapy could result in vocal changes in MTF TG individuals that persist at least partially for up to 15 months. However, some TG subjects were more successful with voice feminization than others. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Barillari, Maria Rosaria; Volpe, Umberto; Mirra, Giuseppina; Giugliano, Francesco; Barillari, Umberto
2017-05-01
Phonomicrosurgery is generally considered to be the treatment of choice for removing vocal fold polyps. However, specific techniques of voice therapy may represent, in selected cases and under certain conditions, a noninvasive therapeutic option for the treatment of such laryngeal lesions. The aim of the present study is to longitudinally assess, in terms of clinical outcomes and quality of life, two groups of patients with cordal polyps, treated either with standard surgery plus standard voice therapy or with a specific training of voice therapy alone, which we have called "Voice Therapy Expulsion." This study is a randomized controlled trial. A total of 150 patients with vocal fold polyps were randomly assigned to either standard surgery or "voice therapy expulsion" protocol. The trial was carried out at the Division of Phoniatrics and Audiology of the Second University of Naples and at the Division of Communication Disorders of Local Health Unit (3 Naples South) from January 2010 to December 2013. A thorough phoniatric evaluation, including laryngostroboscopy, acoustic voice analysis, global grade of dysphonia, instability, roughness, breathiness, asthenia, and strain scale, Voice Handicap Index, and Voice-Related Quality of Life, was performed by using standardized tools, at baseline, at the end of the treatment, and up to 1 year after treatment. We found no significant differences between the two experimental groups in terms of clinical outcomes and personal satisfaction. However, "Voice Therapy Expulsion" was associated with higher scores for quality of life at endpoint evaluation. Besides phonosurgery, this specific "Voice Therapy Expulsion" technique should be considered as a valid, noninvasive, and well-tolerated therapeutic option for the treatment of selected patients with vocal fold polyps. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Simon, Doerte; Becker, Michael; Mothes-Lasch, Martin; Miltner, Wolfgang H.R.
2017-01-01
Abstract Angry expressions of both voices and faces represent disorder-relevant stimuli in social anxiety disorder (SAD). Although individuals with SAD show greater amygdala activation to angry faces, previous work has failed to find comparable effects for angry voices. Here, we investigated whether voice sound-intensity, a modulator of a voice’s threat-relevance, affects brain responses to angry prosody in SAD. We used event-related functional magnetic resonance imaging to explore brain responses to voices varying in sound intensity and emotional prosody in SAD patients and healthy controls (HCs). Angry and neutral voices were presented either with normal or high sound amplitude, while participants had to decide upon the speaker’s gender. Loud vs normal voices induced greater insula activation, and angry vs neutral prosody greater orbitofrontal cortex activation in SAD as compared with HC subjects. Importantly, an interaction of sound intensity, prosody and group was found in the insula and the amygdala. In particular, the amygdala showed greater activation to loud angry voices in SAD as compared with HC subjects. This finding demonstrates a modulating role of voice sound-intensity on amygdalar hyperresponsivity to angry prosody in SAD and suggests that abnormal processing of interpersonal threat signals in amygdala extends beyond facial expressions in SAD. PMID:27651541
Control of voice fundamental frequency in speaking versus singing
NASA Astrophysics Data System (ADS)
Natke, Ulrich; Donath, Thomas M.; Kalveram, Karl Th.
2003-03-01
In order to investigate control of voice fundamental frequency (F0) in speaking and singing, 24 adults had to utter the nonsense word ['ta:tatas] repeatedly, while in selected trials their auditory feedback was frequency-shifted by 100 cents downwards. In the speaking condition the target speech rate and prosodic pattern were indicated by a rhythmic sequence made of white noise. In the singing condition the sequence consisted of piano notes, and subjects were instructed to match the pitch of the notes. In both conditions a response in voice F0 begins with a latency of about 150 ms. As predicted, response magnitude is greater in the singing condition (66 cents) than in the speaking condition (47 cents). Furthermore the singing condition seems to prolong the after-effect which is a continuation of the response in trials after the frequency shift. In the singing condition, response magnitude and the ability to match the target F0 correlate significantly. Results support the view that in speaking voice F0 is monitored mainly supra-segmentally and controlled less tightly than in singing.
Control of voice fundamental frequency in speaking versus singing.
Natke, Ulrich; Donath, Thomas M; Kalveram, Karl Th
2003-03-01
In order to investigate control of voice fundamental frequency (F0) in speaking and singing, 24 adults had to utter the nonsense word ['ta:tatas] repeatedly, while in selected trials their auditory feedback was frequency-shifted by 100 cents downwards. In the speaking condition the target speech rate and prosodic pattern were indicated by a rhythmic sequence made of white noise. In the singing condition the sequence consisted of piano notes, and subjects were instructed to match the pitch of the notes. In both conditions a response in voice F0 begins with a latency of about 150 ms. As predicted, response magnitude is greater in the singing condition (66 cents) than in the speaking condition (47 cents). Furthermore the singing condition seems to prolong the after-effect which is a continuation of the response in trials after the frequency shift. In the singing condition, response magnitude and the ability to match the target F0 correlate significantly. Results support the view that in speaking voice F0 is monitored mainly supra-segmentally and controlled less tightly than in singing.
Evaluating effectiveness of dynamic soundfield system in the classroom
da Cruz, Aline Duarte; Alves Silvério, Kelly Cristina; Da Costa, Aline Roberta Aceituno; Moret, Adriane Lima Mortari; Lauris, José Roberto Pereira; de Souza Jacob, Regina Tangerino
2016-01-01
Research has reported on the use of soundfield amplification devices in the classroom. However, no study has used standardized tests to determine the potential advantages of the dynamic soundfield system for normally hearing students and for the teacher's voice. Our aim was to evaluate the impact of using dynamic soundfield system on the noise of the classroom, teacher's voice and students’ academic performance. This was a prospective cohort study in which 20 student participants enrolled in the third year of basic education were divided into two groups (i.e., control and experimental); their teacher participated. The experimental group was exposed to the dynamic soundfield system for 3 consecutive months. The groups were assessed using standardized tests to evaluate their academic performance. Further, questionnaires and statements were collected on the participants’ experience of using the soundfield system. We statistically analyzed the results to compare the academic performance of the control group with that of the experimental group. In all cases, a significance level of P < .05 was adopted. Use of the dynamic soundfield system was effective for improving the students’ academic performance on standardized tests for reading, improving the teacher's speech intelligibility, and reducing the teacher's vocal strain. The dynamic soundfield system minimizes the impact of noise in the classroom as demonstrated by the mensuration of the signal-to-noise ratio (SNR) and pupil performance on standardized tests for reading and student and teacher ratings of amplification system effectiveness. PMID:26780961
Translational Systems Biology and Voice Pathophysiology
Li, Nicole Y. K.; Abbott, Katherine Verdolini; Rosen, Clark; An, Gary; Hebda, Patricia A.; Vodovotz, Yoram
2011-01-01
Objectives/Hypothesis Personalized medicine has been called upon to tailor healthcare to an individual's needs. Evidence-based medicine (EBM) has advocated using randomized clinical trials with large populations to evaluate treatment effects. However, due to large variations across patients, the results are likely not to apply to an individual patient. We suggest that a complementary, systems biology approach using computational modeling may help tackle biological complexity in order to improve ultimate patient care. The purpose of the article is: 1) to review the pros and cons of EBM, and 2) to discuss the alternative systems biology method and present its utility in clinical voice research. Study Design Tutorial Methods Literature review and discussion. Results We propose that translational systems biology can address many of the limitations of EBM pertinent to voice and other health care domains, and thus complement current health research models. In particular, recent work using mathematical modeling suggests that systems biology has the ability to quantify the highly complex biologic processes underlying voice pathophysiology. Recent data support the premise that this approach can be applied specifically in the case of phonotrauma and surgically induced vocal fold trauma, and may have particular power to address personalized medicine. Conclusions We propose that evidence around vocal health and disease be expanded beyond a population-based method to consider more fully issues of complexity and systems interactions, especially in implementing personalized medicine in voice care and beyond. PMID:20025041
Reimer, Bryan; Mehler, Bruce; Reagan, Ian; Kidd, David; Dobres, Jonathan
2016-01-01
Abstract There is limited research on trade-offs in demand between manual and voice interfaces of embedded and portable technologies. Mehler et al. identified differences in driving performance, visual engagement and workload between two contrasting embedded vehicle system designs (Chevrolet MyLink and Volvo Sensus). The current study extends this work by comparing these embedded systems with a smartphone (Samsung Galaxy S4). None of the voice interfaces eliminated visual demand. Relative to placing calls manually, both embedded voice interfaces resulted in less eyes-off-road time than the smartphone. Errors were most frequent when calling contacts using the smartphone. The smartphone and MyLink allowed addresses to be entered using compound voice commands resulting in shorter eyes-off-road time compared with the menu-based Sensus but with many more errors. Driving performance and physiological measures indicated increased demand when performing secondary tasks relative to ‘just driving’, but were not significantly different between the smartphone and embedded systems. Practitioner Summary: The findings show that embedded system and portable device voice interfaces place fewer visual demands on the driver than manual interfaces, but they also underscore how differences in system designs can significantly affect not only the demands placed on drivers, but also the successful completion of tasks. PMID:27110964
Close up view of the Commander's Seat on the Flight ...
Close up view of the Commander's Seat on the Flight Deck of the Orbiter Discovery. Toward the right of the view and in front of te seat is the commander's Rotational Hand Controller. The pilot station has an identical controller. These control the acceleration in the roll pitch and yaw directions via the reaction control system and/or the orbiter maneuvering system while outside of Earth's atmosphere or via the orbiter's aerosurfaces wile in Earth's atmosphere when the atmospheric density permits the surfaces to be effective. There are a number of switches on the controller, most notably a trigger switch which is a push-to-talk switch for voice communication and a large button on top of the controller which is a switch to engage the backup flight system. This view was taken at Kennedy Space Center. - Space Transportation System, Orbiter Discovery (OV-103), Lyndon B. Johnson Space Center, 2101 NASA Parkway, Houston, Harris County, TX
A TDM link with channel coding and digital voice.
NASA Technical Reports Server (NTRS)
Jones, M. W.; Tu, K.; Harton, P. L.
1972-01-01
The features of a TDM (time-division multiplexed) link model are described. A PCM telemetry sequence was coded for error correction and multiplexed with a digitized voice channel. An all-digital implementation of a variable-slope delta modulation algorithm was used to digitize the voice channel. The results of extensive testing are reported. The measured coding gain and the system performance over a Gaussian channel are compared with theoretical predictions and computer simulations. Word intelligibility scores are reported as a measure of voice channel performance.
Haselden, Karen; Powell, Theresa; Drinnan, Mike; Carding, Paul
2009-11-01
Locus of Control (LoC) refers to an individuals' perception of whether they are in control of life events. Health Locus of Control refers to whether someone feels they have influence over their health. Health Locus of Control has not been studied in any depth in voice-disordered patients. The objective of this study was to examine Health Locus of Control in three patient groups: (1) Spasmodic Dysphonia, (2) Functional Dysphonia and (3) a nondysphonic group with Nonlaryngeal Dystonia. LoC was measured and compared in a total of 57 patients using the Multidimensional Health Locus of Control Scales (diagnostic specific) Form C. Internal, Chance, and Powerful others LoC were measured and comparisons were made using one-way analysis of variance. Contrary to expectations Internal LoC was found to be significantly higher in the Functional Dysphonia group when compared to the other two groups. There was no significant difference between the groups in Chance or Powerful others LoC. The two organic groups, Spasmodic Dysphonia and Nonlaryngeal Dystonia, were more alike in Internal Health Locus of Control than the Functional Dysphonia group. The diagnostic nature of the groups was reflected in their LoC scores rather than their voice loss. These results contribute to the debate about the etiology of Spasmodic Dysphonia and will be of interest to those involved in the psychology of voice and those managing voice-disordered patients.
Voice Interactive Analysis System Study. Final Report, August 28, 1978 through March 23, 1979.
ERIC Educational Resources Information Center
Harry, D. P.; And Others
The Voice Interactive Analysis System study continued research and development of the LISTEN real-time, minicomputer based connected speech recognition system, within NAVTRAEQUIPCEN'S program of developing automatic speech technology in support of training. An attempt was made to identify the most effective features detected by the TTI-500 model…
Ha, Mana, Leo (Breath, Spirit, Voice): Kanaka Maoli Empowerment through Literature. Commentaries
ERIC Educational Resources Information Center
Ho'omanawanui, Ku'ualoha
2004-01-01
Until American Calvinist missionaries created a Hawaiian alphabet and writing system after their arrival to the Hawaiian islands in 1819, all Kanaka Maoli literature was oral. By inventing a Hawaiian alphabet, the missionaries were able to teach Kanaka Maoli reading and writing; by the early 1830s the first missionary-controlled printing press was…
Compairing Picture Exchange and Voice Output Communication Aids in Young Children with Autism
ERIC Educational Resources Information Center
Lorah, Elizabeth R.
2012-01-01
The Center for Disease Control estimates that one in 88 births result in a diagnosis of autism (CDC, 2012). Of those individuals diagnosed with autism approximately 25-61% fail to develop vocal output capabilities (Weitxz, Dexter, & Moore, 1997). The use of Augmentative and Alternative Communication (AAC) systems, such as Picture Exchange (PE)…
Construction site Voice Operated Information System (VOIS) test
NASA Astrophysics Data System (ADS)
Lawrence, Debbie J.; Hettchen, William
1991-01-01
The Voice Activated Information System (VAIS), developed by USACERL, allows inspectors to verbally log on-site inspection reports on a hand held tape recorder. The tape is later processed by the VAIS, which enters the information into the system's database and produces a written report. The Voice Operated Information System (VOIS), developed by USACERL and Automated Sciences Group, through a ESACERL cooperative research and development agreement (CRDA), is an improved voice recognition system based on the concepts and function of the VAIS. To determine the applicability of the VOIS to Corps of Engineers construction projects, Technology Transfer Test Bad (T3B) funds were provided to the Corps of Engineers National Security Agency (NSA) Area Office (Fort Meade) to procure and implement the VOIS, and to train personnel in its use. This report summarizes the NSA application of the VOIS to quality assurance inspection of radio frequency shielding and to progress payment logs, and concludes that the VOIS is an easily implemented system that can offer improvements when applied to repetitive inspection procedures. Use of VOIS can save time during inspection, improve documentation storage, and provide flexible retrieval of stored information.
Response time effects of alerting tone and semantic context for synthesized voice cockpit warnings
NASA Technical Reports Server (NTRS)
Simpson, C. A.; Williams, D. H.
1980-01-01
Some handbooks and human factors design guides have recommended that a voice warning should be preceded by a tone to attract attention to the warning. As far as can be determined from a search of the literature, no experimental evidence supporting this exists. A fixed-base simulator flown by airline pilots was used to test the hypothesis that the total 'system-time' to respond to a synthesized voice cockpit warning would be longer when the message was preceded by a tone because the voice itself was expected to perform both the alerting and the information transfer functions. The simulation included realistic ATC radio voice communications, synthesized engine noise, cockpit conversation, and realistic flight routes. The effect of a tone before a voice warning was to lengthen response time; that is, responses were slower with an alerting tone. Lengthening the voice warning with another work, however, did not increase response time.
Zhou, Zhou; Ge, Pingjiang; Liu, Qian; Liu, Ming; Zhang, Wei
2015-08-01
To investigate the applicability of the eysphonia severity index (DSI) in evaluating effects of surgery between before and after groups of vocal polyp patients. Analyses of measurement data pre and pro-surgery of 70 vocal polyp patients and 35 no voice disorders volunteers (control group). The voice quality was measured subjectively with the voice handicap index (VHI), the GRBAS and fiber electronic laryngoscopy. Measures of maximum phonation time (MPT), shimmer and jitter were obtained for each subject by using DiVAS 2.30 (XION, Germany). The DiVAS 2.30 had spotanenously calculate the scores of DSI. Using SPSS 17.0 to find the differences of DSI scores among the three groups by one-way ANOVA variance analysis. And finding out of the correlation with DSI scores and VHI scores, GRBAS, MPT, jitter and shimmer. DSI improved significantly after surgery in the vocal polyps group (mean difference DSI -2.92 and 1.87, respectively) and also in the control group (mean difference DSI -2.92 and 2.30, respectively). However, no significant difference between the control group and the after surgery group. By using Pearson correlation analysis, this study observed a strong correlation between the DSI scores and the VHI scores, the values of GRBAS, shimmer (P < 0.01). DSI is an effective and high accuracy multi-parameter system for evaluation of vocal cord polyp patients as an independent assessment of dysphonia. DSI also can be used in evaluation of the effects of the vocal polyps surgery.
Virtual interface environment workstations
NASA Technical Reports Server (NTRS)
Fisher, S. S.; Wenzel, E. M.; Coler, C.; Mcgreevy, M. W.
1988-01-01
A head-mounted, wide-angle, stereoscopic display system controlled by operator position, voice and gesture has been developed at NASA's Ames Research Center for use as a multipurpose interface environment. This Virtual Interface Environment Workstation (VIEW) system provides a multisensory, interactive display environment in which a user can virtually explore a 360-degree synthesized or remotely sensed environment and can viscerally interact with its components. Primary applications of the system are in telerobotics, management of large-scale integrated information systems, and human factors research. System configuration, research scenarios, and research directions are described.
The Human Voice in Speech and Singing
NASA Astrophysics Data System (ADS)
Lindblom, Björn; Sundberg, Johan
This chapter
The Human Voice in Speech and Singing
NASA Astrophysics Data System (ADS)
Lindblom, Björn; Sundberg, Johan
This chapter describes various aspects of the human voice as a means of communication in speech and singing. From the point of view of function, vocal sounds can be regarded as the end result of a three stage process: (1) the compression of air in the respiratory system, which produces an exhalatory airstream, (2) the vibrating vocal folds' transformation of this air stream to an intermittent or pulsating air stream, which is a complex tone, referred to as the voice source, and (3) the filtering of this complex tone in the vocal tract resonator. The main function of the respiratory system is to generate an overpressure of air under the glottis, or a subglottal pressure. Section 16.1 describes different aspects of the respiratory system of significance to speech and singing, including lung volume ranges, subglottal pressures, and how this pressure is affected by the ever-varying recoil forces. The complex tone generated when the air stream from the lungs passes the vibrating vocal folds can be varied in at least three dimensions: fundamental frequency, amplitude and spectrum. Section 16.2 describes how these properties of the voice source are affected by the subglottal pressure, the length and stiffness of the vocal folds and how firmly the vocal folds are adducted. Section 16.3 gives an account of the vocal tract filter, how its form determines the frequencies of its resonances, and Sect. 16.4 gives an account for how these resonance frequencies or formants shape the vocal sounds by imposing spectrum peaks separated by spectrum valleys, and how the frequencies of these peaks determine vowel and voice qualities. The remaining sections of the chapter describe various aspects of the acoustic signals used for vocal communication in speech and singing. The syllable structure is discussed in Sect. 16.5, the closely related aspects of rhythmicity and timing in speech and singing is described in Sect. 16.6, and pitch and rhythm aspects in Sect. 16.7. The impressive control of all these acoustic characteristics of vocal signals is discussed in Sect. 16.8, while Sect. 16.9 considers expressive aspects of vocal communication.
17 Ways to Say Yes: Toward Nuanced Tone of Voice in AAC and Speech Technology
Pullin, Graham; Hennig, Shannon
2015-01-01
Abstract People with complex communication needs who use speech-generating devices have very little expressive control over their tone of voice. Despite its importance in human interaction, the issue of tone of voice remains all but absent from AAC research and development however. In this paper, we describe three interdisciplinary projects, past, present and future: The critical design collection Six Speaking Chairs has provoked deeper discussion and inspired a social model of tone of voice; the speculative concept Speech Hedge illustrates challenges and opportunities in designing more expressive user interfaces; the pilot project Tonetable could enable participatory research and seed a research network around tone of voice. We speculate that more radical interactions might expand frontiers of AAC and disrupt speech technology as a whole. PMID:25965913
ERP correlates of motivating voices: quality of motivation and time-course matters
Zougkou, Konstantina; Weinstein, Netta
2017-01-01
Abstract Here, we conducted the first study to explore how motivations expressed through speech are processed in real-time. Participants listened to sentences spoken in two types of well-studied motivational tones (autonomy-supportive and controlling), or a neutral tone of voice. To examine this, listeners were presented with sentences that either signaled motivations through prosody (tone of voice) and words simultaneously (e.g. ‘You absolutely have to do it my way’ spoken in a controlling tone of voice), or lacked motivationally biasing words (e.g. ‘Why don’t we meet again tomorrow’ spoken in a motivational tone of voice). Event-related brain potentials (ERPs) in response to motivations conveyed through words and prosody showed that listeners rapidly distinguished between motivations and neutral forms of communication as shown in enhanced P2 amplitudes in response to motivational when compared with neutral speech. This early detection mechanism is argued to help determine the importance of incoming information. Once assessed, motivational language is continuously monitored and thoroughly evaluated. When compared with neutral speech, listening to controlling (but not autonomy-supportive) speech led to enhanced late potential ERP mean amplitudes, suggesting that listeners are particularly attuned to controlling messages. The importance of controlling motivation for listeners is mirrored in effects observed for motivations expressed through prosody only. Here, an early rapid appraisal, as reflected in enhanced P2 amplitudes, is only found for sentences spoken in controlling (but not autonomy-supportive) prosody. Once identified as sounding pressuring, the message seems to be preferentially processed, as shown by enhanced late potential amplitudes in response to controlling prosody. Taken together, results suggest that motivational and neutral language are differentially processed; further, the data suggest that listening to cues signaling pressure and control cannot be ignored and lead to preferential, and more in-depth processing mechanisms. PMID:28525641
ERP correlates of motivating voices: quality of motivation and time-course matters.
Zougkou, Konstantina; Weinstein, Netta; Paulmann, Silke
2017-10-01
Here, we conducted the first study to explore how motivations expressed through speech are processed in real-time. Participants listened to sentences spoken in two types of well-studied motivational tones (autonomy-supportive and controlling), or a neutral tone of voice. To examine this, listeners were presented with sentences that either signaled motivations through prosody (tone of voice) and words simultaneously (e.g. 'You absolutely have to do it my way' spoken in a controlling tone of voice), or lacked motivationally biasing words (e.g. 'Why don't we meet again tomorrow' spoken in a motivational tone of voice). Event-related brain potentials (ERPs) in response to motivations conveyed through words and prosody showed that listeners rapidly distinguished between motivations and neutral forms of communication as shown in enhanced P2 amplitudes in response to motivational when compared with neutral speech. This early detection mechanism is argued to help determine the importance of incoming information. Once assessed, motivational language is continuously monitored and thoroughly evaluated. When compared with neutral speech, listening to controlling (but not autonomy-supportive) speech led to enhanced late potential ERP mean amplitudes, suggesting that listeners are particularly attuned to controlling messages. The importance of controlling motivation for listeners is mirrored in effects observed for motivations expressed through prosody only. Here, an early rapid appraisal, as reflected in enhanced P2 amplitudes, is only found for sentences spoken in controlling (but not autonomy-supportive) prosody. Once identified as sounding pressuring, the message seems to be preferentially processed, as shown by enhanced late potential amplitudes in response to controlling prosody. Taken together, results suggest that motivational and neutral language are differentially processed; further, the data suggest that listening to cues signaling pressure and control cannot be ignored and lead to preferential, and more in-depth processing mechanisms. © The Author (2017). Published by Oxford University Press.
NASA Technical Reports Server (NTRS)
Parker, J. F., Jr.; Duffy, J. W.; Christensen, D. G.
1981-01-01
A Flight Data Console simulation of a digital communication link to replace the current voice communication system used in air traffic control (ATC) was developed. The study determined how a digital communications system reduces cockpit workload, improve, flight proficiency, and is acceptable to general aviation pilots. It is shown that instrument flight, including approach and landing, can be accomplished by using a digital data link system for ATC communication.
Status Report on Speech Research, 1 April-30 June 1981.
1981-01-01
Fredericka Bell-Berti* Donald Hailey Steven Eady Catherine Best* Terry Halwes Jo Estill Gloria J. Borden* Sabina D. Koroluk Laurie B. Feldman Susan...folds in voicing control is whether activity of CT is associated with abduction or adduction. Stevens ’ model of glottal activity suggests that the...aerodynamic model provided for the efficient transfer of energy from the aerodynamic system to the mechanical system ( Stevens , 1977), given the nature of
Listening to Young Children's Voices: The Evaluation of a Coding System
ERIC Educational Resources Information Center
Tertoolen, Anja; Geldens, Jeannette; van Oers, Bert; Popeijus, Herman
2015-01-01
Listening to young children's voices is an issue with increasing relevance for many researchers in the field of early childhood research. At the same time, teachers and researchers are faced with challenges to provide children with possibilities to express their notions, and to find ways of comprehending children's voices. In our research we aim…
ERIC Educational Resources Information Center
Plotnik, Ronit
2013-01-01
Parenthood is a concrete experience that develops while having a psychological existence in its background. It is heard in two voices simultaneously: the overt, concrete one versus the covert, psychological one. It moves between four intersecting axes, which together create the "Parental Voice" model. Axis 1--Parenthood between fantasy…
ERIC Educational Resources Information Center
Matthews, Sharon Elizabeth
2010-01-01
This study investigated the extent to which there were statistically significant relationships between school administrators' systemic implementation of student voice work and student perceptions (i.e. achievement, motivation, attachment and school climate) and PLAN performance. Student voice was defined as students being equal partners in school…
Enhanced Living by Assessing Voice Pathology Using a Co-Occurrence Matrix
Muhammad, Ghulam; Alhamid, Mohammed F.; Hossain, M. Shamim; Almogren, Ahmad S.; Vasilakos, Athanasios V.
2017-01-01
A large number of the population around the world suffers from various disabilities. Disabilities affect not only children but also adults of different professions. Smart technology can assist the disabled population and lead to a comfortable life in an enhanced living environment (ELE). In this paper, we propose an effective voice pathology assessment system that works in a smart home framework. The proposed system takes input from various sensors, and processes the acquired voice signals and electroglottography (EGG) signals. Co-occurrence matrices in different directions and neighborhoods from the spectrograms of these signals were obtained. Several features such as energy, entropy, contrast, and homogeneity from these matrices were calculated and fed into a Gaussian mixture model-based classifier. Experiments were performed with a publicly available database, namely, the Saarbrucken voice database. The results demonstrate the feasibility of the proposed system in light of its high accuracy and speed. The proposed system can be extended to assess other disabilities in an ELE. PMID:28146069
Enhanced Living by Assessing Voice Pathology Using a Co-Occurrence Matrix.
Muhammad, Ghulam; Alhamid, Mohammed F; Hossain, M Shamim; Almogren, Ahmad S; Vasilakos, Athanasios V
2017-01-29
A large number of the population around the world suffers from various disabilities. Disabilities affect not only children but also adults of different professions. Smart technology can assist the disabled population and lead to a comfortable life in an enhanced living environment (ELE). In this paper, we propose an effective voice pathology assessment system that works in a smart home framework. The proposed system takes input from various sensors, and processes the acquired voice signals and electroglottography (EGG) signals. Co-occurrence matrices in different directions and neighborhoods from the spectrograms of these signals were obtained. Several features such as energy, entropy, contrast, and homogeneity from these matrices were calculated and fed into a Gaussian mixture model-based classifier. Experiments were performed with a publicly available database, namely, the Saarbrucken voice database. The results demonstrate the feasibility of the proposed system in light of its high accuracy and speed. The proposed system can be extended to assess other disabilities in an ELE.
NASA Technical Reports Server (NTRS)
1972-01-01
A Tracking and Data Relay Satellite System (TDRSS) concept for service of low and medium data rate user spacecraft has been defined. The TDRS system uses two geosynchronous dual spin satellites compatible with Delta 2914 to provide command, tracking, and telemetry service between multiple low earth orbiting users and a centrally located ground station. The low data rate user service capability via each TDRS is as follows: (1) forward link at UHF: voice to one user, commands to 20 users (sequential), range and range rate service, and (2) return link at VHF: voice from one user, data from 20 users (simultaneous), range and range rate return signals. The medium data rate user service via each TDRS is as follows: (1) forward link at S band: voice or command and tracking signals to one user, and (2) return link at S band: voice, data and tracking signals from one user "order wire" for high priority service requests (implemented with an earth coverage antenna).
Kahler, Christopher W; Lechner, William J; MacGlashan, James; Wray, Tyler B; Littman, Michael L
2017-06-28
Computer-delivered interventions have been shown to be effective in reducing alcohol consumption in heavy drinking college students. However, these computer-delivered interventions rely on mouse, keyboard, or touchscreen responses for interactions between the users and the computer-delivered intervention. The principles of motivational interviewing suggest that in-person interventions may be effective, in part, because they encourage individuals to think through and speak aloud their motivations for changing a health behavior, which current computer-delivered interventions do not allow. The objective of this study was to take the initial steps toward development of a voice-based computer-delivered intervention that can ask open-ended questions and respond appropriately to users' verbal responses, more closely mirroring a human-delivered motivational intervention. We developed (1) a voice-based computer-delivered intervention that was run by a human controller and that allowed participants to speak their responses to scripted prompts delivered by speech generation software and (2) a text-based computer-delivered intervention that relied on the mouse, keyboard, and computer screen for all interactions. We randomized 60 heavy drinking college students to interact with the voice-based computer-delivered intervention and 30 to interact with the text-based computer-delivered intervention and compared their ratings of the systems as well as their motivation to change drinking and their drinking behavior at 1-month follow-up. Participants reported that the voice-based computer-delivered intervention engaged positively with them in the session and delivered content in a manner consistent with motivational interviewing principles. At 1-month follow-up, participants in the voice-based computer-delivered intervention condition reported significant decreases in quantity, frequency, and problems associated with drinking, and increased perceived importance of changing drinking behaviors. In comparison to the text-based computer-delivered intervention condition, those assigned to voice-based computer-delivered intervention reported significantly fewer alcohol-related problems at the 1-month follow-up (incident rate ratio 0.60, 95% CI 0.44-0.83, P=.002). The conditions did not differ significantly on perceived importance of changing drinking or on measures of drinking quantity and frequency of heavy drinking. Results indicate that it is feasible to construct a series of open-ended questions and a bank of responses and follow-up prompts that can be used in a future fully automated voice-based computer-delivered intervention that may mirror more closely human-delivered motivational interventions to reduce drinking. Such efforts will require using advanced speech recognition capabilities and machine-learning approaches to train a program to mirror the decisions made by human controllers in the voice-based computer-delivered intervention used in this study. In addition, future studies should examine enhancements that can increase the perceived warmth and empathy of voice-based computer-delivered intervention, possibly through greater personalization, improvements in the speech generation software, and embodying the computer-delivered intervention in a physical form. ©Christopher W Kahler, William J Lechner, James MacGlashan, Tyler B Wray, Michael L Littman. Originally published in JMIR Mental Health (http://mental.jmir.org), 28.06.2017.
A baseline maritime satellite communication system
NASA Technical Reports Server (NTRS)
Durrani, S. H.; Mcgregor, D. N.
1974-01-01
This paper describes a baseline system for maritime communications via satellite during the 1980s. The system model employs three geostationary satellites with global coverage antennas. Access to the system is controlled by a master station; user access is based on time-ordered polling or random access. Each Thor-Delta launched satellite has an RF power of 100 W (spinner) or 250 W (three-axis stabilized), and provides 10 equivalent duplex voice channels for up to 1500 ships with average waiting times of approximately 2.5 minutes. The satellite capacity is bounded by the available bandwidth to 50 such channels, which can serve up to 10,000 ships with an average waiting time of 5 minutes. The ships must have peak antenna gains of approximately 15.5 dB or 22.5 dB for the two cases (10 or 50 voice channels) when a spinner satellite is used; the required gains are 4 dB lower if a three-axis stabilized satellite is used. The ship antenna requirements can be reduced by 8 to 10 dB by employing a high-gain multi-beam phased array antenna on the satellite.
The Singer's Formant and Speaker's Ring Resonance: A Long-Term Average Spectrum Analysis
Lee, Sang-Hyuk; Kwon, Hee-Jun; Choi, Hyun-Jin; Lee, Nam-Hun; Lee, Sung-Jin
2008-01-01
Objectives We previously showed that a trained tenor's voice has the conventional singer's formant at the region of 3 kHz and another energy peak at 8-9 kHz. Singers in other operatic voice ranges are assumed to have the same peak in their singing and speaking voice. However, to date, no specific measurement of this has been made. Methods Tenors, baritones, sopranos and mezzo sopranos were chosen to participate in this study of the singer's formant and the speaker's ring resonance. Untrained males (n=15) and females (n=15) were included in the control group. Each subject was asked to produce successive /a/ vowel sounds in their singing and speaking voice. For singing, the low pitch was produced in the chest register and the high notes in the head register. We collected the data on the long-term average spectra of the speaking and singing voices of the trained singers and the control groups. Results For the sounds produced from the head register, a significant energy concentration was seen in both 2.2-3.4 kHz and 7.5-8.4 kHz regions (except for the voices of the mezzo sopranos) in the trained singer group when compared to the control groups. Also, the chest register had a significant energy concentration in the 4 trained singer groups at the 2.2-3.1 kHz and 7.8-8.4 kHz. For speaking sound, all trained singers had a significant energy concentration at 2.2-5.3 kHz and sopranos had another energy concentration at 9-10 kHz. Conclusion The results of this study suggest that opera singers have more energy concentration in the singer's formant/speaker's ring region, in both singing and speaking voices. Furthermore, another region of energy concentration was identified in opera singer's singing sound and in sopranos' speaking sound at 8-9 kHz. The authors believe that these energy concentrations may contribute to the rich voice of trained singers. PMID:19434279
Optimized delivery radiological reports: applying Six Sigma methodology to a radiology department.
Cavagna, Enrico; Berletti, Riccardo; Schiavon, Francesco; Scarsi, Barbara; Barbato, Giuseppe
2003-03-01
To optimise the process of reporting and delivering radiological examinations with a view to achieving 100% service delivery within 72 hours to outpatients and 36 hours to inpatients. To this end, we used the Six Sigma method which adopts a systematic approach and rigorous statistical analysis to analyse and improve processes, by reducing variability and minimising errors. More specifically, our study focused on the process of radiological report creation, from the end of the examination to the time when the report is made available to the patient, to examine the bottlenecks and identify the measures to be taken to improve the process. Six Sigma uses a five-step problem-solving process called DMAIC, an acronym for Define, Measure, Analyze, Improve and Control. The first step is to define the problem and the elements crucial to quality, in terms of Total Quality Control. Next, the situation is analysed to identify the root causes of the problem and determine which of these is most influential. The situation is then improved by implementing change. Finally, to make sure that the change is long-lasting, measures are taken to sustain the improvements and obtain long-term control. In our case we analysed all of the phases the report passes through before reaching the user, and studied the impact of voice-recognition reporting on the speed of the report creation process. Analysis of the information collected showed that the tools available for report creation (dictaphone, voice-recognition system) and the transport of films and reports were the two critical elements on which to focus our efforts. Of all the phases making up the process, reporting (from end of examination to end of reporting) and distribution (from the report available to administrative staff to report available to the patient) account for 90% of process variability (73% and 17%, respectively). We further found that the reports dictated into a voice-recognition reporting system are delivered in 45 hours (median), whereas those dictated using a dictaphone take 96 hours: voice-recognition reporting systems therefore improve performance by 50 hours. Unfortunately, 38% of our reports are delivered within longer timeframes than the 72h for outpatients and 36h for inpatients agreed with the service users. Reports for inpatients have much faster delivery times and lower variability, as 95% of these examinations are reported using voice-recognition reporting (as a result of the greater sensitivity of physicians to the problem of inpatient waiting times). For conventional radiology examinations, numerically greater than CT or MRI, there is a stronger tendency to use the dictaphone which allows for faster dictation as it is unburdened by administrative tasks such as entering examination codes, correcting errors, etc. Freelance status has no impact on report delivery times, service delivery being the same as in the institutional setting. The subprocess of reporting is strongly affected by the choice of reporting method (voice-recognition system or dictaphone), whereas report delivery is affected by the individual's behaviour patterns and ultimately by habits generated by the lack of a clearly charted process (lack of synchronisation among the various phases), and therefore potentially avoidable. The analytical study of the various phases of examination reporting, from writing to delivery, allowed us to identify the process bottlenecks and take corrective measures. Regardless of imaging modality and individual physician, examination reporting consistently takes longer when a dictaphone is used instead of a voice-recognition reporting system, as this makes the process more complex. To improve the two critical subprocesses whilst maintaining constant resources, a first step is to abandon the dictaphone in favour of the voice-recognition system. In addition, we are experimenting other measures to improve the collection and sorting of examinations and the delivery of reports: the technical staff take the films from the examination rooms to the reporting rooms three times a day; the radiologists collect their examinations and prepare the reports, possibly on the same day; the radiologists leave their signed reports on the table in the central reporting room; the administrative staff collect the signed reports three times a day in the morning and afternoon to be able to deliver them on the same day. This project has allowed us to become familiar with the principles of total quality, to better understand our internal processes and to take effective measures to optimise them. This has resulted in enhanced satisfaction of all the department staff and has laid the grounds for further measures in the future.
Flow Control and Routing in an Integrated Voice and Data Communication Network
1981-08-01
require continuous and almost real - time delivery; they are very sensitive to delay. Data conversations, on the other hand, are generally intolerant of...packets arrive in time to be delivered to the sink. However, this is not the solution we seek. We have noted that voice conversations require almost real ...by long messages that require continuous real - time delivery; e.g. voice facsimile, video. Class II: characterized by short discrete messages that
Cross-Cultural Adaptation and Validation of the Voice Handicap Index into Thai.
Jaruchinda, Pariyanan; Suwanwarangkool, Thadchai
2015-12-01
The voice handicap index (VHI) is one of the most utilized instruments for measuring a patient's self-assessment of voice severity. The VHI has been translated into several languages, but not in Thai. To examine the psychometric properties of a Thai translation of the voice Handicap Index (VHI) and assess the applicability in the screening diagnosis. After receiving permission from the American Speech Language Hearing Association (ASHA), the original VHI had been translated and adapted to Thai by forward and backward standard translation. Eighty-five patients with voice disorders, divided in four groups according to the etiology of the diseases (neurogenic, structural, functional, and inflammatory), and 30 asymptomatic subjects were included in the present study. Internal consistency was analyzed through Cronbach's a coefficient. For the VHI test-retest reliability analysis, the Thai VHI was completed twice by 22 patients and assessed through the intraclass correlation coefficient. For clinical validity evaluation, the VHI scores from the pathological group were compared with the control group and compared among the four different pathological groups. The cutoff point for distinguishing the normal from the patient group was assessed by ROC analysis. Effects of age and gender on VHI scores were also evaluated. The Thai VHI showed a significant high internal consistency and test-retest reliability (Cronbach's α = 0.96 and r = 0.843, respectively). Mann-Whitney U test was used to compare the control group and pathological groups and revealed significant difference in total scores and its three domains scores (p < 0.001). ROC analysis demonstrated that a VHI score of 13 should be considered the threshold for revealing the impact of quality of life in voice disorder patients. Age and gender were not affect the VHI scores in both control and patient groups. The Thai VHI has high reliability and validity. The Thai version of VHI is considered to be a self-assessment tool for the severity of voice disorders in Thai patients.
47 CFR 90.353 - LMS operations in the 902-928 MHz band.
Code of Federal Regulations, 2012 CFR
2012-10-01
... band. (b) LMS systems are authorized to transmit status and instructional messages, either voice or non-voice, so long as they are related to the location or monitoring functions of the system. (c) LMS... subparts B and C of this part. (d) Multilateration LMS systems will be authorized on a primary basis within...
47 CFR 90.353 - LMS operations in the 902-928 MHz band.
Code of Federal Regulations, 2010 CFR
2010-10-01
... band. (b) LMS systems are authorized to transmit status and instructional messages, either voice or non-voice, so long as they are related to the location or monitoring functions of the system. (c) LMS... subparts B and C of this part. (d) Multilateration LMS systems will be authorized on a primary basis within...
47 CFR 90.353 - LMS operations in the 902-928 MHz band.
Code of Federal Regulations, 2011 CFR
2011-10-01
... band. (b) LMS systems are authorized to transmit status and instructional messages, either voice or non-voice, so long as they are related to the location or monitoring functions of the system. (c) LMS... subparts B and C of this part. (d) Multilateration LMS systems will be authorized on a primary basis within...
47 CFR 90.353 - LMS operations in the 902-928 MHz band.
Code of Federal Regulations, 2014 CFR
2014-10-01
... band. (b) LMS systems are authorized to transmit status and instructional messages, either voice or non-voice, so long as they are related to the location or monitoring functions of the system. (c) LMS... subparts B and C of this part. (d) Multilateration LMS systems will be authorized on a primary basis within...
47 CFR 90.353 - LMS operations in the 902-928 MHz band.
Code of Federal Regulations, 2013 CFR
2013-10-01
... band. (b) LMS systems are authorized to transmit status and instructional messages, either voice or non-voice, so long as they are related to the location or monitoring functions of the system. (c) LMS... subparts B and C of this part. (d) Multilateration LMS systems will be authorized on a primary basis within...
Reliability and validity of the Chinese pediatric voice handicap index.
Liu, Kena; Liu, Shaofeng; Zhou, Zhou; Ren, Qinyi; Zhong, Jie; Luo, Renzhong; Qin, Huabiao; Zhang, Siyi; Ge, Pingjiang
2018-02-01
To evaluate the reliability and validity of the Chinese version of pediatric voice handicap index (pVHI). The original English version-pVHI was translated into Chinese. Parents of 52 children with voice dysphonia and 43 children with no history or symptoms of voice problems were asked to fill the Chinese pVHI questionnaires twice with an interval of 2 weeks. GRB (Grade, Roughness, Breathiness) scale was used for perceptual assessment by two otolaryngologists and one speech pathologist for each child's voice. The internal consistency was assessed using Cronbach's alpha coefficient. Pearson's correlation coefficient was used to evaluate the test-retest reliability. The Kendall's coefficient of concordance W was used to assess the consistency of GRB scores of 3 voice specialists. The nonparametric Mann-Whitney test was used to assess the differences between the dysphonia group and controls. The correlation between pVHI and GRB scores were assessed using Pearson's correlation coefficient. The internal consistency of total score and three subscales scores of Chinese pVHI were 0.788-0.944. The test-retest reliability was 0.631-0.887(P < .001). The pVHI scores of control group significantly were lower than the pathological group (P = .000). The GRB scores of 3 voice specialists have an excellent consistency (W = 0.694-0.807, P = .000). The pVHI scores positively correlated with GRB assessment (P < .01). The Chinese version of pVHI had a good reliability and validity. It can be applicable and useful supplementary tool for evaluating parents' perception of their children's dysphonia. Copyright © 2017. Published by Elsevier B.V.
Sullivan, Jessica R.; Assmann, Peter F.; Hossain, Shaikat; Schafer, Erin C.
2017-01-01
Two experiments explored the role of differences in voice gender in the recognition of speech masked by a competing talker in cochlear implant simulations. Experiment 1 confirmed that listeners with normal hearing receive little benefit from differences in voice gender between a target and masker sentence in four- and eight-channel simulations, consistent with previous findings that cochlear implants deliver an impoverished representation of the cues for voice gender. However, gender differences led to small but significant improvements in word recognition with 16 and 32 channels. Experiment 2 assessed the benefits of perceptual training on the use of voice gender cues in an eight-channel simulation. Listeners were assigned to one of four groups: (1) word recognition training with target and masker differing in gender; (2) word recognition training with same-gender target and masker; (3) gender recognition training; or (4) control with no training. Significant improvements in word recognition were observed from pre- to post-test sessions for all three training groups compared to the control group. These improvements were maintained at the late session (one week following the last training session) for all three groups. There was an overall improvement in masked word recognition performance provided by gender mismatch following training, but the amount of benefit did not differ as a function of the type of training. The training effects observed here are consistent with a form of rapid perceptual learning that contributes to the segregation of competing voices but does not specifically enhance the benefits provided by voice gender cues. PMID:28372046
Bolbol, Sarah A; Zalat, Marwa M; Hammam, Rehab A M; Elnakeb, Nasser L
2017-03-01
Even though many studies have explored the problem of voice disorders among teachers worldwide, this problem is still not adequately studied in Egypt. The following study was conducted to investigate the risk factors of voice disorders among an Egyptian sample of school teachers, to measure the effect of a vocal hygiene awareness program on them, and to investigate their vocal cord lesions. One hundred fifty-six teachers working in public schools and 180 administrative workers in the Faculty of Medicine in the same city participated in this study. They completed a self-administered questionnaire investigating voice disorders, and were subjected to a voice awareness program and a clinical examination. Voice-related symptoms and Voice Handicap Index were statistically significantly higher among teachers compared with the control subjects. Work duration and high frequency of classes per week of ≥15 were the most statistically significant indicators influencing a teacher's voice. Three months after application of vocal hygiene awareness program, the teachers who were studied showed a statistically significant increase in their awareness about vocal hygiene tips. Egyptian teachers working in public schools are dealing with classes that include a great number of students per class. They also have to deal with unprofessional facilities and limited assisting resources. Therefore, they are highly exposed to the risk of voice-related disorders. Increasing awareness about healthy behavior with the voice in their occupations will help in improving their quality of work and in minimizing any permanent impairments and/or disability. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
ATC/pilot voice communications : a survey of the literature.
DOT National Transportation Integrated Search
1993-11-01
The first radio-equipped control tower in the United States opened at the Cleveland Municipal Airport in 1930. From that time to the present, voice radio communications have played a primary role in air safety. Verbal communications in air traffic co...
Implementation of the MSAT network in Canada
NASA Astrophysics Data System (ADS)
Roscoe, Orest S.
1991-10-01
MSAT will be launched into geostationary orbit at 106.5 degrees west longitude. It will have an aggregate EIRP in excess of 57 dBw, making it the highest powered mobile satellite launched or planned to date. The MSAT ground segment will comprise hundreds of thousands of mobile terminals, as well as numerous feederlink earth stations. This requires a large scale network control system which is unprecedented in a satellite environment. The network control system must support circuit-switched voice and data services and packet-switched data services. In addition to assignment of capacity on demand, the control system must be able to support a variety of service features required by customers, as well as record information for billing. It must operate in a multi-beam, multi-satellite environment as the system grows.
A 4.8 kbps code-excited linear predictive coder
NASA Technical Reports Server (NTRS)
Tremain, Thomas E.; Campbell, Joseph P., Jr.; Welch, Vanoy C.
1988-01-01
A secure voice system STU-3 capable of providing end-to-end secure voice communications (1984) was developed. The terminal for the new system will be built around the standard LPC-10 voice processor algorithm. The performance of the present STU-3 processor is considered to be good, its response to nonspeech sounds such as whistles, coughs and impulse-like noises may not be completely acceptable. Speech in noisy environments also causes problems with the LPC-10 voice algorithm. In addition, there is always a demand for something better. It is hoped that LPC-10's 2.4 kbps voice performance will be complemented with a very high quality speech coder operating at a higher data rate. This new coder is one of a number of candidate algorithms being considered for an upgraded version of the STU-3 in late 1989. The problems of designing a code-excited linear predictive (CELP) coder to provide very high quality speech at a 4.8 kbps data rate that can be implemented on today's hardware are considered.
The speech range profile (SRP): an easy and useful tool to assess vocal limits.
D'Alatri, L; Marchese, M R
2014-08-01
This study was carried out to compare the vocal limits obtained by speech range profile (SRP) with those of voice range profile (VRP) in untrained healthy and dysphonic females. Forty-six healthy voice volunteers (control group) and 148 dysphonic patients (dysphonic group) were evaluated using videolaryngostroboscopic assessment and phonetography for voice measurements. For VRP, subjects were asked to sustain the vowel /a/ as soft and as loud possible from the lowest to the highest frequencies using an automated procedure. The SRP was obtained by recording the speaking voice (SV) and the shouting voice (ShV) asking subjects to read a list of sentences aloud and to shout / ehi/ as loud as they could, respectively. All subjects in the control and dysphonic groups were able to perform SRP. fourty of 46 (85%) and 102 of 148 (68.91%) cases, respectively in control and dysphonic groups, were able to perform VRP. Most frequently, the VRP was not recorded because of the inability to perform or, especially in the dysphonic group, for inadequacy of the vocal signal. In the control group, there were no significant differences between the mean values of Fmin, Fmax, Imin and number of semitones (st) of the VRP and those of the SRP (p > 0.05). In the dysphonic group, the mean values of Fmin, Fmax and st SV+ShV for SRP were significantly higher than those of VRP. Our preliminary results suggest that the SRP may be a useful, alternative tool to assess vocal limits in both euphonic and dysphonic females.
Tamplin, Jeanette; Brazzale, Danny J; Pretto, Jeffrey J; Ruehland, Warren R; Buttifant, Mary; Brown, Douglas J; Berlowitz, David J
2011-02-01
To explore how respiratory impairment after cervical spinal cord injury affects vocal function, and to explore muscle recruitment strategies used during vocal tasks after quadriplegia. It was hypothesized that to achieve the increased respiratory support required for singing and loud speech, people with quadriplegia use different patterns of muscle recruitment and control strategies compared with control subjects without spinal cord injury. Matched, parallel-group design. Large university-affiliated public hospital. Consenting participants with motor-complete C5-7 quadriplegia (n=6) and able-bodied age-matched controls (n=6) were assessed on physiologic and voice measures during vocal tasks. Not applicable. Standard respiratory function testing, surface electromyographic activity from accessory respiratory muscles, sound pressure levels during vocal tasks, the Voice Handicap Index, and the Perceptual Voice Profile. The group with quadriplegia had a reduced lung capacity (vital capacity, 71% vs 102% of predicted; P=.028), more perceived voice problems (Voice Handicap Index score, 22.5 vs 6.5; P=.046), and greater recruitment of accessory respiratory muscles during both loud and soft volumes (P=.028) than the able-bodied controls. The group with quadriplegia also demonstrated higher accessory muscle activation in changing from soft to loud speech (P=.028). People with quadriplegia have impaired vocal ability and use different muscle recruitment strategies during speech than the able-bodied. These findings will enable us to target specific measurements of respiratory physiology for assessing functional improvements in response to formal therapeutic singing training. Copyright © 2011 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Reproducibility of Automated Voice Range Profiles, a Systematic Literature Review.
Printz, Trine; Rosenberg, Tine; Godballe, Christian; Dyrvig, Anne-Kirstine; Grøntved, Ågot Møller
2018-05-01
Reliable voice range profiles are of great importance when measuring effects and side effects from surgery affecting voice capacity. Automated recording systems are increasingly used, but the reproducibility of results is uncertain. Our objective was to identify and review the existing literature on test-retest accuracy of the automated voice range profile assessment. Systematic review. PubMed, Scopus, Cochrane Library, ComDisDome, Embase, and CINAHL (EBSCO). We conducted a systematic literature search of six databases from 1983 to 2016. The following keywords were used: phonetogram, voice range profile, and acoustic voice analysis. Inclusion criteria were automated recording procedure, healthy voices, and no intervention between test and retest. Test-retest values concerning fundamental frequency and voice intensity were reviewed. Of 483 abstracts, 231 full-text articles were read, resulting in six articles included in the final results. The studies found high reliability, but data are few and heterogeneous. The reviewed articles generally reported high reliability of the voice range profile, and thus clinical usefulness, but uncertainty remains because of low sample sizes and different procedures for selecting, collecting, and analyzing data. More data are needed, and clinical conclusions must be drawn with caution. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Facial biases on vocal perception and memory.
Boltz, Marilyn G
2017-06-01
Does a speaker's face influence the way their voice is heard and later remembered? This question was addressed through two experiments where in each, participants listened to middle-aged voices accompanied by faces that were either age-appropriate, younger or older than the voice or, as a control, no face at all. In Experiment 1, participants evaluated each voice on various acoustical dimensions and speaker characteristics. The results showed that facial displays influenced perception such that the same voice was heard differently depending on the age of the accompanying face. Experiment 2 further revealed that facial displays led to memory distortions that were age-congruent in nature. These findings illustrate that faces can activate certain social categories and preconceived stereotypes that then influence vocal and person perception in a corresponding fashion. Processes of face/voice integration are very similar to those of music/film, indicating that the two areas can mutually inform one another and perhaps, more generally, reflect a centralized mechanism of cross-sensory integration. Copyright © 2017 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Macken-Horarik, Mary; Morgan, Wendy
2011-01-01
This paper considers the development of voicing in the writing of secondary English students influenced by post-structuralist approaches to literature. It investigates students' growing capacity not only to voice their own responses to literature but also to relate these to a range of theoretical discourses. Drawing on systemic functional…
NASA Astrophysics Data System (ADS)
Modegi, Toshio
Using our previously developed audio to MIDI code converter tool “Auto-F”, from given vocal acoustic signals we can create MIDI data, which enable to playback the voice-like signals with a standard MIDI synthesizer. Applying this tool, we are constructing a MIDI database, which consists of previously converted simple harmonic structured MIDI codes from a set of 71 Japanese male and female syllable recorded signals. And we are developing a novel voice synthesizing system based on harmonically synthesizing musical sounds, which can generate MIDI data and playback voice signals with a MIDI synthesizer by giving Japanese plain (kana) texts, referring to the syllable MIDI code database. In this paper, we propose an improved MIDI converter tool, which can produce temporally higher-resolution MIDI codes. Then we propose an algorithm separating a set of 20 consonant and vowel phoneme MIDI codes from 71 syllable MIDI converted codes in order to construct a voice synthesizing system. And, we present the evaluation results of voice synthesizing quality between these separated phoneme MIDI codes and their original syllable MIDI codes by our developed 4-syllable word listening tests.
Martins, Regina Helena Garcia; Tavares, Elaine Lara Mendes; Pessin, Adriana Bueno Benito
2017-05-01
Reinke's edema is a benign lesion of the vocal folds that affects chronic smokers, especially women. The voice becomes hoarse and virilized, and the treatment is microsurgery. However, even after surgery and smoking cessation, many patients remain with a deep and hoarse voice. The aim of the present study was to compare pre- and postoperative acoustic and perceptual-auditory vocal analyses of women with Reinke's edema and of women in the control group, who were non-smokers. A total of 20 women with videolaryngoscopy diagnosis of Reinke's edema who underwent laryngeal microsurgery were evaluated pre- and postoperatively (6 months) by videolaryngoscopy, acoustic voice, and perceptual-auditory analyses (General degree of dysphonia, Roughness, Breathiness, Asthenia, Strain, and Instability [GRBASI] scale), and the maximum phonation times were calculated. The pre- and postoperative parameters of the women with Reinke's edema were compared with those of the control group of women with no laryngeal lesions, smoking habit, or vocal symptoms. Acoustic vocal perceptual-auditory analyses and the maximum phonation time of women with Reinke's edema improved significantly in the postoperative evaluations; nevertheless, 6 months after surgery, their voices became worse than the voices of the women from the control group. Abnormalities caused by smoking in Reinke's edema in women are not fully reversible with surgery and smoking cessation. One explanation would be the presence of possible structural alterations in fibroblasts caused by the toxicity of cigarette components, resulting in the uncontrolled production of fibrous matrix in the lamina propria, and preventing complete vocal recovery. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Doarn, Charles R; Zacharias, Stephanie; Keck, Casey Stewart; Tabangin, Meredith; DeAlarcon, Alessandro; Kelchner, Lisa
2018-06-05
This article describes the design and implementation of a web-based portal developed to provide supported home practice between weekly voice therapy sessions delivered through telehealth to children with voice disorders. This in-between care consisted of supported home practice that was remotely monitored by speech-language pathologists (SLPs). A web-based voice therapy portal (VTP) was developed as a platform so participants could complete voice therapy home practice by an interdisciplinary team of SLPs (specialized in pediatric voice therapy), telehealth specialists, biomedical informaticians, and interface designers. The VTP was subsequently field tested in a group of children with voice disorders, participating in a larger telehealth study. Building the VTP for supported home practice for pediatric voice therapy was challenging, but successful. Key interactive features of the final site included 11 vocal hygiene questions, traditional voice therapy exercises grouped into levels, audio/visual voice therapy demonstrations, a store-and-retrieval system for voice samples, message/chat function, written guidelines for weekly therapy exercises, and questionnaires for parents to complete after each therapy session. Ten participants (9-14 years of age) diagnosed with a voice disorder were enrolled for eight weekly telehealth voice therapy sessions with follow-up in-between care provided using the VTP. The development and implementation of the VTP as a novel platform for the delivery of voice therapy home practice sessions were effective. We found that a versatile individual, who can work with all project staff (speak the language of both SLPs and information technologists), is essential to the development process. Once the website was established, participants and SLPs effectively utilized the web-based VTP. They found it feasible and useful for needed in-between care and reinforcement of therapeutic exercises.
Yanagida, Saori; Nishizawa, Noriko; Mizoguchi, Kenji; Hatakeyama, Hiromitsu; Fukuda, Satoshi
2015-07-01
Voice onset time (VOT) for word-initial voiceless consonants in adductor spasmodic dysphonia (ADSD) and abductor spasmodic dysphonia (ABSD) patients were measured to determine (1) which acoustic measures differed from the controls and (2) whether acoustic measures were related to the pause or silence between the test word and the preceding word. Forty-eight patients with ADSD and nine patients with ABSD, as well as 20 matched normal controls read a story in which the word "taiyo" (the sun) was repeated three times, each differentiated by the position of the word in the sentence. The target of measurement was the VOT for the word-initial voiceless consonant /t/. When the target syllable appeared in a sentence following a comma, or at the beginning of a sentence following a period, the ABSD patients' VOTs were significantly longer than those of the ADSD patients and controls. Abnormal prolongation of the VOTs was related to the pause or silence between the test word and the preceding word. VOTs in spasmodic dysphonia (SD) may vary according to the SD subtype or speaking conditions. VOT measurement was suggested to be a useful method for quantifying voice symptoms in SD. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Federal Register 2010, 2011, 2012, 2013, 2014
2011-11-23
... Appraiser Roster regulations by replacing the obsolete references to the Credit Alert Interactive Voice Response System (CAIVRS) with references to its successor, the online-based Credit Alert Verification... propose the elimination references to the Credit Alert Interactive Voice Response System (CAIVRS). On July...
Federal Register 2010, 2011, 2012, 2013, 2014
2011-07-14
... the FHA Appraiser Roster by replacing the obsolete references to the Credit Alert Interactive Voice Response System with references to its successor, the online-based Credit Alert Verification Reporting...'s Limited Denial of Participation list, or in HUD's Credit Alert Interactive Voice Response System...
Miller, Nick; Nath, Uma; Noble, Emma; Burn, David
2017-06-01
To determine if perceptual speech measures distinguish people with Parkinson's disease (PD), multiple system atrophy with predominant parkinsonism (MSA-P) and progressive supranuclear palsy (PSP). Speech-language therapists blind to patient characteristics employed clinical rating scales to evaluate speech/voice in 24 people with clinically diagnosed PD, 17 with PSP and 9 with MSA-P, matched for disease duration (mean 4.9 years, standard deviation 2.2). No consistent intergroup differences appeared on specific speech/voice variables. People with PD were significantly less impaired on overall speech/voice severity. Analyses by severity suggested further investigation around laryngeal, resonance and fluency changes may characterize individual groups. MSA-P and PSP compared with PD were distinguished by severity of speech/voice deterioration, but individual speech/voice parameters failed to consistently differentiate groups.
ERIC Educational Resources Information Center
Mahoney, Diane Feeney; Tarlow, Barbara J.; Jones, Richard N.
2003-01-01
Purpose: We determine the main outcome effects of a 12-month computer-mediated automated interactive voice response (IVR) intervention designed to assist family caregivers managing persons with disruptive behaviors related to Alzheimer's disease (AD). Design and Methods: We conducted a randomized controlled study of 100 caregivers, 51 in the usual…
Satellite voice broadcase system study. Volume 1: Executive summary
NASA Technical Reports Server (NTRS)
Horstein, M.
1985-01-01
The feasibility of providing Voice of America (VOA) broadcasts by satellite relay was investigated. Satellite voice broadcast systems are described for three different frequency bands: HF, FHV, and L-band. Geostationary satellite configurations are considered for both frequency bands. A system of subsynchronous, circular satellites with an orbit period of 8 hours was developed for the HF band. The VHF broadcasts are provided by a system of Molniya satellites. The satellite designs are limited in size and weight to the capability of the STS/Centaur launch vehicle combination. At L-band, only four geostationary satellites are needed to meet the requirements of the complete broadcast schedule. These satellites are comparable in size and weight to current satellites designed for the direct broadcast of video program material.
Micro-video display with ocular tracking and interactive voice control
NASA Technical Reports Server (NTRS)
Miller, James E.
1993-01-01
In certain space-restricted environments, many of the benefits resulting from computer technology have been foregone because of the size, weight, inconvenience, and lack of mobility associated with existing computer interface devices. Accordingly, an effort to develop a highly miniaturized and 'wearable' computer display and control interface device, referred to as the Sensory Integrated Data Interface (SIDI), is underway. The system incorporates a micro-video display that provides data display and ocular tracking on a lightweight headset. Software commands are implemented by conjunctive eye movement and voice commands of the operator. In this initial prototyping effort, various 'off-the-shelf' components have been integrated into a desktop computer and with a customized menu-tree software application to demonstrate feasibility and conceptual capabilities. When fully developed as a customized system, the interface device will allow mobile, 'hand-free' operation of portable computer equipment. It will thus allow integration of information technology applications into those restrictive environments, both military and industrial, that have not yet taken advantage of the computer revolution. This effort is Phase 1 of Small Business Innovative Research (SBIR) Topic number N90-331 sponsored by the Naval Undersea Warfare Center Division, Newport. The prime contractor is Foster-Miller, Inc. of Waltham, MA.
Human neuroethology of emotion.
Ploog, D
1989-01-01
1. Based on ethological theory, the question of what is the difference between human and nonhuman primate emotionality is investigated. 2. The anatomical basis for this difference is the greater number of neurons in the anterior thalamic nuclei in humans than in monkeys and apes. This may represent an increased differentiation of the limbic message being sent to the cortex. 3. Only humans can report about experiences and subjective feelings in certain motivational states. The two most general states are wakefulness and sleep. The subjective aspect of (desynchronized) sleep is dreaming. The causal relationship between dreaming and certain lower brain stem mechanisms is analysed. 4. Whereas the motor system is usually blocked during desynchronized sleep, there are individuals who voice their emotions and speak while sleeping. As there are essential differences in the substrates for the voluntary control of the voice in the human and nonhuman primates there are essential differences in the voluntary control of emotions. 5. Similar to the motor matching theory of speech perception a motor matching process of affect perception is suggested. 6. The evolutionary change in the human motivational system is thought to be one of several prerequisites for the evolution of language.
Alternative Voice Switching and Control System Display Panel Format Simulation and Evaluation.
1984-02-01
hear DA statusF Dvoice FW1 3 voice FUTTER , push IA, DA statue DA Status DA Ole OFF d . ot e tI .1F S A-i 5, %P*-.*~. *.S**~ S a V * C. a6 4c Cu v V V c...UU 0 C C .4. 4c c I.. lbC (W. ki a A-2o -. ... -. - J6 ’... Ii I1N INDIMIIi A0tlS tAil.|: INIIIAIt, Hilli. IrumINAT.vs At..WlUNi’|L.. I. v’enecimon...ALTERNATIVE 2 RADIO SCREEN Displayed below is a frequency pair and its legend 127.100TM 317.7 HRM .. tI I11 I If-> M-main I It S-standby I II These appear
Laryngoscopic, acoustic, perceptual, and functional assessment of voice in rock singers.
Guzman, Marco; Barros, Macarena; Espinoza, Fernanda; Herrera, Alejandro; Parra, Daniela; Muñoz, Daniel; Lloyd, Adam
2013-01-01
The present study aimed to vocally assess a group of rock singers who use growl voice and reinforced falsetto. A group of 21 rock singers and a control group of 18 pop singers were included. Singing and speaking voice was assessed through acoustic, perceptual, functional and laryngoscopic analysis. No significant differences were observed between groups in most of the analyses. Acoustic and perceptual analysis of the experimental group demonstrated normality of speaking voice. Endoscopic evaluation showed that most rock singers presented during singing voice a high vertical laryngeal position, pharyngeal compression and laryngeal supraglottic compression. Supraglottic activity during speaking voice tasks was also observed. However, overall vocal fold integrity was demonstrated in most of the participants. Slightly abnormal observations were demonstrated in few of them. Singing voice handicap index revealed that the most affected variable was the physical sphere, followed by the social and emotional spheres. Although growl voice and reinforced falsetto represent laryngeal and pharyngeal hyperfunctional activity, they did not seem to contribute to the presence of any major vocal fold disorder in our subjects. Nevertheless, we cannot rule out the possibility that more evident vocal fold disorders could be found in singers who use these techniques more often and during a longer period of time.
Doyle-Thomas, Krissy A.R.; Goldberg, Jeremy; Szatmari, Peter; Hall, Geoffrey B.C.
2013-01-01
Despite successful performance on some audiovisual emotion tasks, hypoactivity has been observed in frontal and temporal integration cortices in individuals with autism spectrum disorders (ASD). Little is understood about the neurofunctional network underlying this ability in individuals with ASD. Research suggests that there may be processing biases in individuals with ASD, based on their ability to obtain meaningful information from the face and/or the voice. This functional magnetic resonance imaging study examined brain activity in teens with ASD (n = 18) and typically developing controls (n = 16) during audiovisual and unimodal emotion processing. Teens with ASD had a significantly lower accuracy when matching an emotional face to an emotion label. However, no differences in accuracy were observed between groups when matching an emotional voice or face-voice pair to an emotion label. In both groups brain activity during audiovisual emotion matching differed significantly from activity during unimodal emotion matching. Between-group analyses of audiovisual processing revealed significantly greater activation in teens with ASD in a parietofrontal network believed to be implicated in attention, goal-directed behaviors, and semantic processing. In contrast, controls showed greater activity in frontal and temporal association cortices during this task. These results suggest that in the absence of engaging integrative emotional networks during audiovisual emotion matching, teens with ASD may have recruited the parietofrontal network as an alternate compensatory system. PMID:23750139
a Study of Multiplexing Schemes for Voice and Data.
NASA Astrophysics Data System (ADS)
Sriram, Kotikalapudi
Voice traffic variations are characterized by on/off transitions of voice calls, and talkspurt/silence transitions of speakers in conversations. A speaker is known to be in silence for more than half the time during a telephone conversation. In this dissertation, we study some schemes which exploit speaker silences for an efficient utilization of the transmission capacity in integrated voice/data multiplexing and in digital speech interpolation. We study two voice/data multiplexing schemes. In each scheme, any time slots momentarily unutilized by the voice traffic are made available to data. In the first scheme, the multiplexer does not use speech activity detectors (SAD), and hence the voice traffic variations are due to call on/off only. In the second scheme, the multiplexer detects speaker silences using SAD and transmits voice only during talkspurts. The multiplexer with SAD performs digital speech interpolation (DSI) as well as dynamic channel allocation to voice and data. The performance of the two schemes is evaluated using discrete-time modeling and analysis. The data delay performance for the case of English speech is compared with that for the case of Japanese speech. A closed form expression for the mean data message delay is derived for the single-channel single-talker case. In a DSI system, occasional speech losses occur whenever the number of speakers in simultaneous talkspurt exceeds the number of TDM voice channels. In a buffered DSI system, speech loss is further reduced at the cost of delay. We propose a novel fixed-delay buffered DSI scheme. In this scheme, speech fill-in/hangover is not required because there are no variable delays. Hence, all silences that naturally occur in speech are fully utilized. Consequently, a substantial improvement in the DSI performance is made possible. The scheme is modeled and analyzed in discrete -time. Its performance is evaluated in terms of the probability of speech clipping, packet rejection ratio, DSI advantage, and the delay.
47 CFR 22.1007 - Channels for offshore radiotelephone systems.
Code of Federal Regulations, 2012 CFR
2012-10-01
... emergency auto alarm and voice transmission pertaining to emergency conditions only. Central Subscriber 488... fixed, surface and/or airborne mobile) as indicated, for emergency auto alarm and voice transmission...
47 CFR 22.1007 - Channels for offshore radiotelephone systems.
Code of Federal Regulations, 2013 CFR
2013-10-01
... emergency auto alarm and voice transmission pertaining to emergency conditions only. Central Subscriber 488... fixed, surface and/or airborne mobile) as indicated, for emergency auto alarm and voice transmission...
47 CFR 22.1007 - Channels for offshore radiotelephone systems.
Code of Federal Regulations, 2014 CFR
2014-10-01
... emergency auto alarm and voice transmission pertaining to emergency conditions only. Central Subscriber 488... fixed, surface and/or airborne mobile) as indicated, for emergency auto alarm and voice transmission...
Voice Recognition Vocabulary Lists for the Army’s TACFIRE System.
1983-01-01
reasons for considering the impl-men-ation of vcice control to TACFIR3. Threshold Tnc. was contarted and the researchers were told that there was nothing...thra section on the lactical Fire Control Function. The next section will establish the v3cabalary for the message associated with thS Non -nuclear Fire...Professor Department of Operations Research E. F. Roland Rolands and Associates Reviewed by: Released by: K. T. Mrshl l, ermn Willia M. Tolles Department of
1994-05-01
services developed for transmission using Data Link technology . Candidate designs for five ATC services and functions for the new Initial Sector Suite System...clarify suspected confusions to avoid further congestion. 1.2.2 Data Link Communications . Data Link is a digital communications technology which is being... developed as a supplement to traditional voice radio for two-way, air-ground ATC communications and other applications. As shown in figure 1, Data Link
[Mechanism of neoglottic adjustment for voice variation in tracheoesophageal speech].
Fujimoto, T; Kinishi, M; Mohri, M; Amatsu, M
1994-06-01
Over the past 17 years, we have been performing tracheoesophageal (TE) fistulization for voice restoration following total laryngectomy. The purpose of this technique is to divert the exhaled air through the TE fistula into the hypopharynx where the inferior constrictor muscle forms the retropharyngeal prominence on which the neoglottis is located. It is generally accepted that both pulmonary power and laryngeal adjustment control voice frequency and intensity change in laryngeal phonation. Regularity at various pitches and voice intensities was seen in TE phonation, despite laryngeal adjustment being lost. Regular voice production with various pitches and intensities requires a regulatory mechanism for both pulmonary power and the neoglottis. This study was designed to clarify the mechanism of neoglottic adjustment in TE phonation. Ten speakers with TE fistula were subjected to aerodynamic and electrophysiological investigations. Tracheal pressure, fundamental frequency, intensity, and airflow rate were measured for easy phonation, a high-pitched voice, and a loud voice. Resistance and efficiency of the neoglottis were calculated from the data obtained. Electromyograms of the inferior constrictor muscle and tracheal pressure were simultaneously recorded when the pitch or intensity of the voice increased. Six of the ten subjects examined were able to produce a high-pitched voice. Tracheal pressure increased in all six, the airflow rate in four, and neoglottal resistance in five, as compared with the data obtained during easy phonation. Nine of the ten subjects examined were able to produce a loud voice. In all nine, both tracheal pressure and the airflow rate increased as compared with the values measured during easy phonation. Neoglottal resistance had no definite pattern in relation to voice intensity changes. Electrophysiological study demonstrated that the activity of the inferior constrictor muscle increased as tracheal pressure increased so as to raise the pitch or increase the intensity of the voice. These results indicate that the adjustment of neoglottic closure and stiffness produced by the inferior constrictor muscle has the role of varying the frequency or intensity of the voice.
A system for analysis and classification of voice communications
NASA Technical Reports Server (NTRS)
Older, H. J.; Jenney, L. L.; Garland, L.
1973-01-01
A method for analysis and classification of verbal communications typically associated with manned space missions or simulations was developed. The study was carried out in two phases. Phase 1 was devoted to identification of crew tasks and activities which require voice communication for accomplishment or reporting. Phase 2 entailed development of a message classification system and a preliminary test of its feasibility. The classification system permits voice communications to be analyzed to three progressively more specific levels of detail and to be described in terms of message content, purpose, and the participants in the information exchange. A coding technique was devised to allow messages to be recorded by an eight-digit number.
Assessment of voice, speech and communication changes associated with cervical spinal cord injury.
Johansson, Kerstin; Seiger, Åke; Forsén, Malin; Holmgren Nilsson, Jeanette; Hartelius, Lena; Schalling, Ellika
2018-02-24
Respiratory muscle impairment following cervical spinal cord injury (CSCI) may lead to reduced voice function, although the individual variation is large. Voice problems in this population may not always receive attention since individuals with CSCI face other, more acute and life-threatening issues that need/receive attention. Currently there is no consensus on the tasks suitable to identify the specific voice impairments and functional voice changes experienced by individuals with CSCI. To examine which voice/speech tasks identify the specific voice and communication changes associated with CSCI, habitual and maximum speech performance of a group with CSCI was compared with that of a healthy control group (CG), and the findings were related to respiratory function and to self-reported voice problems. Respiratory, aerodynamic, acoustic and self-reported voice data from 19 individuals (nine women and 10 men, aged 23-59 years, heights = 153-192 cm) with CSCI (levels C3-C7) were compared with data from a CG consisting of 19 carefully matched non-injured people (nine women and 10 men, aged 19-59 years, heights = 152-187 cm). Despite considerable variability of performance, highly significant differences between the group with CSCI and the CG were found in maximum phonation time, maximum duration of breath phrases, maximum sound pressure level and maximum voice area in voice-range profiles (all p = .000). Subglottal pressure was lower and phonatory stability was reduced in some of the individuals with CSCI, but differences between the groups were not statistically significant. Six of 19 had voice handicap index (VHI) scores above 20 (the cut-off for voice disorder). Individuals with a vital capacity below 50% of the expected for an equivalent reference individual performed significantly worse than participants with more normal vital capacity. Completeness and level of injury seemed to impact vocal function in some individuals. A combination of maximum performance speech tasks, respiratory tasks and self-reported information on voice problems help to identify individuals with reduced voice function following CSCI. Early identification of individuals with voice changes post-CSCI, and introducing appropriate rehabilitation strategies, may help to minimize development of maladaptive voice behaviours such as vocal strain, which can lead to further impairments and limitations to communication participation. © 2018 Royal College of Speech and Language Therapists.
Correlational Analysis of Speech Intelligibility Tests and Metrics for Speech Transmission
2017-12-04
frequency scale (male voice; normal voice effort) ............................... 4 Fig. 2 Diagram of a speech communication system (Letowski...languages. Consonants contain mostly high frequency (above 1500 Hz) speech energy, but this energy is relatively small in comparison to that of the whole...voices (Letowski et al. 1993). Since the mid- frequency spectral region contains mostly vowel energy while consonants are high frequency sounds, an
Changes in brain activity following intensive voice treatment in children with cerebral palsy.
Bakhtiari, Reyhaneh; Cummine, Jacqueline; Reed, Alesha; Fox, Cynthia M; Chouinard, Brea; Cribben, Ivor; Boliek, Carol A
2017-09-01
Eight children (3 females; 8-16 years) with motor speech disorders secondary to cerebral palsy underwent 4 weeks of an intensive neuroplasticity-principled voice treatment protocol, LSVT LOUD ® , followed by a structured 12-week maintenance program. Children were asked to overtly produce phonation (ah) at conversational loudness, cued-phonation at perceived twice-conversational loudness, a series of single words, and a prosodic imitation task while being scanned using fMRI, immediately pre- and post-treatment and 12 weeks following a maintenance program. Eight age- and sex-matched controls were scanned at each of the same three time points. Based on the speech and language literature, 16 bilateral regions of interest were selected a priori to detect potential neural changes following treatment. Reduced neural activity in the motor areas (decreased motor system effort) before and immediately after treatment, and increased activity in the anterior cingulate gyrus after treatment (increased contribution of decision making processes) were observed in the group with cerebral palsy compared to the control group. Using graphical models, post-treatment changes in connectivity were observed between the left supramarginal gyrus and the right supramarginal gyrus and the left precentral gyrus for the children with cerebral palsy, suggesting LSVT LOUD enhanced contributions of the feedback system in the speech production network instead of high reliance on feedforward control system and the somatosensory target map for regulating vocal effort. Network pruning indicates greater processing efficiency and the recruitment of the auditory and somatosensory feedback control systems following intensive treatment. Hum Brain Mapp 38:4413-4429, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Al-Nasheri, Ahmed; Muhammad, Ghulam; Alsulaiman, Mansour; Ali, Zulfiqar; Mesallam, Tamer A; Farahat, Mohamed; Malki, Khalid H; Bencherif, Mohamed A
2017-01-01
Automatic voice-pathology detection and classification systems may help clinicians to detect the existence of any voice pathologies and the type of pathology from which patients suffer in the early stages. The main aim of this paper is to investigate Multidimensional Voice Program (MDVP) parameters to automatically detect and classify the voice pathologies in multiple databases, and then to find out which parameters performed well in these two processes. Samples of the sustained vowel /a/ of normal and pathological voices were extracted from three different databases, which have three voice pathologies in common. The selected databases in this study represent three distinct languages: (1) the Arabic voice pathology database; (2) the Massachusetts Eye and Ear Infirmary database (English database); and (3) the Saarbruecken Voice Database (German database). A computerized speech lab program was used to extract MDVP parameters as features, and an acoustical analysis was performed. The Fisher discrimination ratio was applied to rank the parameters. A t test was performed to highlight any significant differences in the means of the normal and pathological samples. The experimental results demonstrate a clear difference in the performance of the MDVP parameters using these databases. The highly ranked parameters also differed from one database to another. The best accuracies were obtained by using the three highest ranked MDVP parameters arranged according to the Fisher discrimination ratio: these accuracies were 99.68%, 88.21%, and 72.53% for the Saarbruecken Voice Database, the Massachusetts Eye and Ear Infirmary database, and the Arabic voice pathology database, respectively. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Effect of hydration treatments on laryngeal nodules and polyps and related voice measures.
Verdolini-Marston, K; Sandage, M; Titze, I R
1994-03-01
In this study, a double-blind, placebo-controlled approach was used in assessing the effectiveness of hydration treatments in the clinical management of selected voice disorders. Six adult female patients with laryngeal nodules or polyps each received 5 consecutive days of hydration treatment and 5 consecutive days of placebo/control treatment. The combined results indicated improvements in voice and in laryngeal appearance following both placebo/control and hydration treatments as compared with baseline. However, the greatest improvements were obtained following the hydration treatment. Although caution about generalization of the effects to the typical clinical situation is emphasized, the study provides preliminary evidence of a therapeutic benefit from hydration treatments in patients with nodules or polyps. Based on previous theoretical work, hydration effects may be related to reductions in the viscosity of vocal fold tissue, although other explanations are also possible.
A hybrid voice/data modulation for the VHF aeronautical channels
NASA Technical Reports Server (NTRS)
Akos, Dennis M.
1993-01-01
A method of improving the spectral efficiency of the existing Very High Frequency (VHF) Amplitude Modulation (AM) voice communication channels is proposed. The technique is to phase modulate the existing voice amplitude modulated carrier with digital data. This allows the transmission of digital information over an existing AM voice channel with no change to the existing AM signal format. There is no modification to the existing AM receiver to demodulate the voice signal and an additional receiver module can be added for processing of the digital data. The existing VHF AM transmitter requires only a slight modification for the addition of the digital data signal. The past work in the area is summarized and presented together with an improved system design and the proposed implementation.
D'haeseleer, Evelien; Claeys, Sofie; Bettens, Kim; Leemans, Laura; Van Calster, Ann-Sophie; Van Damme, Nina; Thijs, Zoë; Daelman, Julie; Leyns, Clara; Van Lierde, Kristiane
2017-07-01
The purpose of this study was to measure the objective and subjective vocal quality in women aged between 60 and 75 years. Secondly, the impact of a teaching or singing career on the vocal quality was investigated by comparing the vocal quality of retired women with different careers. This is a case-control study. Seventy-three retired women between 60 and 75 years (mean age: 67 years, standard deviation: 4.49) participated in the study and were divided into three groups: women with a teaching career (n = 21), choir singers with a singing career (n = 12), and women with a non-vocal career (n = 40). All subjects underwent the same assessment protocol consisting of objective (aerodynamic, maximum performance, vocal range, acoustic measurements, and the Dysphonia Severity Index) and subjective (the Voice Handicap Index, auditory-perceptual evaluations by three listeners) voice measurements. In all three groups, objective and perceptual voice analysis showed a mild dysphonia. No differences in the Dysphonia Severity Index were found between the three groups. The voices of choir singers with a singing career were perceived significantly less rough than voices of the women with a non-vocal career. Additionally, the lowest frequency of the frequency range was significantly lower in the retired teachers and choir singers than in the controls. The results of this study prudently suggest that a singing or a teaching career compared with a non-vocal career has a positive impact on the vocal frequency range, and that singing has a positive impact on the perceptual vocal quality of the older female voice. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Özcebe, Esra; Aydinli, Fatma Esen; Tiğrak, Tuğçe Karahan; İncebay, Önal; Yilmaz, Taner
2018-01-11
The main purpose of this study was to culturally adapt the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) to Turkish and to evaluate its internal consistency, validity, and reliability. The Turkish version of CAPE-V was developed, and with the use of a prospective case-control design, the voice recordings of 130 participants were collected according to CAPE-V protocol. Auditory-perceptual evaluation was conducted according to CAPE-V and Grade, Roughness, Breathiness, Asthenia, and Strain (GRBAS) scale by two ear, nose, and throat specialists and two speech and language therapists. The different types of voice disorders, classified as organic and functional disorders, were compared in terms of their CAPE-V scores. The overall severity parameter had the highest intrarater and inter-reliability values for all the participants. For all four raters, the differences in the six CAPE-V parameters between the study and the control groups were found to be statistically significant. Among the correlations for the comparable parameters of the CAPE-V and the GRBAS scales, the highest correlation was found between the overall severity-grade parameters. There was no difference found between the organic and functional voice disorders in terms of the CAPE-V scores. The Turkish version of CAPE-V has been proven to be a reliable and valid instrument to use in the auditory-perceptual evaluation of voice. For the future application of this study, it would be important to investigate whether cepstral measures correlate with the auditory-perceptual judgments of dysphonia severity collected by a Turkish version of the CAPE-V. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Voice Habits and Behaviors: Voice Care Among Flamenco Singers.
Garzón García, Marina; Muñoz López, Juana; Y Mendoza Lara, Elvira
2017-03-01
The purpose of this study is to analyze the vocal behavior of flamenco singers, as compared with classical music singers, to establish a differential vocal profile of voice habits and behaviors in flamenco music. Bibliographic review was conducted, and the Singer's Vocal Habits Questionnaire, an experimental tool designed by the authors to gather data regarding hygiene behavior, drinking and smoking habits, type of practice, voice care, and symptomatology perceived in both the singing and the speaking voice, was administered. We interviewed 94 singers, divided into two groups: the flamenco experimental group (FEG, n = 48) and the classical control group (CCG, n = 46). Frequency analysis, a Likert scale, and discriminant and exploratory factor analysis were used to obtain a differential profile for each group. The FEG scored higher than the CCG in speaking voice symptomatology. The FEG scored significantly higher than the CCG in use of "inadequate vocal technique" when singing. Regarding voice habits, the FEG scored higher in "lack of practice and warm-up" and "environmental habits." A total of 92.6% of the subjects classified themselves correctly in each group. The Singer's Vocal Habits Questionnaire has proven effective in differentiating flamenco and classical singers. Flamenco singers are exposed to numerous vocal risk factors that make them more prone to vocal fatigue, mucosa dehydration, phonotrauma, and muscle stiffness than classical singers. Further research is needed in voice training in flamenco music, as a means to strengthen the voice and enable it to meet the requirements of this musical genre. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
F0 Characteristics of Newsreaders on Varied Emotional Texts in Tamil Language.
Gunasekaran, Nishanthi; Boominathan, Prakash; Seethapathy, Jayashree
2017-12-26
The objective of this study was to profile speaking F 0 and its variations in newsreaders on varied emotional texts. This study has a prospective, case-control study design. Fifteen professional newsreaders and 15 non-newsreaders were the participants. The participants read the news bulletin that conveyed different emotions (shock, neutral, happy, and sad) in a habitual and "newsreading" voice. Speaking fundamental frequency (SFF) and F 0 variations were extracted from 1620 tokens using Praat software (version 5.2.32) on the opening lines, headlines, news stories, and closing lines of each news item. Paired t test, independent t test, and Friedman test were used for statistical analysis. Both male and female newsreaders had significantly (P ≤ 0.05) higher SFFs and standard deviations (SDs) of SFF in newsreading voice than speaking voice. Female non-newsreaders demonstrated significantly higher SFF and SD of SFF in newsreading voice, whereas no significant differences were noticed in the frequency parameters for male non-newsreaders. No significant difference was noted in the frequency parameters of speaking and newsreading voice between male newsreaders and male non-newsreaders. A significant difference in the SD of SFF was noticed between female newsreaders and female non-newsreaders in newsreading voice. Female newsreaders had a higher frequency range in both speaking voice and newsreading voice when compared with non-newsreaders. F 0 characteristics and frequency range determine the amount of frequency changes exercised by newsreaders while reading bulletins. This information is highly pedagogic for training voices in this profession. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Sequoyah Foreign Language Translation System - Business Case Analysis
2007-12-01
Interactive Natural Dialogue System (S-MINDS)..................................................................20 j. Voice Response Translator ( VRT ...20 Figure 8. U.S. Marine Military Policeman Demonstrating VRT (From: Ref. U.S...www.languagerealm.com/Files/usmc_mt_test_2004.pdf. 21 j. Voice Response Translator ( VRT ) The VRT is a S2S human language translation device that uses
Concept and implementation of the Globalstar mobile satellite system
NASA Technical Reports Server (NTRS)
Schindall, Joel
1995-01-01
Globalstar is a satellite-based mobile communications system which provides quality wireless communications (voice and/or data) anywhere in the world except the polar regions. The Globalstar system concept is based upon technological advancements in Low Earth Orbit (LEO) satellite technology and in cellular telephone technology, including the commercial application of Code Division Multiple Access (CDMA) technologies. The Globalstar system uses elements of CDMA and Frequency Division Multiple Access (FDMA), combined with satellite Multiple Beam Antenna (MBA) technology and advanced variable-rate vocoder technology to arrive at one of the most efficient modulation and multiple access systems ever proposed for a satellite communications system. The technology used in Globalstar includes the following techniques in obtaining high spectral efficiency and affordable cost per channel: (1) CDMA modulation with efficient power control; (2) high efficiency vocoder with voice activity factor; (3) spot beam antenna for increased gain and frequency reuse; (4) weighted satellite antenna gain for broad geographic coverage; (5) multisatellite user links (diversity) to enhance communications reliability; and (6) soft hand-off between beams and satellites. Initial launch is scheduled in 1997 and the system is scheduled to be operational in 1998. The Globalstar system utilizes frequencies in L-, S- and C-bands which have the potential to offer worldwide availability with authorization by the appropriate regulatory agencies.
Distress, omnipotence, and responsibility beliefs in command hallucinations.
Ellett, Lyn; Luzon, Olga; Birchwood, Max; Abbas, Zarina; Harris, Abi; Chadwick, Paul
2017-09-01
Command hallucinations are considered to be one of the most distressing and disturbing symptoms of schizophrenia. Building on earlier studies, we compare key attributes in the symptomatic, affective, and cognitive profiles of people diagnosed with schizophrenia and hearing voices that do (n = 77) or do not (n = 74) give commands. The study employed a cross-sectional design, in which we assessed voice severity, distress and control (PSYRATs), anxiety and depression (HADS), beliefs about voices (BAVQ-R), and responsibility beliefs (RIQ). Clinical and demographic variables were also collected. Command hallucinations were found to be more distressing and controlling, perceived as more omnipotent and malevolent, linked to higher anxiety and depression, and resisted more than hallucinations without commands. Commanding voices were also associated with higher conviction ratings for being personally responsible for preventing harm. The findings suggest key differences in the affective and cognitive profiles of people who hear commanding voices, which have important implications for theory and psychological interventions. Command hallucinations are associated with higher distress, malevolence, and omnipotence. Command hallucinations are associated with higher responsibility beliefs for preventing harm. Responsibility beliefs are associated with voice-related distress. Future psychological interventions for command hallucinations might benefit from focussing not only on omnipotence, but also on responsibility beliefs, as is done in psychological therapies for obsessive compulsive disorder. Limitations The cross-sectional design does not assess issues of causality. We did not measure the presence or severity of delusions. © 2017 The British Psychological Society.
NASA Astrophysics Data System (ADS)
Xia, Weiwei; Shen, Lianfeng
We propose two vertical handoff schemes for cellular network and wireless local area network (WLAN) integration: integrated service-based handoff (ISH) and integrated service-based handoff with queue capabilities (ISHQ). Compared with existing handoff schemes in integrated cellular/WLAN networks, the proposed schemes consider a more comprehensive set of system characteristics such as different features of voice and data services, dynamic information about the admitted calls, user mobility and vertical handoffs in two directions. The code division multiple access (CDMA) cellular network and IEEE 802.11e WLAN are taken into account in the proposed schemes. We model the integrated networks by using multi-dimensional Markov chains and the major performance measures are derived for voice and data services. The important system parameters such as thresholds to prioritize handoff voice calls and queue sizes are optimized. Numerical results demonstrate that the proposed ISHQ scheme can maximize the utilization of overall bandwidth resources with the best quality of service (QoS) provisioning for voice and data services.