Supporting Graduate Student Writers with VoiceThread
ERIC Educational Resources Information Center
Gonzalez, Michelle; Moore, Noreen S.
2018-01-01
This qualitative case study examined the influence of the use of VoiceThread technology on the feedback process for thesis writing in two online asynchronous graduate courses. The influence on instructor feedback process and graduate student writers' perceptions of the use of VoiceThread were the foci of the study. Master's-level students (n = 18)…
What does voice-processing technology support today?
Nakatsu, R; Suzuki, Y
1995-01-01
This paper describes the state of the art in applications of voice-processing technologies. In the first part, technologies concerning the implementation of speech recognition and synthesis algorithms are described. Hardware technologies such as microprocessors and DSPs (digital signal processors) are discussed. Software development environment, which is a key technology in developing applications software, ranging from DSP software to support software also is described. In the second part, the state of the art of algorithms from the standpoint of applications is discussed. Several issues concerning evaluation of speech recognition/synthesis algorithms are covered, as well as issues concerning the robustness of algorithms in adverse conditions. Images Fig. 3 PMID:7479720
New Directions in Space Operations Services in Support of Interplanetary Exploration
NASA Technical Reports Server (NTRS)
Bradford, Robert N.
2005-01-01
To gain access to the necessary operational processes and data in support of NASA's Lunar/Mars Exploration Initiative, new services, adequate levels of computing cycles and access to myriad forms of data must be provided to onboard spacecraft and ground based personnel/systems (earth, lunar and Martian) to enable interplanetary exploration by humans. These systems, cycles and access to vast amounts of development, test and operational data will be required to provide a new level of services not currently available to existing spacecraft, on board crews and other operational personnel. Although current voice, video and data systems in support of current space based operations has been adequate, new highly reliable and autonomous processes and services will be necessary for future space exploration activities. These services will range from the more mundane voice in LEO to voice in interplanetary travel which because of the high latencies will require new voice processes and standards. New services, like component failure predictions based on data mining of significant quantities of data, located at disparate locations, will be required. 3D or holographic representation of onboard components, systems or family members will greatly improve maintenance, operations and service restoration not to mention crew morale. Current operational systems and standards, like the Internet Protocol, will not able to provide the level of service required end to end from an end point on the Martian surface like a scientific instrument to a researcher at a university. Ground operations whether earth, lunar or Martian and in flight operations to the moon and especially to Mars will require significant autonomy that will require access to highly reliable processing capabilities, data storage based on network storage technologies. Significant processing cycles will be needed onboard but could be borrowed from other locations either ground based or onboard other spacecraft. Reliability will be a key factor with onboard and distributed backup processing an absolutely necessary requirement. Current cluster processing/Grid technologies may provide the basis for providing these services. An overview of existing services, future services that will be required and the technologies and standards required to be developed will be presented. The purpose of this paper will be to initiate a technological roadmap, albeit at a high level, of current voice, video, data and network technologies and standards (which show promise for adaptation or evolution) to what technologies and standards need to be redefined, adjusted or areas where new ones require development. The roadmap should begin the differentiation between non manned and manned processes/services where applicable. The paper will be based in part on the activities of the CCSDS Monitor and Control working group which is beginning the process of standardization of the these processes. Another element of the paper will be based on an analysis of current technologies supporting space flight processes and services at JSC, MSFC, GSFC and to a lesser extent at KSC. Work being accomplished in areas such as Grid computing, data mining and network storage at ARC, IBM and the University of Alabama at Huntsville will be researched and analyzed.
ERIC Educational Resources Information Center
Jones, Monty; Dexter, Sara
2018-01-01
This mixed-methods study examined the technology integration learning activities of four teachers throughout one year using weekly quantitative surveys and a series of three qualitative individual interviews. Through the teachers' own voices an illustration of their learning processes is presented, and the gap between what is supported by their…
Technology-assisted psychoanalysis.
Scharff, Jill Savege
2013-06-01
Teleanalysis-remote psychoanalysis by telephone, voice over internet protocol (VoIP), or videoteleconference (VTC)-has been thought of as a distortion of the frame that cannot support authentic analytic process. Yet it can augment continuity, permit optimum frequency of analytic sessions for in-depth analytic work, and enable outreach to analysands in areas far from specialized psychoanalytic centers. Theoretical arguments against teleanalysis are presented and countered and its advantages and disadvantages discussed. Vignettes of analytic process from teleanalytic sessions are presented, and indications, contraindications, and ethical concerns are addressed. The aim is to provide material from which to judge the authenticity of analytic process supported by technology.
Advanced Electronic Technology
1977-11-15
Electronics 15 III. Materials Research 15 TV. Microelectronics 16 V. Surface- Wave Technology 16 DATA SYSTEMS DIVISION 2 INTRODUCTION This...Processing Digital Voice Processing Packet Speech Wideband Integrated Voice/Data Technology Radar Signal Processing Technology Nuclear Safety Designs...facilities make it possible to track the status of these jobs, retrieve their job control language listings, and direct a copy of printed or punched
Network Speech Systems Technology Program
NASA Astrophysics Data System (ADS)
Weinstein, C. J.
1980-09-01
This report documents work performed during FY 1980 on the DCA-sponsored Network Speech Systems Technology Program. The areas of work reported are: (1) communication systems studies in Demand-Assignment Multiple Access (DAMA), voice/data integration, and adaptive routing, in support of the evolving Defense Communications System (DCS) and Defense Switched Network (DSN); (2) a satellite/terrestrial integration design study including the functional design of voice and data interfaces to interconnect terrestrial and satellite network subsystems; and (3) voice-conferencing efforts dealing with support of the Secure Voice and Graphics Conferencing (SVGC) Test and Evaluation Program. Progress in definition and planning of experiments for the Experimental Integrated Switched Network (EISN) is detailed separately in an FY 80 Experiment Plan Supplement.
Seelbach, C
1995-01-01
The Colloquium on Human-Machine Communication by Voice highlighted the global technical community's focus on the problems and promise of voice-processing technology, particularly, speech recognition and speech synthesis. Clearly, there are many areas in both the research and development of these technologies that can be advanced significantly. However, it is also true that there are many applications of these technologies that are capable of commercialization now. Early successful commercialization of new technology is vital to ensure continuing interest in its development. This paper addresses efforts to commercialize speech technologies in two markets: telecommunications and aids for the handicapped. PMID:7479814
Voice loops as coordination aids in space shuttle mission control.
Patterson, E S; Watts-Perotti, J; Woods, D D
1999-01-01
Voice loops, an auditory groupware technology, are essential coordination support tools for experienced practitioners in domains such as air traffic management, aircraft carrier operations and space shuttle mission control. They support synchronous communication on multiple channels among groups of people who are spatially distributed. In this paper, we suggest reasons for why the voice loop system is a successful medium for supporting coordination in space shuttle mission control based on over 130 hours of direct observation. Voice loops allow practitioners to listen in on relevant communications without disrupting their own activities or the activities of others. In addition, the voice loop system is structured around the mission control organization, and therefore directly supports the demands of the domain. By understanding how voice loops meet the particular demands of the mission control environment, insight can be gained for the design of groupware tools to support cooperative activity in other event-driven domains.
Voice loops as coordination aids in space shuttle mission control
NASA Technical Reports Server (NTRS)
Patterson, E. S.; Watts-Perotti, J.; Woods, D. D.
1999-01-01
Voice loops, an auditory groupware technology, are essential coordination support tools for experienced practitioners in domains such as air traffic management, aircraft carrier operations and space shuttle mission control. They support synchronous communication on multiple channels among groups of people who are spatially distributed. In this paper, we suggest reasons for why the voice loop system is a successful medium for supporting coordination in space shuttle mission control based on over 130 hours of direct observation. Voice loops allow practitioners to listen in on relevant communications without disrupting their own activities or the activities of others. In addition, the voice loop system is structured around the mission control organization, and therefore directly supports the demands of the domain. By understanding how voice loops meet the particular demands of the mission control environment, insight can be gained for the design of groupware tools to support cooperative activity in other event-driven domains.
Understanding the mechanisms of familiar voice-identity recognition in the human brain.
Maguinness, Corrina; Roswandowitz, Claudia; von Kriegstein, Katharina
2018-03-31
Humans have a remarkable skill for voice-identity recognition: most of us can remember many voices that surround us as 'unique'. In this review, we explore the computational and neural mechanisms which may support our ability to represent and recognise a unique voice-identity. We examine the functional architecture of voice-sensitive regions in the superior temporal gyrus/sulcus, and bring together findings on how these regions may interact with each other, and additional face-sensitive regions, to support voice-identity processing. We also contrast findings from studies on neurotypicals and clinical populations which have examined the processing of familiar and unfamiliar voices. Taken together, the findings suggest that representations of familiar and unfamiliar voices might dissociate in the human brain. Such an observation does not fit well with current models for voice-identity processing, which by-and-large assume a common sequential analysis of the incoming voice signal, regardless of voice familiarity. We provide a revised audio-visual integrative model of voice-identity processing which brings together traditional and prototype models of identity processing. This revised model includes a mechanism of how voice-identity representations are established and provides a novel framework for understanding and examining the potential differences in familiar and unfamiliar voice processing in the human brain. Copyright © 2018 Elsevier Ltd. All rights reserved.
Voice Over Internet Protocol (VoIP) in a Control Center Environment
NASA Technical Reports Server (NTRS)
Pirani, Joseph; Calvelage, Steven
2010-01-01
The technology of transmitting voice over data networks has been available for over 10 years. Mass market VoIP services for consumers to make and receive standard telephone calls over broadband Internet networks have grown in the last 5 years. While operational costs are less with VoIP implementations as opposed to time division multiplexing (TDM) based voice switches, is it still advantageous to convert a mission control center s voice system to this newer technology? Marshall Space Flight Center (MSFC) Huntsville Operations Support Center (HOSC) has converted its mission voice services to a commercial product that utilizes VoIP technology. Results from this testing, design, and installation have shown unique considerations that must be addressed before user operations. There are many factors to consider for a control center voice design. Technology advantages and disadvantages were investigated as they refer to cost. There were integration concerns which could lead to complex failure scenarios but simpler integration for the mission infrastructure. MSFC HOSC will benefit from this voice conversion with less product replacement cost, less operations cost and a more integrated mission services environment.
Wolfe, Jace; Morais, Mila; Schafer, Erin; Agrawal, Smita; Koch, Dawn
2015-05-01
Cochlear implant recipients often experience difficulty with understanding speech in the presence of noise. Cochlear implant manufacturers have developed sound processing algorithms designed to improve speech recognition in noise, and research has shown these technologies to be effective. Remote microphone technology utilizing adaptive, digital wireless radio transmission has also been shown to provide significant improvement in speech recognition in noise. There are no studies examining the potential improvement in speech recognition in noise when these two technologies are used simultaneously. The goal of this study was to evaluate the potential benefits and limitations associated with the simultaneous use of a sound processing algorithm designed to improve performance in noise (Advanced Bionics ClearVoice) and a remote microphone system that incorporates adaptive, digital wireless radio transmission (Phonak Roger). A two-by-two way repeated measures design was used to examine performance differences obtained without these technologies compared to the use of each technology separately as well as the simultaneous use of both technologies. Eleven Advanced Bionics (AB) cochlear implant recipients, ages 11 to 68 yr. AzBio sentence recognition was measured in quiet and in the presence of classroom noise ranging in level from 50 to 80 dBA in 5-dB steps. Performance was evaluated in four conditions: (1) No ClearVoice and no Roger, (2) ClearVoice enabled without the use of Roger, (3) ClearVoice disabled with Roger enabled, and (4) simultaneous use of ClearVoice and Roger. Speech recognition in quiet was better than speech recognition in noise for all conditions. Use of ClearVoice and Roger each provided significant improvement in speech recognition in noise. The best performance in noise was obtained with the simultaneous use of ClearVoice and Roger. ClearVoice and Roger technology each improves speech recognition in noise, particularly when used at the same time. Because ClearVoice does not degrade performance in quiet settings, clinicians should consider recommending ClearVoice for routine, full-time use for AB implant recipients. Roger should be used in all instances in which remote microphone technology may assist the user in understanding speech in the presence of noise. American Academy of Audiology.
Two-voice fundamental frequency estimation
NASA Astrophysics Data System (ADS)
de Cheveigné, Alain
2002-05-01
An algorithm is presented that estimates the fundamental frequencies of two concurrent voices or instruments. The algorithm models each voice as a periodic function of time, and jointly estimates both periods by cancellation according to a previously proposed method [de Cheveigné and Kawahara, Speech Commun. 27, 175-185 (1999)]. The new algorithm improves on the old in several respects; it allows an unrestricted search range, effectively avoids harmonic and subharmonic errors, is more accurate (it uses two-dimensional parabolic interpolation), and is computationally less costly. It remains subject to unavoidable errors when periods are in certain simple ratios and the task is inherently ambiguous. The algorithm is evaluated on a small database including speech, singing voice, and instrumental sounds. It can be extended in several ways; to decide the number of voices, to handle amplitude variations, and to estimate more than two voices (at the expense of increased processing cost and decreased reliability). It makes no use of instrument models, learned or otherwise, although it could usefully be combined with such models. [Work supported by the Cognitique programme of the French Ministry of Research and Technology.
Scientific bases of human-machine communication by voice.
Schafer, R W
1995-01-01
The scientific bases for human-machine communication by voice are in the fields of psychology, linguistics, acoustics, signal processing, computer science, and integrated circuit technology. The purpose of this paper is to highlight the basic scientific and technological issues in human-machine communication by voice and to point out areas of future research opportunity. The discussion is organized around the following major issues in implementing human-machine voice communication systems: (i) hardware/software implementation of the system, (ii) speech synthesis for voice output, (iii) speech recognition and understanding for voice input, and (iv) usability factors related to how humans interact with machines. PMID:7479802
ERIC Educational Resources Information Center
Halpern, Angela E.; Ramig, Lorraine O.; Matos, Carlos E. C.; Petska-Cable, Jill A.; Spielman, Jennifer L.; Pogoda, Janice M.; Gilley, Phillip M.; Sapir, Shimon; Bennett, John K.; McFarland, David H.
2012-01-01
Purpose: To assess the feasibility and effectiveness of a newly developed assistive technology system, Lee Silverman Voice Treatment Companion (LSVT[R] Companion[TM], hereafter referred to as "Companion"), to support the delivery of LSVT[R]LOUD, an efficacious speech intervention for individuals with Parkinson disease (PD). Method: Sixteen…
Students' Voices about Information and Communication Technology in Upper Secondary Schools
ERIC Educational Resources Information Center
Olofsson, Anders D.; Lindberg, Ola J.; Fransson, Göran
2018-01-01
Purpose: The purpose of this paper is to explore upper secondary school students' voices on how information and communication technology (ICT) could structure and support their everyday activities and time at school. Design/methodology/approach: In all, 11 group interviews were conducted with a total of 46 students from three upper secondary…
Using Voice Boards: Pedagogical Design, Technological Implementation, Evaluation and Reflections
ERIC Educational Resources Information Center
Yaneske, Elisabeth; Oates, Briony
2011-01-01
We present a case study to evaluate the use of a Wimba Voice Board to support asynchronous audio discussion. We discuss the learning strategy and pedagogic rationale when a Voice Board was implemented within an MA module for language learners, enabling students to create learning objects and facilitating peer-to-peer learning. Previously students…
Using Voice Boards: Pedagogical Design, Technological Implementation, Evaluation and Reflections
ERIC Educational Resources Information Center
Yaneske, Elisabeth; Oates, Briony
2010-01-01
We present a case study to evaluate the use of a Wimba Voice Board to support asynchronous audio discussion. We discuss the learning strategy and pedagogic rationale when a Voice Board was implemented within an MA module for language learners, enabling students to create learning objects and facilitating peer-to-peer learning. Previously students…
Researching the Use of Voice Recognition Writing Software.
ERIC Educational Resources Information Center
Honeycutt, Lee
2003-01-01
Notes that voice recognition technology (VRT) has become accurate and fast enough to be useful in a variety of writing scenarios. Contends that little is known about how this technology might affect writing process or perceptions of silent writing. Explores future use of VRT by examining past research in the technology of dictation. (PM)
ERIC Educational Resources Information Center
Liu, Min; Horton, Lucas; Lee, Jaejin; Kang, Jina; Rosenblum, Jason; O'Hair, Matthew; Lu, Chu-Wei
2014-01-01
This paper describes the design and development process used to create Alien Rescue, a multimedia-enhanced learning environment that supports problem-based learning (PBL) in middle school science. The goal of the project is to further our understandings of technology, pedagogy, and instructional theories as they relate to the application of PBL…
Voice Technologies in Libraries: A Look into the Future.
ERIC Educational Resources Information Center
Lange, Holley R., Ed.; And Others
1991-01-01
Discussion of synthesized speech and voice recognition focuses on a forum that addressed the potential for speech technologies in libraries. Topics discussed by three contributors include possible library applications in technical processing, book receipt, circulation control, and database access; use by disabled and illiterate users; and problems…
ERIC Educational Resources Information Center
DiGiovanni, Lisa Marie
2013-01-01
The American Heart Association's HeartCode[TM] Healthcare Provider (HCP) Basic Life Support (BLS) e-learning program with voice-advisory manikins was implemented in an acute care hospital as the only teaching method offered for BLS certification. On course evaluations, healthcare provider staff commented that the VAM technology for skills practice…
Voice Enabled Framework to Support Post-Surgical Discharge Monitoring
Blansit, Kevin; Marmor, Rebecca; Zhao, Beiqun; Tien, Dan
2017-01-01
Unplanned surgical readmissions pose a challenging problem for the American healthcare system. We propose to combine consumer electronic voice recognition technology with the FHIR standard to create a post-surgical discharge monitoring app to identify and alert physicians to a patient’s deteriorating status. PMID:29854267
Cognitive Attachment Model of Voices: Evidence Base and Future Implications
Berry, Katherine; Varese, Filippo; Bucci, Sandra
2017-01-01
There is a robust association between hearing voices and exposure to traumatic events. Identifying mediating mechanisms for this relationship is key to theories of voice hearing and the development of therapies for distressing voices. This paper outlines the Cognitive Attachment model of Voices (CAV), a theoretical model to understand the relationship between earlier interpersonal trauma and distressing voice hearing. The model builds on attachment theory and well-established cognitive models of voices and argues that attachment and dissociative processes are key psychological mechanisms that explain how trauma influences voice hearing. Following the presentation of the model, the paper will review the current state of evidence regarding the proposed mechanisms of vulnerability to voice hearing and maintenance of voice-related distress. This review will include evidence from studies supporting associations between dissociation and voices, followed by details of our own research supporting the role of dissociation in mediating the relationship between trauma and voices and evidence supporting the role of adult attachment in influencing beliefs and relationships that voice hearers can develop with voices. The paper concludes by outlining the key questions that future research needs to address to fully test the model and the clinical implications that arise from the work. PMID:28713292
NASA Technical Reports Server (NTRS)
Voorhees, J. W.; Bucher, N. M.
1983-01-01
The cockpit has been one of the most rapidly changing areas of new aircraft design over the past thirty years. In connection with these developments, a pilot can now be considered a decision maker/system manager as well as a vehicle controller. There is, however, a trend towards an information overload in the cockpit, and information processing problems begin to occur for the rotorcraft pilot. One approach to overcome the arising difficulties is based on the utilization of voice technology to improve the information transfer rate in the cockpit with respect to both input and output. Attention is given to the background of speech technology, the application of speech technology within the cockpit, voice interactive electronic warning system (VIEWS) simulation, and methodology. Information subsystems are considered along with a dynamic simulation study, and data collection.
Micro-Based Speech Recognition: Instructional Innovation for Handicapped Learners.
ERIC Educational Resources Information Center
Horn, Carin E.; Scott, Brian L.
A new voice based learning system (VBLS), which allows the handicapped user to interact with a microcomputer by voice commands, is described. Speech or voice recognition is the computerized process of identifying a spoken word or phrase, including those resulting from speech impediments. This new technology is helpful to the severely physically…
Hands-free human-machine interaction with voice
NASA Astrophysics Data System (ADS)
Juang, B. H.
2004-05-01
Voice is natural communication interface between a human and a machine. The machine, when placed in today's communication networks, may be configured to provide automation to save substantial operating cost, as demonstrated in AT&T's VRCP (Voice Recognition Call Processing), or to facilitate intelligent services, such as virtual personal assistants, to enhance individual productivity. These intelligent services often need to be accessible anytime, anywhere (e.g., in cars when the user is in a hands-busy-eyes-busy situation or during meetings where constantly talking to a microphone is either undersirable or impossible), and thus call for advanced signal processing and automatic speech recognition techniques which support what we call ``hands-free'' human-machine communication. These techniques entail a broad spectrum of technical ideas, ranging from use of directional microphones and acoustic echo cancellatiion to robust speech recognition. In this talk, we highlight a number of key techniques that were developed for hands-free human-machine communication in the mid-1990s after Bell Labs became a unit of Lucent Technologies. A video clip will be played to demonstrate the accomplishement.
Voice Interactive Analysis System Study. Final Report, August 28, 1978 through March 23, 1979.
ERIC Educational Resources Information Center
Harry, D. P.; And Others
The Voice Interactive Analysis System study continued research and development of the LISTEN real-time, minicomputer based connected speech recognition system, within NAVTRAEQUIPCEN'S program of developing automatic speech technology in support of training. An attempt was made to identify the most effective features detected by the TTI-500 model…
WES (Waterways Experiment Station) Communications Plan for Voice and Data
1989-01-01
modem on a leased line, and two wideband HDLC 56K connections not used on the Honeywell. 30. Honeywell DPS-8 configuration, as of October 1987, is as...based voice system to support additional asynchronous dial-up modem traffic. In June 1987, Dr. N. Radhakhrishnan of the WES Information Technology...voice system (PBX) and very low-speed data communications by the laboratories using 1,200/2,400-baud asynchronous modems over analog phone lines, and
Native voice, self-concept and the moral case for personalized voice technology.
Nathanson, Esther
2017-01-01
Purpose (1) To explore the role of native voice and effects of voice loss on self-concept and identity, and survey the state of assistive voice technology; (2) to establish the moral case for developing personalized voice technology. Methods This narrative review examines published literature on the human significance of voice, the impact of voice loss on self-concept and identity, and the strengths and limitations of current voice technology. Based on the impact of voice loss on self and identity, and voice technology limitations, the moral case for personalized voice technology is developed. Results Given the richness of information conveyed by voice, loss of voice constrains expression of the self, but the full impact is poorly understood. Augmentative and alternative communication (AAC) devices facilitate communication but, despite advances in this field, voice output cannot yet express the unique nuances of individual voice. The ethical principles of autonomy, beneficence and equality of opportunity establish the moral responsibility to invest in accessible, cost-effective, personalized voice technology. Conclusions Although further research is needed to elucidate the full effects of voice loss on self-concept, identity and social functioning, current understanding of the profoundly negative impact of voice loss establishes the moral case for developing personalized voice technology. Implications for Rehabilitation Rehabilitation of voice-disordered patients should facilitate self-expression, interpersonal connectedness and social/occupational participation. Proactive questioning about the psychological and social experiences of patients with voice loss is a valuable entry point for rehabilitation planning. Personalized voice technology would enhance sense of self, communicative participation and autonomy and promote shared healthcare decision-making. Further research is needed to identify the best strategies to preserve and strengthen identity and sense of self.
ERIC Educational Resources Information Center
Ludwig, Timothy D.; Goomas, David T.
2007-01-01
Field study was conducted in auto-parts after-market distribution centers where selectors used handheld computers to receive instructions and feedback about their product selection process. A wireless voice-interaction technology was then implemented in a multiple baseline fashion across three departments of a warehouse (N = 14) and was associated…
ERIC Educational Resources Information Center
Hardison, Debra M.; Sonchaeng, Chayawan
2005-01-01
This paper provides a sequence of specific techniques and examples for implementing theatre voice training and technology in teaching ESL/EFL oral skills. A layered approach is proposed based on information processing theory in which the focus of learner attention is shifted in stages from the physiological to the linguistic and then to the…
Value driven innovation in medical device design: a process for balancing stakeholder voices.
de Ana, F J; Umstead, K A; Phillips, G J; Conner, C P
2013-09-01
The innovation process has often been represented as a linear process which funnels customer needs through various business and process filters. This method may be appropriate for some consumer products, but in the medical device industry there are some inherent limitations to the traditional innovation funnel approach. In the medical device industry, there are a number of stakeholders who need to have their voices heard throughout the innovation process. Each stakeholder has diverse and unique needs relating to the medical device, the needs of one may highly affect the needs of another, and the relationships between stakeholders may be tenuous. This paper describes the application of a spiral innovation process to the development of a medical device which considers three distinct stakeholder voices: the Voice of the Customer, the Voice of the Business and the Voice of the Technology. The process is presented as a case study focusing on the front-end redesign of a class III medical device for an orthopedics company. Starting from project initiation and scope alignment, the process describes four phases, Discover, Envision, Create, and Refine, and concludes with value assessment of the final design features.
Evaluating a voice recognition system: finding the right product for your department.
Freeh, M; Dewey, M; Brigham, L
2001-06-01
The Department of Radiology at the University of Utah Health Sciences Center has been in the process of transitioning from the traditional film-based department to a digital imaging department for the past 2 years. The department is now transitioning from the traditional method of dictating reports (dictation by radiologist to transcription to review and signing by radiologist) to a voice recognition system. The transition to digital operations will not be complete until we have the ability to directly interface the dictation process with the image review process. Voice recognition technology has advanced to the level where it can and should be an integral part of the new way of working in radiology and is an integral part of an efficient digital imaging department. The transition to voice recognition requires the task of identifying the product and the company that will best meet a department's needs. This report introduces the methods we used to evaluate the vendors and the products available as we made our purchasing decision. We discuss our evaluation method and provide a checklist that can be used by other departments to assist with their evaluation process. The criteria used in the evaluation process fall into the following major categories: user operations, technical infrastructure, medical dictionary, system interfaces, service support, cost, and company strength. Conclusions drawn from our evaluation process will be detailed, with the intention being to shorten the process for others as they embark on a similar venture. As more and more organizations investigate the many products and services that are now being offered to enhance the operations of a radiology department, it becomes increasingly important that solid methods are used to most effectively evaluate the new products. This report should help others complete the task of evaluating a voice recognition system and may be adaptable to other products as well.
Tippey, Kathryn G; Sivaraj, Elayaraj; Ferris, Thomas K
2017-06-01
This study evaluated the individual and combined effects of voice (vs. manual) input and head-up (vs. head-down) display in a driving and device interaction task. Advances in wearable technology offer new possibilities for in-vehicle interaction but also present new challenges for managing driver attention and regulating device usage in vehicles. This research investigated how driving performance is affected by interface characteristics of devices used for concurrent secondary tasks. A positive impact on driving performance was expected when devices included voice-to-text functionality (reducing demand for visual and manual resources) and a head-up display (HUD) (supporting greater visibility of the driving environment). Driver behavior and performance was compared in a texting-while-driving task set during a driving simulation. The texting task was completed with and without voice-to-text using a smartphone and with voice-to-text using Google Glass's HUD. Driving task performance degraded with the addition of the secondary texting task. However, voice-to-text input supported relatively better performance in both driving and texting tasks compared to using manual entry. HUD functionality further improved driving performance compared to conditions using a smartphone and often was not significantly worse than performance without the texting task. This study suggests that despite the performance costs of texting-while-driving, voice input methods improve performance over manual entry, and head-up displays may further extend those performance benefits. This study can inform designers and potential users of wearable technologies as well as policymakers tasked with regulating the use of these technologies while driving.
van Leer, Eva; Connor, Nadine P.
2012-01-01
Summary Objectives/Hypotheses There are many documented barriers to successful adherence to voice therapy. However, methods for facilitating adherence are not well understood. The purpose of this study was to determine if patient adherence could be improved by providing patients with practice support between sessions using mobile treatment videos. Methods Thirteen voice therapy participants were provided with portable media players containing videos of voice exercises exemplified by their therapists and themselves. A randomized crossover design of two conditions was used: (1) standard of care voice therapy where participants were provided with written homework descriptions; and (2) video-enhanced voice therapy where participants received a portable digital media player with clinician and self-videos. The duration of each condition was 1 week. Results Practice of voice exercises was significantly greater in the video-enhanced voice therapy condition than in the standard of care “written” condition (P < 0.05). Three aspects of participant motivation for practice-overall commitment to practice, importance of practice, and confidence in the ability to practice were also significantly greater after video-enhanced condition than after standard of care condition. Conclusion These results support the use of video examples and portable digital media players in voice therapy for individuals who are comfortable using such technology. PMID:21840169
Exploring expressivity and emotion with artificial voice and speech technologies.
Pauletto, Sandra; Balentine, Bruce; Pidcock, Chris; Jones, Kevin; Bottaci, Leonardo; Aretoulaki, Maria; Wells, Jez; Mundy, Darren P; Balentine, James
2013-10-01
Emotion in audio-voice signals, as synthesized by text-to-speech (TTS) technologies, was investigated to formulate a theory of expression for user interface design. Emotional parameters were specified with markup tags, and the resulting audio was further modulated with post-processing techniques. Software was then developed to link a selected TTS synthesizer with an automatic speech recognition (ASR) engine, producing a chatbot that could speak and listen. Using these two artificial voice subsystems, investigators explored both artistic and psychological implications of artificial speech emotion. Goals of the investigation were interdisciplinary, with interest in musical composition, augmentative and alternative communication (AAC), commercial voice announcement applications, human-computer interaction (HCI), and artificial intelligence (AI). The work-in-progress points towards an emerging interdisciplinary ontology for artificial voices. As one study output, HCI tools are proposed for future collaboration.
Future Educators' Explaining Voices
ERIC Educational Resources Information Center
de Oliveira, Janaina Minelli; Caballero, Pablo Buenestado; Camacho, Mar
2013-01-01
Teacher education programs must offer pre-service students innovative technology-supported learning environments, guiding them in the revision of their preconceptions on literacy and technology. This present paper presents a case study that uses podcast to inquiry into future educators' views on technology and the digital age. Results show future…
Utilization of Internet Protocol-Based Voice Systems in Remote Payload Operations
NASA Technical Reports Server (NTRS)
Chamberlain, jim; Bradford, Bob; Best, Susan; Nichols, Kelvin
2002-01-01
Due to limited crew availability to support science and the large number of experiments to be operated simultaneously, telescience is key to a successful International Space Station (ISS) science program. Crew, operations personnel at NASA centers, and researchers at universities and companies around the world must work closely together to per orm scientific experiments on-board ISS. The deployment of reliable high-speed Internet Protocol (IP)-based networks promises to greatly enhance telescience capabilities. These networks are now being used to cost-effectively extend the reach of remote mission support systems. They reduce the need for dedicated leased lines and travel while improving distributed workgroup collaboration capabilities. NASA has initiated use of Voice over Internet Protocol (VoIP) to supplement the existing mission voice communications system used by researchers at their remote sites. The Internet Voice Distribution System (IVoDS) connects remote researchers to mission support "loopsll or conferences via NASA networks and Internet 2. Researchers use NODS software on personal computers to talk with operations personnel at NASA centers. IVoDS also has the ;capability, if authorized, to allow researchers to communicate with the ISS crew during experiment operations. NODS was developed by Marshall Space Flight Center with contractors & Technology, First Virtual Communications, Lockheed-Martin, and VoIP Group. NODS is currently undergoing field-testing with full deployment for up to 50 simultaneous users expected in 2002. Research is being performed in parallel with IVoDS deployment for a next-generation system to qualitatively enhance communications among ISS operations personnel. In addition to the current voice capability, video and data/application-sharing capabilities are being investigated. IVoDS technology is also being considered for mission support systems for programs such as Space Launch Initiative and Homeland Defense.
NASA Astrophysics Data System (ADS)
Attention is given to aspects of quality assurance methodologies in development life cycles, optical intercity transmission systems, multiaccess protocols, system and technology aspects in the case of regional/domestic satellites, advances in SSB-AM radio transmission over terrestrial and satellite network, and development environments for telecommunications systems. Other subjects studied are concerned with business communication networks for voice and data, VLSI in local network and communication protocol, product evaluation and support, an update regarding Videotex, topics in communication theory, topics in radio propagation, a status report regarding societal effects of technology in the workplace, digital image processing, and adaptive signal processing for communications. The management of the reliability function in the development process is considered along with Giga-bit technologies for long distance large capacity optical transmission equipment. The application of gallium arsenide analog and digital integrated circuits for high-speed fiber optical communications, and a simple algorithm for image data coding.
Interference effects of vocalization on dual task performance
NASA Astrophysics Data System (ADS)
Owens, J. M.; Goodman, L. S.; Pianka, M. J.
1984-09-01
Voice command and control systems have been proposed as a potential means of off-loading the typically overburdened visual information processing system. However, prior to introducing novel human-machine interfacing technologies in high workload environments, consideration must be given to the integration of the new technologists within existing task structures to ensure that no new sources of workload or interference are systematically introduced. This study examined the use of voice interactive systems technology in the joint performance of two cognitive information processing tasks requiring continuous memory and choice reaction wherein a basis for intertask interference might be expected. Stimuli for the continuous memory task were presented aurally and either voice or keyboard responding was required in the choice reaction task. Performance was significantly degraded in each task when voice responding was required in the choice reaction time task. Performance degradation was evident in higher error scores for both the choice reaction and continuous memory tasks. Performance decrements observed under conditions of high intertask stimulus similarity were not statistically significant. The results signal the need to consider further the task requirements for verbal short-term memory when applying speech technology in multitask environments.
NASA Astrophysics Data System (ADS)
Huang, Zhaohui; Huang, Xiemin
2018-04-01
This paper, firstly, introduces the application trend of the integration of multi-channel interactions in automotive HMI ((Human Machine Interface) from complex information models faced by existing automotive HMI and describes various interaction modes. By comparing voice interaction and touch screen, gestures and other interaction modes, the potential and feasibility of voice interaction in automotive HMI experience design are concluded. Then, the related theories of voice interaction, identification technologies, human beings' cognitive models of voices and voice design methods are further explored. And the research priority of this paper is proposed, i.e. how to design voice interaction to create more humane task-oriented dialogue scenarios to enhance interactive experiences of automotive HMI. The specific scenarios in driving behaviors suitable for the use of voice interaction are studied and classified, and the usability principles and key elements for automotive HMI voice design are proposed according to the scenario features. Then, through the user participatory usability testing experiment, the dialogue processes of voice interaction in automotive HMI are defined. The logics and grammars in voice interaction are classified according to the experimental results, and the mental models in the interaction processes are analyzed. At last, the voice interaction design method to create the humane task-oriented dialogue scenarios in the driving environment is proposed.
Centering Students in School-Based Support Processes: Critical Inquiries and Shifting Perspectives
ERIC Educational Resources Information Center
Brion-Meisels, Gretchen
2015-01-01
Drawing on data from two qualitative studies, this chapter argues that both school organizations and individual students will benefit from centering youth voices in student support systems. To do this, the author shares data from adolescents' narratives that demonstrate how young people's voices might (re)shape the central practices of…
7 CFR 2.98 - Director, Management Services.
Code of Federal Regulations, 2011 CFR
2011-01-01
... management services; information technology services related to end user office automation, desktop computers, enterprise networking support, handheld devices and voice telecommunications; with authority to take actions...
7 CFR 2.98 - Director, Management Services.
Code of Federal Regulations, 2013 CFR
2013-01-01
... management services; information technology services related to end user office automation, desktop computers, enterprise networking support, handheld devices and voice telecommunications; with authority to take actions...
7 CFR 2.98 - Director, Management Services.
Code of Federal Regulations, 2012 CFR
2012-01-01
... management services; information technology services related to end user office automation, desktop computers, enterprise networking support, handheld devices and voice telecommunications; with authority to take actions...
Environment Tied to Successful Learning.
ERIC Educational Resources Information Center
Cash, Carol S.; And Others
1997-01-01
Technology available to schools includes a broad spectrum of voice, data and video equipment. Planners need to consider the following subsystems individually and collectively: (1) technology-based products; (2) communications and power distribution to support the equipment; and (3) ergonomics, lighting, acoustics, environmental controls, and…
The Voice as Computer Interface: A Look at Tomorrow's Technologies.
ERIC Educational Resources Information Center
Lange, Holley R.
1991-01-01
Discussion of voice as the communications device for computer-human interaction focuses on voice recognition systems for use within a library environment. Voice technologies are described, including voice response and voice recognition; examples of voice systems in use in libraries are examined; and further possibilities, including use with…
Is there an effect of dysphonic teachers' voices on children's processing of spoken language?
Rogerson, Jemma; Dodd, Barbara
2005-03-01
There is a vast body of literature on the causes, prevalence, implications, and issues of vocal dysfunction in teachers. However, the educational effect of teacher vocal impairment is largely unknown. The purpose of this study was to investigate the effect of impaired voice quality on children's processing of spoken language. One hundred and seven children (age range, 9.2 to 10.6, mean 9.8, SD 3.76 months) listened to three video passages, one read in a control voice, one in a mild dysphonic voice, and one in a severe dysphonic voice. After each video passage, children were asked to answer six questions, with multiple-choice answers. The results indicated that children's perceptions of speech across the three voice qualities differed, regardless of gender, IQ, and school attended. Performance in the control voice passages was better than performance in the mild and severe dysphonic voice passages. No difference was found between performance in the mild and severe dysphonic voice passages, highlighting that any form of vocal impairment is detrimental to children's speech processing and is therefore likely to have a negative educational effect. These findings, in light of the high rate of vocal dysfunction in teachers, further support the implementation of specific voice care education for those in the teaching profession.
Doarn, Charles R; Zacharias, Stephanie; Keck, Casey Stewart; Tabangin, Meredith; DeAlarcon, Alessandro; Kelchner, Lisa
2018-06-05
This article describes the design and implementation of a web-based portal developed to provide supported home practice between weekly voice therapy sessions delivered through telehealth to children with voice disorders. This in-between care consisted of supported home practice that was remotely monitored by speech-language pathologists (SLPs). A web-based voice therapy portal (VTP) was developed as a platform so participants could complete voice therapy home practice by an interdisciplinary team of SLPs (specialized in pediatric voice therapy), telehealth specialists, biomedical informaticians, and interface designers. The VTP was subsequently field tested in a group of children with voice disorders, participating in a larger telehealth study. Building the VTP for supported home practice for pediatric voice therapy was challenging, but successful. Key interactive features of the final site included 11 vocal hygiene questions, traditional voice therapy exercises grouped into levels, audio/visual voice therapy demonstrations, a store-and-retrieval system for voice samples, message/chat function, written guidelines for weekly therapy exercises, and questionnaires for parents to complete after each therapy session. Ten participants (9-14 years of age) diagnosed with a voice disorder were enrolled for eight weekly telehealth voice therapy sessions with follow-up in-between care provided using the VTP. The development and implementation of the VTP as a novel platform for the delivery of voice therapy home practice sessions were effective. We found that a versatile individual, who can work with all project staff (speak the language of both SLPs and information technologists), is essential to the development process. Once the website was established, participants and SLPs effectively utilized the web-based VTP. They found it feasible and useful for needed in-between care and reinforcement of therapeutic exercises.
ERIC Educational Resources Information Center
Consortium for School Networking (NJ1), 2006
2006-01-01
The Consortium for School Networking (CoSN) is the country's premier voice in education technology leadership with a mission to serve as the national organization for K-12 technology leaders who use technology strategically to ultimately improve teaching and learning. CoSN provides products and services to support and nurture leadership…
ERIC Educational Resources Information Center
Bainbridge, Marc
2005-01-01
The Individuals with Disabilities Education Act (IDEA-PL 101-478) of 1990 assures the right to assistive technology for individuals with disabilities. Later versions of the law increased spending, support, and consumer decision making and encouraged research and development for a range of assistive instructional technologies. As a result, the…
Use of speech-to-text technology for documentation by healthcare providers.
Ajami, Sima
2016-01-01
Medical records are a critical component of a patient's treatment. However, documentation of patient-related information is considered a secondary activity in the provision of healthcare services, often leading to incomplete medical records and patient data of low quality. Advances in information technology (IT) in the health system and registration of information in electronic health records (EHR) using speechto- text conversion software have facilitated service delivery. This narrative review is a literature search with the help of libraries, books, conference proceedings, databases of Science Direct, PubMed, Proquest, Springer, SID (Scientific Information Database), and search engines such as Yahoo, and Google. I used the following keywords and their combinations: speech recognition, automatic report documentation, voice to text software, healthcare, information, and voice recognition. Due to lack of knowledge of other languages, I searched all texts in English or Persian with no time limits. Of a total of 70, only 42 articles were selected. Speech-to-text conversion technology offers opportunities to improve the documentation process of medical records, reduce cost and time of recording information, enhance the quality of documentation, improve the quality of services provided to patients, and support healthcare providers in legal matters. Healthcare providers should recognize the impact of this technology on service delivery.
ERIC Educational Resources Information Center
Villano, Matt
2006-01-01
The benefits of deploying a communications system that runs over the Internet Protocol are well documented. Sending voice over the Internet, a process commonly known as VoIP, has been shown to save money on long distance calls, make voice mail more accessible, and enable users to answer their phones from anywhere. The technology also makes adding…
Predicting and influencing voice therapy adherence using social-cognitive factors and mobile video.
van Leer, Eva; Connor, Nadine P
2015-05-01
Patient adherence to voice therapy is an established challenge. The purpose of this study was (a) to examine whether adherence to treatment could be predicted from three social-cognitive factors measured at treatment onset: self-efficacy, goal commitment, and the therapeutic alliance, and (b) to test whether the provision of clinician, self-, and peer model mobile treatment videos on MP4 players would influence the same triad of social cognitive factors and the adherence behavior of patients. Forty adults with adducted hyperfunction with and without benign lesions were prospectively randomized to either 4 sessions of voice therapy enhanced by MP4 support or without MP4 support. Adherence between sessions was assessed through self-report. Social cognitive factors and voice outcomes were assessed at the beginning and end of therapy. Utility of MP4 support was assessed via interviews. Self-efficacy and the therapeutic alliance predicted a significant amount of adherence variance. MP4 support significantly increased generalization, self-efficacy for generalization, and the therapeutic alliance. An interaction effect demonstrated that MP4 support was particularly effective for patients who started therapy with poor self-efficacy for generalization. Adherence may be predicted and influenced via social-cognitive means. Mobile technology can extend therapy to extraclinical settings.
Functional selectivity for face processing in the temporal voice area of early deaf individuals
van Ackeren, Markus J.; Rabini, Giuseppe; Zonca, Joshua; Foa, Valentina; Baruffaldi, Francesca; Rezk, Mohamed; Pavani, Francesco; Rossion, Bruno; Collignon, Olivier
2017-01-01
Brain systems supporting face and voice processing both contribute to the extraction of important information for social interaction (e.g., person identity). How does the brain reorganize when one of these channels is absent? Here, we explore this question by combining behavioral and multimodal neuroimaging measures (magneto-encephalography and functional imaging) in a group of early deaf humans. We show enhanced selective neural response for faces and for individual face coding in a specific region of the auditory cortex that is typically specialized for voice perception in hearing individuals. In this region, selectivity to face signals emerges early in the visual processing hierarchy, shortly after typical face-selective responses in the ventral visual pathway. Functional and effective connectivity analyses suggest reorganization in long-range connections from early visual areas to the face-selective temporal area in individuals with early and profound deafness. Altogether, these observations demonstrate that regions that typically specialize for voice processing in the hearing brain preferentially reorganize for face processing in born-deaf people. Our results support the idea that cross-modal plasticity in the case of early sensory deprivation relates to the original functional specialization of the reorganized brain regions. PMID:28652333
Internet Based Remote Operations
NASA Technical Reports Server (NTRS)
Chamberlain, James
1999-01-01
This is the Final Report for the Internet Based Remote Operations Contract, has performed payload operations research support tasks March 1999 through September 1999. These tasks support the GSD goal of developing a secure, inexpensive data, voice, and video mission communications capability between remote payload investigators and the NASA payload operations team in the International Space Station (ISS) era. AZTek has provided feedback from the NASA payload community by utilizing its extensive payload development and operations experience to test and evaluate remote payload operations systems. AZTek has focused on use of the "public Internet" and inexpensive, Commercial-off-the-shelf (COTS) Internet-based tools that would most benefit "small" (e.g., $2 Million or less) payloads and small developers without permanent remote operations facilities. Such projects have limited budgets to support installation and development of high-speed dedicated communications links and high-end, custom ground support equipment and software. The primary conclusions of the study are as follows: (1) The trend of using Internet technology for "live" collaborative applications such as telescience will continue. The GSD-developed data and voice capabilities continued to work well over the "public" Internet during this period. 2. Transmitting multiple voice streams from a voice-conferencing server to a client PC to be mixed and played on the PC is feasible. 3. There are two classes of voice vendors in the market: - Large traditional phone equipment vendors pursuing integration of PSTN with Internet, and Small Internet startups.The key to selecting a vendor will be to find a company sufficiently large and established to provide a base voice-conferencing software product line for the next several years.
Payne, Thomas H; Alonso, W David; Markiel, J Andrew; Lybarger, Kevin; White, Andrew A
2018-01-01
We describe the development and design of a smartphone app-based system to create inpatient progress notes using voice, commercial automatic speech recognition software, with text processing to recognize spoken voice commands and format the note, and integration with a commercial EHR. This new system fits hospital rounding workflow and was used to support a randomized clinical trial testing whether use of voice to create notes improves timeliness of note availability, note quality, and physician satisfaction with the note creation process. The system was used to create 709 notes which were placed in the corresponding patient's EHR record. The median time from pressing the Send button to appearance of the formatted note in the Inbox was 8.8 min. It was generally very reliable, accepted by physician users, and secure. This approach provides an alternative to use of keyboard and templates to create progress notes and may appeal to physicians who prefer voice to typing. Copyright © 2017 Elsevier Inc. All rights reserved.
The value of visualizing tone of voice.
Pullin, Graham; Cook, Andrew
2013-10-01
Whilst most of us have an innate feeling for tone of voice, it is an elusive quality that even phoneticians struggle to describe with sufficient subtlety. For people who cannot speak themselves this can have particularly profound repercussions. Augmentative communication often involves text-to-speech, a technology that only supports a basic choice of prosody based on punctuation. Given how inherently difficult it is to talk about more nuanced tone of voice, there is a risk that its absence from current devices goes unremarked and unchallenged. Looking ahead optimistically to more expressive communication aids, their design will need to involve more subtle interactions with tone of voice-interactions that the people using them can understand and engage with. Interaction design can play a role in making tone of voice visible, tangible, and accessible. Two projects that have already catalysed interdisciplinary debate in this area, Six Speaking Chairs and Speech Hedge, are introduced together with responses. A broader role for design is advocated, as a means to opening up speech technology research to a wider range of disciplinary perspectives, and also to the contributions and influence of people who use it in their everyday lives.
Military and Government Applications of Human-Machine Communication by Voice
NASA Astrophysics Data System (ADS)
Weinstein, Clifford J.
1995-10-01
This paper describes a range of opportunities for military and government applications of human-machine communication by voice, based on visits and contacts with numerous user organizations in the United States. The applications include some that appear to be feasible by careful integration of current state-of-the-art technology and others that will require a varying mix of advances in speech technology and in integration of the technology into applications environments. Applications that are described include (1) speech recognition and synthesis for mobile command and control; (2) speech processing for a portable multifunction soldier's computer; (3) speech- and language-based technology for naval combat team tactical training; (4) speech technology for command and control on a carrier flight deck; (5) control of auxiliary systems, and alert and warning generation, in fighter aircraft and helicopters; and (6) voice check-in, report entry, and communication for law enforcement agents or special forces. A phased approach for transfer of the technology into applications is advocated, where integration of applications systems is pursued in parallel with advanced research to meet future needs.
The influence of nationality on the accuracy of face and voice recognition.
Doty, N D
1998-01-01
Sixty English and U.S. citizens were tested to determine the effect of nationality on accuracy in recognizing previously witnessed faces and voices. Subjects viewed a frontal facial photograph and were then asked to select that face from a set of 10 oblique facial photographs. Subjects listened to a recorded voice and were then asked to select the same voice from a set of 10 voice recordings. This process was repeated 7 more times, such that subjects identified a male and female face and voice from England, France, Belize, and the United States. Subjects demonstrated better accuracy recognizing the faces and voices of their own nationality. Subgoups analysis further supported the other-nationality effect as well as the previously documented other-race effect.
Practical applications of interactive voice technologies: Some accomplishments and prospects
NASA Technical Reports Server (NTRS)
Grady, Michael W.; Hicklin, M. B.; Porter, J. E.
1977-01-01
A technology assessment of the application of computers and electronics to complex systems is presented. Three existing systems which utilize voice technology (speech recognition and speech generation) are described. Future directions in voice technology are also described.
Voice-processing technologies--their application in telecommunications.
Wilpon, J G
1995-01-01
As the telecommunications industry evolves over the next decade to provide the products and services that people will desire, several key technologies will become commonplace. Two of these, automatic speech recognition and text-to-speech synthesis, will provide users with more freedom on when, where, and how they access information. While these technologies are currently in their infancy, their capabilities are rapidly increasing and their deployment in today's telephone network is expanding. The economic impact of just one application, the automation of operator services, is well over $100 million per year. Yet there still are many technical challenges that must be resolved before these technologies can be deployed ubiquitously in products and services throughout the worldwide telephone network. These challenges include: (i) High level of accuracy. The technology must be perceived by the user as highly accurate, robust, and reliable. (ii) Easy to use. Speech is only one of several possible input/output modalities for conveying information between a human and a machine, much like a computer terminal or Touch-Tone pad on a telephone. It is not the final product. Therefore, speech technologies must be hidden from the user. That is, the burden of using the technology must be on the technology itself. (iii) Quick prototyping and development of new products and services. The technology must support the creation of new products and services based on speech in an efficient and timely fashion. In this paper I present a vision of the voice-processing industry with a focus on the areas with the broadest base of user penetration: speech recognition, text-to-speech synthesis, natural language processing, and speaker recognition technologies. The current and future applications of these technologies in the telecommunications industry will be examined in terms of their strengths, limitations, and the degree to which user needs have been or have yet to be met. Although noteworthy gains have been made in areas with potentially small user bases and in the more mature speech-coding technologies, these subjects are outside the scope of this paper. Images Fig. 1 PMID:7479815
The cancer experience map: an approach to including the patient voice in supportive care solutions.
Hall, Leslie Kelly; Kunz, Breanne F; Davis, Elizabeth V; Dawson, Rose I; Powers, Ryan S
2015-05-28
The perspective of the patient, also called the "patient voice", is an essential element in materials created for cancer supportive care. Identifying that voice, however, can be a challenge for researchers and developers. A multidisciplinary team at a health information company tasked with addressing this issue created a representational model they call the "cancer experience map". This map, designed as a tool for content developers, offers a window into the complex perspectives inside the cancer experience. Informed by actual patient quotes, the map shows common overall themes for cancer patients, concerns at key treatment points, strategies for patient engagement, and targeted behavioral goals. In this article, the team members share the process by which they created the map as well as its first use as a resource for cancer support videos. The article also addresses the broader policy implications of including the patient voice in supportive cancer content, particularly with regard to mHealth apps.
Internet Voice Distribution System (IVoDS) Utilization in Remote Payload Operations
NASA Technical Reports Server (NTRS)
Best, Susan; Bradford, Bob; Chamberlain, Jim; Nichols, Kelvin; Bailey, Darrell (Technical Monitor)
2002-01-01
Due to limited crew availability to support science and the large number of experiments to be operated simultaneously, telescience is key to a successful International Space Station (ISS) science program. Crew, operations personnel at NASA centers, and researchers at universities and companies around the world must work closely together to perform scientific experiments on-board ISS. NASA has initiated use of Voice over Internet Protocol (VoIP) to supplement the existing HVoDS mission voice communications system used by researchers. The Internet Voice Distribution System (IVoDS) connects researchers to mission support "loops" or conferences via Internet Protocol networks such as the high-speed Internet 2. Researchers use IVoDS software on personal computers to talk with operations personnel at NASA centers. IVoDS also has the capability, if authorized, to allow researchers to communicate with the ISS crew during experiment operations. NODS was developed by Marshall Space Flight Center with contractors A2 Technology, Inc. FVC, Lockheed- Martin, and VoIP Group. IVoDS is currently undergoing field-testing with full deployment for up to 50 simultaneous users expected in 2002. Research is currently being performed to take full advantage of the digital world - the Personal Computer and Internet Protocol networks - to qualitatively enhance communications among ISS operations personnel. In addition to the current voice capability, video and data-sharing capabilities are being investigated. Major obstacles being addressed include network bandwidth capacity and strict security requirements. Techniques being investigated to reduce and overcome these obstacles include emerging audio-video protocols and network technology including multicast and quality-of-service.
Integration of Voice, Data and Image as a Factor in Information Planning. AIR Forum 1981 Paper.
ERIC Educational Resources Information Center
Sheehan, Bernard S.; McMinn, William B.
Technological advances in computers and telecommunications that have significantly changed higher education are analyzed within a framework that associates technologies, applications, and aspects of institutions that are affected. Two further frameworks, one for information systems suggested by decision support systems study and one which is a map…
Learning with Portable Digital Devices in Australian Schools: 20 Years On!
ERIC Educational Resources Information Center
Newhouse, C. Paul
2014-01-01
Portable computing technologies such as laptops, tablets, smartphones, wireless networking, voice/stylus input, and plug and play peripheral devices, appear to offer the means of finally realising much of the long heralded vision for computers to support learning in schools. There is the possibility for the technology to finally become a…
Digital signal processing algorithms for automatic voice recognition
NASA Technical Reports Server (NTRS)
Botros, Nazeih M.
1987-01-01
The current digital signal analysis algorithms are investigated that are implemented in automatic voice recognition algorithms. Automatic voice recognition means, the capability of a computer to recognize and interact with verbal commands. The digital signal is focused on, rather than the linguistic, analysis of speech signal. Several digital signal processing algorithms are available for voice recognition. Some of these algorithms are: Linear Predictive Coding (LPC), Short-time Fourier Analysis, and Cepstrum Analysis. Among these algorithms, the LPC is the most widely used. This algorithm has short execution time and do not require large memory storage. However, it has several limitations due to the assumptions used to develop it. The other 2 algorithms are frequency domain algorithms with not many assumptions, but they are not widely implemented or investigated. However, with the recent advances in the digital technology, namely signal processors, these 2 frequency domain algorithms may be investigated in order to implement them in voice recognition. This research is concerned with real time, microprocessor based recognition algorithms.
ERIC Educational Resources Information Center
Worley, Robin
2011-01-01
Millions of youths in developing countries are described by UNICEF as "invisible and excluded." They live at the margins of society, facing challenges to their daily existence, powerless to make positive changes. But the emergence of citizen journalism and digital storytelling may offer these youths a chance to share their voices and…
ERIC Educational Resources Information Center
Blau, Ina; Shamir-Inbal, Tamar
2018-01-01
"Student voice" (SV) refers to listening to and valuing students' views regarding their learning experiences, as well as treating them as equal partners in the evaluation process. This is expected, in turn, to empower students to take a more active role in shaping their learning. This study explores the role played by digital…
Neural correlates of perceptual narrowing in cross-species face-voice matching.
Grossmann, Tobias; Missana, Manuela; Friederici, Angela D; Ghazanfar, Asif A
2012-11-01
Integrating the multisensory features of talking faces is critical to learning and extracting coherent meaning from social signals. While we know much about the development of these capacities at the behavioral level, we know very little about the underlying neural processes. One prominent behavioral milestone of these capacities is the perceptual narrowing of face-voice matching, whereby young infants match faces and voices across species, but older infants do not. In the present study, we provide neurophysiological evidence for developmental decline in cross-species face-voice matching. We measured event-related brain potentials (ERPs) while 4- and 8-month-old infants watched and listened to congruent and incongruent audio-visual presentations of monkey vocalizations and humans mimicking monkey vocalizations. The ERP results indicated that younger infants distinguished between the congruent and the incongruent faces and voices regardless of species, whereas in older infants, the sensitivity to multisensory congruency was limited to the human face and voice. Furthermore, with development, visual and frontal brain processes and their functional connectivity became more sensitive to the congruence of human faces and voices relative to monkey faces and voices. Our data show the neural correlates of perceptual narrowing in face-voice matching and support the notion that postnatal experience with species identity is associated with neural changes in multisensory processing (Lewkowicz & Ghazanfar, 2009). © 2012 Blackwell Publishing Ltd.
Moerman, Mieke; Martens, Jean-Pierre; Dejonckere, Philippe
2015-04-01
This article is a compilation of own research performed during the European COoperation in Science and Technology (COST) action 2103: 'Advance Voice Function Assessment', an initiative of voice and speech processing teams consisting of physicists, engineers, and clinicians. This manuscript concerns analyzing largely irregular voicing types, namely substitution voicing (SV) and adductor spasmodic dysphonia (AdSD). A specific perceptual rating scale (IINFVo) was developed, and the Auditory Model Based Pitch Extractor (AMPEX), a piece of software that automatically analyses running speech and generates pitch values in background noise, was applied. The IINFVo perceptual rating scale has been shown to be useful in evaluating SV. The analysis of strongly irregular voices stimulated a modification of the European Laryngological Society's assessment protocol which was originally designed for the common types of (less severe) dysphonia. Acoustic analysis with AMPEX demonstrates that the most informative features are, for SV, the voicing-related acoustic features and, for AdSD, the perturbation measures. Poor correlations between self-assessment and acoustic and perceptual dimensions in the assessment of highly irregular voices argue for a multidimensional approach.
Smaradottir, Berglind; Håland, Jarle; Martinez, Santiago
2017-01-01
A mobile device's touchscreen allows users to use a choreography of hand gestures to interact with the user interface. A screen reader on a mobile device is designed to support the interaction of visually disabled users while using gestures. This paper presents an evaluation of VoiceOver, a screen reader in Apple Inc. products. The evaluation was a part of the research project "Visually impaired users touching the screen - a user evaluation of assistive technology".
Intra-building telecommunications cabling standards for Sandia National Laboratories, New Mexico
DOE Office of Scientific and Technical Information (OSTI.GOV)
Adams, R.L.
1993-08-01
This document establishes a working standard for all telecommunications cable installations at Sandia National Laboratories, New Mexico. It is based on recent national commercial cabling standards. The topics addressed are Secure and Open/Restricted Access telecommunications environments and both twisted-pair and optical-fiber components of communications media. Some of the state-of-the-art technologies that will be supported by the intrabuilding cable infrastructure are Circuit and Packet Switched Networks (PBX/5ESS Voice and Low-Speed Data), Local Area Networks (Ethernet, Token Ring, Fiber and Copper Distributed Data Interface), and Wide Area Networks (Asynchronous Transfer Mode). These technologies can be delivered to every desk and can transportmore » data at rates sufficient to support all existing applications (such as Voice, Text and graphics, Still Images, Full-motion Video), as well as applications to be defined in the future.« less
Military and government applications of human-machine communication by voice.
Weinstein, C J
1995-01-01
This paper describes a range of opportunities for military and government applications of human-machine communication by voice, based on visits and contacts with numerous user organizations in the United States. The applications include some that appear to be feasible by careful integration of current state-of-the-art technology and others that will require a varying mix of advances in speech technology and in integration of the technology into applications environments. Applications that are described include (1) speech recognition and synthesis for mobile command and control; (2) speech processing for a portable multifunction soldier's computer; (3) speech- and language-based technology for naval combat team tactical training; (4) speech technology for command and control on a carrier flight deck; (5) control of auxiliary systems, and alert and warning generation, in fighter aircraft and helicopters; and (6) voice check-in, report entry, and communication for law enforcement agents or special forces. A phased approach for transfer of the technology into applications is advocated, where integration of applications systems is pursued in parallel with advanced research to meet future needs. Images Fig. 1 Fig. 2 Fig. 3 Fig. 4 Fig. 5 Fig. 6 PMID:7479718
Giving Canadian Science, Mathematics, and Technology Education an Independent Voice
ERIC Educational Resources Information Center
Hodson, Derek
2015-01-01
It is noted that the "Canadian Journal of Science, Mathematics and Technology Education" (CJSMTE) was founded with the support of a donation of $1.0 million from the Imperial Oil Charitable Foundation. Four goals were uppermost in the thinking behind the journal: first, it should be bilingual; second, it should be cross-disciplinary;…
Storytelling Supported by Technology: An Alternative for EFL Children with Learning Difficulties
ERIC Educational Resources Information Center
Lee, Sy-ying
2012-01-01
This action research aims to investigate how technology improves the conditions of storytelling to help enhance the learning attitude and motivation of EFL children with learning difficulty using power point designs and an online recording system--VoiceThread (http://voicethread.com/). The use of power point designs is to assure children of clear…
Physiological characteristics of the supported singing voice. A preliminary study.
Griffin, B; Woo, P; Colton, R; Casper, J; Brewer, D
1995-03-01
The purpose of this study was to develop a definition of the supported singing voice based on physiological characteristics by comparing the subjects' concepts of a supported voice with objective measurements of their supported and unsupported voice. This preliminary report presents findings based on data from eight classically trained singers. Subjects answered questions about their concepts of the characteristics of the supported singing voice and how it is produced. Samples of the supported and unsupported singing voice produced at low, medium, and high pitches at a comfortable loudness level were collected for acoustic, spectral, airflow, electroglottographic, air volume, and stroboscopic analyses. Significant differences between the supported and unsupported voice were found for sound pressure level (SPL), peak airflow, subglottal pressure (Ps), glottal open time, and frequency of the fourth formant (F4). Mean flow and F2 frequency differences were sex and pitch related. Males adjusted laryngeal configuration to produce supported voice, whereas glottal configuration differences were greater in females. Breathing patterns were variable and not significantly different between supported and unsupported voice. Subjects in this study believe that the supported singing voice is resonant, clear, and easy to manage and is produced by correct breath management. Results of data analysis show that the supported singing voice has different spectral characteristics from and higher SPL, peak airflow, and Ps than the unsupported voice. Singers adjust laryngeal and/or glottal configuration to account for these changes, but no significant differences in breathing activity were found.
NWR (National Weather Service) voice synthesis project, phase 1
NASA Astrophysics Data System (ADS)
Sampson, G. W.
1986-01-01
The purpose of the NOAA Weather Radio (NWR) Voice Synthesis Project is to provide a demonstration of the current voice synthesis technology. Phase 1 of this project is presented, providing a complete automation of an hourly surface aviation observation for broadcast over NWR. In examining the products currently available on the market, the decision was made that synthetic voice technology does not have the high quality speech required for broadcast over the NWR. Therefore the system presented uses the phrase concatenation type of technology for a very high quality, versatile, voice synthesis system.
NASA's Physics of the Cosmos and Cosmic Origins technology development programs
NASA Astrophysics Data System (ADS)
Clampin, Mark; Pham, Thai
2014-07-01
NASA's Physics of the Cosmos (PCOS) and Cosmic Origins (COR) Program Offices, established in 2011, reside at the NASA Goddard Space Flight Center (GSFC). The offices serve as the implementation arm for the Astrophysics Division at NASA Headquarters. We present an overview of the programs' technology development activities and technology investment portfolio, funded by NASA's Strategic Astrophysics Technology (SAT) program. We currently fund 19 technology advancements to enable future PCOS and COR missions to help answer the questions "How did our universe begin and evolve?" and "How did galaxies, stars, and planets come to be?" We discuss the process for addressing community-provided technology gaps and Technology Management Board (TMB)-vetted prioritization and investment recommendations that inform the SAT program. The process improves the transparency and relevance of our technology investments, provides the community a voice in the process, and promotes targeted external technology investments by defining needs and identifying customers. The programs' goal is to promote and support technology development needed to enable missions envisioned by the National Research Council's (NRC) "New Worlds, New Horizons in Astronomy and Astrophysics" (NWNH) Decadal Survey report [1] and the Astrophysics Implementation Plan (AIP) [2]. These include technology development for dark energy, gravitational waves, X-ray and inflation probe science, and a 4m-class UV/optical telescope to conduct imaging and spectroscopy studies, as a post-Hubble observatory with significantly improved sensitivity and capability.
NASA's Physics of the Cosmos and Cosmic Origins Technology Development Programs
NASA Technical Reports Server (NTRS)
Clampin, Mark; Pham, Thai
2014-01-01
NASA's Physics of the Cosmos (PCOS) and Cosmic Origins (COR) Program Offices, established in 2011, reside at the NASA Goddard Space Flight Center (GSFC). The offices serve as the implementation arm for the Astrophysics Division at NASA Headquarters. We present an overview of the programs' technology development activities and technology investment portfolio, funded by NASA's Strategic Astrophysics Technology (SAT) program. We currently fund 19 technology advancements to enable future PCOS and COR missions to help answer the questions "How did our universe begin and evolve?" and "How did galaxies, stars, and planets come to be?" We discuss the process for addressing community-provided technology gaps and Technology Management Board (TMB)-vetted prioritization and investment recommendations that inform the SAT program. The process improves the transparency and relevance of our technology investments, provides the community a voice in the process, and promotes targeted external technology investments by defining needs and identifying customers. The programs' goal is to promote and support technology development needed to enable missions envisioned by the National Research Council's (NRC) "New Worlds, New Horizons in Astronomy and Astrophysics" (NWNH) Decadal Survey report [1] and the Astrophysics Implementation Plan (AIP) [2]. These include technology development for dark energy, gravitational waves, X-ray and inflation probe science, and a 4m-class UV/optical telescope to conduct imaging and spectroscopy studies, as a post-Hubble observatory with significantly improved sensitivity and capability.
Design and realization of intelligent tourism service system based on voice interaction
NASA Astrophysics Data System (ADS)
Hu, Lei-di; Long, Yi; Qian, Cheng-yang; Zhang, Ling; Lv, Guo-nian
2008-10-01
Voice technology is one of the important contents to improve the intelligence and humanization of tourism service system. Combining voice technology, the paper concentrates on application needs and the composition of system to present an overall intelligent tourism service system's framework consisting of presentation layer, Web services layer, and tourism application service layer. On the basis, the paper further elaborated the implementation of the system and its key technologies, including intelligent voice interactive technology, seamless integration technology of multiple data sources, location-perception-based guides' services technology, and tourism safety control technology. Finally, according to the situation of Nanjing tourism, a prototype of Tourism Services System is realized.
A long distance voice transmission system based on the white light LED
NASA Astrophysics Data System (ADS)
Tian, Chunyu; Wei, Chang; Wang, Yulian; Wang, Dachi; Yu, Benli; Xu, Feng
2017-10-01
A long distance voice transmission system based on a visible light communication technology (VLCT) is proposed in the paper. Our proposed system includes transmitter, receiver and the voice signal processing of single chip microcomputer. In the compact-sized LED transmitter, we use on-off-keying and not-return-to-zero (OOK-NRZ) to easily realize high speed modulation, and then systematic complexity is reduced. A voice transmission system, which possesses the properties of the low-noise and wide modulation band, is achieved by the design of high efficiency receiving optical path and using filters to reduce noise from the surrounding light. To improve the speed of the signal processing, we use single chip microcomputer to code and decode voice signal. Furthermore, serial peripheral interface (SPI) is adopted to accurately transmit voice signal data. The test results of our proposed system show that the transmission distance of this system is more than100 meters with the maximum data rate of 1.5 Mbit/s and a SNR of 30dB. This system has many advantages, such as simple construction, low cost and strong practicality. Therefore, it has extensive application prospect in the fields of the emergency communication and indoor wireless communication, etc.
ERIC Educational Resources Information Center
Malone, Karen
1999-01-01
Presents the story of a community that, with the support of the researcher, endeavored to peel back the layers of debris that had muffled their voices. This constructed debris--social, physical, cultural, and historical--served to reproduce the power of the other through processes of false consciousness. (Author/CCM)
Voice stress analysis and evaluation
NASA Astrophysics Data System (ADS)
Haddad, Darren M.; Ratley, Roy J.
2001-02-01
Voice Stress Analysis (VSA) systems are marketed as computer-based systems capable of measuring stress in a person's voice as an indicator of deception. They are advertised as being less expensive, easier to use, less invasive in use, and less constrained in their operation then polygraph technology. The National Institute of Justice have asked the Air Force Research Laboratory for assistance in evaluating voice stress analysis technology. Law enforcement officials have also been asking questions about this technology. If VSA technology proves to be effective, its value for military and law enforcement application is tremendous.
Nurses using futuristic technology in today's healthcare setting.
Wolf, Debra M; Kapadia, Amar; Kintzel, Jessie; Anton, Bonnie B
2009-01-01
Human computer interaction (HCI) equates nurses using voice assisted technology within a clinical setting to document patient care real time, retrieve patient information from care plans, and complete routine tasks. This is a reality currently utilized by clinicians today in acute and long term care settings. Voice assisted documentation provides hands & eyes free accurate documentation while enabling effective communication and task management. The speech technology increases the accuracy of documentation, while interfacing directly into the electronic health record (EHR). Using technology consisting of a light weight headset and small fist size wireless computer, verbal responses to easy to follow cues are converted into a database systems allowing staff to obtain individualized care status reports on demand. To further assist staff in their daily process, this innovative technology allows staff to send and receive pages as needed. This paper will discuss how leading edge and award winning technology is being integrated within the United States. Collaborative efforts between clinicians and analyst will be discussed reflecting the interactive design and build functionality. Features such as the system's voice responses and directed cues will be shared and how easily data can be documented, viewed and retrieved. Outcome data will be presented on how the technology impacted organization's quality outcomes, financial reimbursement, and employee's level of satisfaction.
Definition of problems of persons in sheltered care environments
NASA Technical Reports Server (NTRS)
Fetzner, W. N.
1979-01-01
Innovations in health care using aerospace technologies are described. Voice synthesizer and voice recognition technologies were used in developing voice controlled wheel chairs and optacons. Telephone interface modules are also described.
A SOUND SOURCE LOCALIZATION TECHNIQUE TO SUPPORT SEARCH AND RESCUE IN LOUD NOISE ENVIRONMENTS
NASA Astrophysics Data System (ADS)
Yoshinaga, Hiroshi; Mizutani, Koichi; Wakatsuki, Naoto
At some sites of earthquakes and other disasters, rescuers search for people buried under rubble by listening for the sounds which they make. Thus developing a technique to localize sound sources amidst loud noise will support such search and rescue operations. In this paper, we discuss an experiment performed to test an array signal processing technique which searches for unperceivable sound in loud noise environments. Two speakers simultaneously played a noise of a generator and a voice decreased by 20 dB (= 1/100 of power) from the generator noise at an outdoor space where cicadas were making noise. The sound signal was received by a horizontally set linear microphone array 1.05 m in length and consisting of 15 microphones. The direction and the distance of the voice were computed and the sound of the voice was extracted and played back as an audible sound by array signal processing.
Architecting Communication Network of Networks for Space System of Systems
NASA Technical Reports Server (NTRS)
Bhasin, Kul B.; Hayden, Jeffrey L.
2008-01-01
The National Aeronautics and Space Administration (NASA) and the Department of Defense (DoD) are planning Space System of Systems (SoS) to address the new challenges of space exploration, defense, communications, navigation, Earth observation, and science. In addition, these complex systems must provide interoperability, enhanced reliability, common interfaces, dynamic operations, and autonomy in system management. Both NASA and the DoD have chosen to meet the new demands with high data rate communication systems and space Internet technologies that bring Internet Protocols (IP), routers, servers, software, and interfaces to space networks to enable as much autonomous operation of those networks as possible. These technologies reduce the cost of operations and, with higher bandwidths, support the expected voice, video, and data needed to coordinate activities at each stage of an exploration mission. In this paper, we discuss, in a generic fashion, how the architectural approaches and processes are being developed and used for defining a hypothetical communication and navigation networks infrastructure to support lunar exploration. Examples are given of the products generated by the architecture development process.
ERIC Educational Resources Information Center
Popyk, Marilyn K.
1986-01-01
Discusses the new automated office and its six major technologies (data processing, word processing, graphics, image, voice, and networking), the information processing cycle (input, processing, output, distribution/communication, and storage and retrieval), ergonomics, and ways to expand office education classes (versus class instruction). (CT)
Processing of speech signals for physical and sensory disabilities.
Levitt, H
1995-01-01
Assistive technology involving voice communication is used primarily by people who are deaf, hard of hearing, or who have speech and/or language disabilities. It is also used to a lesser extent by people with visual or motor disabilities. A very wide range of devices has been developed for people with hearing loss. These devices can be categorized not only by the modality of stimulation [i.e., auditory, visual, tactile, or direct electrical stimulation of the auditory nerve (auditory-neural)] but also in terms of the degree of speech processing that is used. At least four such categories can be distinguished: assistive devices (a) that are not designed specifically for speech, (b) that take the average characteristics of speech into account, (c) that process articulatory or phonetic characteristics of speech, and (d) that embody some degree of automatic speech recognition. Assistive devices for people with speech and/or language disabilities typically involve some form of speech synthesis or symbol generation for severe forms of language disability. Speech synthesis is also used in text-to-speech systems for sightless persons. Other applications of assistive technology involving voice communication include voice control of wheelchairs and other devices for people with mobility disabilities. Images Fig. 4 PMID:7479816
Processing of Speech Signals for Physical and Sensory Disabilities
NASA Astrophysics Data System (ADS)
Levitt, Harry
1995-10-01
Assistive technology involving voice communication is used primarily by people who are deaf, hard of hearing, or who have speech and/or language disabilities. It is also used to a lesser extent by people with visual or motor disabilities. A very wide range of devices has been developed for people with hearing loss. These devices can be categorized not only by the modality of stimulation [i.e., auditory, visual, tactile, or direct electrical stimulation of the auditory nerve (auditory-neural)] but also in terms of the degree of speech processing that is used. At least four such categories can be distinguished: assistive devices (a) that are not designed specifically for speech, (b) that take the average characteristics of speech into account, (c) that process articulatory or phonetic characteristics of speech, and (d) that embody some degree of automatic speech recognition. Assistive devices for people with speech and/or language disabilities typically involve some form of speech synthesis or symbol generation for severe forms of language disability. Speech synthesis is also used in text-to-speech systems for sightless persons. Other applications of assistive technology involving voice communication include voice control of wheelchairs and other devices for people with mobility disabilities.
Barone, Pascal; Chambaudie, Laure; Strelnikov, Kuzma; Fraysse, Bernard; Marx, Mathieu; Belin, Pascal; Deguine, Olivier
2016-10-01
Due to signal distortion, speech comprehension in cochlear-implanted (CI) patients relies strongly on visual information, a compensatory strategy supported by important cortical crossmodal reorganisations. Though crossmodal interactions are evident for speech processing, it is unclear whether a visual influence is observed in CI patients during non-linguistic visual-auditory processing, such as face-voice interactions, which are important in social communication. We analyse and compare visual-auditory interactions in CI patients and normal-hearing subjects (NHS) at equivalent auditory performance levels. Proficient CI patients and NHS performed a voice-gender categorisation in the visual-auditory modality from a morphing-generated voice continuum between male and female speakers, while ignoring the presentation of a male or female visual face. Our data show that during the face-voice interaction, CI deaf patients are strongly influenced by visual information when performing an auditory gender categorisation task, in spite of maximum recovery of auditory speech. No such effect is observed in NHS, even in situations of CI simulation. Our hypothesis is that the functional crossmodal reorganisation that occurs in deafness could influence nonverbal processing, such as face-voice interaction; this is important for patient internal supramodal representation. Copyright © 2016 Elsevier Ltd. All rights reserved.
mHealth: Using Mobile Technology to Support Healthcare
Okuboyejo, Senanu; Eyesan, Omatseyin
2014-01-01
Adherence to long-term therapy in outpatient setting is required to reduce the prevalence of chronic diseases such as HIV/AIDS, Diabetes, Tuberculosis and Malaria. This paper presents a mobile technology-based medical alert system for outpatient adherence in Nigeria. The system makes use of the SMS and voice features of mobile phones. The system has the potential of improving adherence to medication in outpatient setting by reminding patients of dosing schedules and attendance to scheduled appointments through SMS and voice calls. It will also inform patients of benefits and risks associated with adherence. Interventions aimed at improving adherence would provide significant positive return on investment through primary prevention (of risk factors) and secondary prevention of adverse health outcomes. PMID:24678384
Ventriloquising the Voice: Writing in the University
ERIC Educational Resources Information Center
Fulford, Amanda
2009-01-01
In this paper I consider one aspect of how student writing is supported in the university. I focus on the use of the "writing frame", questioning its status as a vehicle for facilitating student voice, and in the process questioning how that notion is itself understood. I illustrate this by using examples from the story of the 1944 Hollywood film…
Bethmann, Anja; Scheich, Henning; Brechmann, André
2012-01-01
It is widely accepted that the perception of human voices is supported by neural structures located along the superior temporal sulci. However, there is an ongoing discussion to what extent the activations found in fMRI studies are evoked by the vocal features themselves or are the result of phonetic processing. To show that the temporal lobes are indeed engaged in voice processing, short utterances spoken by famous and unknown people were presented to healthy young participants whose task it was to identify the familiar speakers. In two event-related fMRI experiments, the temporal lobes were found to differentiate between familiar and unfamiliar voices such that named voices elicited higher BOLD signal intensities than unfamiliar voices. Yet, the temporal cortices did not only discriminate between familiar and unfamiliar voices. Experiment 2, which required overtly spoken responses and allowed to distinguish between four familiarity grades, revealed that there was a fine-grained differentiation between all of these familiarity levels with higher familiarity being associated with larger BOLD signal amplitudes. Finally, we observed a gradual response change such that the BOLD signal differences between unfamiliar and highly familiar voices increased with the distance of an area from the transverse temporal gyri, especially towards the anterior temporal cortex and the middle temporal gyri. Therefore, the results suggest that (the anterior and non-superior portions of) the temporal lobes participate in voice-specific processing independent from phonetic components also involved in spoken speech material. PMID:23112826
McCloskey, Rose; Jarrett, Pamela; Stewart, Connie; Keeping-Burke, Lisa
2015-01-01
Technology has the potential to offer support to older adults after being discharged from geriatric rehabilitation. This article highlights recruitment and retention challenges in a study examining an interactive voice response telephone system designed to monitor and support older adults and their informal caregivers following discharge from a geriatric rehabilitation unit. A prospective longitudinal study was planned to examine the feasibility of an interactive voice telephone system in facilitating the transition from rehabilitation to home for older adults and their family caregivers. Patient participants were required to make daily calls into the system. Using standardized instruments, data was to be collected at baseline and during home visits. Older adults and their caregivers may not be willing to learn how to use new technology at the time of hospital discharge. Poor recruitment and retention rates prevented analysis of findings. The importance of recruitment and retention in any study should never be underestimated. Target users of any intervention need to be included in both the design of the intervention and the study examining its benefit. Identifying the issues associated with introducing technology with a group of older rehabilitation patients should assist others who are interested in exploring the role of technology in facilitating hospital discharge. © 2014 Association of Rehabilitation Nurses.
Lebacq, Jean; Schoentgen, Jean; Cantarella, Giovanna; Bruss, Franz Thomas; Manfredi, Claudia; DeJonckere, Philippe
2017-09-01
Smartphone technology provides new opportunities for recording standardized voice samples of patients and transmitting the audio files to the voice laboratory. This drastically improves the achievement of baseline designs, used in research on efficiency of voice treatments. However, the basic requirement is the suitability of smartphones for recording and digitizing pathologic voices (mainly characterized by period perturbations and noise) without significant distortion. In a previous article, this was tested using realistic synthesized deviant voice samples (/a:/) with three precisely known levels of jitter and of noise in all combinations. High correlations were found between jitter and noise to harmonics ratio measured in (1) recordings via smartphones, (2) direct microphone recordings, and (3) sound files generated by the synthesizer. In the present work, similar experiments were performed (1) in the presence of increasing levels of ambient noise and (2) using synthetic deviant voice samples (/a:/) as well as synthetic voice material simulating a deviant short voiced utterance (/aiuaiuaiu/). Ambient noise levels up to 50 dB A are acceptable. However, signal processing occurs in some smartphones, and this significantly affects estimates of jitter and noise to harmonics ratio when formant changes are introduced in analogy with running speech. The conclusion is that voice material must provisionally be limited to a sustained /a/. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
ERP correlates of motivating voices: quality of motivation and time-course matters
Zougkou, Konstantina; Weinstein, Netta
2017-01-01
Abstract Here, we conducted the first study to explore how motivations expressed through speech are processed in real-time. Participants listened to sentences spoken in two types of well-studied motivational tones (autonomy-supportive and controlling), or a neutral tone of voice. To examine this, listeners were presented with sentences that either signaled motivations through prosody (tone of voice) and words simultaneously (e.g. ‘You absolutely have to do it my way’ spoken in a controlling tone of voice), or lacked motivationally biasing words (e.g. ‘Why don’t we meet again tomorrow’ spoken in a motivational tone of voice). Event-related brain potentials (ERPs) in response to motivations conveyed through words and prosody showed that listeners rapidly distinguished between motivations and neutral forms of communication as shown in enhanced P2 amplitudes in response to motivational when compared with neutral speech. This early detection mechanism is argued to help determine the importance of incoming information. Once assessed, motivational language is continuously monitored and thoroughly evaluated. When compared with neutral speech, listening to controlling (but not autonomy-supportive) speech led to enhanced late potential ERP mean amplitudes, suggesting that listeners are particularly attuned to controlling messages. The importance of controlling motivation for listeners is mirrored in effects observed for motivations expressed through prosody only. Here, an early rapid appraisal, as reflected in enhanced P2 amplitudes, is only found for sentences spoken in controlling (but not autonomy-supportive) prosody. Once identified as sounding pressuring, the message seems to be preferentially processed, as shown by enhanced late potential amplitudes in response to controlling prosody. Taken together, results suggest that motivational and neutral language are differentially processed; further, the data suggest that listening to cues signaling pressure and control cannot be ignored and lead to preferential, and more in-depth processing mechanisms. PMID:28525641
ERP correlates of motivating voices: quality of motivation and time-course matters.
Zougkou, Konstantina; Weinstein, Netta; Paulmann, Silke
2017-10-01
Here, we conducted the first study to explore how motivations expressed through speech are processed in real-time. Participants listened to sentences spoken in two types of well-studied motivational tones (autonomy-supportive and controlling), or a neutral tone of voice. To examine this, listeners were presented with sentences that either signaled motivations through prosody (tone of voice) and words simultaneously (e.g. 'You absolutely have to do it my way' spoken in a controlling tone of voice), or lacked motivationally biasing words (e.g. 'Why don't we meet again tomorrow' spoken in a motivational tone of voice). Event-related brain potentials (ERPs) in response to motivations conveyed through words and prosody showed that listeners rapidly distinguished between motivations and neutral forms of communication as shown in enhanced P2 amplitudes in response to motivational when compared with neutral speech. This early detection mechanism is argued to help determine the importance of incoming information. Once assessed, motivational language is continuously monitored and thoroughly evaluated. When compared with neutral speech, listening to controlling (but not autonomy-supportive) speech led to enhanced late potential ERP mean amplitudes, suggesting that listeners are particularly attuned to controlling messages. The importance of controlling motivation for listeners is mirrored in effects observed for motivations expressed through prosody only. Here, an early rapid appraisal, as reflected in enhanced P2 amplitudes, is only found for sentences spoken in controlling (but not autonomy-supportive) prosody. Once identified as sounding pressuring, the message seems to be preferentially processed, as shown by enhanced late potential amplitudes in response to controlling prosody. Taken together, results suggest that motivational and neutral language are differentially processed; further, the data suggest that listening to cues signaling pressure and control cannot be ignored and lead to preferential, and more in-depth processing mechanisms. © The Author (2017). Published by Oxford University Press.
Prototype app for voice therapy: a peer review.
Lavaissiéri, Paula; Melo, Paulo Eduardo Damasceno
2017-03-09
Voice speech therapy promotes changes in patients' voice-related habits and rehabilitation. Speech-language therapists use a host of materials ranging from pictures to electronic resources and computer tools as aids in this process. Mobile technology is attractive, interactive and a nearly constant feature in the daily routine of a large part of the population and has a growing application in healthcare. To develop a prototype application for voice therapy, submit it to peer assessment, and to improve the initial prototype based on these assessments. a prototype of the Q-Voz application was developed based on Apple's Human Interface Guidelines. The prototype was analyzed by seven speech therapists who work in the voice area. Improvements to the product were made based on these assessments. all features of the application were considered satisfactory by most evaluators. All evaluators found the application very useful; evaluators reported that patients would find it easier to make changes in voice behavior with the application than without it; the evaluators stated they would use this application with their patients with dysphonia and in the process of rehabilitation and that the application offers useful tools for voice self-management. Based on the suggestions provided, six improvements were made to the prototype. the prototype Q-Voz Application was developed and evaluated by seven judges and subsequently improved. All evaluators stated they would use the application with their patients undergoing rehabilitation, indicating that the Q-Voz Application for mobile devices can be considered an auxiliary tool for voice speech therapy.
Cross-classification of musical and vocal emotions in the auditory cortex.
Paquette, Sébastien; Takerkart, Sylvain; Saget, Shinji; Peretz, Isabelle; Belin, Pascal
2018-05-09
Whether emotions carried by voice and music are processed by the brain using similar mechanisms has long been investigated. Yet neuroimaging studies do not provide a clear picture, mainly due to lack of control over stimuli. Here, we report a functional magnetic resonance imaging (fMRI) study using comparable stimulus material in the voice and music domains-the Montreal Affective Voices and the Musical Emotional Bursts-which include nonverbal short bursts of happiness, fear, sadness, and neutral expressions. We use a multivariate emotion-classification fMRI analysis involving cross-timbre classification as a means of comparing the neural mechanisms involved in processing emotional information in the two domains. We find, for affective stimuli in the violin, clarinet, or voice timbres, that local fMRI patterns in the bilateral auditory cortex and upper premotor regions support above-chance emotion classification when training and testing sets are performed within the same timbre category. More importantly, classifier performance generalized well across timbre in cross-classifying schemes, albeit with a slight accuracy drop when crossing the voice-music boundary, providing evidence for a shared neural code for processing musical and vocal emotions, with possibly a cost for the voice due to its evolutionary significance. © 2018 New York Academy of Sciences.
Redesigning Schools for 21st Century Technologies: A Middle School with the Power to Improve.
ERIC Educational Resources Information Center
Van Dam, Janet M.
1994-01-01
Describes the processes involved in redesigning and renovating Power Middle School (Michigan) for current and future educational technology, particularly for the media center. Topics discussed include planning; time management; wiring infrastructure; voice and video networks; teacher and student multimedia production rooms; and communications…
Implementation of the Intelligent Voice System for Kazakh
NASA Astrophysics Data System (ADS)
Yessenbayev, Zh; Saparkhojayev, N.; Tibeyev, T.
2014-04-01
Modern speech technologies are highly advanced and widely used in day-to-day applications. However, this is mostly concerned with the languages of well-developed countries such as English, German, Japan, Russian, etc. As for Kazakh, the situation is less prominent and research in this field is only starting to evolve. In this research and application-oriented project, we introduce an intelligent voice system for the fast deployment of call-centers and information desks supporting Kazakh speech. The demand on such a system is obvious if the country's large size and small population is considered. The landline and cell phones become the only means of communication for the distant villages and suburbs. The system features Kazakh speech recognition and synthesis modules as well as a web-GUI for efficient dialog management. For speech recognition we use CMU Sphinx engine and for speech synthesis- MaryTTS. The web-GUI is implemented in Java enabling operators to quickly create and manage the dialogs in user-friendly graphical environment. The call routines are handled by Asterisk PBX and JBoss Application Server. The system supports such technologies and protocols as VoIP, VoiceXML, FastAGI, Java SpeechAPI and J2EE. For the speech recognition experiments we compiled and used the first Kazakh speech corpus with the utterances from 169 native speakers. The performance of the speech recognizer is 4.1% WER on isolated word recognition and 6.9% WER on clean continuous speech recognition tasks. The speech synthesis experiments include the training of male and female voices.
Smith, Chris; Vannak, Uk; Sokhey, Ly; Ngo, Thoai D; Gold, Judy; Free, Caroline
2016-01-05
The objective of this paper is to outline the formative research process used to develop the MOTIF mobile phone-based (mHealth) intervention to support post-abortion family planning in Cambodia. The formative research process involved literature reviews, interviews and focus group discussions with clients, and consultation with clinicians and organisations implementing mHealth activities in Cambodia. This process led to the development of a conceptual framework and the intervention. Key findings from the formative research included identification of the main reasons for non-use of contraception and patterns of mobile phone use in Cambodia. We drew on components of existing interventions and behaviour change theory to develop a conceptual framework. A multi-faceted voice-based intervention was designed to address health concerns and other key determinants of contraception use. Formative research was essential in order to develop an appropriate mHealth intervention to support post-abortion contraception in Cambodia. Each component of the formative research contributed to the final intervention design.
ERIC Educational Resources Information Center
Skouge, James R.; Kelly, Mary L.; Roberts, Kelly D.; Leake, David W.; Stodden, Robert A.
2007-01-01
This paper focuses on "technologies for voice" that are related to the self-determination of youth with developmental disabilities. The authors describe a self-determination model that values family-focused, community-referenced pedagogies employing "new media" to give voice to youth and their families. In line with the adage that a picture is…
Antiles, S; Couris, J; Schweitzer, A; Rosenthal, D; Da Silva, R Q
2000-01-01
Computerized voice recognition systems (VR) can reduce costs and enhance service. The capital outlay required for conversion to a VR system is significant; therefore, it is incumbent on radiology departments to provide cost and service justifications to administrators. Massachusetts General Hospital (MGH) in Boston implemented VR over a two-year period and achieved annual savings of $530,000 and a 50% decrease in report throughput. Those accomplishments required solid planning and implementation strategies, training and sustainment programs. This article walks through the process, step by step, in the hope of providing a tool set for future implementations. Because VR has dramatic implications for workflow, a solid operational plan is needed when assessing vendors and planning for implementation. The goals for implementation should be to minimize operational disruptions and capitalize on efficiencies of the technology. Senior leadership--the department chair or vice-chair--must select the goals to be accomplished and oversee, manage and direct the VR initiative. The importance of this point cannot be overstated, since implementation will require behavior changes from radiologists and others who may not perceive any personal benefits. Training is the pivotal factor affecting the success of voice recognition, and practice is the only way for radiologists to enhance their skills. Through practice, radiologists will discover shortcuts, and their speed and comfort will improve. Measurement and data analysis are critical to changing and improving the voice recognition application and are vital to decision-making. Some of the issues about which valuable date can be collected are technical and educational problems, VR penetration, report turnaround time and annual cost savings. Sustained effort is indispensable to the maintenance of voice recognition. Finally, all efforts made and gains achieved may prove to be futile without ongoing sustainment of the system through retraining, education and technical support.
A voice region in the monkey brain.
Petkov, Christopher I; Kayser, Christoph; Steudel, Thomas; Whittingstall, Kevin; Augath, Mark; Logothetis, Nikos K
2008-03-01
For vocal animals, recognizing species-specific vocalizations is important for survival and social interactions. In humans, a voice region has been identified that is sensitive to human voices and vocalizations. As this region also strongly responds to speech, it is unclear whether it is tightly associated with linguistic processing and is thus unique to humans. Using functional magnetic resonance imaging of macaque monkeys (Old World primates, Macaca mulatta) we discovered a high-level auditory region that prefers species-specific vocalizations over other vocalizations and sounds. This region not only showed sensitivity to the 'voice' of the species, but also to the vocal identify of conspecific individuals. The monkey voice region is located on the superior-temporal plane and belongs to an anterior auditory 'what' pathway. These results establish functional relationships with the human voice region and support the notion that, for different primate species, the anterior temporal regions of the brain are adapted for recognizing communication signals from conspecifics.
Exploring multiliteracies, student voice, and scientific practices in two elementary classrooms
NASA Astrophysics Data System (ADS)
Allison, Elizabeth Rowland
This study explored the voices of children in a changing world with evolving needs and new opportunities. The workplaces of rapidly moving capitalist societies value creativity, collaboration, and critical thinking skills which are of growing importance and manifesting themselves in modern K-12 science classroom cultures (Gee, 2000; New London Group, 2000). This study explored issues of multiliteracies and student voice set within the context of teaching and learning in 4th and 5th grade science classrooms. The purpose of the study was to ascertain what and how multiliteracies and scientific practices (NGSS Lead States, 2013c) are implemented, explore how multiliteracies influence students' voices, and investigate teacher and student perceptions of multiliteracies, student voice, and scientific practices. Grounded in a constructivist framework, a multiple case study was employed in two elementary classrooms. Through observations, student focus groups and interviews, and teacher interviews, a detailed narrative was created to describe a range of multiliteracies, student voice, and scientific practices that occurred with the science classroom context. Using grounded theory analysis, data were coded and analyzed to reveal emergent themes. Data analysis revealed that these two classrooms were enriched with multiliteracies that serve metaphorically as breeding grounds for student voice. In the modern classroom, defined as a space where information is instantly accessible through the Internet, multiliteracies can be developed through inquiry-based, collaborative, and technology-rich experiences. Scientific literacy, cultivated through student communication and collaboration, is arguably a multiliteracy that has not been considered in the literature, and should be, as an integral component of overall individual literacy in the 21st century. Findings revealed four themes. Three themes suggest that teachers address several modes of multiliteracies in science, but identify barriers to integrating multiliteracies and scientific practices into science teaching. The issues include time, increased standards accountability, and lack of comfort with effective integration of technology. The fourth theme revealed that students have the ability to shape and define their learning while supporting other voices through collaborative science experiences.
Speech transport for packet telephony and voice over IP
NASA Astrophysics Data System (ADS)
Baker, Maurice R.
1999-11-01
Recent advances in packet switching, internetworking, and digital signal processing technologies have converged to allow realizable practical implementations of packet telephony systems. This paper provides a tutorial on transmission engineering for packet telephony covering the topics of speech coding/decoding, speech packetization, packet data network transport, and impairments which may negatively impact end-to-end system quality. Particular emphasis is placed upon Voice over Internet Protocol given the current popularity and ubiquity of IP transport.
Stroux, Lisa; Martinez, Boris; Ixen, Enma Coyote; King, Nora; Hall-Clifford, Rachel; Rohloff, Peter; Clifford, Gari D.
2016-01-01
Limited funding for medical technology, low levels of education and poor infrastructure for delivering and maintaining technology severely limit medical decision support in low- and middle-income countries. Perinatal and maternal mortality is of particular concern with millions dying every year from potentially treatable conditions. Guatemala has one of the worst maternal mortality ratios, the highest incidence of intrauterine growth restriction (IUGR), and one of the lowest gross national incomes per capita within Latin America. To address the lack of decision support in rural Guatemala, a smartphone-based system is proposed including peripheral sensors, such as a handheld Doppler for the identification of fetal compromise. Designed for use by illiterate birth attendants, the system uses pictograms, audio guidance, local and cloud processing, SMS alerts and voice calling. The initial prototype was evaluated on 22 women in highland Guatemala. Results were fed back into the refinement of the system, currently undergoing RCT evaluation. PMID:27696915
Stroux, Lisa; Martinez, Boris; Coyote Ixen, Enma; King, Nora; Hall-Clifford, Rachel; Rohloff, Peter; Clifford, Gari D
Limited funding for medical technology, low levels of education and poor infrastructure for delivering and maintaining technology severely limit medical decision support in low- and middle-income countries. Perinatal and maternal mortality is of particular concern with millions dying every year from potentially treatable conditions. Guatemala has one of the worst maternal mortality ratios, the highest incidence of intra-uterine growth restriction (IUGR), and one of the lowest gross national incomes per capita within Latin America. To address the lack of decision support in rural Guatemala, a smartphone-based system is proposed including peripheral sensors, such as a handheld Doppler for the identification of foetal compromise. Designed for use by illiterate birth attendants, the system uses pictograms, audio guidance, local and cloud processing, SMS alerts and voice calling. The initial prototype was evaluated on 22 women in highland Guatemala. Results were fed back into the refinement of the system, currently undergoing RCT evaluation.
Space nuclear power: Key to outer solar system exploration
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bennett, G.L.; Allen, D.M.
1998-07-01
In 1995, in response to threatened budget cuts, the American Institute of Aeronautics and Astronautics (AIAA) approved a position paper supporting the maintenance of the technology base for space nuclear power. The position paper contained four recomemndations: (1) DOE, NASA, and DoD should develop and support an integrated program that maintains the nuclear option and develops the needed high-payoff technologies; (2) Congress should provide strong, continuing financial and political support for the agencies' program; (3) Government and industry leaders should voice their advocacy for a strong space nuclear power program to support future system requirements; and (4) The US shouldmore » continue to maintain its cooperation and technical interchanges with other countries to advance nuclear power source technology and to promote nuclear safety.« less
1999-12-01
compression technology . The ubiquity of routed Internet Protocol (IP) networks, and the desire to trim telephony costs are the major driving forces of the...mid- s, data and voice began to merge, propelled by advances in compression technology . The ubiquity of routed Internet Protocol (IP) networks...transmit voice over IP networks that are privately owned or publicly utilized. If we have the technology to transmit Voice over the Internet then why not
Voice Rest versus Exercise: A Review of the Literature
Ishikawa, Keiko; Thibeault, Susan
2010-01-01
Voice rest is commonly prescribed after vocal fold surgery to promote wound healing of the vocal fold. Currently, there is no standard protocol that is established based on biological evidence. In orthopedic rehabilitation, long-term rest is found to be less effective for connective tissue healing than exercise. Connective tissue healing is also an important factor for successful voice rehabilitation; however, whether this concept can be extrapolated to voice rehabilitation is unknown. The purpose of this article is to review current clinical and basic science literature to examine the effect of voice rest in post-surgical rehabilitation. First, we present a summary of clinical literature that pertains to voice rest. Second, description of connective tissue that are involved in orthopedic and voice rehabilitation, specifically, ligament and lamina propria, respectively, and their wound healing process are offered. Third, a summary of the literature from orthopedic research on the effect of rest versus exercise is presented. Lastly, it summarizes in vitro and in vivo studies that examined the effect of mechanical stress on vocal fold tissue. Current literature suggests that there is a lack of clinical evidence that supports a specific type and duration of voice rest, and extrapolation of the findings from orthopedic research may be unreasonable due to the morphological and biochemical difference between the tissues. In order to determine the effect of voice rest, further elucidation of vocal fold wound healing process and the effect of mechanical stress on vocal fold tissue remodeling are needed. PMID:19660903
Voice Recognition: A New Assessment Tool?
ERIC Educational Resources Information Center
Jones, Darla
2005-01-01
This article presents the results of a study conducted in Anchorage, Alaska, that evaluated the accuracy and efficiency of using voice recognition (VR) technology to collect oral reading fluency data for classroom-based assessments. The primary research question was as follows: Is voice recognition technology a valid and reliable alternative to…
Speech technology and cinema: can they learn from each other?
Pauletto, Sandra
2013-10-01
The voice is the most important sound of a film soundtrack. It represents a character and it carries language. There are different types of cinematic voices: dialogue, internal monologues, and voice-overs. Conventionally, two main characteristics differentiate these voices: lip synchronization and the voice's attributes that make it appropriate for the character (for example, a voice that sounds very close to the audience can be appropriate for a narrator, but not for an onscreen character). What happens, then, if a film character can only speak through an asynchronous machine that produces a 'robot-like' voice? This article discusses the sound-related work and experimentation done by the author for the short film Voice by Choice. It also attempts to discover whether speech technology design can learn from its cinematic representation, and if such uncommon film protagonists can contribute creatively to transform the conventions of cinematic voices.
Remote voice training: A case study on space shuttle applications, appendix C
NASA Technical Reports Server (NTRS)
Mollakarimi, Cindy; Hamid, Tamin
1990-01-01
The Tile Automation System includes applications of automation and robotics technology to all aspects of the Shuttle tile processing and inspection system. An integrated set of rapid prototyping testbeds was developed which include speech recognition and synthesis, laser imaging systems, distributed Ada programming environments, distributed relational data base architectures, distributed computer network architectures, multi-media workbenches, and human factors considerations. Remote voice training in the Tile Automation System is discussed. The user is prompted over a headset by synthesized speech for the training sequences. The voice recognition units and the voice output units are remote from the user and are connected by Ethernet to the main computer system. A supervisory channel is used to monitor the training sequences. Discussions include the training approaches as well as the human factors problems and solutions for this system utilizing remote training techniques.
IP voice over ATM satellite: experimental results over satellite channels
NASA Astrophysics Data System (ADS)
Saraf, Koroush A.; Butts, Norman P.
1999-01-01
IP telephony, a new technology to provide voice communication over traditional data networks, has the potential to revolutionize telephone communication within the modern enterprise. This innovation uses packetization techniques to carry voice conversations over IP networks. This packet switched technology promises new integrated services, and lower cost long-distance communication compared to traditional circuit switched telephone networks. Future satellites will need to carry IP traffic efficiently in order to stay competitive in servicing the global data- networking and global telephony infrastructure. However, the effects of Voice over IP over switched satellite channels have not been investigated in detail. To fully understand the effects of satellite channels on Voice over IP quality; several experiments were conducted at Lockheed Martin Telecommunications' Satellite Integration Lab. The result of those experiments along with suggested improvements for voice communication over satellite are presented in this document. First, a detailed introduction of IP telephony as a suitable technology for voice communication over future satellites is presented. This is followed by procedures for the experiments, along with results and strategies. In conclusion we hope that these capability demonstrations will alleviate any uncertainty regarding the applicability of this technology to satellite networks.
Human factors issues associated with the use of speech technology in the cockpit
NASA Technical Reports Server (NTRS)
Kersteen, Z. A.; Damos, D.
1983-01-01
The human factors issues associated with the use of voice technology in the cockpit are summarized. The formulation of the LHX avionics suite is described and the allocation of tasks to voice in the cockpit is discussed. State-of-the-art speech recognition technology is reviewed. Finally, a questionnaire designed to tap pilot opinions concerning the allocation of tasks to voice input and output in the cockpit is presented. This questionnaire was designed to be administered to operational AH-1G Cobra gunship pilots. Half of the questionnaire deals specifically with the AH-1G cockpit and the types of tasks pilots would like to have performed by voice in this existing rotorcraft. The remaining portion of the questionnaire deals with an undefined rotorcraft of the future and is aimed at determining what types of tasks these pilots would like to have performed by voice technology if anything was possible, i.e. if there were no technological constraints.
NASA Astrophysics Data System (ADS)
Coleman, D. F.
2012-12-01
Most research vessels are equipped with satellite Internet services with bandwidths capable of being upgraded to support telepresence technologies and live shore-based participation. This capability can be used for real-time data transmission to shore, where it can be distributed, managed, processed, and archived. The University of Rhode Island Inner Space Center utilizes telepresence technologies and a growing network of command centers on Internet2 to participate live with a variety of research vessels and their ocean observing and sampling systems. High-bandwidth video streaming, voice-over-IP telecommunications, and real-time data feeds and file transfers enable users on shore to take part in the oceanographic expeditions as if they were present on the ship, working in the lab. Telepresence-enabled systematic ocean exploration and similar programs represent a significant and growing paradigm shift that can change the future of seagoing ocean observations using research vessels. The required platform is the ship itself, and users of the technology rely on the ship-based technical teams, but remote and distributed shore-based science users, students, educators, and the general public can now take part by being aboard virtually.
ERIC Educational Resources Information Center
Young, Dan
2005-01-01
Internet Protocol (IP) Telephony, or voice-over IP (VoIP), has proved to be a wise decision for many organizations. This technology crosses the boundaries of public and private networks, enterprise and residential markets, voice and data technologies, as well as local and long-distance services. The convergence of voice and data into a single,…
Empowering Student Voice through Interactive Design and Digital Making
ERIC Educational Resources Information Center
Kim, Yanghee; Searle, Kristin
2017-01-01
Over the last two decades online technology and digital media have provided space for students to participate and express their voices. This paper further explores how new digital technologies, such as humanoid robots and wearable electronics, can be used to offer additional spaces where students' voices are heard. In these spaces, young students…
Audio feature extraction using probability distribution function
NASA Astrophysics Data System (ADS)
Suhaib, A.; Wan, Khairunizam; Aziz, Azri A.; Hazry, D.; Razlan, Zuradzman M.; Shahriman A., B.
2015-05-01
Voice recognition has been one of the popular applications in robotic field. It is also known to be recently used for biometric and multimedia information retrieval system. This technology is attained from successive research on audio feature extraction analysis. Probability Distribution Function (PDF) is a statistical method which is usually used as one of the processes in complex feature extraction methods such as GMM and PCA. In this paper, a new method for audio feature extraction is proposed which is by using only PDF as a feature extraction method itself for speech analysis purpose. Certain pre-processing techniques are performed in prior to the proposed feature extraction method. Subsequently, the PDF result values for each frame of sampled voice signals obtained from certain numbers of individuals are plotted. From the experimental results obtained, it can be seen visually from the plotted data that each individuals' voice has comparable PDF values and shapes.
Early development of polyphonic sound encoding and the high voice superiority effect.
Marie, Céline; Trainor, Laurel J
2014-05-01
Previous research suggests that when two streams of pitched tones are presented simultaneously, adults process each stream in a separate memory trace, as reflected by mismatch negativity (MMN), a component of the event-related potential (ERP). Furthermore, a superior encoding of the higher tone or voice in polyphonic sounds has been found for 7-month-old infants and both musician and non-musician adults in terms of a larger amplitude MMN in response to pitch deviant stimuli in the higher than the lower voice. These results, in conjunction with modeling work, suggest that the high voice superiority effect might originate in characteristics of the peripheral auditory system. If this is the case, the high voice superiority effect should be present in infants younger than 7 months. In the present study we tested 3-month-old infants as there is no evidence at this age of perceptual narrowing or specialization of musical processing according to the pitch or rhythmic structure of music experienced in the infant׳s environment. We presented two simultaneous streams of tones (high and low) with 50% of trials modified by 1 semitone (up or down), either on the higher or the lower tone, leaving 50% standard trials. Results indicate that like the 7-month-olds, 3-month-old infants process each tone in a separate memory trace and show greater saliency for the higher tone. Although MMN was smaller and later in both voices for the group of sixteen 3-month-olds compared to the group of sixteen 7-month-olds, the size of the difference in MMN for the high compared to low voice was similar across ages. These results support the hypothesis of an innate peripheral origin of the high voice superiority effect. Copyright © 2014 Elsevier Ltd. All rights reserved.
Connecting Generations: Developing Co-Design Methods for Older Adults and Children
ERIC Educational Resources Information Center
Xie, Bo; Druin, Allison; Fails, Jerry; Massey, Sheri; Golub, Evan; Franckel, Sonia; Schneider, Kiki
2012-01-01
As new technologies emerge that can bring older adults together with children, little has been discussed by researchers concerning the design methods used to create these new technologies. Giving both children and older adults a voice in a shared design process comes with many challenges. This paper details an exploratory study focusing on…
Research Guided Practice: Student Online Experiences during Mathematics Class in the Middle School
ERIC Educational Resources Information Center
Mojica-Casey, Maria; Dekkers, John; Thrupp, Rose-Marie
2014-01-01
The approaches to new technologies available to schools, teachers and students largely concern computers and engagement. This requires adoption of alternate and new teaching practices to engage students in the teaching and learning process. This research integrates youth voice about the use of technology. A major motivation for this research is to…
ERIC Educational Resources Information Center
Wilford, Valerie J.; Logan, Lee; Bell, Lori; Cloyes, Kay
The Alliance Library System (ALS) is one of 12 regional library systems in Illinois, providing a full spectrum of support services for 300 member libraries of all types (public, school, academic, and special) located in west central Illinois. This paper describes the process by which ALS implemented a voice/video/data network connecting their four…
Smith, Chris; Ly, Sokhey; Uk, Vannak; Warnock, Ruby; Free, Caroline
2017-06-12
The MObile Technology for Improved Family Planning (MOTIF) trial assessed a mobile phone-based intervention comprising voice messages and counsellor support to increase post-abortion contraception at four Marie Stopes International clinics in Cambodia. The aim of this process evaluation was to assess women's views and experiences of receiving the MOTIF intervention, gain insights into the mechanism of action of the intervention and seek recommendations for improvements. We conducted a qualitative study comprising15 semi-structured interviews with women who had received the intervention and undertook a simple thematic analysis. We identified themes relating to communication via mobile phone, supporting contraception use, broader post-abortion care, interaction with family and friends and suggestions for improvement. The majority of women were positive about the mobile phone-based intervention to support contraception use and reported it to be a convenient way to ask questions or get advice without going to a health centre, although a few women found the voice messages intrusive. The intervention supported contraception use by provision of information, encouragement, reminders to return to clinic, reassurance and advice for problems and had a positive effect on contraceptive uptake and continuation. Women reported a sense of being cared for and received support for additional physical and emotional issues. Most women thought that the duration of the intervention and frequency of messages were acceptable. The majority of women were positive about the mobile phone-based intervention which provided support for contraception use as well as additional physical and emotional issues. The study provides some insights into how the intervention might have worked and considers how the intervention could be improved.
Telehealth: voice therapy using telecommunications technology.
Mashima, Pauline A; Birkmire-Peters, Deborah P; Syms, Mark J; Holtel, Michael R; Burgess, Lawrence P A; Peters, Leslie J
2003-11-01
Telehealth offers the potential to meet the needs of underserved populations in remote regions. The purpose of this study was a proof-of-concept to determine whether voice therapy can be delivered effectively remotely. Treatment outcomes were evaluated for a vocal rehabilitation protocol delivered under 2 conditions: with the patient and clinician interacting within the same room (conventional group) and with the patient and clinician in separate rooms, interacting in real time via a hard-wired video camera and monitor (video teleconference group). Seventy-two patients with voice disorders served as participants. Based on evaluation by otolaryngologists, 31 participants were diagnosed with vocal nodules, 29 were diagnosed with edema, 9 were diagnosed with unilateral vocal fold paralysis, and 3 presented with vocal hyperfunction with no laryngeal pathology. Fifty-one participants (71%) completed the vocal rehabilitation protocol. Outcome measures included perceptual judgments of voice quality, acoustic analyses of voice, patient satisfaction ratings, and fiber-optic laryngoscopy. There were no differences in outcome measures between the conventional group and the remote video teleconference group. Participants in both groups showed positive changes on all outcome measures after completing the vocal rehabilitation protocol. Reasons for participants discontinuing therapy prematurely provided support for the telehealth model of service delivery.
Preserving Tradition through Technology.
ERIC Educational Resources Information Center
Wakshul, Barbra
2001-01-01
Language is easiest to learn before age 5. The Cherokee Nation supported production of a toy that teaches young children basic Cherokee words. When figures that come with the toy are placed into it, a computer chip activates a voice speaking the name of the figure in Cherokee. Learning takes place on visual, auditory, and tactile levels. (TD)
Sensor technology more than a support.
Olsson, Anna; Persson, Ann-Christine; Bartfai, Aniko; Boman, Inga-Lill
2018-03-01
This interview study is a part of a project that evaluated sensor technology as a support in everyday activities for patients with memory impairment. To explore patients with memory impairment and their partners' experiences of using sensor technology in their homes. Five patients with memory impairment after stroke and three partners were interviewed. Individual semi-structured interviews were analyzed with qualitative content analysis. Installing sensor technology with individually prerecorded voice reminders as memory support in the home had a broad impact on patients' and their families' lives. These effects were both positive and negative. The sensor technology not only supported activities but also influenced the patients by changing behavior, providing a sense of security, independence and increased self-confidence. For the partners, the sensor technology eased daily life, but also gave increased responsibility for maintenance. Technical problems led to frustration and stress for the patients. The results indicate that sensor technology has potential to increase opportunities for persons with memory impairment to perform and participate in activities and to unburden their partners. The results may promote an understanding of how sensor technology can be used to support persons with memory impairment in their homes.
Sidtis, Diana; Kreiman, Jody
2011-01-01
The human voice is described in dialogic linguistics as an embodiment of self in a social context, contributing to expression, perception and mutual exchange of self, consciousness, inner life, and personhood. While these approaches are subjective and arise from phenomenological perspectives, scientific facts about personal vocal identity, and its role in biological development, support these views. It is our purpose to review studies of the biology of personal vocal identity -- the familiar voice pattern-- as providing an empirical foundation for the view that the human voice is an embodiment of self in the social context. Recent developments in the biology and evolution of communication are concordant with these notions, revealing that familiar voice recognition (also known as vocal identity recognition or individual vocal recognition) or contributed to survival in the earliest vocalizing species. Contemporary ethology documents the crucial role of familiar voices across animal species in signaling and perceiving internal states and personal identities. Neuropsychological studies of voice reveal multimodal cerebral associations arising across brain structures involved in memory, emotion, attention, and arousal in vocal perception and production, such that the voice represents the whole person. Although its roots are in evolutionary biology, human competence for processing layered social and personal meanings in the voice, as well as personal identity in a large repertory of familiar voice patterns, has achieved an immense sophistication. PMID:21710374
Wilson, Benjamin; Petkov, Christopher I
2011-04-01
Considerable knowledge is available on the neural substrates for speech and language from brain-imaging studies in humans, but until recently there was a lack of data for comparison from other animal species on the evolutionarily conserved brain regions that process species-specific communication signals. To obtain new insights into the relationship of the substrates for communication in primates, we compared the results from several neuroimaging studies in humans with those that have recently been obtained from macaque monkeys and chimpanzees. The recent work in humans challenges the longstanding notion of highly localized speech areas. As a result, the brain regions that have been identified in humans for speech and nonlinguistic voice processing show a striking general correspondence to how the brains of other primates analyze species-specific vocalizations or information in the voice, such as voice identity. The comparative neuroimaging work has begun to clarify evolutionary relationships in brain function, supporting the notion that the brain regions that process communication signals in the human brain arose from a precursor network of regions that is present in nonhuman primates and is used for processing species-specific vocalizations. We conclude by considering how the stage now seems to be set for comparative neurobiology to characterize the ancestral state of the network that evolved in humans to support language.
Introduction and comparison of next-generation mobile wireless technologies
NASA Astrophysics Data System (ADS)
Zaidi, Syed R.; Hussain, Shahab; Ali, M. A.; Sana, Ajaz; Saddawi, Samir; Carranza, Aparicio
2010-01-01
Mobile networks and services have gone further than voice-only communication services and are rapidly developing towards data-centric services. Emerging mobile data services are expected to see the same explosive growth in demand that Internet and wireless voice services have seen in recent years. To support such a rapid increase in traffic, active users, and advanced multimedia services implied by this growth rate along with the diverse quality of service (QoS) and rate requirements set by these services, mobile operator need to rapidly transition to a simple and cost-effective, flat, all IP-network. This has accelerated the development and deployment of new wireless broadband access technologies including fourth-generation (4G) mobile WiMAX and cellular Long-Term Evolution (LTE). Mobile WiMAX and LTE are two different (but not necessarily competing) technologies that will eventually be used to achieve data speeds of up to 100 Mbps. Speeds that are fast enough to potentially replace wired broadband connections with wireless. This paper introduces both of these next generation technologies and then compares them in the end.
Trainor, Laurel J; Marie, Céline; Bruce, Ian C; Bidelman, Gavin M
2014-02-01
Natural auditory environments contain multiple simultaneously-sounding objects and the auditory system must parse the incoming complex sound wave they collectively create into parts that represent each of these individual objects. Music often similarly requires processing of more than one voice or stream at the same time, and behavioral studies demonstrate that human listeners show a systematic perceptual bias in processing the highest voice in multi-voiced music. Here, we review studies utilizing event-related brain potentials (ERPs), which support the notions that (1) separate memory traces are formed for two simultaneous voices (even without conscious awareness) in auditory cortex and (2) adults show more robust encoding (i.e., larger ERP responses) to deviant pitches in the higher than in the lower voice, indicating better encoding of the former. Furthermore, infants also show this high-voice superiority effect, suggesting that the perceptual dominance observed across studies might result from neurophysiological characteristics of the peripheral auditory system. Although musically untrained adults show smaller responses in general than musically trained adults, both groups similarly show a more robust cortical representation of the higher than of the lower voice. Finally, years of experience playing a bass-range instrument reduces but does not reverse the high voice superiority effect, indicating that although it can be modified, it is not highly neuroplastic. Results of new modeling experiments examined the possibility that characteristics of middle-ear filtering and cochlear dynamics (e.g., suppression) reflected in auditory nerve firing patterns might account for the higher-voice superiority effect. Simulations show that both place and temporal AN coding schemes well-predict a high-voice superiority across a wide range of interval spacings and registers. Collectively, we infer an innate, peripheral origin for the higher-voice superiority observed in human ERP and psychophysical music listening studies. Copyright © 2013 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Ellrodt, John Charles; Fico, Maria; Harnett, Susanne; Ramsey, Lori Gerstein; Lopez, Angelina
2014-01-01
The Global Writes (GW) model is a well-designed performing arts integrated literacy program that builds local and global support among students, teachers, and arts partners through the use of innovative technologies. Through local partnerships between schools and arts organizations forged by GW, classroom teachers and local teaching artists build…
Voice Assessment of Student Work: Recent Studies and Emerging Technologies
ERIC Educational Resources Information Center
Eckhouse, Barry; Carroll, Rebecca
2013-01-01
Although relatively little attention has been given to the voice assessment of student work, at least when compared with more traditional forms of text-based review, the attention it has received strongly points to a promising form of review that has been hampered by the limits of an emerging technology. A fresh review of voice assessment in light…
Whispering - The hidden side of auditory communication.
Frühholz, Sascha; Trost, Wiebke; Grandjean, Didier
2016-11-15
Whispering is a unique expression mode that is specific to auditory communication. Individuals switch their vocalization mode to whispering especially when affected by inner emotions in certain social contexts, such as in intimate relationships or intimidating social interactions. Although this context-dependent whispering is adaptive, whispered voices are acoustically far less rich than phonated voices and thus impose higher hearing and neural auditory decoding demands for recognizing their socio-affective value by listeners. The neural dynamics underlying this recognition especially from whispered voices are largely unknown. Here we show that whispered voices in humans are considerably impoverished as quantified by an entropy measure of spectral acoustic information, and this missing information needs large-scale neural compensation in terms of auditory and cognitive processing. Notably, recognizing the socio-affective information from voices was slightly more difficult from whispered voices, probably based on missing tonal information. While phonated voices elicited extended activity in auditory regions for decoding of relevant tonal and time information and the valence of voices, whispered voices elicited activity in a complex auditory-frontal brain network. Our data suggest that a large-scale multidirectional brain network compensates for the impoverished sound quality of socially meaningful environmental signals to support their accurate recognition and valence attribution. Copyright © 2016 Elsevier Inc. All rights reserved.
Using collaborative technologies in remote lab delivery systems for topics in automation
NASA Astrophysics Data System (ADS)
Ashby, Joe E.
Lab exercises are a pedagogically essential component of engineering and technology education. Distance education remote labs are being developed which enable students to access lab facilities via the Internet. Collaboration, students working in teams, enhances learning activity through the development of communication skills, sharing observations and problem solving. Web meeting communication tools are currently used in remote labs. The problem identified for investigation was that no standards of practice or paradigms exist to guide remote lab designers in the selection of collaboration tools that best support learning achievement. The goal of this work was to add to the body of knowledge involving the selection and use of remote lab collaboration tools. Experimental research was conducted where the participants were randomly assigned to three communication treatments and learning achievement was measured via assessments at the completion of each of six remote lab based lessons. Quantitative instruments used for assessing learning achievement were implemented, along with a survey to correlate user preference with collaboration treatments. A total of 53 undergraduate technology students worked in two-person teams, where each team was assigned one of the treatments, namely (a) text messaging chat, (b) voice chat, or (c) webcam video with voice chat. Each had little experience with the subject matter involving automation, but possessed the necessary technical background. Analysis of the assessment score data included mean and standard deviation, confirmation of the homogeneity of variance, a one-way ANOVA test and post hoc comparisons. The quantitative and qualitative data indicated that text messaging chat negatively impacted learning achievement and that text messaging chat was not preferred. The data also suggested that the subjects were equally divided on preference to voice chat verses webcam video with voice chat. To the end of designing collaborative communication tools for remote labs involving automation equipment, the results of this work points to making voice chat the default method of communication; but the webcam video with voice chat option should be included. Standards are only beginning to be developed for the design of remote lab systems. Research, design and innovation involving collaboration and presence should be included.
ERIC Educational Resources Information Center
Boardman, Margot
2007-01-01
This study set out to investigate the use of digital cameras and voice recorders to accurately capture essential components of early learners' achievements. The project was undertaken by 29 early childhood educators within kindergarten settings in Tasmania and the Australian Capital Territory. Data collected indicated that digital technologies,…
Tucker, Sean; Chmiel, Nik; Turner, Nick; Hershcovis, M Sandy; Stride, Chris B
2008-10-01
In the present study, we modeled 2 sources of safety support (perceived organizational support for safety and perceived coworker support for safety) as predictors of employee safety voice, that is, speaking out in an attempt to change unsafe working conditions. Drawing on social exchange and social impact theories, we hypothesized and tested a mediated model predicting employee safety voice using a cross-sectional survey of urban bus drivers (n = 213) in the United Kingdom. Hierarchical regression analysis showed that perceived coworker support for safety fully mediated the relationship between perceived organizational support for safety and employee safety voice. This study adds to the employee voice literature by evaluating the important role that coworkers can play in encouraging others to speak out about safety issues. Implications for research and practice related to change-oriented safety communication are discussed.
Secure Recognition of Voice-Less Commands Using Videos
NASA Astrophysics Data System (ADS)
Yau, Wai Chee; Kumar, Dinesh Kant; Weghorn, Hans
Interest in voice recognition technologies for internet applications is growing due to the flexibility of speech-based communication. The major drawback with the use of sound for internet access with computers is that the commands will be audible to other people in the vicinity. This paper examines a secure and voice-less method for recognition of speech-based commands using video without evaluating sound signals. The proposed approach represents mouth movements in the video data using 2D spatio-temporal templates (STT). Zernike moments (ZM) are computed from STT and fed into support vector machines (SVM) to be classified into one of the utterances. The experimental results demonstrate that the proposed technique produces a high accuracy of 98% in a phoneme classification task. The proposed technique is demonstrated to be invariant to global variations of illumination level. Such a system is useful for securely interpreting user commands for internet applications on mobile devices.
Gerhard, Julia; Rosow, David E
2016-05-01
Speech-language pathologists have long used technology for the clinical measurement of the speaking voice, but present research shows that vocal pedagogues and voice students are becoming more accepting of technology in the studio. As a result, the equipment and technology used in singing voice studios by speech-language pathologists and vocal pedagogues are changing. Although guides exist regarding equipment and technology necessary for developing a voice laboratory and private voice studio, there are no data documenting the current implementation of these items and their perceived effectiveness. This study seeks to document current trends in equipment used in voice laboratories and studios. Two separate surveys were distributed to 60 vocologists and approximately 300 student singers representative of the general singing student population. The surveys contained questions about the inventory of items found in voice studios and perceived effectiveness of these items. Data were analyzed using descriptive analyses and statistical analyses when applicable. Twenty-six of 60 potential vocologists responded, and 66 student singers responded. The vocologists reported highly uniform inventories and ratings of studio items. There were wide-ranging differences between the inventories reported by the vocologist and student singer groups. Statistically significant differences between ratings of effectiveness of studio items were found for 11 of the 17 items. In all significant cases, vocologists rated usefulness to be higher than student singers. Although the order of rankings of vocologists and student singers was similar, a much higher percentage of vocologists report the items as being efficient and effective than students. The historically typical studio items, including the keyboard and mirror, were ranked as most effective by both vocologists and student singers. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Normal voice processing after posterior superior temporal sulcus lesion.
Jiahui, Guo; Garrido, Lúcia; Liu, Ran R; Susilo, Tirta; Barton, Jason J S; Duchaine, Bradley
2017-10-01
The right posterior superior temporal sulcus (pSTS) shows a strong response to voices, but the cognitive processes generating this response are unclear. One possibility is that this activity reflects basic voice processing. However, several fMRI and magnetoencephalography findings suggest instead that pSTS serves as an integrative hub that combines voice and face information. Here we investigate whether right pSTS contributes to basic voice processing by testing Faith, a patient whose right pSTS was resected, with eight behavioral tasks assessing voice identity perception and recognition, voice sex perception, and voice expression perception. Faith performed normally on all the tasks. Her normal performance indicates right pSTS is not necessary for intact voice recognition and suggests that pSTS activations to voices reflect higher-level processes. Copyright © 2017 Elsevier Ltd. All rights reserved.
Keys to the Adoption and Use of Voice Recognition Technology in Organizations.
ERIC Educational Resources Information Center
Goette, Tanya
2000-01-01
Presents results from a field study of individuals with disabilities who used voice recognition technology (VRT). Results indicated that task-technology fit, training, the environment, and disability limitations were the differentiating items, and that using VRT for a trial period may be the major factor in successful adoption of the technology.…
Orchestrating 21st Century Learning in Higher Education: A Perspective on Student Voice
ERIC Educational Resources Information Center
Hämäläinen, Raija; Kiili, Carita; Smith, Blaine E.
2017-01-01
For universities to meet the 21st-century learning needs of today's students, it is important they allow students to take an active role in developing pedagogy and sharing their perspective. This paper introduces design-based research aiming to develop a pedagogic approach to support technology-enhanced learning practices at the university level…
2007-09-01
Australian/New Zealand English, Canadian French, Cantonese , European French, German, Italian, Japanese, Jordanian Arabic, Mandarin, Portuguese...Environment Within the congruence model, the environment “includes people, other organizations, social and economic forces, and legal constraints” [28
A Research Program in Computer Technology. Volume 1
1981-08-01
rigidity, sensor networks 10. command and control, digital voice communication, graphic input device for terminal, multimedia communications, portable...satellite channel in the internetwork environment; Distributed Sensor Networks - formulation of algorithms and communication protocols to support the...operation of geographically distributed sensors ; Personal Communicator - work intended to result in a demonstration-level portable terminal to test and
Smartphone App for Voice Disorders
... on. Feature: Taste, Smell, Hearing, Language, Voice, Balance Smartphone App for Voice Disorders Past Issues / Fall 2013 ... developed a mobile monitoring device that relies on smartphone technology to gather a week's worth of talking, ...
2014-09-01
power. The wireless infrastructure is an expansion of the current DOD IE which can be leveraged to connect mobile capabilities and technologies. The...DOD must focus on three critical areas central to mobility : the wireless infrastructure , the devices themselves, and the applications the devices use... infrastructure to support mobile devices. – The intent behind this goal is to improve the existing wireless backbone to support secure voice, data, and video
Women as a resource for the flexibility required for high technology innovation
NASA Technical Reports Server (NTRS)
Marlaire, Ruth Dasso
1994-01-01
What do women scientists need to know for career advancement into senior level positions? Our declining economic conditions have been the cause for major political and technological changes. The U.S. Congress is turning toward technology to increase our competitive edge in the world. Allowing women scientists, and women engineers in particular, more voice in the decision making process may be an innovative alternative for the diversity and flexibility needed for the unknown technological problems of the future. But first women scientists need to know how the system measures scientific achievement and how to identify the processes needed to increase our technological capability in order for them to formidably compete and win higher ranking positions.
Wireless infrared communications for space and terrestrial applications
NASA Technical Reports Server (NTRS)
Crimmins, James W.
1993-01-01
Voice and data communications via wireless (and fiberless) optical means has been commonplace for many years. However, continuous advances in optoelectronics and microelectronics have resulted in significant advances in wireless optical communications over the last decade. Wilton has specialized in diffuse infrared voice and data communications since 1979. In 1986, NASA Johnson Space Center invited Wilton to apply its wireless telecommunications and factory floor technology to astronaut voice communications aboard the shuttle. In September, 1988 a special infrared voice communications system flew aboard a 'Discovery' Shuttle mission as a flight experiment. Since then the technology has been further developed, resulting in a general purpose of 2Mbs wireless voice/data LAN which has been tested for a variety of applications including use aboard Spacelab. Funds for Wilton's wireless IR development were provided in part by NASA's Technology Utilization Office and by the NASA Small Business Innovative Research Program. As a consequence, Wilton's commercial product capability has been significantly enhanced to include diffuse infrared wireless LAN's as well as wireless infrared telecommunication systems for voice and data.
Advanced voice function assessment: editorial introduction to this special issue.
Barney, Anna; Kob, Malte
2015-04-01
ICT COST Action 2103 was an EU-funded collaborative network of speech processing engineers, laryngologists, and phoniatricians that started on 19 December 2006 and ended on 18 June 2011. The main objectives were to improve the clinical assessment of voice using new technologies; to encourage clinicians and technologists to work closely together to understand the needs and limitations of each other's fields and, in parallel, to acquire new data with a view to elaborating better voice production models. The papers in this special issue represent some of the outcomes of that partnership. This editorial introduces the background and context for COST Action 2103 and each of the papers. In conclusion we discuss the impact of the Action and what aspects of it may have a lasting effect on practice.
Francis, Alexander L; Driscoll, Courtney
2006-09-01
We examined the effect of perceptual training on a well-established hemispheric asymmetry in speech processing. Eighteen listeners were trained to use a within-category difference in voice onset time (VOT) to cue talker identity. Successful learners (n=8) showed faster response times for stimuli presented only to the left ear than for those presented only to the right. The development of a left-ear/right-hemisphere advantage for processing a prototypically phonetic cue supports a model of speech perception in which lateralization is driven by functional demands (talker identification vs. phonetic categorization) rather than by acoustic stimulus properties alone.
Masculine Voices Predict Well-Being in Female-to-Male Transgender Individuals.
Watt, Seth O; Tskhay, Konstantin O; Rule, Nicholas O
2018-05-01
Voices convey important social information about an individual's identity, including gender. This is especially relevant to transgender individuals, who cite voice alteration as a primary goal of the gender alignment process. Although the voice is a primary target of testosterone therapy among female-to-male (FTM) trans people, little research has explored the effects of such changes on their psychological well-being. Here, we investigated how FTMs' vocal gender related to their well-being. A total of 77 FTMs (M age = 25.45 years, SD = 6.77) provided voice samples and completed measures of their well-being and psychological health. An independent group of 32 naïve raters (M age = 22.16 years, SD = 8.21) subsequently rated the voice samples for masculinity. We found that FTMs whose voices sounded more congruent with their experienced gender (i.e., sounded more masculine) reported greater well-being (better life satisfaction, quality of life, and self-esteem; lower levels of anxiety and depression) than FTMs with less gender congruent (i.e., more feminine) voices (β = .48). The convergence between outwardly perceived vocal gender and gender identity brought about through hormone replacement therapy may therefore support greater well-being for FTMs.
ERIC Educational Resources Information Center
Pautsch, Catherine A.
2010-01-01
Student voice is defined as meaningful opportunities that allow students to take an active role in the decision-making that influences their lives. While there is a growing body of literature linking student voice to increased student engagement, achievement, civic engagement and successful school reform, the role of the principal in supporting…
Point-of-care technology: integration for improved delivery of care.
Gregory, Debbie; Buckner, Martha
2014-01-01
The growing complexity of technology, equipment, and devices involved in patient care delivery can be staggering and overwhelming. Technology is intended to be a tool to help clinicians, but it can also be a frustrating hindrance if not thoughtfully planned and strategically aligned. Critical care nurses are key partners in the collaborations needed to improve safety and quality through health information technology (IT). Nurses must advocate for systems that are interoperable and adapted to the context of care experiences. The involvement and collaboration between clinicians, information technology specialists, biomedical engineers, and vendors has never been more relevant and applicable. Working together strategically with a shared vision can effectively provide a seamless clinical workflow, maximize technology investments, and ultimately improve patient care delivery and outcomes. Developing a strategic integrated clinical and IT roadmap is a critical component of today's health care environment. How can technology strategy be aligned from the executive suite to the bedside caregiver? What is the model for using clinical workflows to drive technology adoption? How can the voice of the critical care nurse strengthen this process? How can success be assured from the initial assessment and selection of technology to a sustainable support model? What is the vendor's role as a strategic partner and "co-caregiver"?
The Lincoln Training System: A Summary Report.
ERIC Educational Resources Information Center
Butman, Robert C.; Frick, Frederick C.
The current status of the Lincoln Training System (LTS) is reported. This document describes LTS as a computer supported microfiche system which: 1) provides random access to voice quality audio and to graphics; 2) supports student-controlled interactive processes; and 3) functions in a variety of environments. The report offers a detailed…
Automatic Speech Acquisition and Recognition for Spacesuit Audio Systems
NASA Technical Reports Server (NTRS)
Ye, Sherry
2015-01-01
NASA has a widely recognized but unmet need for novel human-machine interface technologies that can facilitate communication during astronaut extravehicular activities (EVAs), when loud noises and strong reverberations inside spacesuits make communication challenging. WeVoice, Inc., has developed a multichannel signal-processing method for speech acquisition in noisy and reverberant environments that enables automatic speech recognition (ASR) technology inside spacesuits. The technology reduces noise by exploiting differences between the statistical nature of signals (i.e., speech) and noise that exists in the spatial and temporal domains. As a result, ASR accuracy can be improved to the level at which crewmembers will find the speech interface useful. System components and features include beam forming/multichannel noise reduction, single-channel noise reduction, speech feature extraction, feature transformation and normalization, feature compression, and ASR decoding. Arithmetic complexity models were developed and will help designers of real-time ASR systems select proper tasks when confronted with constraints in computational resources. In Phase I of the project, WeVoice validated the technology. The company further refined the technology in Phase II and developed a prototype for testing and use by suited astronauts.
1983-05-01
The VDE system developed had the capability of recognizing up to 248 separate words in syntactic structures. 4 The two systems described are isolated...size, weight, and power consumption of VDE devices (See Fig. 19). 8. DUU and NATU Advisory Groups on Voice Technology At the present time, two major
Physics of the Cosmos (PCOS) Technology Development Program Overview
NASA Astrophysics Data System (ADS)
Pham, B. Thai; Clampin, M.; Werneth, R. L.
2014-01-01
The Physics of the Cosmos (PCOS) Program Office was established in FY11 and resides at the NASA Goddard Space Flight Center (GSFC). The office serves as the implementation arm for the Astrophysics Division at NASA Headquarters for PCOS Program related matters. We present an overview of the Program’s technology management activities and the Program’s technology development portfolio. We discuss the process for addressing community-provided technology needs and the Technology Management Board (TMB)-vetted prioritization and investment recommendations. This process improves the transparency and relevance of technology investments, provides the community a voice in the process, and leverages the technology investments of external organizations by defining a need and a customer. Goals for the PCOS Program envisioned by the National Research Council’s (NRC) “New Worlds, New Horizons in Astronomy and Astrophysics” (NWNH) Decadal Survey report include science missions and technology development for dark energy, gravitational waves, X-ray, and inflation probe science.
Linking Colleague Support to Employees’ Promotive Voice: A Moderated Mediation Model
2015-01-01
Promotive voice is essential for improving team and organization performance. Yet in the current literature, less was known regarding the psychological reasons why people engage in promotive voice. Through the lens of social exchange, we proposed that employees who received support from colleagues may develop higher level of felt obligation for constructive change which leads to promotive voice. Analyses of multi-source data from 51 cross-functional sources (51 team supervisors and 162 employees) showed that employees’ felt obligation for constructive change positively mediates the relationship between colleague support and promotive voice behavior. Moreover, the impact of colleague support on felt obligation for constructive change is stronger when there is a low level of subgroup formation in the team. Theoretical and practical implications of these findings are discussed. PMID:26148194
Enhanced Living by Assessing Voice Pathology Using a Co-Occurrence Matrix
Muhammad, Ghulam; Alhamid, Mohammed F.; Hossain, M. Shamim; Almogren, Ahmad S.; Vasilakos, Athanasios V.
2017-01-01
A large number of the population around the world suffers from various disabilities. Disabilities affect not only children but also adults of different professions. Smart technology can assist the disabled population and lead to a comfortable life in an enhanced living environment (ELE). In this paper, we propose an effective voice pathology assessment system that works in a smart home framework. The proposed system takes input from various sensors, and processes the acquired voice signals and electroglottography (EGG) signals. Co-occurrence matrices in different directions and neighborhoods from the spectrograms of these signals were obtained. Several features such as energy, entropy, contrast, and homogeneity from these matrices were calculated and fed into a Gaussian mixture model-based classifier. Experiments were performed with a publicly available database, namely, the Saarbrucken voice database. The results demonstrate the feasibility of the proposed system in light of its high accuracy and speed. The proposed system can be extended to assess other disabilities in an ELE. PMID:28146069
Enhanced Living by Assessing Voice Pathology Using a Co-Occurrence Matrix.
Muhammad, Ghulam; Alhamid, Mohammed F; Hossain, M Shamim; Almogren, Ahmad S; Vasilakos, Athanasios V
2017-01-29
A large number of the population around the world suffers from various disabilities. Disabilities affect not only children but also adults of different professions. Smart technology can assist the disabled population and lead to a comfortable life in an enhanced living environment (ELE). In this paper, we propose an effective voice pathology assessment system that works in a smart home framework. The proposed system takes input from various sensors, and processes the acquired voice signals and electroglottography (EGG) signals. Co-occurrence matrices in different directions and neighborhoods from the spectrograms of these signals were obtained. Several features such as energy, entropy, contrast, and homogeneity from these matrices were calculated and fed into a Gaussian mixture model-based classifier. Experiments were performed with a publicly available database, namely, the Saarbrucken voice database. The results demonstrate the feasibility of the proposed system in light of its high accuracy and speed. The proposed system can be extended to assess other disabilities in an ELE.
Educational Technology and Student Voice: Examining Teacher Candidates' Perceptions
ERIC Educational Resources Information Center
Byker, Erik Jon; Putman, S. Michael; Handler, Laura; Polly, Drew
2017-01-01
Student Voice is a term that honors the participatory roles that students have when they enter learning spaces like classrooms. Student Voice is the recognition of students' choice, creativity, and freedom. Seminal educationists--like Dewey and Montessori--centered the purposes of education in the flourishing and valuing of Student Voice. This…
Expert Voices in Learning Improvisation: Shaping Regulation Processes through Experiential Influence
ERIC Educational Resources Information Center
de Bruin, Leon R.
2017-01-01
Interpersonal and collaborative activity plays an important role in the social aspects of self-regulated learning (SRL) development. Peer, teacher and group interactions facilitate support for self-regulation, co-regulation and socially shared regulatory processes. Situated and experiential interplay facilitates personal, co-constructed and…
Research in speech communication.
Flanagan, J
1995-10-24
Advances in digital speech processing are now supporting application and deployment of a variety of speech technologies for human/machine communication. In fact, new businesses are rapidly forming about these technologies. But these capabilities are of little use unless society can afford them. Happily, explosive advances in microelectronics over the past two decades have assured affordable access to this sophistication as well as to the underlying computing technology. The research challenges in speech processing remain in the traditionally identified areas of recognition, synthesis, and coding. These three areas have typically been addressed individually, often with significant isolation among the efforts. But they are all facets of the same fundamental issue--how to represent and quantify the information in the speech signal. This implies deeper understanding of the physics of speech production, the constraints that the conventions of language impose, and the mechanism for information processing in the auditory system. In ongoing research, therefore, we seek more accurate models of speech generation, better computational formulations of language, and realistic perceptual guides for speech processing--along with ways to coalesce the fundamental issues of recognition, synthesis, and coding. Successful solution will yield the long-sought dictation machine, high-quality synthesis from text, and the ultimate in low bit-rate transmission of speech. It will also open the door to language-translating telephony, where the synthetic foreign translation can be in the voice of the originating talker.
Allen, Summer
2016-01-01
It's tough to imagine anything more frustrating than interacting with a call center. Generally, people don't reach out to call centers when they?re happy-they're usually trying to get help with a problem or gearing up to do battle over a billing error. Add in an automatic phone tree, and you have a recipe for annoyance. But what if that robotic voice offering you a smorgasbord of numbered choices could tell that you were frustrated and then funnel you to an actual human being? This type of voice analysis technology exists, and it's just one example of the many ways that computers can use your voice to extract information about your mental and emotional state-including information you may not think of as being accessible through your voice alone.
Evaluation of a voice recognition system for the MOTAS pseudo pilot station function
NASA Technical Reports Server (NTRS)
Houck, J. A.
1982-01-01
The Langley Research Center has undertaken a technology development activity to provide a capability, the mission oriented terminal area simulation (MOTAS), wherein terminal area and aircraft systems studies can be performed. An experiment was conducted to evaluate state-of-the-art voice recognition technology and specifically, the Threshold 600 voice recognition system to serve as an aircraft control input device for the MOTAS pseudo pilot station function. The results of the experiment using ten subjects showed a recognition error of 3.67 percent for a 48-word vocabulary tested against a programmed vocabulary of 103 words. After the ten subjects retrained the Threshold 600 system for the words which were misrecognized or rejected, the recognition error decreased to 1.96 percent. The rejection rates for both cases were less than 0.70 percent. Based on the results of the experiment, voice recognition technology and specifically the Threshold 600 voice recognition system were chosen to fulfill this MOTAS function.
Monitoring and telemedicine support in remote environments and in human space flight.
Cermack, M
2006-07-01
The common features of remote environments are geographical separation, logistic problems with health care delivery and with patient retrieval, extreme natural conditions, artificial environment, or combination of all. The exposure can have adverse effects on patients' physiology, on care providers' performance and on hardware functionality. The time to definite treatment may vary between hours as in orbital space flight, days for remote exploratory camp, weeks for polar bases and months to years for interplanetary exploration. The generic system architecture, used in any telematic support, consists of data acquisition, data-processing and storage, telecommunications links, decision-making facilities and the means of command execution. At the present level of technology, a simple data transfer and two-way voice communication could be established from any place on the earth, but the current use of mobile communication technologies for telemedicine applications is still low, either for logistic, economic and political reasons, or because of limited knowledge about the available technology and procedures. Criteria for selection of portable telemedicine terminals in remote terrestrial places, characteristics of currently available mobile telecommunication systems, and the concept of integrated monitoring of physiological and environmental parameters are mentioned in the first section of this paper. The second part describes some aspects of emergency medical support in human orbital spaceflight, the limits of telemedicine support in near-Earth space environment and mentions some open issues related to long-term exploratory missions beyond the low Earth orbit.
Can You Help Me with My Pitch? Studying a Tool for Real-Time Automated Feedback
ERIC Educational Resources Information Center
Schneider, Jan; Borner, Dirk; van Rosmalen, Peter; Specht, Marcus
2016-01-01
In our pursue to study effective real-time feedback in Technology Enhanced Learning, we developed the Presentation Trainer, a tool designed to support the practice of nonverbal communication skills for public speaking. The tool tracks the user's voice and body to analyze her performance, and selects the type of real-time feedback to be presented.…
Wireless networks of opportunity in support of secure field operations
NASA Astrophysics Data System (ADS)
Stehle, Roy H.; Lewis, Mark
1997-02-01
Under funding from the Defense Advanced Research Projects Agency (DARPA) for joint military and law enforcement technologies, demonstrations of secure information transfer in support of law enforcement and military operations other than war, using wireless and wired technology, were held in September 1996 at several locations in the United States. In this paper, the network architecture, protocols, and equipment supporting the demonstration's scenarios are presented, together with initial results, including lessons learned and desired system enhancements. Wireless networks of opportunity encompassed in-building (wireless-LAN), campus-wide (Metricom Inc.), metropolitan (AMPS cellular, CDPD), and national (one- and two-way satellite) systems. Evolving DARPA-sponsored packet radio technology was incorporated. All data was encrypted, using multilevel information system security initiative (MISSI)FORTEZZA technology, for carriage over unsecured and unclassified commercial networks. The identification and authentication process inherent in the security system permitted logging for database accesses and provided an audit trail useful in evidence gathering. Wireless and wireline communications support, to and between modeled crisis management centers, was demonstrated. Mechanisms for the guarded transport of data through the secret-high military tactical Internet were included, to support joint law enforcement and crisis management missions. A secure World Wide Web (WWW) browser forms the primary, user-friendly interface for information retrieval and submission. The WWW pages were structured to be sensitive to the bandwidth, error rate, and cost of the communications medium in use (e.g., the use of and resolution for graphical data). Both still and motion compressed video were demonstrated, along with secure voice transmission from laptop computers in the field. Issues of network bandwidth, airtime costs, and deployment status are discussed.
Maynes, Timothy D; Podsakoff, Philip M
2014-01-01
Scholarly interest in employee voice behavior has increased dramatically over the past 15 years. Although this research has produced valuable knowledge, it has focused almost exclusively on voice as a positively intended challenge to the status quo, even though some scholars have argued that it need not challenge the status quo or be well intentioned. Thus, in this paper, we create an expanded view of voice; one that extends beyond voice as a positively intended challenge to the status quo to include voice that supports how things are being done in organizations as well as voice that may not be well intentioned. We construct a framework based on this expanded view that identifies 4 different types of voice behavior (supportive, constructive, defensive, and destructive). We then develop and validate survey measures for each of these. Evidence from 5 studies across 4 samples provides strong support for our new measures in that (a) a 4-factor confirmatory factor analysis model fit the data significantly better than 1-, 2-, or 3-factor models; (b) the voice measures converged with and yet remained distinct from conceptually related comparison constructs; (c) personality predictors exhibited unique patterns of relationships with the different types of voice; (d) variations in actual voice behaviors had a direct causal impact on responses to the survey items; and (e) each type of voice significantly impacted important outcomes for voicing employees (e.g., likelihood of relying on a voicing employee's opinions and evaluations of a voicing employee's overall performance). Implications of our findings are discussed. PsycINFO Database Record (c) 2014 APA, all rights reserved
Including Multiple Voices in Collaboratively Designing a Teacher Education Program
ERIC Educational Resources Information Center
Konecki, Loretta R.; Sturdivant, Robika L.; King, Caryn M.; Melin, Jacquelyn A.; Lancaster, Paula E.
2012-01-01
This narrative case study describes the collaborative processes employed by a midwestern university as it designed and implemented a clinically based, postbaccalaureate teacher preparation program for science, technology, engineering, and mathematics (STEM) graduates committed to teaching in high need secondary schools. The program development…
Selective attention modulates early human evoked potentials during emotional face-voice processing.
Ho, Hao Tam; Schröger, Erich; Kotz, Sonja A
2015-04-01
Recent findings on multisensory integration suggest that selective attention influences cross-sensory interactions from an early processing stage. Yet, in the field of emotional face-voice integration, the hypothesis prevails that facial and vocal emotional information interacts preattentively. Using ERPs, we investigated the influence of selective attention on the perception of congruent versus incongruent combinations of neutral and angry facial and vocal expressions. Attention was manipulated via four tasks that directed participants to (i) the facial expression, (ii) the vocal expression, (iii) the emotional congruence between the face and the voice, and (iv) the synchrony between lip movement and speech onset. Our results revealed early interactions between facial and vocal emotional expressions, manifested as modulations of the auditory N1 and P2 amplitude by incongruent emotional face-voice combinations. Although audiovisual emotional interactions within the N1 time window were affected by the attentional manipulations, interactions within the P2 modulation showed no such attentional influence. Thus, we propose that the N1 and P2 are functionally dissociated in terms of emotional face-voice processing and discuss evidence in support of the notion that the N1 is associated with cross-sensory prediction, whereas the P2 relates to the derivation of an emotional percept. Essentially, our findings put the integration of facial and vocal emotional expressions into a new perspective-one that regards the integration process as a composite of multiple, possibly independent subprocesses, some of which are susceptible to attentional modulation, whereas others may be influenced by additional factors.
17 Ways to Say Yes: Toward Nuanced Tone of Voice in AAC and Speech Technology
Pullin, Graham; Hennig, Shannon
2015-01-01
Abstract People with complex communication needs who use speech-generating devices have very little expressive control over their tone of voice. Despite its importance in human interaction, the issue of tone of voice remains all but absent from AAC research and development however. In this paper, we describe three interdisciplinary projects, past, present and future: The critical design collection Six Speaking Chairs has provoked deeper discussion and inspired a social model of tone of voice; the speculative concept Speech Hedge illustrates challenges and opportunities in designing more expressive user interfaces; the pilot project Tonetable could enable participatory research and seed a research network around tone of voice. We speculate that more radical interactions might expand frontiers of AAC and disrupt speech technology as a whole. PMID:25965913
Schueller, Stephen Matthew
2017-01-01
Background Positive psychological interventions for children have typically focused on direct adaptations of interventions developed for adults. As the community moves toward designing positive computing technologies to support child well-being, it is important to use a more participatory process that directly engages children’s voices. Objective Our objectives were, through a participatory design study, to understand children’s interpretations of positive psychology concepts, as well as their perspectives on technologies that are best suited to enhance their engagement with practice of well-being skills. Methods We addressed these questions through a content analysis of 434 design ideas, 51 sketches, and 8 prototype and videos, which emerged from a 14-session cooperative inquiry study with 12 child “happiness inventors.” The study was part of a summer learning camp held at the children’s middle school, which focused on teaching the invention process, teaching well-being skills drawn from positive psychology and related areas (gratitude, mindfulness, and problem solving), and iterating design ideas for technologies to support these skills. Results The children’s ideas and prototypes revealed specific facets of how they interpreted gratitude (as thanking, being positive, and doing good things), mindfulness (as externally representing thought and emotions, controlling those thoughts and emotions, getting through unpleasant things, and avoiding forgetting something), and problem solving (as preventing bad decisions, seeking alternative solutions, and not dwelling on unproductive thoughts). This process also revealed that children emphasized particular technologies in their solutions. While desktop or laptop solutions were notably lacking, other ideas were roughly evenly distributed between mobile apps and embodied computing technologies (toys, wearables, etc). We also report on desired functionalities and approaches to engagement in the children’s ideas, such as a notable emphasis on representing and responding to internal states. Conclusions Our findings point to promising directions for the design of positive computing technologies targeted at children, with particular emphases on the perspectives, technologies, engagement approaches, and functionalities that appealed to the children in our study. The dual focus of the study on teaching skills while designing technologies is a novel methodology in the design of positive computing technologies intended to increase child well-being. PMID:28096066
Qi, Yuan-Hua; Wang, Hui; Zhang, Xiao-Bo; Jin, Yan; Ge, Xiao-Guang; Jing, Zhi-Xian; Wang, Ling; Zhao, Yu-Ping; Guo, Lan-Ping; Huang, Lu-Qi
2017-11-01
In this paper, a data acquisition system based on mobile terminal combining GPS, offset correction, automatic speech recognition and database networking technology was designed implemented with the function of locating the latitude and elevation information fast, taking conveniently various types of Chinese herbal plant photos, photos, samples habitat photos and so on. The mobile system realizes automatic association with Chinese medicine source information, through the voice recognition function it records the information of plant characteristics and environmental characteristics, and record relevant plant specimen information. The data processing platform based on Chinese medicine resources survey data reporting client can effectively assists in indoor data processing, derives the mobile terminal data to computer terminal. The established data acquisition system provides strong technical support for the fourth national survey of the Chinese materia medica resources (CMMR). Copyright© by the Chinese Pharmaceutical Association.
NASA Astrophysics Data System (ADS)
Various papers on global telecommunications are presented. The general topics addressed include: multiservice integration with optical fibers, multicompany owned telecommunication networks, softworks quality and reliability, advanced on-board processing, impact of new services and systems on operations and maintenance, analytical studies of protocols for data communication networks, topics in packet radio networking, CCITT No. 7 to support new services, document processing and communication, antenna technology and system aspects in satellite communications. Also considered are: communication systems modelling methodology, experimental integrated local area voice/data nets, spread spectrum communications, motion video at the DS-0 rate, optical and data communications, intelligent work stations, switch performance analysis, novel radio communication systems, wireless local networks, ISDN services, LAN communication protocols, user-system interface, radio propagation and performance, mobile satellite system, software for computer networks, VLSI for ISDN terminals, quality management, man-machine interfaces in switching, and local area network performance.
Lax Vox as a Voice Training Program for Teachers: A Pilot Study.
Mailänder, Eva; Mühre, Lea; Barsties, Ben
2017-03-01
The objective of this study was to explore the effectiveness of a 3-week training program with the voice therapy "Lax Vox" for teachers. Four healthy female teachers participated as volunteers for the study. Several voice measurements of perception, acoustics, aerodynamics, and self-evaluation were investigated. Furthermore, a survey to rate the applicability of Lax Vox was also part of the study. To assess the treatment effects of the Lax Vox training, an effect size analysis (d unb ) was conducted. After 3 weeks of training, medium and large improvements were found in some parameters of perceptual and acoustic voice quality assessments (d unb >0.50 and d unb >0.80, respectively). Furthermore, medium improvements were revealed in some parameters of self-evaluation (ie, physical and total scale of the Voice Handicap Index) and aerodynamic (ie, maximum phonation time) assessments (all d unb >0.50). Additionally, acoustic measures of vocal function showed an expansion in the upper contour of voice range profiles after training. Particularly, the main improvements in the voice range profile was found in the modal and the beginning of the falsetto voice registers. There was an increase of the intensity levels of about 4.6 dB. No changes were revealed in some acoustic measures of the voice range profile, self-evaluation measurements, and the perception of breathy voice quality (all d unb <0.20). Finally, the applicability of Lax Vox perceptually showed clear support in training success, learning process, and transfer to the daily routine. Lax Vox training for teachers appears to improve select measures of voice quality, maximum phonation time, vocal function, self-evaluation, and perceived applicability. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Do What I Say! Voice Recognition Makes Major Advances.
ERIC Educational Resources Information Center
Ruley, C. Dorsey
1994-01-01
Explains voice recognition technology applications in the workplace, schools, and libraries. Highlights include a voice-controlled work station using the DragonDictate system that can be used with dyslexic students, converting text to speech, and converting speech to text. (LRW)
Voice Response Systems Technology.
ERIC Educational Resources Information Center
Gerald, Jeanette
1984-01-01
Examines two methods of generating synthetic speech in voice response systems, which allow computers to communicate in human terms (speech), using human interface devices (ears): phoneme and reconstructed voice systems. Considerations prior to implementation, current and potential applications, glossary, directory, and introduction to Input Output…
NASA Technical Reports Server (NTRS)
Lee, Paul U.; Bender, Kim; Pagan, Danielle
2011-01-01
Flexible Airspace Management (FAM) is a mid- term Next Generation Air Transportation System (NextGen) concept that allows dynamic changes to airspace configurations to meet the changes in the traffic demand. A series of human-in-the-loop (HITL) studies have identified procedures and decision support requirements needed to implement FAM. This paper outlines a suggested FAM procedure and associated decision support functionality based on these HITL studies. A description of both the tools used to support the HITLs and the planned NextGen technologies available in the mid-term are presented and compared. The mid-term implementation of several NextGen capabilities, specifically, upgrades to the Traffic Management Unit (TMU), the initial release of an en route automation system, the deployment of a digital data communication system, a more flexible voice communications network, and the introduction of a tool envisioned to manage and coordinate networked ground systems can support the implementation of the FAM concept. Because of the variability in the overall deployment schedule of the mid-term NextGen capabilities, the dependency of the individual NextGen capabilities are examined to determine their impact on a mid-term implementation of FAM. A cursory review of the different technologies suggests that new functionality slated for the new en route automation system is a critical enabling technology for FAM, as well as the functionality to manage and coordinate networked ground systems. Upgrades to the TMU are less critical but important nonetheless for FAM to be fully realized. Flexible voice communications network and digital data communication system could allow more flexible FAM operations but they are not as essential.
Speech-language pathology students' self-reports on voice training: easier to understand or to do?
Lindhe, Christina; Hartelius, Lena
2009-01-01
The aim of the study was to describe the subjective ratings of the course 'Training of the student's own voice and speech', from a student-centred perspective. A questionnaire was completed after each of the six individual sessions. Six speech and language pathology (SLP) students rated how they perceived the practical exercises in terms of doing and understanding. The results showed that five of the six participants rated the exercises as significantly easier to understand than to do. The exercises were also rated as easier to do over time. Results are interpreted within in a theoretical framework of approaches to learning. The findings support the importance of both the physical and reflective aspects of the voice training process.
Smith, Kathleen W.; Balkwill, Laura-Lee; Vartanian, Oshin; Goel, Vinod
2015-01-01
Despite the fact that most real-world reasoning occurs in some emotional context, very little is known about the underlying behavioral and neural implications of such context. To further understand the role of emotional context in logical reasoning we scanned 15 participants with fMRI while they engaged in logical reasoning about neutral syllogisms presented through the auditory channel in a sad, angry, or neutral tone of voice. Exposure to angry voice led to improved reasoning performance compared to exposure to sad and neutral voice. A likely explanation for this effect is that exposure to expressions of anger increases selective attention toward the relevant features of target stimuli, in this case the reasoning task. Supporting this interpretation, reasoning in the context of angry voice was accompanied by activation in the superior frontal gyrus—a region known to be associated with selective attention. Our findings contribute to a greater understanding of the neural processes that underlie reasoning in an emotional context by demonstrating that two emotional contexts, despite being of the same (negative) valence, have different effects on reasoning. PMID:26029089
Real-time reference: the use of chat technology to improve point of need assistance.
Connor, Elizabeth
2002-01-01
Chat reference refers to the use of instant messaging and call center software to support interactive text or voice communication with library patrons. Instant messaging has been integrated into many e-commerce environments, and into the social lives of many teenagers and young adults, affording a level of immediacy and intimacy not possible with e-mail applications. The convergence and interoperability of new and emerging technologies can be used to develop new communities of users that view libraries as being essential to their education, patient care, and research activities.
Information technologies for Marine Corps combat medicine.
Carey, N B; Rattelman, C R; Nguyen, H Q
1998-09-01
Future Marine Corps warfighting concepts will make it more difficult to locate casualties, which will complicate casualty evacuation, lengthen casualty wait times, and require infantrymen or corpsmen to provide more extensive treatment. In these future scenarios, information flow and communications will be critical to medical functions. We asked, for Navy medical support to the Marines, what information will future combat medicine require and what technologies should supply those information needs? Based on analyses of patient data streams, focus groups of Navy medical personnel, and our estimates of the cost and feasibility of communications systems, we recommend the following: (1) increase medical training for some fraction of Marines, especially in hemorrhage control; (2) augment corpsmen's training; (3) furnish data systems for evacuation and supply that would provide in-transit visibility and simplify requests; (4) provide all ground medical personnel with access to treatment information systems and limited voice communications; and (5) exploit e-mail systems to reduce reliance on voice communications. Implementation time frames are discussed.
Mechanics of human voice production and control
Zhang, Zhaoyan
2016-01-01
As the primary means of communication, voice plays an important role in daily life. Voice also conveys personal information such as social status, personal traits, and the emotional state of the speaker. Mechanically, voice production involves complex fluid-structure interaction within the glottis and its control by laryngeal muscle activation. An important goal of voice research is to establish a causal theory linking voice physiology and biomechanics to how speakers use and control voice to communicate meaning and personal information. Establishing such a causal theory has important implications for clinical voice management, voice training, and many speech technology applications. This paper provides a review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality. Current efforts to develop mechanical and computational models of voice production are also critically reviewed. Finally, issues and future challenges in developing a causal theory of voice production and perception are discussed. PMID:27794319
Mechanics of human voice production and control.
Zhang, Zhaoyan
2016-10-01
As the primary means of communication, voice plays an important role in daily life. Voice also conveys personal information such as social status, personal traits, and the emotional state of the speaker. Mechanically, voice production involves complex fluid-structure interaction within the glottis and its control by laryngeal muscle activation. An important goal of voice research is to establish a causal theory linking voice physiology and biomechanics to how speakers use and control voice to communicate meaning and personal information. Establishing such a causal theory has important implications for clinical voice management, voice training, and many speech technology applications. This paper provides a review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality. Current efforts to develop mechanical and computational models of voice production are also critically reviewed. Finally, issues and future challenges in developing a causal theory of voice production and perception are discussed.
Inflatable Antennas Support Emergency Communication
NASA Technical Reports Server (NTRS)
2010-01-01
Glenn Research Center awarded Small Business Innovation Research (SBIR) contracts to ManTech SRS Technologies, of Newport Beach, California, to develop thin film inflatable antennas for space communication. With additional funding, SRS modified the concepts for ground-based inflatable antennas. GATR (Ground Antenna Transmit and Receive) Technologies, of Huntsville, Alabama, licensed the technology and refined it to become the world s first inflatable antenna certified by the Federal Communications Commission. Capable of providing Internet access, voice over Internet protocol, e-mail, video teleconferencing, broadcast television, and other high-bandwidth communications, the systems have provided communication during the wildfires in California, after Hurricane Katrina in Mississippi, and following the 2010 Haiti earthquake.
2001-04-01
Information Dominance , Sustainment and Support, and Training. The study concludes: 1) the FCS concept is sound, but senior level attention is required to ensure technologies are ready for 2006 FCS EMD; and 2) Key technologies will significantly improve force projection and combat power. The Information Dominance Panel was tasked to: 1) Assess required sensors at National and Theater level; 2) Assess the technological opportunity to provide necessary bandwidth for data, voice and video requirements; 3) Ascertain the requirements to deny the threat
Construction site Voice Operated Information System (VOIS) test
NASA Astrophysics Data System (ADS)
Lawrence, Debbie J.; Hettchen, William
1991-01-01
The Voice Activated Information System (VAIS), developed by USACERL, allows inspectors to verbally log on-site inspection reports on a hand held tape recorder. The tape is later processed by the VAIS, which enters the information into the system's database and produces a written report. The Voice Operated Information System (VOIS), developed by USACERL and Automated Sciences Group, through a ESACERL cooperative research and development agreement (CRDA), is an improved voice recognition system based on the concepts and function of the VAIS. To determine the applicability of the VOIS to Corps of Engineers construction projects, Technology Transfer Test Bad (T3B) funds were provided to the Corps of Engineers National Security Agency (NSA) Area Office (Fort Meade) to procure and implement the VOIS, and to train personnel in its use. This report summarizes the NSA application of the VOIS to quality assurance inspection of radio frequency shielding and to progress payment logs, and concludes that the VOIS is an easily implemented system that can offer improvements when applied to repetitive inspection procedures. Use of VOIS can save time during inspection, improve documentation storage, and provide flexible retrieval of stored information.
[Research on Control System of an Exoskeleton Upper-limb Rehabilitation Robot].
Wang, Lulu; Hu, Xin; Hu, Jie; Fang, Youfang; He, Rongrong; Yu, Hongliu
2016-12-01
In order to help the patients with upper-limb disfunction go on rehabilitation training,this paper proposed an upper-limb exoskeleton rehabilitation robot with four degrees of freedom(DOF),and realized two control schemes,i.e.,voice control and electromyography control.The hardware and software design of the voice control system was completed based on RSC-4128 chips,which realized the speech recognition technology of a specific person.Besides,this study adapted self-made surface eletromyogram(sEMG)signal extraction electrodes to collect sEMG signals and realized pattern recognition by conducting sEMG signals processing,extracting time domain features and fixed threshold algorithm.In addition,the pulse-width modulation(PWM)algorithm was used to realize the speed adjustment of the system.Voice control and electromyography control experiments were then carried out,and the results showed that the mean recognition rate of the voice control and electromyography control reached 93.1%and 90.9%,respectively.The results proved the feasibility of the control system.This study is expected to lay a theoretical foundation for the further improvement of the control system of the upper-limb rehabilitation robot.
ERIC Educational Resources Information Center
Gonzalez, Taucia; Love, Larry D.; Johnson, Mary L.; Picón, Ninoska; Velázquez, Josué
2017-01-01
This Equity Brief focuses on disrupting the dominant narratives around inclusion by asking the question, "Whose Voice Matters" in creating inclusive schools. Ensuring that youth from historically marginalized groups have leadership opportunities and have their voices heard is one way to support and empower them. Supporting students of…
ERIC Educational Resources Information Center
Cetin, Sahin
2013-01-01
The purpose of this research is to study the impact of perceived organizational support and management openness and teacher personality traits on teacher voice. Voice is defined as the discretionary communication of ideas, suggestions or concerns about work-related issues with the intent to improve organizational functioning. Sample of the study…
Selective attention in perceptual adjustments to voice.
Mullennix, J W; Howe, J N
1999-10-01
The effects of perceptual adjustments to voice information on the perception of isolated spoken words were examined. In two experiments, spoken target words were preceded or followed within a trial by a neutral word spoken in the same voice or in a different voice as the target. Over-all, words were reproduced more accurately on trials on which the voice of the neutral word matched the voice of the spoken target word, suggesting that perceptual adjustments to voice interfere with word processing. This result, however, was mediated by selective attention to voice. The results provide further evidence of a close processing relationship between perceptual adjustments to voice and spoken word recognition.
Dudley, James; Eames, Catrin; Mulligan, John; Fisher, Naomi
2018-03-01
Developing compassion towards oneself has been linked to improvement in many areas of psychological well-being, including psychosis. Furthermore, developing a non-judgemental, accepting way of relating to voices is associated with lower levels of distress for people who hear voices. These factors have also been associated with secure attachment. This study explores associations between the constructs of mindfulness of voices, self-compassion, and distress from hearing voices and how secure attachment style related to each of these variables. Cross-sectional online. One hundred and twenty-eight people (73% female; M age = 37.5; 87.5% Caucasian) who currently hear voices completed the Self-Compassion Scale, Southampton Mindfulness of Voices Questionnaire, Relationships Questionnaire, and Hamilton Programme for Schizophrenia Voices Questionnaire. Results showed that mindfulness of voices mediated the relationship between self-compassion and severity of voices, and self-compassion mediated the relationship between mindfulness of voices and severity of voices. Self-compassion and mindfulness of voices were significantly positively correlated with each other and negatively correlated with distress and severity of voices. Mindful relation to voices and self-compassion are associated with reduced distress and severity of voices, which supports the proposed potential benefits of mindful relating to voices and self-compassion as therapeutic skills for people experiencing distress by voice hearing. Greater self-compassion and mindfulness of voices were significantly associated with less distress from voices. These findings support theory underlining compassionate mind training. Mindfulness of voices mediated the relationship between self-compassion and distress from voices, indicating a synergistic relationship between the constructs. Although the current findings do not give a direction of causation, consideration is given to the potential impact of mindful and compassionate approaches to voices. © 2017 The Authors. British Journal of Clinical Psychology published by John Wiley & Sons Ltd on behalf of British Psychological Society.
The Higher Learning and High Technology: Dynamics of Higher Education Policy Formation.
ERIC Educational Resources Information Center
Slaughter, Sheila
An examination of national higher education policy formation focuses on the shift in relationships of the central actors in the policy formation process. It is argued that whereas the federal government, together with the scientific community and foundations, previously had the dominant voice, the corporate community, in partnership with leaders…
Vibrant Student Voices: Exploring Effects of the Use of Clickers in Large College Courses
ERIC Educational Resources Information Center
Hoekstra, Angel
2008-01-01
Teachers have begun using student response systems (SRSs) in an effort to enhance the learning process in higher education courses. Research providing detailed information about how interactive technologies affect students as they learn is crucial for professors who seek to improve teaching quality, attendance rates and student learning. This…
Voice Recognition Software Accuracy with Second Language Speakers of English.
ERIC Educational Resources Information Center
Coniam, D.
1999-01-01
Explores the potential of the use of voice-recognition technology with second-language speakers of English. Involves the analysis of the output produced by a small group of very competent second-language subjects reading a text into the voice recognition software Dragon Systems "Dragon NaturallySpeaking." (Author/VWL)
Cockpit voice recognition program at Princeton University
NASA Technical Reports Server (NTRS)
Huang, C. Y.
1983-01-01
Voice recognition technology (VRT) is applied to aeronautics, particularly on the pilot workload alleviation. The VRT does not have to prove its maturity any longer. The feasibility of voice tuning of radio and DME are demonstrated since there are immediate advantages to the pilot and can be completed in a reasonable time.
Conde, Tatiana; Gonçalves, Oscar F; Pinheiro, Ana P
2016-01-01
Auditory verbal hallucinations (AVH) are a core symptom of schizophrenia. Like "real" voices, AVH carry a rich amount of linguistic and paralinguistic cues that convey not only speech, but also affect and identity, information. Disturbed processing of voice identity, affective, and speech information has been reported in patients with schizophrenia. More recent evidence has suggested a link between voice-processing abnormalities and specific clinical symptoms of schizophrenia, especially AVH. It is still not well understood, however, to what extent these dimensions are impaired and how abnormalities in these processes might contribute to AVH. In this review, we consider behavioral, neuroimaging, and electrophysiological data to investigate the speech, identity, and affective dimensions of voice processing in schizophrenia, and we discuss how abnormalities in these processes might help to elucidate the mechanisms underlying specific phenomenological features of AVH. Schizophrenia patients exhibit behavioral and neural disturbances in the three dimensions of voice processing. Evidence suggesting a role of dysfunctional voice processing in AVH seems to be stronger for the identity and speech dimensions than for the affective domain.
End-to-end QoS bounds for RTP-based service subnetworks
NASA Astrophysics Data System (ADS)
Pitts, Jonathan M.; Schormans, John A.
1999-11-01
With the increasing focus on traffic prioritization to support voice-data integration in corporate intranets, practical methods are needed to dimension and manage cost efficient service partitions. This is particularly important for the provisioning of real time, delay sensitive services such as telephony and voice/video conferencing applications. Typically these can be provided over RTP/UDP/IP or ATM DBR/SBR bearers but, irrespective of the specific networking technology, the switches or routers need to implement some form of virtual buffer management with queue scheduling mechanisms to provide partitioning. The key requirement is for operators of such networks to be able to dimension the partitions and virtual buffer sizes for efficient resource utilization, instead of simply over-dimensioning. This paper draws on recent work at Queen Mary, University of London, supported by the UK Engineering and Physical Sciences Research Council, to investigate approximate analytical methods for assessing end to end delay variation bounds in cell based and packet based networks.
Speech, Voice, and Communication.
Johnson, Julia A
2017-01-01
Communication changes are an important feature of Parkinson's and include both motor and nonmotor features. This chapter will cover briefly the motor features affecting speech production and voice function before focusing on the nonmotor aspects. A description of the difficulties experienced by people with Parkinson's when trying to communicate effectively is presented along with some of the assessment tools and therapists' treatment options. The idea of clinical heterogeneity of PD and subtyping patients with different communication problems is explored and suggestions are made on how this may influence clinicians' treatment methods and choices so as to provide personalized therapy programmes. The importance of encouraging and supporting people to maintain social networks, employment, and leisure activities is stated as the key to achieving sustainability. Finally looking into the future, the emergence of new technologies is seen as providing further possibilities to support therapists in the goal of helping people with Parkinson's to maintain good communication skills throughout the course of the disease. © 2017 Elsevier Inc. All rights reserved.
Technologies for Self-Determination for Youth with Developmental Disabilities
ERIC Educational Resources Information Center
Skouge, James R.; Kelly, Mary L.; Roberts, Kelly D.; Leake, David W.; Stodden, Robert A.
2007-01-01
This paper focuses on "technologies for voice" that are related to the self-determination of youth with developmental disabilities. The authors describe a self-determination model that values family-focused, community-referenced pedagogies employing "new media" to give voice to youth and their families. In line with the adage that a picture is…
ERIC Educational Resources Information Center
Milner, Jacob
2005-01-01
Voice over Internet Protocol (VoIP) is everywhere. The technology lets users make and receive phone calls over the Internet, transporting voice traffic alongside data traffic such as instant messages (IMs) and e-mail. While the number of consumer customers using VoIP increases every week, the technology is finding its way into K-12 education as…
Research in speech communication.
Flanagan, J
1995-01-01
Advances in digital speech processing are now supporting application and deployment of a variety of speech technologies for human/machine communication. In fact, new businesses are rapidly forming about these technologies. But these capabilities are of little use unless society can afford them. Happily, explosive advances in microelectronics over the past two decades have assured affordable access to this sophistication as well as to the underlying computing technology. The research challenges in speech processing remain in the traditionally identified areas of recognition, synthesis, and coding. These three areas have typically been addressed individually, often with significant isolation among the efforts. But they are all facets of the same fundamental issue--how to represent and quantify the information in the speech signal. This implies deeper understanding of the physics of speech production, the constraints that the conventions of language impose, and the mechanism for information processing in the auditory system. In ongoing research, therefore, we seek more accurate models of speech generation, better computational formulations of language, and realistic perceptual guides for speech processing--along with ways to coalesce the fundamental issues of recognition, synthesis, and coding. Successful solution will yield the long-sought dictation machine, high-quality synthesis from text, and the ultimate in low bit-rate transmission of speech. It will also open the door to language-translating telephony, where the synthetic foreign translation can be in the voice of the originating talker. Images Fig. 1 Fig. 2 Fig. 5 Fig. 8 Fig. 11 Fig. 12 Fig. 13 PMID:7479806
Interpersonal Processes and Attachment in Voice-Hearers.
Robson, George; Mason, Oliver
2015-11-01
Studies of both clinical and non-clinical voice hearers suggest that distress is rather inconsistently associated with the perceived relationship between voice and hearer. It is also not clear if their beliefs about voices are relevant. This study investigated the links between attachment anxiety/avoidance, interpersonal aspects of the voice relationship, and distress whilst considering the impact of beliefs about voices and paranoia. Forty-four voice-hearing participants completed a number of self-report measures tapping attachment, interpersonal processes in the voice relationship, beliefs about voices, paranoia, distress and depression. Attachment avoidance was related to voice intrusiveness, hearer distance and distress. Attachment anxiety was related to voice intrusiveness, hearer dependence and distress. A series of simple mediation analyses were conducted that suggest that the relationship between attachment and voice related distress may be mediated by interpersonal dynamics in the voice-hearer relationship, beliefs about voices and paranoia. Beliefs about voices, the hearer's relationship with their voices, and the distress voices sometimes engender appear to be meaningfully related to their attachment style. This may be important to consider in therapeutic work.
Wong, Raymond
2013-01-01
Voice biometrics is one kind of physiological characteristics whose voice is different for each individual person. Due to this uniqueness, voice classification has found useful applications in classifying speakers' gender, mother tongue or ethnicity (accent), emotion states, identity verification, verbal command control, and so forth. In this paper, we adopt a new preprocessing method named Statistical Feature Extraction (SFX) for extracting important features in training a classification model, based on piecewise transformation treating an audio waveform as a time-series. Using SFX we can faithfully remodel statistical characteristics of the time-series; together with spectral analysis, a substantial amount of features are extracted in combination. An ensemble is utilized in selecting only the influential features to be used in classification model induction. We focus on the comparison of effects of various popular data mining algorithms on multiple datasets. Our experiment consists of classification tests over four typical categories of human voice data, namely, Female and Male, Emotional Speech, Speaker Identification, and Language Recognition. The experiments yield encouraging results supporting the fact that heuristically choosing significant features from both time and frequency domains indeed produces better performance in voice classification than traditional signal processing techniques alone, like wavelets and LPC-to-CC. PMID:24288684
Wang, Rosalie H; Korotchenko, Alexandra; Hurd Clarke, Laura; Mortenson, W Ben; Mihailidis, Alex
2013-01-01
Collision avoidance technology has the capacity to facilitate safer mobility among older power mobility users with physical, sensory, and cognitive impairments, thus enabling independence for more users. Little is known about consumers' perceptions of collision avoidance. This article draws on interviews (29 users, 5 caregivers, and 10 prescribers) to examine views on design and utilization of this technology. Data analysis identified three themes: "useful situations or contexts," "technology design issues and real-life application," and "appropriateness of collision avoidance technology for a variety of users." Findings support ongoing development of collision avoidance for older adult users. The majority of participants supported the technology and felt that it might benefit current users and users with visual impairments, but might be unsuitable for people with significant cognitive impairments. Some participants voiced concerns regarding the risk for injury with power mobility use and some identified situations where collision avoidance might be beneficial (driving backward, avoiding dynamic obstacles, negotiating outdoor barriers, and learning power mobility use). Design issues include the need for context awareness, reliability, and user interface specifications. User desire to maintain driving autonomy supports development of collaboratively controlled systems. This research lays the groundwork for future development by illustrating consumer requirements for this technology.
To hear or not to hear: Voice processing under visual load.
Zäske, Romi; Perlich, Marie-Christin; Schweinberger, Stefan R
2016-07-01
Adaptation to female voices causes subsequent voices to be perceived as more male, and vice versa. This contrastive aftereffect disappears under spatial inattention to adaptors, suggesting that voices are not encoded automatically. According to Lavie, Hirst, de Fockert, and Viding (2004), the processing of task-irrelevant stimuli during selective attention depends on perceptual resources and working memory. Possibly due to their social significance, faces may be an exceptional domain: That is, task-irrelevant faces can escape perceptual load effects. Here we tested voice processing, to study whether voice gender aftereffects (VGAEs) depend on low or high perceptual (Exp. 1) or working memory (Exp. 2) load in a relevant visual task. Participants adapted to irrelevant voices while either searching digit displays for a target (Exp. 1) or recognizing studied digits (Exp. 2). We found that the VGAE was unaffected by perceptual load, indicating that task-irrelevant voices, like faces, can also escape perceptual-load effects. Intriguingly, the VGAE was increased under high memory load. Therefore, visual working memory load, but not general perceptual load, determines the processing of task-irrelevant voices.
[An across-scales analysis of the voice self-concept questionnaire (FESS)].
Nusseck, Manfred; Richter, Bernhard; Echternach, Matthias; Spahn, Claudia
2018-04-01
The questionnaire for the assessment of the voice selfconcept (FESS) contains three sub-scales indicating the personal relation with the own voice. The scales address the relationship with one's own voice, the awareness of the use of one's own voice, and the perception of the connection between voice and emotional changes. A comprehensive approach across the three scales supporting a simplified interpretation of the results was still missing. The FESS questionnaire was used in a sample of 536 German teachers. With a discrimination analysis, commonalities in the scale characteristics were investigated. For a comparative validation with voice health and psychological and physiological wellbeing, the Voice Handicap Index (VHI), the questionnaire for Work-related Behavior and Experience Patterns (AVEM), and the questionnaire for Health-related Quality of Life (SF-12) were additionally collected. The analysis provided four different groups of voice self-concept: group 1 with healthy values in the voice self-concept and wellbeing scales, group 2 with a low voice self-concept and mean wellbeing values, group 3 with a high awareness of the voice use and mean wellbeing values and group 4 with low values in all scales. The results show that a combined approach across all scales of the questionnaire for the assessment of the voice self-concept enables a more detailed interpretation of the characteristics in the voice self-concept. The presented groups provide an applicable use supporting medical diagnoses. © Georg Thieme Verlag KG Stuttgart · New York.
Evolution of crossmodal reorganization of the voice area in cochlear-implanted deaf patients.
Rouger, Julien; Lagleyre, Sébastien; Démonet, Jean-François; Fraysse, Bernard; Deguine, Olivier; Barone, Pascal
2012-08-01
Psychophysical and neuroimaging studies in both animal and human subjects have clearly demonstrated that cortical plasticity following sensory deprivation leads to a brain functional reorganization that favors the spared modalities. In postlingually deaf patients, the use of a cochlear implant (CI) allows a recovery of the auditory function, which will probably counteract the cortical crossmodal reorganization induced by hearing loss. To study the dynamics of such reversed crossmodal plasticity, we designed a longitudinal neuroimaging study involving the follow-up of 10 postlingually deaf adult CI users engaged in a visual speechreading task. While speechreading activates Broca's area in normally hearing subjects (NHS), the activity level elicited in this region in CI patients is abnormally low and increases progressively with post-implantation time. Furthermore, speechreading in CI patients induces abnormal crossmodal activations in right anterior regions of the superior temporal cortex normally devoted to processing human voice stimuli (temporal voice-sensitive areas-TVA). These abnormal activity levels diminish with post-implantation time and tend towards the levels observed in NHS. First, our study revealed that the neuroplasticity after cochlear implantation involves not only auditory but also visual and audiovisual speech processing networks. Second, our results suggest that during deafness, the functional links between cortical regions specialized in face and voice processing are reallocated to support speech-related visual processing through cross-modal reorganization. Such reorganization allows a more efficient audiovisual integration of speech after cochlear implantation. These compensatory sensory strategies are later completed by the progressive restoration of the visuo-audio-motor speech processing loop, including Broca's area. Copyright © 2011 Wiley Periodicals, Inc.
Student Voice in the Mobile Phone Environment: A Grounded Theory Approach
ERIC Educational Resources Information Center
Daher, Wajeeh
2017-01-01
Student voice is recently attracting educational researchers' attention for its influence on various aspects of student lives and futures, as well as social life in general. Mobile technologies are proliferating in social and practical life. This article studies student voice in carrying out outdoor activities with mobile phones. Thirty middle…
ELearning Strategic Planning 2020: The Voice of Future Students as Stakeholders in Higher Education
ERIC Educational Resources Information Center
Finger, Glenn; Smart, Vicky
2013-01-01
Most universities are undertaking information technology (IT) strategic planning. The development of those plans often includes the voices of academics and sometimes engages alumni and current students. However, few engage and acknowledge the voice of future students. This paper is situated within the "Griffith University 2020 Strategic…
ERIC Educational Resources Information Center
Bedard, Catherine; Belin, Pascal
2004-01-01
Voice is the carrier of speech but is also an ''auditory face'' rich in information on the speaker's identity and affective state. Three experiments explored the possibility of a ''voice inversion effect,'' by analogy to the classical ''face inversion effect,'' which could support the hypothesis of a voice-specific module. Experiment 1 consisted…
Wilson-Sands, Cathy; Brahn, Pamela; Graves, Kristal
2015-01-01
Validating participants' ability to correctly perform cardiopulmonary resuscitation (CPR) skills during basic life support courses can be a challenge for nursing professional development specialists. This study compares two methods of basic life support training, instructor-led and computer-based learning with voice-activated manikins, to identify if one method is more effective for performance of CPR skills. The findings suggest that a computer-based learning course with voice-activated manikins is a more effective method of training for improved CPR performance.
Research on oral test modeling based on multi-feature fusion
NASA Astrophysics Data System (ADS)
Shi, Yuliang; Tao, Yiyue; Lei, Jun
2018-04-01
In this paper, the spectrum of speech signal is taken as an input of feature extraction. The advantage of PCNN in image segmentation and other processing is used to process the speech spectrum and extract features. And a new method combining speech signal processing and image processing is explored. At the same time of using the features of the speech map, adding the MFCC to establish the spectral features and integrating them with the features of the spectrogram to further improve the accuracy of the spoken language recognition. Considering that the input features are more complicated and distinguishable, we use Support Vector Machine (SVM) to construct the classifier, and then compare the extracted test voice features with the standard voice features to achieve the spoken standard detection. Experiments show that the method of extracting features from spectrograms using PCNN is feasible, and the fusion of image features and spectral features can improve the detection accuracy.
[Pilot plan for a mobile health communication and monitoring system for people with diabetes].
Alcayaga, Claudia; Pérez, Janet Carola; Bustamante, Claudia; Campos, Solange; Lange, Ilta; Zuñiga, Francisca
2014-01-01
mHealth is a practical, useful, and available tool for one-way or two-way communication between health professionals and patients. It is especially promising in countries such as Chile, with widespread and growing mobile telephone coverage that is very well accepted by the public. Our objective is to demonstrate the process for designing a mobile communication and monitoring model, aimed at providing communication between professionals in primary healthcare centers and their patients, to facilitate timely diagnosis and initiation of treatment for type 2 diabetes. This model's characteristics include use of mobile telephones as a communication tool, a one-way method (from health centers to patients), integration into in-person care delivered at health centers, use of different communication strategies (voice and written), and integrated functioning using open-source software. The system includes personalized communication, automated voice communication, and automated written communication using short message service (SMS). We describe the strategies and components of the system. The lessons learned include the contribution from successful implementation of COSMOS (consolidated online modulated operating systems), a technological innovation, to support the health care of people with suspected type 2 diabetes in primary healthcare centers. Working together with teams in the field is essential to this achievement.
Voicing on Virtual and Face to Face Discussion
ERIC Educational Resources Information Center
Yamat, Hamidah
2013-01-01
This paper presents and discusses findings of a study conducted on pre-service teachers' experiences in virtual and face to face discussions. Technology has brought learning nowadays beyond the classroom context or time zone. The learning context and process no longer rely solely on face to face communications in the presence of a teacher.…
Guidi, Andrea; Salvi, Sergio; Ottaviano, Manuel; Gentili, Claudio; Bertschy, Gilles; de Rossi, Danilo; Scilingo, Enzo Pasquale; Vanello, Nicola
2015-11-06
Bipolar disorder is one of the most common mood disorders characterized by large and invalidating mood swings. Several projects focus on the development of decision support systems that monitor and advise patients, as well as clinicians. Voice monitoring and speech signal analysis can be exploited to reach this goal. In this study, an Android application was designed for analyzing running speech using a smartphone device. The application can record audio samples and estimate speech fundamental frequency, F0, and its changes. F0-related features are estimated locally on the smartphone, with some advantages with respect to remote processing approaches in terms of privacy protection and reduced upload costs. The raw features can be sent to a central server and further processed. The quality of the audio recordings, algorithm reliability and performance of the overall system were evaluated in terms of voiced segment detection and features estimation. The results demonstrate that mean F0 from each voiced segment can be reliably estimated, thus describing prosodic features across the speech sample. Instead, features related to F0 variability within each voiced segment performed poorly. A case study performed on a bipolar patient is presented.
Guidi, Andrea; Salvi, Sergio; Ottaviano, Manuel; Gentili, Claudio; Bertschy, Gilles; de Rossi, Danilo; Scilingo, Enzo Pasquale; Vanello, Nicola
2015-01-01
Bipolar disorder is one of the most common mood disorders characterized by large and invalidating mood swings. Several projects focus on the development of decision support systems that monitor and advise patients, as well as clinicians. Voice monitoring and speech signal analysis can be exploited to reach this goal. In this study, an Android application was designed for analyzing running speech using a smartphone device. The application can record audio samples and estimate speech fundamental frequency, F0, and its changes. F0-related features are estimated locally on the smartphone, with some advantages with respect to remote processing approaches in terms of privacy protection and reduced upload costs. The raw features can be sent to a central server and further processed. The quality of the audio recordings, algorithm reliability and performance of the overall system were evaluated in terms of voiced segment detection and features estimation. The results demonstrate that mean F0 from each voiced segment can be reliably estimated, thus describing prosodic features across the speech sample. Instead, features related to F0 variability within each voiced segment performed poorly. A case study performed on a bipolar patient is presented. PMID:26561811
Yarosh, Svetlana; Schueller, Stephen Matthew
2017-01-17
Positive psychological interventions for children have typically focused on direct adaptations of interventions developed for adults. As the community moves toward designing positive computing technologies to support child well-being, it is important to use a more participatory process that directly engages children's voices. Our objectives were, through a participatory design study, to understand children's interpretations of positive psychology concepts, as well as their perspectives on technologies that are best suited to enhance their engagement with practice of well-being skills. We addressed these questions through a content analysis of 434 design ideas, 51 sketches, and 8 prototype and videos, which emerged from a 14-session cooperative inquiry study with 12 child "happiness inventors." The study was part of a summer learning camp held at the children's middle school, which focused on teaching the invention process, teaching well-being skills drawn from positive psychology and related areas (gratitude, mindfulness, and problem solving), and iterating design ideas for technologies to support these skills. The children's ideas and prototypes revealed specific facets of how they interpreted gratitude (as thanking, being positive, and doing good things), mindfulness (as externally representing thought and emotions, controlling those thoughts and emotions, getting through unpleasant things, and avoiding forgetting something), and problem solving (as preventing bad decisions, seeking alternative solutions, and not dwelling on unproductive thoughts). This process also revealed that children emphasized particular technologies in their solutions. While desktop or laptop solutions were notably lacking, other ideas were roughly evenly distributed between mobile apps and embodied computing technologies (toys, wearables, etc). We also report on desired functionalities and approaches to engagement in the children's ideas, such as a notable emphasis on representing and responding to internal states. Our findings point to promising directions for the design of positive computing technologies targeted at children, with particular emphases on the perspectives, technologies, engagement approaches, and functionalities that appealed to the children in our study. The dual focus of the study on teaching skills while designing technologies is a novel methodology in the design of positive computing technologies intended to increase child well-being. ©Svetlana Yarosh, Stephen Matthew Schueller. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 17.01.2017.
Goffi-Fynn, Jeanne C; Carroll, Linda M
2013-05-01
This study was designed as a qualitative case study to demonstrate the process of diagnosis and treatment between a voice team to manage a singer diagnosed with muscular tension dysphonia (MTD). Traditionally, literature suggests that MTD is challenging to treat and little in the literature directly addresses singers with MTD. Data collected included initial medical screening with laryngologist, referral to speech-language pathologist (SLP) specializing in voice disorders among singers, and adjunctive voice training with voice teacher trained in vocology (singing voice specialist or SVS). Initial target goals with SLP included reducing extrinsic laryngeal tension, using a relaxed laryngeal posture, and effective abdominal-diaphragmatic support for all phonation events. Balance of respiratory forces, laryngeal coordination, and use of optimum filtering of the source signal through resonance and articulatory awareness was emphasized. Further work with SVS included three main goals including a lowered breathing pattern to aid in decreasing subglottic air pressure, vertical laryngeal position to lower to allow for a relaxed laryngeal position, and a top-down singing approach to encourage an easier, more balanced registration, and better resonance. Initial results also emphasize the retraining of subject toward a sensory rather than auditory mode of monitoring. Other areas of consideration include singers' training and vocal use, the psychological effects of MTD, the personalities potentially associated with it, and its relationship with stress. Finally, the results emphasize that a positive rapport with the subject and collaboration between all professionals involved in a singer's care are essential for recovery. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
ERIC Educational Resources Information Center
Parkham, Shamika; McBroom, Aravis
2015-01-01
In this chapter, two student members of the Student Voice Collaborative (SVC) describe their experiences as "Student Shadows" during the annual Quality Review process, used throughout the New York Department of Education to evaluate how well schools are organized to support student achievement. They chronicle how this experience enhanced…
Valadez, Victor; Ysunza, Antonio; Ocharan-Hernandez, Esther; Garrido-Bustamante, Norma; Sanchez-Valerio, Araceli; Pamplona, Ma C
2012-09-01
Vocal Nodules (VN) are a functional voice disorder associated with voice misuse and abuse in children. There are few reports addressing vocal parameters in children with VN, especially after a period of vocal rehabilitation. The purpose of this study is to describe measurements of vocal parameters including Fundamental Frequency (FF), Shimmer (S), and Jitter (J), videonasolaryngoscopy examination and clinical perceptual assessment, before and after voice therapy in children with VN. Voice therapy was provided using visual support through Speech-Viewer software. Twenty patients with VN were studied. An acoustical analysis of voice was performed and compared with data from subjects from a control group matched by age and gender. Also, clinical perceptual assessment of voice and videonasolaryngoscopy were performed to all patients with VN. After a period of voice therapy, provided with visual support using Speech Viewer-III (SV-III-IBM) software, new acoustical analyses, perceptual assessments and videonasolaryngoscopies were performed. Before the onset of voice therapy, there was a significant difference (p<0.05) in mean FF, S and J, between the patients with VN and subjects from the control group. After the voice therapy period, a significant improvement (p<0.05) was found in all acoustic voice parameters. Moreover, perceptual voice analysis demonstrated improvement in all cases. Finally, videonasolaryngoscopy demonstrated that vocal nodules were no longer discernible on the vocal folds in any of the cases. SV-III software seems to be a safe and reliable method for providing voice therapy in children with VN. Acoustic voice parameters, perceptual data and videonasolaryngoscopy were significantly improved after the speech therapy period was completed. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Kotake, Kumiko; Suzukamo, Yoshimi; Kai, Ichiro; Iwanaga, Kazuyo; Takahashi, Aya
2017-03-01
The objective is to clarify whether social support and acquisition of alternative voice enhance the psychological adjustment of laryngectomized patients and which part of the psychological adjustment structure would be influenced by social support. We contacted 1445 patients enrolled in a patient association using mail surveys and 679 patients agreed to participate in the study. The survey items included age, sex, occupation, post-surgery duration, communication method, psychological adjustment (by the Nottingham Adjustment Scale Japanese Laryngectomy Version: NAS-J-L), and the formal support (by Hospital Patient Satisfaction Questionnaire-25: HPSQ-25). Social support and communication methods were added to the three-tier structural model of psychological adjustment shown in our previous study, and a covariance structure analysis was conducted. Formal/informal supports and acquisition of alternative voice influence only the "recognition of oneself as voluntary agent", the first tier of the three-tier structure of psychological adjustment. The results suggest that social support and acquisition of alternative voice may enhance the recognition of oneself as voluntary agent and promote the psychological adjustment.
Industrial platforms--a unique feature of the European Commission's biotechnology R&D programme.
Aguilar, A; Ingemansson, T; Hogan, S; Magnien, E
1998-09-01
The European Commission's research, technological development and demonstration programmes aim to strengthen European research and technological development, and to increase the competitiveness of European industries. The creation and development of Industrial Platforms play an important role in these processes by improving the transition from research to commercial application. Industrial Platforms are technology-based industrial groupings established by industry with the aims of enabling the exploitation or dissemination of research results, encouraging academic-industrial collaborations and providing their members with a means of voicing their opinion on present and future research policies.
26 CFR 1.509(a)-5 - Special rules of attribution.
Code of Federal Regulations, 2010 CFR
2010-04-01
... have an effective voice in the management of both the supporting and beneficiary organizations. (iii... membership organizations have an effective voice in the management of the supporting and beneficiary... organization meets the not-more-than-one-third support test set forth in section 509(a)(2)(B), amounts received...
26 CFR 1.509(a)-5 - Special rules of attribution.
Code of Federal Regulations, 2011 CFR
2011-04-01
... have an effective voice in the management of both the supporting and beneficiary organizations. (iii... membership organizations have an effective voice in the management of the supporting and beneficiary... organization meets the not-more-than-one-third support test set forth in section 509(a)(2)(B), amounts received...
Financial viability, medical technology, and hospital closures.
Prince, T R; Sullivan, J A
2000-01-01
Informed investments in medical technology and information systems are associated with the financial viability of community hospitals. Financially distressed facilities are 3 to 4 years behind proactive hospitals in supporting high-speed data, voice, and image transmissions to physicians in various locations. Impact of the Balanced Budget Act of 1997, fraud and abuse activities, Y2K issues, and lack of information systems support for physicians will result in 800 hospital closures and mergers of distressed hospitals over the next 60 months. These findings are based on the application of an eight-step framework for classifying information systems in health care entities. This framework is validated by survey instruments, site visits, interviews with senior management in 44 health care entities containing 576 hospitals, and judgments on the financial status of the health care entities.
McKinnon, David H; McLeod, Sharynne; Reilly, Sheena
2007-01-01
The aims of this study were threefold: to report teachers' estimates of the prevalence of speech disorders (specifically, stuttering, voice, and speech-sound disorders); to consider correspondence between the prevalence of speech disorders and gender, grade level, and socioeconomic status; and to describe the level of support provided to schoolchildren with speech disorders. Students with speech disorders were identified from 10,425 students in Australia using a 4-stage process: training in the data collection process, teacher identification, confirmation by a speech-language pathologist, and consultation with district special needs advisors. The prevalence of students with speech disorders was estimated; specifically, 0.33% of students were identified as stuttering, 0.12% as having a voice disorder, and 1.06% as having a speech-sound disorder. There was a higher prevalence of speech disorders in males than in females. As grade level increased, the prevalence of speech disorders decreased. There was no significant difference in the pattern of prevalence across the three speech disorders and four socioeconomic groups; however, students who were identified with a speech disorder were more likely to be in the higher socioeconomic groups. Finally, there was a difference between the perceived and actual level of support that was provided to these students. These prevalence figures are lower than those using initial identification by speech-language pathologists and similar to those using parent report.
Evaluation of Flight Deck-Based Interval Management Crew Procedure Feasibility
NASA Technical Reports Server (NTRS)
Wilson, Sara R.; Murdoch, Jennifer L.; Hubbs, Clay E.; Swieringa, Kurt A.
2013-01-01
Air traffic demand is predicted to increase over the next 20 years, creating a need for new technologies and procedures to support this growth in a safe and efficient manner. The National Aeronautics and Space Administration's (NASA) Air Traffic Management Technology Demonstration - 1 (ATD-1) will operationally demonstrate the feasibility of efficient arrival operations combining ground-based and airborne NASA technologies. The integration of these technologies will increase throughput, reduce delay, conserve fuel, and minimize environmental impacts. The ground-based tools include Traffic Management Advisor with Terminal Metering for precise time-based scheduling and Controller Managed Spacing decision support tools for better managing aircraft delay with speed control. The core airborne technology in ATD-1 is Flight deck-based Interval Management (FIM). FIM tools provide pilots with speed commands calculated using information from Automatic Dependent Surveillance - Broadcast. The precise merging and spacing enabled by FIM avionics and flight crew procedures will reduce excess spacing buffers and result in higher terminal throughput. This paper describes a human-in-the-loop experiment designed to assess the acceptability and feasibility of the ATD-1 procedures used in a voice communications environment. This experiment utilized the ATD-1 integrated system of ground-based and airborne technologies. Pilot participants flew a high-fidelity fixed base simulator equipped with an airborne spacing algorithm and a FIM crew interface. Experiment scenarios involved multiple air traffic flows into the Dallas-Fort Worth Terminal Radar Control airspace. Results indicate that the proposed procedures were feasible for use by flight crews in a voice communications environment. The delivery accuracy at the achieve-by point was within +/- five seconds and the delivery precision was less than five seconds. Furthermore, FIM speed commands occurred at a rate of less than one per minute, and pilots found the frequency of the speed commands to be acceptable at all times throughout the experiment scenarios.
Cognitive Behavioural Relating Therapy (CBRT) for voice hearers: a case study.
Paulik, Georgie; Hayward, Mark; Birchwood, Max
2013-10-01
There has been a recent focus on the interpersonal nature of the voice hearing experience, with studies showing that similar patterns of relating exist between voice hearer and voice as between voice hearer and social others. Two recent therapeutic approaches to voices, Cognitive Therapy for Command Hallucinations and Relating Therapy, have been developed to address patterns of relating and power imbalances between voice hearer and voice. This paper presents a novel intervention that combines elements of these two therapies, named Cognitive Behavioural Relating Therapy (CBRT). The application of CBRT is illustrated through a clinical case study. The clinical case study showed changes in patterns of relating, improved self-esteem and reductions in voice-related distress. The outcomes provide preliminary support for the utility of CBRT when working with voice hearers.
Pribuisiene, Ruta; Uloza, Virgilijus; Kardisiene, Vilija
2011-12-01
To determine impact of age, gender, and vocal training on voice characteristics of children aged 6-13 years. Voice acoustic and phonetogram parameters were determined for the group of 44 singing and 31 non-singing children. No impact of gender and/or age on phonetogram, acoustic voice parameters, and maximum phonation time was detected. Voice ranges of all children represented a pre-pubertal soprano type with a voice range of 22 semitones for non-singing and of 26 semitones for singing individuals. The mean maximum voice intensity was 81 dB. Vocal training had a positive impact on voice intensity parameters in girls. The presented data on average voice characteristics may be applicable in the clinical practice and provide relevant support for voice assessment.
Neurobiological correlates of emotional intelligence in voice and face perception networks
Karle, Kathrin N; Ethofer, Thomas; Jacob, Heike; Brück, Carolin; Erb, Michael; Lotze, Martin; Nizielski, Sophia; Schütz, Astrid; Wildgruber, Dirk; Kreifelts, Benjamin
2018-01-01
Abstract Facial expressions and voice modulations are among the most important communicational signals to convey emotional information. The ability to correctly interpret this information is highly relevant for successful social interaction and represents an integral component of emotional competencies that have been conceptualized under the term emotional intelligence. Here, we investigated the relationship of emotional intelligence as measured with the Salovey-Caruso-Emotional-Intelligence-Test (MSCEIT) with cerebral voice and face processing using functional and structural magnetic resonance imaging. MSCEIT scores were positively correlated with increased voice-sensitivity and gray matter volume of the insula accompanied by voice-sensitivity enhanced connectivity between the insula and the temporal voice area, indicating generally increased salience of voices. Conversely, in the face processing system, higher MSCEIT scores were associated with decreased face-sensitivity and gray matter volume of the fusiform face area. Taken together, these findings point to an alteration in the balance of cerebral voice and face processing systems in the form of an attenuated face-vs-voice bias as one potential factor underpinning emotional intelligence. PMID:29365199
Neurobiological correlates of emotional intelligence in voice and face perception networks.
Karle, Kathrin N; Ethofer, Thomas; Jacob, Heike; Brück, Carolin; Erb, Michael; Lotze, Martin; Nizielski, Sophia; Schütz, Astrid; Wildgruber, Dirk; Kreifelts, Benjamin
2018-02-01
Facial expressions and voice modulations are among the most important communicational signals to convey emotional information. The ability to correctly interpret this information is highly relevant for successful social interaction and represents an integral component of emotional competencies that have been conceptualized under the term emotional intelligence. Here, we investigated the relationship of emotional intelligence as measured with the Salovey-Caruso-Emotional-Intelligence-Test (MSCEIT) with cerebral voice and face processing using functional and structural magnetic resonance imaging. MSCEIT scores were positively correlated with increased voice-sensitivity and gray matter volume of the insula accompanied by voice-sensitivity enhanced connectivity between the insula and the temporal voice area, indicating generally increased salience of voices. Conversely, in the face processing system, higher MSCEIT scores were associated with decreased face-sensitivity and gray matter volume of the fusiform face area. Taken together, these findings point to an alteration in the balance of cerebral voice and face processing systems in the form of an attenuated face-vs-voice bias as one potential factor underpinning emotional intelligence.
Advanced distributed simulation technology: Digital Voice Gateway Reference Guide
NASA Astrophysics Data System (ADS)
Vanhook, Dan; Stadler, Ed
1994-01-01
The Digital Voice Gateway (referred to as the 'DVG' in this document) transmits and receives four full duplex encoded speech channels over the Ethernet. The information in this document applies only to DVG's running firmware of the version listed on the title page. This document, previously named Digital Voice Gateway Reference Guide, BBN Systems and Technologies Corporation, Cambridge, MA 02138, was revised for revision 2.00. This new revision changes the network protocol used by the DVG, to comply with the SINCGARS radio simulation (For SIMNET 6.6.1). Because of the extensive changes to revision 2.00 a separate document was created rather than supplying change pages.
Borowiak, Kamila; von Kriegstein, Katharina
2016-01-01
The ability to recognise the identity of others is a key requirement for successful communication. Brain regions that respond selectively to voices exist in humans from early infancy on. Currently, it is unclear whether dysfunction of these voice-sensitive regions can explain voice identity recognition impairments. Here, we used two independent functional magnetic resonance imaging studies to investigate voice processing in a population that has been reported to have no voice-sensitive regions: autism spectrum disorder (ASD). Our results refute the earlier report that individuals with ASD have no responses in voice-sensitive regions: Passive listening to vocal, compared to non-vocal, sounds elicited typical responses in voice-sensitive regions in the high-functioning ASD group and controls. In contrast, the ASD group had a dysfunction in voice-sensitive regions during voice identity but not speech recognition in the right posterior superior temporal sulcus/gyrus (STS/STG)—a region implicated in processing complex spectrotemporal voice features and unfamiliar voices. The right anterior STS/STG correlated with voice identity recognition performance in controls but not in the ASD group. The findings suggest that right STS/STG dysfunction is critical for explaining voice recognition impairments in high-functioning ASD and show that ASD is not characterised by a general lack of voice-sensitive responses. PMID:27369067
The Memory Stack: New Technologies Harness Talking for Writing.
ERIC Educational Resources Information Center
Gannon, Maureen T.
In this paper, an elementary school teacher describes her experiences with the Memory Stack--a HyperCard based tool that can accommodate a voice recording, a graphic image, and a written text on the same card--which she designed to help her second and third grade students integrate their oral language fluency into the process of learning how to…
ERIC Educational Resources Information Center
Terwel, Bart W.; Harinck, Fieke; Ellemers, Naomi; Daamen, Dancker D. L.
2010-01-01
The implementation of carbon dioxide capture and storage technology (CCS) is considered an important climate change mitigation strategy, but the viability of this technology will depend on public acceptance of CCS policy decisions. The results of three experiments with students as participants show that whether or not interest groups receive an…
Nembhard, Ingrid M.; Yuan, Christina T.; Shabanova, Veronika; Cleary, Paul D.
2017-01-01
Background Aspects of the patient care experience, despite being central to quality care, are often problematic. In particular, patients frequently report problems with timeliness of care. As yet, research offers little insight on setting characteristics that contribute to patients’ experience of timely care. Purpose The aims of this study were to assess the relationship between organizational climate and patients’ reports of timely care in primary care clinics and to broadly examine the link between staff’s work environment and patient care experiences. We test hypotheses about the relationship between voice climate—staff feeling safe to speak up about issues—and reported timeliness of care, consistency in reported voice climate across professions, and how climate differences for various professions relate to timely care. Methodology We conducted a cross-sectional study of employees (n = 1,121) and patients (n = 8,164) affiliated with 37 clinics participating in a statewide reporting initiative. Employees were surveyed about clinics’ voice climate, and patients were surveyed about the timeliness of care. Hypotheses were tested using analysis of variance and generalized estimating equations. Findings Clinical and administrative staff (e.g., nurses and office assistants) reported clinics’ climates to be significantly less supportive of voice than did clinical leaders (e.g., physicians). The greater the difference in reported support for voice between professional groups, the less patients reported experiencing timely care in three respects: obtaining an appointment, seeing the doctor within 15 minutes of appointment time, and receiving test results. In clinics where staff reported climates supportive of voice, patients indicated receiving more timely care. Clinical leaders’ reports of voice climate had no relationship to reported timeliness of care. Practical Implications Our findings suggest the importance of clinics developing a strong climate for voice, particularly for clinical and administrative staff, to support better service quality for patients. PMID:24589927
Single-channel voice-response-system program documentation volume I : system description
DOT National Transportation Integrated Search
1977-01-01
This report documents the design and implementation of a Voice Response System (VRS) using Adaptive Differential Pulse Code Modulation (ADPCM) voice coding. Implemented on a Digital Equipment Corporation PDP-11/20,R this VRS system supports a single ...
Big Brother Or Trusted Allies How The Police Can Earn Community Support For Using Unmanned Aircraft
2017-12-01
many law enforcement agencies across the nation to use this technology and enable widespread surveillance of the American people . The thought of...daily activities. American Civil Liberties Union (ACLU) writer Jay Stanly has given voice to his organization’s concerns when he said that law...enforcement agencies and some of the people they serve. In the wake of high profile and controversial uses of force across the nation, many seemingly
Benavides-Vaello, Sandra; Katz, Janet R; Peterson, Jeffery Chaichana; Allen, Carol B; Paul, Robbie; Charette-Bluff, Andrea Lelana; Morris, Phyllis
2014-04-01
This participatory study used PhotoVoice and qualitative description to (a) mentor baccalaureate nursing and college students in workforce diversity research; (b) explore barriers and facilitators encountered by rural American Indian, Hispanic, and other high school students when attending college and pursuing careers in nursing or the health sciences; and (c) model a process of social action to help existing and future students. Baccalaureate nursing and graduate students participated in all stages of research, including dissemination. Five themes emerged from analysis of PhotoVoice data: (a) being afraid; (b) believing; (c) taking small steps; (d) facing fears; and (e) using support systems. Findings underscore the importance of helping students participate in efforts to increase work-force diversity through research. Increasing nursing and health sciences workforce diversity may require strategies developed within and tailored to specific cultures and communities. Copyright 2014, SLACK Incorporated.
Chipps, Jennifer; Pimmer, Christoph; Brysiewicz, Petra; Walters, Fiona; Linxen, Sebastian; Ndebele, Thandi; Gröhbiel, Urs
2015-12-14
Empirical studies show the value of mobile phones as effective educational tools to support learning in the nursing profession, predominantly in high income countries. The rapidly increasing prevalence of mobile phone technology in Africa nourishes hopes that these tools could be equally effective in lowly resourced contexts, specifically in efforts to achieve the health-related Millennium Development goals. The purpose of this study was to investigate the perception and use of mobile phones as educational and professional tools by nurses in lowly resourced settings. A quantitative survey using self-administered questionnaires was conducted of rural advanced midwives. Fifty-six nurses (49.6%) from the 113 rural-based midwives attending an advanced midwifery training programme at the University of KwaZulu-Natal, South Africa, filled in a questionnaire. The results showed that, whilst nurses regarded their technology competences as low and although they received very little official support from their educational and professional institutions, the majority frequently used mobile functions and applications to support their work and learning processes. They perceived mobile devices with their voice, text, and email functions as important tools for the educational and professional activities of searching for information and engaging with facilitators and peers from work and study contexts. To a lesser extent, the use of social networks, such as WhatsApp and Facebook, were also reported. It is concluded that educational institutions should support the appropriate use of mobile phones more systematically; particularly in relation to the development of mobile network literacy skills.
Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach.
Fang, Shih-Hau; Tsao, Yu; Hsiao, Min-Jing; Chen, Ji-Ying; Lai, Ying-Hui; Lin, Feng-Chuan; Wang, Chi-Te
2018-03-19
Computerized detection of voice disorders has attracted considerable academic and clinical interest in the hope of providing an effective screening method for voice diseases before endoscopic confirmation. This study proposes a deep-learning-based approach to detect pathological voice and examines its performance and utility compared with other automatic classification algorithms. This study retrospectively collected 60 normal voice samples and 402 pathological voice samples of 8 common clinical voice disorders in a voice clinic of a tertiary teaching hospital. We extracted Mel frequency cepstral coefficients from 3-second samples of a sustained vowel. The performances of three machine learning algorithms, namely, deep neural network (DNN), support vector machine, and Gaussian mixture model, were evaluated based on a fivefold cross-validation. Collective cases from the voice disorder database of MEEI (Massachusetts Eye and Ear Infirmary) were used to verify the performance of the classification mechanisms. The experimental results demonstrated that DNN outperforms Gaussian mixture model and support vector machine. Its accuracy in detecting voice pathologies reached 94.26% and 90.52% in male and female subjects, based on three representative Mel frequency cepstral coefficient features. When applied to the MEEI database for validation, the DNN also achieved a higher accuracy (99.32%) than the other two classification algorithms. By stacking several layers of neurons with optimized weights, the proposed DNN algorithm can fully utilize the acoustic features and efficiently differentiate between normal and pathological voice samples. Based on this pilot study, future research may proceed to explore more application of DNN from laboratory and clinical perspectives. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Pinheiro, Ana P; Rezaii, Neguine; Rauber, Andréia; Nestor, Paul G; Spencer, Kevin M; Niznikiewicz, Margaret
2017-09-01
Abnormalities in self-other voice processing have been observed in schizophrenia, and may underlie the experience of hallucinations. More recent studies demonstrated that these impairments are enhanced for speech stimuli with negative content. Nonetheless, few studies probed the temporal dynamics of self versus nonself speech processing in schizophrenia and, particularly, the impact of semantic valence on self-other voice discrimination. In the current study, we examined these questions, and additionally probed whether impairments in these processes are associated with the experience of hallucinations. Fifteen schizophrenia patients and 16 healthy controls listened to 420 prerecorded adjectives differing in voice identity (self-generated [SGS] versus nonself speech [NSS]) and semantic valence (neutral, positive, and negative), while EEG data were recorded. The N1, P2, and late positive potential (LPP) ERP components were analyzed. ERP results revealed group differences in the interaction between voice identity and valence in the P2 and LPP components. Specifically, LPP amplitude was reduced in patients compared with healthy subjects for SGS and NSS with negative content. Further, auditory hallucinations severity was significantly predicted by LPP amplitude: the higher the SAPS "voices conversing" score, the larger the difference in LPP amplitude between negative and positive NSS. The absence of group differences in the N1 suggests that self-other voice processing abnormalities in schizophrenia are not primarily driven by disrupted sensory processing of voice acoustic information. The association between LPP amplitude and hallucination severity suggests that auditory hallucinations are associated with enhanced sustained attention to negative cues conveyed by a nonself voice. © 2017 Society for Psychophysiological Research.
Voice emotion perception and production in cochlear implant users.
Jiam, N T; Caldwell, M; Deroche, M L; Chatterjee, M; Limb, C J
2017-09-01
Voice emotion is a fundamental component of human social interaction and social development. Unfortunately, cochlear implant users are often forced to interface with highly degraded prosodic cues as a result of device constraints in extraction, processing, and transmission. As such, individuals with cochlear implants frequently demonstrate significant difficulty in recognizing voice emotions in comparison to their normal hearing counterparts. Cochlear implant-mediated perception and production of voice emotion is an important but relatively understudied area of research. However, a rich understanding of the voice emotion auditory processing offers opportunities to improve upon CI biomedical design and to develop training programs benefiting CI performance. In this review, we will address the issues, current literature, and future directions for improved voice emotion processing in cochlear implant users. Copyright © 2017 Elsevier B.V. All rights reserved.
Smartphones Offer New Opportunities in Clinical Voice Research.
Manfredi, C; Lebacq, J; Cantarella, G; Schoentgen, J; Orlandi, S; Bandini, A; DeJonckere, P H
2017-01-01
Smartphone technology provides new opportunities for recording standardized voice samples of patients and sending the files by e-mail to the voice laboratory. This drastically improves the collection of baseline data, as used in research on efficiency of voice treatments. However, the basic requirement is the suitability of smartphones for recording and digitizing pathologic voices (mainly characterized by period perturbations and noise) without significant distortion. In this experiment, two smartphones (a very inexpensive one and a high-level one) were tested and compared with direct microphone recordings in a soundproof room. The voice stimuli consisted in synthesized deviant voice samples (median of fundamental frequency: 120 and 200 Hz) with three levels of jitter and three levels of added noise. All voice samples were analyzed using PRAAT software. The results show high correlations between jitter, shimmer, and noise-to-harmonics ratio measured on the recordings via both smartphones, the microphone, and measured directly on the sound files from the synthesizer. Smartphones thus appear adequate for reliable recording and digitizing of pathologic voices. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Discriminating male and female voices: differentiating pitch and gender.
Latinus, Marianne; Taylor, Margot J
2012-04-01
Gender is salient, socially critical information obtained from faces and voices, yet the brain processes underlying gender discrimination have not been well studied. We investigated neural correlates of gender processing of voices in two ERP studies. In the first, ERP differences were seen between female and male voices starting at 87 ms, in both spatial-temporal and peak analyses, particularly the fronto-central N1 and P2. As pitch differences may drive gender differences, the second study used normal, high- and low-pitch voices. The results of these studies suggested that differences in pitch produced early effects (27-63 ms). Gender effects were seen on N1 (120 ms) with implicit pitch processing (study 1), but were not seen with manipulations of pitch (study 2), demonstrating that N1 was modulated by attention. P2 (between 170 and 230 ms) discriminated male from female voices, independent of pitch. Thus, these data show that there are two stages in voice gender processing; a very early pitch or frequency discrimination and a later more accurate determination of gender at the P2 latency.
Error-dependent modulation of speech-induced auditory suppression for pitch-shifted voice feedback.
Behroozmand, Roozbeh; Larson, Charles R
2011-06-06
The motor-driven predictions about expected sensory feedback (efference copies) have been proposed to play an important role in recognition of sensory consequences of self-produced motor actions. In the auditory system, this effect was suggested to result in suppression of sensory neural responses to self-produced voices that are predicted by the efference copies during vocal production in comparison with passive listening to the playback of the identical self-vocalizations. In the present study, event-related potentials (ERPs) were recorded in response to upward pitch shift stimuli (PSS) with five different magnitudes (0, +50, +100, +200 and +400 cents) at voice onset during active vocal production and passive listening to the playback. Results indicated that the suppression of the N1 component during vocal production was largest for unaltered voice feedback (PSS: 0 cents), became smaller as the magnitude of PSS increased to 200 cents, and was almost completely eliminated in response to 400 cents stimuli. Findings of the present study suggest that the brain utilizes the motor predictions (efference copies) to determine the source of incoming stimuli and maximally suppresses the auditory responses to unaltered feedback of self-vocalizations. The reduction of suppression for 50, 100 and 200 cents and its elimination for 400 cents pitch-shifted voice auditory feedback support the idea that motor-driven suppression of voice feedback leads to distinctly different sensory neural processing of self vs. non-self vocalizations. This characteristic may enable the audio-vocal system to more effectively detect and correct for unexpected errors in the feedback of self-produced voice pitch compared with externally-generated sounds.
Rumbach, Anna F
2013-11-01
To determine the anatomical and physiological nature of voice problems and their treatment in those group fitness instructors (GFIs) who have sought a medical diagnosis; the impact of voice disorders on quality of life and their contribution to activity limitations and participation restrictions; and the perceived attitudes and level of support from the industry at large in response to instructor's voice disorders and need for treatment. Prospective self-completion questionnaire design. Thirty-eight individuals (3 males and 35 females) currently active in the Australian fitness industry who had been diagnosed with a voice disorder completed an online self-completion questionnaire administered via SurveyMonkey. Laryngeal pathology included vocal fold nodules (N = 24), vocal fold cysts (N = 2), vocal fold hemorrhage (N = 1), and recurrent chronic laryngitis (N = 3). Eight individuals reported vocal strain and muscle tension dysphonia without concurrent vocal fold pathology. Treatment methods were variable, with 73.68% (N = 28) receiving voice therapy alone, 7.89% (N = 3) having voice therapy in combination with surgery, and 10.53% (N = 4) having voice therapy in conjunction with medication. Three individuals (7.89%) received no treatment for their voice disorder. During treatment, 82% of the cohort altered their teaching practices. Half of the cohort reported that their voice problems led to social withdrawal, decreased job satisfaction, and emotional distress. Greater than 65% also reported being dissatisfied with the level of industry and coworker support during the period of voice recovery. This study identifies that GFIs are susceptible to a number of voice disorders that impact their social and professional lives, and there is a need for more proactive training and advice on voice care for instructors, as well as those in management positions within the industry to address mixed approaches and opinions regarding the importance of voice care. Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Vocal Responses to Perturbations in Voice Auditory Feedback in Individuals with Parkinson's Disease
Liu, Hanjun; Wang, Emily Q.; Metman, Leo Verhagen; Larson, Charles R.
2012-01-01
Background One of the most common symptoms of speech deficits in individuals with Parkinson's disease (PD) is significantly reduced vocal loudness and pitch range. The present study investigated whether abnormal vocalizations in individuals with PD are related to sensory processing of voice auditory feedback. Perturbations in loudness or pitch of voice auditory feedback are known to elicit short latency, compensatory responses in voice amplitude or fundamental frequency. Methodology/Principal Findings Twelve individuals with Parkinson's disease and 13 age- and sex- matched healthy control subjects sustained a vowel sound (/α/) and received unexpected, brief (200 ms) perturbations in voice loudness (±3 or 6 dB) or pitch (±100 cents) auditory feedback. Results showed that, while all subjects produced compensatory responses in their voice amplitude or fundamental frequency, individuals with PD exhibited larger response magnitudes than the control subjects. Furthermore, for loudness-shifted feedback, upward stimuli resulted in shorter response latencies than downward stimuli in the control subjects but not in individuals with PD. Conclusions/Significance The larger response magnitudes in individuals with PD compared with the control subjects suggest that processing of voice auditory feedback is abnormal in PD. Although the precise mechanisms of the voice feedback processing are unknown, results of this study suggest that abnormal voice control in individuals with PD may be related to dysfunctional mechanisms of error detection or correction in sensory feedback processing. PMID:22448258
Simultaneous face and voice processing in schizophrenia.
Liu, Taosheng; Pinheiro, Ana P; Zhao, Zhongxin; Nestor, Paul G; McCarley, Robert W; Niznikiewicz, Margaret
2016-05-15
While several studies have consistently demonstrated abnormalities in the unisensory processing of face and voice in schizophrenia (SZ), the extent of abnormalities in the simultaneous processing of both types of information remains unclear. To address this issue, we used event-related potentials (ERP) methodology to probe the multisensory integration of face and non-semantic sounds in schizophrenia. EEG was recorded from 18 schizophrenia patients and 19 healthy control (HC) subjects in three conditions: neutral faces (visual condition-VIS); neutral non-semantic sounds (auditory condition-AUD); neutral faces presented simultaneously with neutral non-semantic sounds (audiovisual condition-AUDVIS). When compared with HC, the schizophrenia group showed less negative N170 to both face and face-voice stimuli; later P270 peak latency in the multimodal condition of face-voice relative to unimodal condition of face (the reverse was true in HC); reduced P400 amplitude and earlier P400 peak latency in the face but not in the voice-face condition. Thus, the analysis of ERP components suggests that deficits in the encoding of facial information extend to multimodal face-voice stimuli and that delays exist in feature extraction from multimodal face-voice stimuli in schizophrenia. In contrast, categorization processes seem to benefit from the presentation of simultaneous face-voice information. Timepoint by timepoint tests of multimodal integration did not suggest impairment in the initial stages of processing in schizophrenia. Published by Elsevier B.V.
Impact of data link technology on railroad dispatching operations
DOT National Transportation Integrated Search
2004-10-01
This study examined data link communication as an alternative channel to voice radio for railroad dispatchers. The goal was to compare how data link affected performance compared to voice radio only communications on measures related to safety, produ...
2011-04-01
Research Institute Technology-Based Training Research Unit Stephen L. Goldberg , Chief April 2011 United States Army...Research Unit Stephen L. Goldberg , Chief U.S. Army Research Institute for the Behavioral and Social Sciences 2511 Jefferson Davis Highway...statements of approval voiced by command elements. Rather, researchers must complete a program of transfer of training studies to show that variations in
Does technology really enhance nurse education?
Goodchild, Tim
2018-07-01
Technology has clearly impacted upon our working lives, and the purpose of this paper is to offer a critical insight into the ubiquitous presence of technology in nurse education. This paper argues that technology enhanced learning is predicated on the promise of potential and purported transformation of teaching and learning. It suggests that there is a lack of critical engagement in the academic field of technology enhanced learning, and adds a critical voice to some of the emerging arguments in this area. There is also a lack of systematic evidence supporting the enhancement offered by technology, and yet the technology enhanced project continues to persist. The discourse surrounding technology enhanced learning has become so dominant, so pervasive, that those of us within it can no longer see alternatives. But there are alternatives, and this paper argues that we need to challenge the dominance of technology enhanced learning, and become aware of its contingent nature. Copyright © 2018 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Rogerson-Revell, Pamela; Nie, Ming; Armellini, Alejandro
2012-01-01
We researched the incorporation of three learning technologies (voice boards, i.e. voice-based discussion boards, e-book readers, and Second Life virtual world), into the Master's Programme in Applied Linguistics and Teaching English to Speakers of Other Languages offered by distance learning at the University of Leicester. This small-scale study…
Design and performance of mobile terminal for North American MSAT network
NASA Technical Reports Server (NTRS)
Fuji, Tsuyoshi; Tsuchiya, Makio; Isota, Yoji; Aoki, Katsuhiko
1995-01-01
The mobile terminal (MT), which can be selected for various applications, i.e. land mobile, transportable, fixed site, and maritime use, has been developed. Medium gain and high gain antennas are available. The MT can support circuit switched voice and data service. Additionally, cellular roaming service, net radio, and Group 3 facsimile services are optionally provided. A Mitsubishi handheld portable phone can be used as a stand-alone portable cellular-only phone or it can provide MSAT voice service when connected to MT. The MT which operates in L-band (1.5 GHz/1.6 GHz) satisfies equivalent isotropically radiated power (EIRP) of 12.5 dBW minimum and G/T of -16 dB/K minimum for medium gain system and -12 dB/K for high gain system. The excellent performance of transmit phase noise and bit error rate is achieved by using new technologies.
Research on the method of precise alignment technology of atmospheric laser communication
NASA Astrophysics Data System (ADS)
Chen, Wen-jian; Gao, Wei; Duan, Yuan-yuan; Ma, Shi-wei; Chen, Jian
2016-10-01
Atmosphere laser communication takes advantage of laser as the carrier transmitting the voice, data, and image information in the atmosphere. Because of its high reliability, strong anti-interference ability, the advantages of easy installation, it has great potential and development space in the communications field. In the process of establish communication, the capture, targeting and tracking of the communication signal is the key technology. This paper introduce a method of targeting the signal spot in the process of atmosphere laser communication, which through the way of making analog signal addition and subtraction directly and normalized to obtain the target azimuth information to drive the servo system to achieve precise alignment of tracking.
Technology Supported Self-Development for Soldiers Deploying to Afghanistan
2010-06-11
iPhone Cost: $2.99 from iTunes Store; free “Lite” version Easy to use flashcard program Download premade cards from flashcard exchange iPhone...phrases that are included in the program. uTalk Dari can be purchased from the iTunes store and its website is called uTalk–Now on iPhone and iPod...High Learn 275 Dari words on your iPhone Cost: $9.99 from iTunes Store Easy, medium, and hard games in 9 categories Male and Female voice
Literature review of voice recognition and generation technology for Army helicopter applications
NASA Astrophysics Data System (ADS)
Christ, K. A.
1984-08-01
This report is a literature review on the topics of voice recognition and generation. Areas covered are: manual versus vocal data input, vocabulary, stress and workload, noise, protective masks, feedback, and voice warning systems. Results of the studies presented in this report indicate that voice data entry has less of an impact on a pilot's flight performance, during low-level flying and other difficult missions, than manual data entry. However, the stress resulting from such missions may cause the pilot's voice to change, reducing the recognition accuracy of the system. The noise present in helicopter cockpits also causes the recognition accuracy to decrease. Noise-cancelling devices are being developed and improved upon to increase the recognition performance in noisy environments. Future research in the fields of voice recognition and generation should be conducted in the areas of stress and workload, vocabulary, and the types of voice generation best suited for the helicopter cockpit. Also, specific tasks should be studied to determine whether voice recognition and generation can be effectively applied.
Designing interaction, voice, and inclusion in AAC research.
Pullin, Graham; Treviranus, Jutta; Patel, Rupal; Higginbotham, Jeff
2017-09-01
The ISAAC 2016 Research Symposium included a Design Stream that examined timely issues across augmentative and alternative communication (AAC), framed in terms of designing interaction, designing voice, and designing inclusion. Each is a complex term with multiple meanings; together they represent challenging yet important frontiers of AAC research. The Design Stream was conceived by the four authors, researchers who have been exploring AAC and disability-related design throughout their careers, brought together by a shared conviction that designing for communication implies more than ensuring access to words and utterances. Each of these presenters came to AAC from a different background: interaction design, inclusive design, speech science, and social science. The resulting discussion among 24 symposium participants included controversies about the role of technology, tensions about independence and interdependence, and a provocation about taste. The paper concludes by proposing new directions for AAC research: (a) new interdisciplinary research could combine scientific and design research methods, as distant yet complementary as microanalysis and interaction design, (b) new research tools could seed accessible and engaging contextual research into voice within a social model of disability, and (c) new open research networks could support inclusive, international and interdisciplinary research.
Speech and swallowing disorders in Parkinson disease.
Sapir, Shimon; Ramig, Lorraine; Fox, Cynthia
2008-06-01
To review recent research and clinical studies pertaining to the nature, diagnosis, and treatment of speech and swallowing disorders in Parkinson disease. Although some studies indicate improvement in voice and speech with dopamine therapy and deep brain stimulation of the subthalamic nucleus, others show minimal or adverse effects. Repetitive transcranial magnetic stimulation of the mouth motor cortex and injection of collagen in the vocal folds have preliminary data supporting improvement in phonation in people with Parkinson disease. Treatments focusing on vocal loudness, specifically LSVT LOUD (Lee Silverman Voice Treatment), have been effective for the treatment of speech disorders in Parkinson disease. Changes in brain activity due to LSVT LOUD provide preliminary evidence for neural plasticity. Computer-based technology makes the Lee Silverman Voice Treatment available to a large number of users. A rat model for studying neuropharmacologic effects on vocalization in Parkinson disease has been developed. New diagnostic methods of speech and swallowing are also available as the result of recent studies. Speech rehabilitation with the LSVT LOUD is highly efficacious and scientifically tested. There is a need for more studies to improve understanding, diagnosis, prevention, and treatment of speech and swallowing disorders in Parkinson disease.
Systematic analysis of CMOS-micromachined inductors with application to mixer matching circuits
NASA Astrophysics Data System (ADS)
Wu, Jerry Chun-Li
The growing demand for consumer voice and data communication systems and military communication applications has created a need for low-power, low-cost, high-performance radio-frequency (RF) front-end. To achieve this goal, bringing passive components, especially inductors, to silicon is imperative. On-chip passive components such as inductors and capacitors generally enhance the reliability and efficiency of silicon-integrated RF cells. They can provide circuit solutions with superior performance and contribute to a higher level of integration. With passive components on chip, there is a great opportunity to have transformers, filters, and matching networks on chip. However, inductors on silicon have a low quality factor (Q) due to both substrate and metal loss. This dissertation demonstrates the systematic analysis of inductors fabricated using standard complementary metal-oxide-semiconductor (CMOS) and micro-electro-mechanical (MEMS) system technologies. We report system-on-chip inductor modeling, simulation, and measurements of effective inductance and quality factors. In this analysis methodology, a number of systematic simulations are performed on regular and micromachined inductors with different parameters such as spiral topology, number of turns, outer diameter, thickness, and percentage of substrate removed by using micromachining technologies. Three different novel support structures of the micromachined spiral inductor are proposed, analyzed, and implemented for larger size suspended inductors. The sensitivity of the structure support and different degree of substrate etching by post-processing is illustrated. The results provide guidelines for the selection of inductor parameters, post-processing methodologies, and its spiral supports to meet the RF design specifications and the stability requirements for mobile communication. The proposed CMOS-micromachined inductor is used in a low cost-effective double-balanced Gilbert mixer with on-chip matching network. The integrated mixer inductor was implemented and tested to prove the concept.
Cerebral Processing of Voice Gender Studied Using a Continuous Carryover fMRI Design
Pernet, Cyril; Latinus, Marianne; Crabbe, Frances; Belin, Pascal
2013-01-01
Normal listeners effortlessly determine a person's gender by voice, but the cerebral mechanisms underlying this ability remain unclear. Here, we demonstrate 2 stages of cerebral processing during voice gender categorization. Using voice morphing along with an adaptation-optimized functional magnetic resonance imaging design, we found that secondary auditory cortex including the anterior part of the temporal voice areas in the right hemisphere responded primarily to acoustical distance with the previously heard stimulus. In contrast, a network of bilateral regions involving inferior prefrontal and anterior and posterior cingulate cortex reflected perceived stimulus ambiguity. These findings suggest that voice gender recognition involves neuronal populations along the auditory ventral stream responsible for auditory feature extraction, functioning in pair with the prefrontal cortex in voice gender perception. PMID:22490550
Approaching Praxis: YPAR as Critical Pedagogical Process in a College Access Program
ERIC Educational Resources Information Center
Scott, Mary Alice; Pyne, Kimberly B.; Means, Darris R.
2015-01-01
To address the persistent failure of schooling to support underserved students, youth participatory action research (YPAR) has emerged as an alternative and critical paradigm for educational practice. YPAR re-centers authority on marginalized voices and understands research as a tool for social change. Grounded in critical pedagogy, such projects…
Advising Doctorate Candidates and Candidates' Views during the Dissertation Process
ERIC Educational Resources Information Center
Hilliard, Ann T.
2013-01-01
In order to provide candidates with effective advisement, it is important for the advisor to continue to practice positive professional relationships and provide relevant academic support to candidates. The advisor should work closely with other faculty members and need to listen to the voices of candidates to ensure candidates' success. What…
Involvement of the left insula in the ecological validity of the human voice
Tamura, Yuri; Kuriki, Shinji; Nakano, Tamami
2015-01-01
A subtle difference between a real human and an artificial object that resembles a human evokes an impression of a large qualitative difference between them. This suggests the existence of a neural mechanism that processes the sense of humanness. To examine the presence of such a mechanism, we compared the behavioral and brain responses of participants who listened to human and artificial singing voices created from vocal fragments of a real human voice. The behavioral experiment showed that the song sung by human voices more often elicited positive feelings and feelings of humanness than the same song sung by artificial voices, although the lyrics, melody, and rhythm were identical. Functional magnetic resonance imaging revealed significantly higher activation in the left posterior insula in response to human voices than in response to artificial voices. Insular activation was not merely evoked by differences in acoustic features between the voices. Therefore, these results suggest that the left insula participates in the neural processing of the ecological quality of the human voice. PMID:25739519
On the definition and interpretation of voice selective activation in the temporal cortex
Bethmann, Anja; Brechmann, André
2014-01-01
Regions along the superior temporal sulci and in the anterior temporal lobes have been found to be involved in voice processing. It has even been argued that parts of the temporal cortices serve as voice-selective areas. Yet, evidence for voice-selective activation in the strict sense is still missing. The current fMRI study aimed at assessing the degree of voice-specific processing in different parts of the superior and middle temporal cortices. To this end, voices of famous persons were contrasted with widely different categories, which were sounds of animals and musical instruments. The argumentation was that only brain regions with statistically proven absence of activation by the control stimuli may be considered as candidates for voice-selective areas. Neural activity was found to be stronger in response to human voices in all analyzed parts of the temporal lobes except for the middle and posterior STG. More importantly, the activation differences between voices and the other environmental sounds increased continuously from the mid-posterior STG to the anterior MTG. Here, only voices but not the control stimuli excited an increase of the BOLD response above a resting baseline level. The findings are discussed with reference to the function of the anterior temporal lobes in person recognition and the general question on how to define selectivity of brain regions for a specific class of stimuli or tasks. In addition, our results corroborate recent assumptions about the hierarchical organization of auditory processing building on a processing stream from the primary auditory cortices to anterior portions of the temporal lobes. PMID:25071527
Evidence-Based Clinical Voice Assessment: A Systematic Review
ERIC Educational Resources Information Center
Roy, Nelson; Barkmeier-Kraemer, Julie; Eadie, Tanya; Sivasankar, M. Preeti; Mehta, Daryush; Paul, Diane; Hillman, Robert
2013-01-01
Purpose: To determine what research evidence exists to support the use of voice measures in the clinical assessment of patients with voice disorders. Method: The American Speech-Language-Hearing Association (ASHA) National Center for Evidence-Based Practice in Communication Disorders staff searched 29 databases for peer-reviewed English-language…
The use of hypermedia to increase the productivity of software development teams
NASA Technical Reports Server (NTRS)
Coles, L. Stephen
1991-01-01
Rapid progress in low-cost commercial PC-class multimedia workstation technology will potentially have a dramatic impact on the productivity of distributed work groups of 50-100 software developers. Hypermedia/multimedia involves the seamless integration in a graphical user interface (GUI) of a wide variety of data structures, including high-resolution graphics, maps, images, voice, and full-motion video. Hypermedia will normally require the manipulation of large dynamic files for which relational data base technology and SQL servers are essential. Basic machine architecture, special-purpose video boards, video equipment, optical memory, software needed for animation, network technology, and the anticipated increase in productivity that will result for the introduction of hypermedia technology are covered. It is suggested that the cost of the hardware and software to support an individual multimedia workstation will be on the order of $10,000.
The Johns Hopkins Medical Institutions' Premise Distribution Plan
Barta, Wendy; Buckholtz, Howard; Johnston, Mark; Lenhard, Raymond; Tolchin, Stephen; Vienne, Donald
1987-01-01
A Premise Distribution Plan is being developed to address the growing voice and data communications needs at Johns Hopkins Medical Institutions. More specifically, the use of a rapidly expanding Ethernet computer network and a new Integrated Services Digital Network (ISDN) Digital Centrex system must be planned to provide easy, reliable and cost-effective data and voice communications services. Existing Premise Distribution Systems are compared along with voice and data technologies which would use them.
Utilization of Internet Protocol-Based Voice Systems in Remote Payload Operations
NASA Technical Reports Server (NTRS)
Best, Susan; Nichols, Kelvin; Bradford, Robert
2003-01-01
This viewgraph presentation provides an overview of a proposed voice communication system for use in remote payload operations performed on the International Space Station. The system, Internet Voice Distribution System (IVoDS), would make use of existing Internet protocols, and offer a number of advantages over the system currently in use. Topics covered include: system description and operation, system software and hardware, system architecture, project status, and technology transfer applications.
ERIC Educational Resources Information Center
Stuart, Shannon
2012-01-01
Schools have struggled for decades to provide expensive augmentative and alternative communication (AAC) resources for autistic students with communication challenges. Clunky voice output devices, often included in students' individualized education plans, cost about $8,000, a difficult expense to cover in hard times. However, mobile technology is…
"Teachers' Voices for School Change": An Introduction to Educative Research.
ERIC Educational Resources Information Center
Jacobs, Mary-Ellen
1993-01-01
Reviews a book, "Teachers' Voices for School Change" by Andrew Gitlin, on educative research and teacher voice, examining the educative research process which grounds reflection in the life histories of teacher researchers, presenting four case studies on educative research, and reflecting on the educative research process itself.…
Iris Cryptography for Security Purpose
NASA Astrophysics Data System (ADS)
Ajith, Srighakollapu; Balaji Ganesh Kumar, M.; Latha, S.; Samiappan, Dhanalakshmi; Muthu, P.
2018-04-01
In today's world, the security became the major issue to every human being. A major issue is hacking as hackers are everywhere, as the technology was developed still there are many issues where the technology fails to meet the security. Engineers, scientists were discovering the new products for security purpose as biometrics sensors like face recognition, pattern recognition, gesture recognition, voice authentication etcetera. But these devices fail to reach the expected results. In this work, we are going to present an approach to generate a unique secure key using the iris template. Here the iris templates are processed using the well-defined processing techniques. Using the encryption and decryption process they are stored, traversed and utilized. As of the work, we can conclude that the iris cryptography gives us the expected results for securing the data from eavesdroppers.
ERIC Educational Resources Information Center
Ianes, D.; Cappello, S.; Demo, H.
2017-01-01
Student voice has become increasingly important in educational research at an international level. Research in Italy on school integration of students with disabilities has almost entirely left behind student voice. The very few researches based on student voice suggest that there is a mismatch between student and teacher voices when faced with…
Analogue and digital linear modulation techniques for mobile satellite
NASA Technical Reports Server (NTRS)
Whitmarsh, W. J.; Bateman, A.; Mcgeehan, J. P.
1990-01-01
The choice of modulation format for a mobile satellite service is complex. The subjective performance is summarized of candidate schemes and voice coder technologies. It is shown that good performance can be achieved with both analogue and digital voice systems, although the analogue system gives superior performance in fading. The results highlight the need for flexibility in the choice of signaling format. Linear transceiver technology capable of using many forms of narrowband modulation is described.
Treatment for the service member: a description of innovative interventions.
Yancosek, Katie; Daugherty, Stephanie E; Cancio, Leonard
2008-01-01
This article describes advances to the rehabilitation programs at major military medical centers since the onset of operations in Iraq and Afghanistan. The demands on military health care in times of war produce advances in the various rehabilitation professions. This article describes two programs that use new technologies for the care of military patients with devastating injuries to the upper extremity. One project relates to the application of voice-sensitive technology. The other project describes the utilization of virtual reality technology through a Firearm Training System. The article also explains an adaptive sports program and how recreation is part of a robust community reintegration program. Lastly, this article discusses the Center for the Intrepid, which is one of two new amputee care centers built to support the advanced rehabilitation of war-wounded amputees.
Constructing Visually-Based Digital Conversations in EFL with VoiceThread
ERIC Educational Resources Information Center
Kent, David
2017-01-01
VoiceThread holds potential to provide students who rarely speak in class a means to create visually-based digital conversations. In light of this, pedagogical affordances of the tool are considered, along with efficacy behind VoiceThread development within English as a Foreign Language contexts. Instructional strategies, supported by examples,…
Error-dependent modulation of speech-induced auditory suppression for pitch-shifted voice feedback
2011-01-01
Background The motor-driven predictions about expected sensory feedback (efference copies) have been proposed to play an important role in recognition of sensory consequences of self-produced motor actions. In the auditory system, this effect was suggested to result in suppression of sensory neural responses to self-produced voices that are predicted by the efference copies during vocal production in comparison with passive listening to the playback of the identical self-vocalizations. In the present study, event-related potentials (ERPs) were recorded in response to upward pitch shift stimuli (PSS) with five different magnitudes (0, +50, +100, +200 and +400 cents) at voice onset during active vocal production and passive listening to the playback. Results Results indicated that the suppression of the N1 component during vocal production was largest for unaltered voice feedback (PSS: 0 cents), became smaller as the magnitude of PSS increased to 200 cents, and was almost completely eliminated in response to 400 cents stimuli. Conclusions Findings of the present study suggest that the brain utilizes the motor predictions (efference copies) to determine the source of incoming stimuli and maximally suppresses the auditory responses to unaltered feedback of self-vocalizations. The reduction of suppression for 50, 100 and 200 cents and its elimination for 400 cents pitch-shifted voice auditory feedback support the idea that motor-driven suppression of voice feedback leads to distinctly different sensory neural processing of self vs. non-self vocalizations. This characteristic may enable the audio-vocal system to more effectively detect and correct for unexpected errors in the feedback of self-produced voice pitch compared with externally-generated sounds. PMID:21645406
Analysis and application of intelligence network based on FTTH
NASA Astrophysics Data System (ADS)
Feng, Xiancheng; Yun, Xiang
2008-12-01
With the continued rapid growth of Internet, new network service emerges in endless stream, especially the increase of network game, meeting TV, video on demand, etc. The bandwidth requirement increase continuously. Network technique, optical device technical development is swift and violent. FTTH supports all present and future service with enormous bandwidth, including traditional telecommunication service, traditional data service and traditional TV service, and the future digital TV and VOD. With huge bandwidth of FTTH, it wins the final solution of broadband network, becomes the final goal of development of optical access network. Firstly, it introduces the main service which FTTH supports, main analysis key technology such as FTTH system composition way, topological structure, multiplexing, optical cable and device. It focus two kinds of realization methods - PON, P2P technology. Then it proposed that the solution of FTTH can support comprehensive access (service such as broadband data, voice, video and narrowband private line). Finally, it shows the engineering application for FTTH in the district and building. It brings enormous economic benefits and social benefit.
Stacy, Jane N; Schwartz, Steven M; Ershoff, Daniel; Shreve, Marilyn Standifer
2009-10-01
The current study presents the impact of a behavior change program to increase statin adherence using interactive voice response (IVR) technology. Subjects were affiliated with a large health benefit company, were prescribed a statin (index) and had no lipid-lowering pharmacy claims in the previous 6 months, and were continuously enrolled in the plan for 12 months prior and 6 months post index statin. Potential subjects (1219) were contacted by the IVR system; 497 gave informed consent. Subjects were asked to respond to 15 questions from the IVR that were guided by several behavior change theories. At the conclusion of the questions, subjects were randomly assigned to either a control group (n = 244), who received generic feedback at the conclusion of the call and were then mailed a generic cholesterol guide, or an experimental group (n = 253), who received tailored feedback based on their cholesterol-related knowledge, attitudes, beliefs, and perceived barriers to medication adherence, and were mailed a tailored guide that reinforced similar themes. Subjects in the experimental group had the opportunity to participate in 2 additional tailored IVR support calls. The primary dependent variable was 6-month point prevalence, defined as claims evidence of a statin on days 121-180 post index statin. Subjects in the experimental group had a significantly higher 6-month point prevalence than the controls (70.4% vs. 60.7%, P < 0.05). Results of this study suggest that a behavioral support program using IVR technology can be a cost-effective modality to address the important public health problem of patient nonadherence with statin medication.
Criteria for Appraising Computer-Based Simulations for Teaching Arabic as a Foreign Language
2005-04-01
activity abroad that most contributed to their increase in fluency was ‘hanging out’ with Russian friends, defined as visiting, eating, and watching...approach is testing that learning has indeed occurred, in that a teacher must evaluate not only linguistic accuracy but also fluency in the proper...written responses, with student input analyzed using voice processing technology. Cultural Proficiency in Arabic Fluency in a foreign language
Developing Student Voices on the Internet.
ERIC Educational Resources Information Center
Dresang, Eliza T.
1997-01-01
Books and online discussion groups encourage youth to develop strong narrative voices. Includes an annotated bibliography of books and Internet sites dealing with discovering the self and others; exploring race, culture, archeology, technology, war, poverty, gender and urban problems; creating and critiquing stories; and publishing industry…
Processing of voices in deafness rehabilitation by auditory brainstem implant.
Coez, Arnaud; Zilbovicius, Monica; Ferrary, Evelyne; Bouccara, Didier; Mosnier, Isabelle; Ambert-Dahan, Emmanuèle; Kalamarides, Michel; Bizaguet, Eric; Syrota, André; Samson, Yves; Sterkers, Olivier
2009-10-01
The superior temporal sulcus (STS) is specifically involved in processing the human voice. Profound acquired deafness by post-meningitis ossified cochlea and by bilateral vestibular schwannoma in neurofibromatosis type 2 patients are two indications for auditory brainstem implantation (ABI). In order to objectively measure the cortical voice processing of a group of ABI patients, we studied the activation of the human temporal voice areas (TVA) by PET H(2)(15)O, performed in a group of implanted deaf adults (n=7) with more than two years of auditory brainstem implant experience, with an intelligibility score average of 17%+/-17 [mean+/-SD]. Relative cerebral blood flow (rCBF) was measured in the three following conditions: during silence, while passive listening to human voice, and to non-voice stimuli. Compared to silence, the activations induced by voice and non-voice stimuli were bilaterally located in the superior temporal regions. However, compared to non-voice stimuli, the voice stimuli did not induce specific supplementary activation of the TVA along the STS. The comparison of ABI group with a normal-hearing controls group (n=7) showed that TVA activations were significantly enhanced among controls group. ABI allowed the transmission of sound stimuli to temporal brain regions but lacked transmitting the specific cues of the human voice to the TVA. Moreover, among groups, during silent condition, brain visual regions showed higher rCBF in ABI group, although temporal brain regions had higher rCBF in the controls group. ABI patients had consequently developed enhanced visual strategies to keep interacting with their environment.
Voices to reckon with: perceptions of voice identity in clinical and non-clinical voice hearers
Badcock, Johanna C.; Chhabra, Saruchi
2013-01-01
The current review focuses on the perception of voice identity in clinical and non-clinical voice hearers. Identity perception in auditory verbal hallucinations (AVH) is grounded in the mechanisms of human (i.e., real, external) voice perception, and shapes the emotional (distress) and behavioral (help-seeking) response to the experience. Yet, the phenomenological assessment of voice identity is often limited, for example to the gender of the voice, and has failed to take advantage of recent models and evidence on human voice perception. In this paper we aim to synthesize the literature on identity in real and hallucinated voices and begin by providing a comprehensive overview of the features used to judge voice identity in healthy individuals and in people with schizophrenia. The findings suggest some subtle, but possibly systematic biases across different levels of voice identity in clinical hallucinators that are associated with higher levels of distress. Next we provide a critical evaluation of voice processing abilities in clinical and non-clinical voice hearers, including recent data collected in our laboratory. Our studies used diverse methods, assessing recognition and binding of words and voices in memory as well as multidimensional scaling of voice dissimilarity judgments. The findings overall point to significant difficulties recognizing familiar speakers and discriminating between unfamiliar speakers in people with schizophrenia, both with and without AVH. In contrast, these voice processing abilities appear to be generally intact in non-clinical hallucinators. The review highlights some important avenues for future research and treatment of AVH associated with a need for care, and suggests some novel insights into other symptoms of psychosis. PMID:23565088
Cosmic Origins (COR) Technology Development Program Overview
NASA Astrophysics Data System (ADS)
Werneth, Russell; Pham, B.; Clampin, M.
2014-01-01
The Cosmic Origins (COR) Program Office was established in FY11 and resides at the NASA Goddard Space Flight Center (GSFC). The office serves as the implementation arm for the Astrophysics Division at NASA Headquarters for COR Program related matters. We present an overview of the Program’s technology management activities and the Program’s technology development portfolio. We discuss the process for addressing community-provided technology needs and the Technology Management Board (TMB)-vetted prioritization and investment recommendations. This process improves the transparency and relevance of technology investments, provides the community a voice in the process, and leverages the technology investments of external organizations by defining a need and a customer. Goals for the COR Program envisioned by the National Research Council’s (NRC) “New Worlds, New Horizons in Astronomy and Astrophysics” (NWNH) Decadal Survey report includes a 4m-class UV/optical telescope that would conduct imaging and spectroscopy as a post-Hubble observatory with significantly improved sensitivity and capability, a near-term investigation of NASA participation in the Japanese Aerospace Exploration Agency/Institute of Space and Astronautical Science (JAXA/ISAS) Space Infrared Telescope for Cosmology and Astrophysics (SPICA) mission, and future Explorers.
Listen to Me! An Exploration of the Students' Voices Regarding Homework
ERIC Educational Resources Information Center
Landing-Corretjer, Gladys
2009-01-01
Research on students' voices and perspectives regarding homework is absent from the literature. This qualitative case study explored the perspectives of 5th and 6th grade students and ten teachers' perceptions regarding homework completion. The literature review revealed 3 trends in homework, including support homework, support against homework,…
Translational Systems Biology and Voice Pathophysiology
Li, Nicole Y. K.; Abbott, Katherine Verdolini; Rosen, Clark; An, Gary; Hebda, Patricia A.; Vodovotz, Yoram
2011-01-01
Objectives/Hypothesis Personalized medicine has been called upon to tailor healthcare to an individual's needs. Evidence-based medicine (EBM) has advocated using randomized clinical trials with large populations to evaluate treatment effects. However, due to large variations across patients, the results are likely not to apply to an individual patient. We suggest that a complementary, systems biology approach using computational modeling may help tackle biological complexity in order to improve ultimate patient care. The purpose of the article is: 1) to review the pros and cons of EBM, and 2) to discuss the alternative systems biology method and present its utility in clinical voice research. Study Design Tutorial Methods Literature review and discussion. Results We propose that translational systems biology can address many of the limitations of EBM pertinent to voice and other health care domains, and thus complement current health research models. In particular, recent work using mathematical modeling suggests that systems biology has the ability to quantify the highly complex biologic processes underlying voice pathophysiology. Recent data support the premise that this approach can be applied specifically in the case of phonotrauma and surgically induced vocal fold trauma, and may have particular power to address personalized medicine. Conclusions We propose that evidence around vocal health and disease be expanded beyond a population-based method to consider more fully issues of complexity and systems interactions, especially in implementing personalized medicine in voice care and beyond. PMID:20025041
Implementation of a tactical voice/data network over FDDI. [Fiber Distributed Data Interface
NASA Technical Reports Server (NTRS)
Bergman, L. A.; Halloran, F.; Martinez, J.
1988-01-01
An asynchronous high-speed fiber-optic local-area network is described that simultaneously supports packet data traffic with synchronous TI voice traffic over a standard asynchronous FDDI (fiber distributed data interface) token-ring channel. A voice interface module was developed that parses, buffers, and resynchronizes the voice data to the packet network. The technique is general, however, and can be applied to any deterministic class of networks, including multitier backbones. In addition, the higher layer packet data protocols may operate independently of those for the voice, thereby permitting great flexibility in reconfiguring the network. Voice call setup and switching functions are performed external to the network with PABX equipment.
Real-time interactive speech technology at Threshold Technology, Incorporated
NASA Technical Reports Server (NTRS)
Herscher, Marvin B.
1977-01-01
Basic real-time isolated-word recognition techniques are reviewed. Industrial applications of voice technology are described in chronological order of their development. Future research efforts are also discussed.
Multimodal processing of emotional information in 9-month-old infants I: emotional faces and voices.
Otte, R A; Donkers, F C L; Braeken, M A K A; Van den Bergh, B R H
2015-04-01
Making sense of emotions manifesting in human voice is an important social skill which is influenced by emotions in other modalities, such as that of the corresponding face. Although processing emotional information from voices and faces simultaneously has been studied in adults, little is known about the neural mechanisms underlying the development of this ability in infancy. Here we investigated multimodal processing of fearful and happy face/voice pairs using event-related potential (ERP) measures in a group of 84 9-month-olds. Infants were presented with emotional vocalisations (fearful/happy) preceded by the same or a different facial expression (fearful/happy). The ERP data revealed that the processing of emotional information appearing in human voice was modulated by the emotional expression appearing on the corresponding face: Infants responded with larger auditory ERPs after fearful compared to happy facial primes. This finding suggests that infants dedicate more processing capacities to potentially threatening than to non-threatening stimuli. Copyright © 2014 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Hyassat, Mizyed A.
2016-01-01
Since the literature in the field of special education supports the argument that involving parents in the educational process is more likely to positively influence children's educational outcomes, this research aims at exploring the position of Jordanian parents of young children with disabilities in terms of their involvement. A qualitative…
Resilience Theory: Risk and Protective Factors for Novice Special Education Teachers
ERIC Educational Resources Information Center
Benjamin, Thomas L.; Black, Rhonda S.
2012-01-01
This study describes experiences of novice special education teachers in rural areas in Hawaii through a lens of resiliency theory. Two types of support--administrative and collegial--were examined in terms of being risk or protective processes. A case study design was used to give voice to five participants who expressed their satisfaction and…
VoiceThread as a Peer Review and Dissemination Tool for Undergraduate Research
NASA Astrophysics Data System (ADS)
Guertin, L. A.
2012-12-01
VoiceThread has been utilized in an undergraduate research methods course for peer review and final research project dissemination. VoiceThread (http://www.voicethread.com) can be considered a social media tool, as it is a web-based technology with the capacity to enable interactive dialogue. VoiceThread is an application that allows a user to place a media collection online containing images, audio, videos, documents, and/or presentations in an interface that facilitates asynchronous communication. Participants in a VoiceThread can be passive viewers of the online content or engaged commenters via text, audio, video, with slide annotations via a doodle tool. The VoiceThread, which runs across browsers and operating systems, can be public or private for viewing and commenting and can be embedded into any website. Although few university students are aware of the VoiceThread platform (only 10% of the students surveyed by Ng (2012)), the 2009 K-12 edition of The Horizon Report (Johnson et al., 2009) lists VoiceThread as a tool to watch because of the opportunities it provides as a collaborative learning environment. In Fall 2011, eleven students enrolled in an undergraduate research methods course at Penn State Brandywine each conducted their own small-scale research project. Upon conclusion of the projects, students were required to create a poster summarizing their work for peer review. To facilitate the peer review process outside of class, each student-created PowerPoint file was placed in a VoiceThread with private access to only the class members and instructor. Each student was assigned to peer review five different student posters (i.e., VoiceThread images) with the audio and doodle tools to comment on formatting, clarity of content, etc. After the peer reviews were complete, the students were allowed to edit their PowerPoint poster files for a new VoiceThread. In the new VoiceThread, students were required to video record themselves describing their research and taking the viewer through their poster in the VoiceThread. This new VoiceThread with their final presentations was open for public viewing but not public commenting. A formal assessment was not conducted on the student impact of using VoiceThread for peer review and final research presentations. From an instructional standpoint, requiring students to use audio for the peer review commenting seemed to result in lengthier and more detailed reviews, connected with specific poster features when the doodle tool was utilized. By recording themselves as a "talking head" for the final product, students were required to be comfortable and confident with presenting their research, similar to what would be expected at a conference presentation. VoiceThread is currently being tested in general education Earth science courses at Penn State Brandywine as a dissemination tool for classroom-based inquiry projects and recruitment tool for Earth & Mineral Science majors.
NASA Astrophysics Data System (ADS)
White, R. W.; Parks, D. L.
1985-07-01
A study was conducted to determine potential commercial aircraft flight deck applications and implementation guidelines for voice recognition and synthesis. At first, a survey of voice recognition and synthesis technology was undertaken to develop a working knowledge base. Then, numerous potential aircraft and simulator flight deck voice applications were identified and each proposed application was rated on a number of criteria in order to achieve an overall payoff rating. The potential voice recognition applications fell into five general categories: programming, interrogation, data entry, switch and mode selection, and continuous/time-critical action control. The ratings of the first three categories showed the most promise of being beneficial to flight deck operations. Possible applications of voice synthesis systems were categorized as automatic or pilot selectable and many were rated as being potentially beneficial. In addition, voice system implementation guidelines and pertinent performance criteria are proposed. Finally, the findings of this study are compared with those made in a recent NASA study of a 1995 transport concept.
NASA Technical Reports Server (NTRS)
White, R. W.; Parks, D. L.
1985-01-01
A study was conducted to determine potential commercial aircraft flight deck applications and implementation guidelines for voice recognition and synthesis. At first, a survey of voice recognition and synthesis technology was undertaken to develop a working knowledge base. Then, numerous potential aircraft and simulator flight deck voice applications were identified and each proposed application was rated on a number of criteria in order to achieve an overall payoff rating. The potential voice recognition applications fell into five general categories: programming, interrogation, data entry, switch and mode selection, and continuous/time-critical action control. The ratings of the first three categories showed the most promise of being beneficial to flight deck operations. Possible applications of voice synthesis systems were categorized as automatic or pilot selectable and many were rated as being potentially beneficial. In addition, voice system implementation guidelines and pertinent performance criteria are proposed. Finally, the findings of this study are compared with those made in a recent NASA study of a 1995 transport concept.
A fiber optic tactical voice/data network based on FDDI
NASA Technical Reports Server (NTRS)
Bergman, L. A.; Hartmayer, R.; Marelid, S.; Wu, W. H.; Edgar, G.; Cassell, P.; Mancini, R.; Kiernicki, J.; Paul, L. J.; Jeng, J.
1988-01-01
An asynchronous high-speed fiber optic local area network is described that supports ordinary data packet traffic simultaneously with synchronous Tl voice traffic over a common FDDI token ring channel. A voice interface module was developed that parses, buffers, and resynchronizes the voice data to the packet network. The technique is general, however, and can be applied to any deterministic class of networks, including multi-tier backbones. A conventional single token access protocol was employed at the lowest layer, with fixed packet sizes for voice and variable for data. In addition, the higher layer packet data protocols are allowed to operate independently of those for the voice thereby permitting great flexibility in reconfiguring the network. Voice call setup and switching functions were performed external to the network with PABX equipment.
Abrams, Daniel A.; Chen, Tianwen; Odriozola, Paola; Cheng, Katherine M.; Baker, Amanda E.; Padmanabhan, Aarthi; Ryali, Srikanth; Kochalka, John; Feinstein, Carl; Menon, Vinod
2016-01-01
The human voice is a critical social cue, and listeners are extremely sensitive to the voices in their environment. One of the most salient voices in a child’s life is mother's voice: Infants discriminate their mother’s voice from the first days of life, and this stimulus is associated with guiding emotional and social function during development. Little is known regarding the functional circuits that are selectively engaged in children by biologically salient voices such as mother’s voice or whether this brain activity is related to children’s social communication abilities. We used functional MRI to measure brain activity in 24 healthy children (mean age, 10.2 y) while they attended to brief (<1 s) nonsense words produced by their biological mother and two female control voices and explored relationships between speech-evoked neural activity and social function. Compared to female control voices, mother’s voice elicited greater activity in primary auditory regions in the midbrain and cortex; voice-selective superior temporal sulcus (STS); the amygdala, which is crucial for processing of affect; nucleus accumbens and orbitofrontal cortex of the reward circuit; anterior insula and cingulate of the salience network; and a subregion of fusiform gyrus associated with face perception. The strength of brain connectivity between voice-selective STS and reward, affective, salience, memory, and face-processing regions during mother’s voice perception predicted social communication skills. Our findings provide a novel neurobiological template for investigation of typical social development as well as clinical disorders, such as autism, in which perception of biologically and socially salient voices may be impaired. PMID:27185915
Abrams, Daniel A; Chen, Tianwen; Odriozola, Paola; Cheng, Katherine M; Baker, Amanda E; Padmanabhan, Aarthi; Ryali, Srikanth; Kochalka, John; Feinstein, Carl; Menon, Vinod
2016-05-31
The human voice is a critical social cue, and listeners are extremely sensitive to the voices in their environment. One of the most salient voices in a child's life is mother's voice: Infants discriminate their mother's voice from the first days of life, and this stimulus is associated with guiding emotional and social function during development. Little is known regarding the functional circuits that are selectively engaged in children by biologically salient voices such as mother's voice or whether this brain activity is related to children's social communication abilities. We used functional MRI to measure brain activity in 24 healthy children (mean age, 10.2 y) while they attended to brief (<1 s) nonsense words produced by their biological mother and two female control voices and explored relationships between speech-evoked neural activity and social function. Compared to female control voices, mother's voice elicited greater activity in primary auditory regions in the midbrain and cortex; voice-selective superior temporal sulcus (STS); the amygdala, which is crucial for processing of affect; nucleus accumbens and orbitofrontal cortex of the reward circuit; anterior insula and cingulate of the salience network; and a subregion of fusiform gyrus associated with face perception. The strength of brain connectivity between voice-selective STS and reward, affective, salience, memory, and face-processing regions during mother's voice perception predicted social communication skills. Our findings provide a novel neurobiological template for investigation of typical social development as well as clinical disorders, such as autism, in which perception of biologically and socially salient voices may be impaired.
Pilot study on the feasibility of a computerized speech recognition charting system.
Feldman, C A; Stevens, D
1990-08-01
The objective of this study was to determine the feasibility of developing and using a voice recognition computerized charting system to record dental clinical examination data. More specifically, the study was designed to analyze the time and error differential between the traditional examiner/recorder method (ASSISTANT) and computerized voice recognition method (VOICE). DMFS examinations were performed twice on 20 patients using the traditional ASSISTANT and the VOICE charting system. A statistically significant difference was found when comparing the mean ASSISTANT time of 2.69 min to the VOICE time of 3.72 min (P less than 0.001). No statistically significant difference was found when comparing the mean ASSISTANT recording errors of 0.1 to VOICE recording errors of 0.6 (P = 0.059). 90% of the patients indicated they felt comfortable with the dentist talking to a computer and only 5% of the sample indicated they opposed VOICE. Results from this pilot study indicate that a charting system utilizing voice recognition technology could be considered a viable alternative to traditional examiner/recorder methods of clinical charting.
Kuo, Chung-Feng Jeffrey; Chu, Yueng-Hsiang; Wang, Po-Chun; Lai, Chun-Yu; Chu, Wen-Lin; Leu, Yi-Shing; Wang, Hsing-Won
2013-12-01
The human larynx is an important organ for voice production and respiratory mechanisms. The vocal cord is approximated for voice production and open for breathing. The videolaryngoscope is widely used for vocal cord examination. At present, physicians usually diagnose vocal cord diseases by manually selecting the image of the vocal cord opening to the largest extent (abduction), thus maximally exposing the vocal cord lesion. On the other hand, the severity of diseases such as vocal palsy, atrophic vocal cord is largely dependent on the vocal cord closing to the smallest extent (adduction). Therefore, diseases can be assessed by the image of the vocal cord opening to the largest extent, and the seriousness of breathy voice is closely correlated to the gap between vocal cords when closing to the smallest extent. The aim of the study was to design an automatic vocal cord image selection system to improve the conventional selection process by physicians and enhance diagnosis efficiency. Also, due to the unwanted fuzzy images resulting from examination process caused by human factors as well as the non-vocal cord images, texture analysis is added in this study to measure image entropy to establish a screening and elimination system to effectively enhance the accuracy of selecting the image of the vocal cord closing to the smallest extent. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Emotional expressions in voice and music: same code, same effect?
Escoffier, Nicolas; Zhong, Jidan; Schirmer, Annett; Qiu, Anqi
2013-08-01
Scholars have documented similarities in the way voice and music convey emotions. By using functional magnetic resonance imaging (fMRI) we explored whether these similarities imply overlapping processing substrates. We asked participants to trace changes in either the emotion or pitch of vocalizations and music using a joystick. Compared to music, vocalizations more strongly activated superior and middle temporal cortex, cuneus, and precuneus. However, despite these differences, overlapping rather than differing regions emerged when comparing emotion with pitch tracing for music and vocalizations, respectively. Relative to pitch tracing, emotion tracing activated medial superior frontal and anterior cingulate cortex regardless of stimulus type. Additionally, we observed emotion specific effects in primary and secondary auditory cortex as well as in medial frontal cortex that were comparable for voice and music. Together these results indicate that similar mechanisms support emotional inferences from vocalizations and music and that these mechanisms tap on a general system involved in social cognition. Copyright © 2011 Wiley Periodicals, Inc.
ERIC Educational Resources Information Center
Skouge, James R.; Kajiyama, Brian
2009-01-01
In this article, the authors relate a story about the transformative power of technologies for voice. They relate Brian Kajiyama's personal odyssey--what might be described as a journey from unvoiced to vocal--in learning to use a DynaWrite, a type-and-talk device that Brian uses as a communication tool.
Weiner, Shoshana; Fink, Jeffery C.
2017-01-01
Chronic kidney disease (CKD) patients have several features conferring upon them a high risk of adverse safety events, which are defined as incidents with unintended harm related to processes of care or medications. These characteristics include impaired renal function, polypharmacy, and frequent health system encounters. The consequences of such events in CKD can include new or prolonged hospitalization, accelerated renal function loss, acute kidney injury, end-stage renal disease and death. Health information technology administered via telemedicine presents opportunities for CKD patients to remotely communicate safety-related findings to providers for the purpose of improving their care. However, many CKD patients have limitations which hinder their use of telemedicine and access to the broad capabilities of health information technology. In this review we summarize previous assessments of the pre-dialysis CKD populations’ proficiency in using telemedicine modalities and describe the use of interactive voice-response system (IVRS) to gauge the safety phenotype of the CKD patient. We discuss the potential for expanded IVRS use in CKD to address the safety threats inherent to this population. PMID:28224940
``The perceptual bases of speaker identity'' revisited
NASA Astrophysics Data System (ADS)
Voiers, William D.
2003-10-01
A series of experiments begun 40 years ago [W. D. Voiers, J. Acoust. Soc. Am. 36, 1065-1073 (1964)] was concerned with identifying the perceived voice traits (PVTs) on which human recognition of voices depends. It culminated with the development of a voice taxonomy based on 20 PVTs and a set of highly reliable rating scales for classifying voices with respect to those PVTs. The development of a perceptual voice taxonomy was motivated by the need for a practical method of evaluating speaker recognizability in voice communication systems. The Diagnostic Speaker Recognition Test (DSRT) evaluates the effects of systems on speaker recognizability as reflected in changes in the inter-listener reliability of voice ratings on the 20 PVTs. The DSRT thus provides a qualitative, as well as quantitative, evaluation of the effects of a system on speaker recognizability. A fringe benefit of this project is PVT rating data for a sample of 680 voices. [Work partially supported by USAFRL.
47 CFR 54.503 - Other supported special services.
Code of Federal Regulations, 2010 CFR
2010-10-01
... telecommunications carriers include voice mail, interconnected voice over Internet protocol (VoIP), text messaging, Internet access, and installation and maintenance of internal connections in addition to all reasonable...
Metropolitan area networks: a corner stone in the broadband era
NASA Astrophysics Data System (ADS)
Ghanem, Adel
1991-02-01
Deployment of Broadband ISDN is being influenced by both a market pull and a technology push. New broadband service opportunities exist in the business and residential sectors of the market place. It is envisioned that some customers will need connections directly to broadband switches because of the high bandwidth needed for their applications. At the same time Metropolitan Area Network (MAN) systems will serve those customers with bandwidth requirements less than or equal to 150 Mbps. A given MAN will have a geographical domain to serve where it will carry out the switching tasks within this domain. While MANs couldbe designed using differentarchitecturalconcepts the setofservices expected tobeprovidedby MANs could be equivalent to thelist ofservices thatwillbe supported by the targetbroadband network. This paperpositions MANs as a major building block for Broadband networks. It also examines the evolution process ofMANs as a needed step to assure the successful deployment of these new broadband services. 2. BISDN - OVERVIEW Broadband ISDN (BISDN) is being driven into existence by both a market pull as well as a technology push. Opportunities for new valueadded services are the prime market pull for future broadband networks. These services opportunities extend beyond simple voice and low speed data applications and cover both the residential and the business sectors of the market. It is noted for instance that business customers have growing needs for sophisticated telecommunication vehicles to support their
Your Voice Counts: Listening to the Voice of High School Students with Autism Spectrum Disorder
ERIC Educational Resources Information Center
Saggers, Beth; Hwang, Yoon-Suk; Mercer, K. Louise
2011-01-01
Supporting students with autism spectrum disorders (ASDs) in inclusive settings presents both opportunities and significant challenges to school communities. This study, which explored the lived experience of nine students with ASD in an inclusive high school in Australia, is based on the belief that by listening to the voices of students, school…
Assuring children's human right to freedom of opinion and expression in education.
Gillett-Swan, Jenna; Sargeant, Jonathon
2018-02-01
The Universal Declaration of Human Rights elaborated for children through the United Nations Convention on the Rights of the Child, mandates each child's right to participate in all matters affecting them. In particular, Article 19 includes the child's right to freedom of expression and opinion, access to information and communication choice. However, many barriers placed on children's daily lives often restrict or limit the enactment of children's participatory rights in practice, most noticeably in education. It is often the adult who decides what, when and how children can communicate, and the extent children's views and opinions are sought, considered or incorporated. This paper explores how children's daily lives are mediated in ways that restrict their expression, voice and communication rights. Children spend a significant proportion of their daily lives in education settings yet the restrictions on children's access to information and communication choices do not reflect contemporary pedagogical thinking. Many school settings perpetuate the key participation barriers of adult attitude and knowledge, pedagogical tradition, organisational structure and technological advancement. Such barriers to engagement stifle the realisation of the child's communication rights that then limits educational enhancement. Supporting children's right to communicate via a range of media enables pedagogy supporting voice-inclusive practice.
Underconnectivity between voice-selective cortex and reward circuitry in children with autism.
Abrams, Daniel A; Lynch, Charles J; Cheng, Katherine M; Phillips, Jennifer; Supekar, Kaustubh; Ryali, Srikanth; Uddin, Lucina Q; Menon, Vinod
2013-07-16
Individuals with autism spectrum disorders (ASDs) often show insensitivity to the human voice, a deficit that is thought to play a key role in communication deficits in this population. The social motivation theory of ASD predicts that impaired function of reward and emotional systems impedes children with ASD from actively engaging with speech. Here we explore this theory by investigating distributed brain systems underlying human voice perception in children with ASD. Using resting-state functional MRI data acquired from 20 children with ASD and 19 age- and intelligence quotient-matched typically developing children, we examined intrinsic functional connectivity of voice-selective bilateral posterior superior temporal sulcus (pSTS). Children with ASD showed a striking pattern of underconnectivity between left-hemisphere pSTS and distributed nodes of the dopaminergic reward pathway, including bilateral ventral tegmental areas and nucleus accumbens, left-hemisphere insula, orbitofrontal cortex, and ventromedial prefrontal cortex. Children with ASD also showed underconnectivity between right-hemisphere pSTS, a region known for processing speech prosody, and the orbitofrontal cortex and amygdala, brain regions critical for emotion-related associative learning. The degree of underconnectivity between voice-selective cortex and reward pathways predicted symptom severity for communication deficits in children with ASD. Our results suggest that weak connectivity of voice-selective cortex and brain structures involved in reward and emotion may impair the ability of children with ASD to experience speech as a pleasurable stimulus, thereby impacting language and social skill development in this population. Our study provides support for the social motivation theory of ASD.
A Review on Radio-Over-Fiber Technology-Based Integrated (Optical/Wireless) Networks
NASA Astrophysics Data System (ADS)
Rajpal, Shivika; Goyal, Rakesh
2017-06-01
In the present paper, radio-over-fiber (RoF) technology has been proposed, which is the integration of the optical and radio networks. With a high transmission capacity, comparatively low cost and low attenuation, optical fiber provides an ideal solution for accomplishing the interconnections. In addition, a radio system enables the significant mobility, flexibility and easy access. Therefore, the system integration can meet the increasing demands of subscribers for voice, data and multimedia services that require the access network to support high data rates at any time and any place inexpensively. RoF has the potentiality to the backbone of the wireless access network and it has gained significant momentum in the last decade as a potential last-mile access scheme. This paper gives the comprehensive review of RoF technology used in the communication system. Concept, applications, advantages and limitations of RoF technology are also discussed in this paper.
Telecom Modeling with ChatterBell.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jrad, Ahmad M.; Kelic, Andjelka
This document provides a description and user manual for the ChatterBell voice telecom modeling and simulation capability. The intended audience consists of network planners and practitioners who wish to use the tool to model a particular voice network and analyze its behavior under varying assumptions and possible failure conditions. ChatterBell is built on top of the N-SMART voice simulation and visualization suite that was developed through collaboration between Sandia National Laboratories and Bell Laboratories of Lucent Technologies. The new and improved modeling and simulation tool has been modified and modernized to incorporate the latest development in the telecom world includingmore » the widespread use of VoIP technology. In addition, ChatterBell provides new commands and modeling capabilities that were not available in the N-SMART application.« less
Tezcaner, Zahide Çiler; Gökmen, Muhammed Fatih; Yıldırım, Sibel; Dursun, Gürsel
2017-11-06
The aim of this study was to define the clinical features of psychogenic voice disorder (PVD) and explore the treatment efficiency of voice therapy and psychological evaluation. Fifty-eight patients who received treatment following the PVD diagnosis and had no organic or other functional voice disorders were assessed retrospectively based on laryngoscopic examinations and subjective and objective assessments. Epidemiological characteristics, accompanying organic and psychological disorders, preferred methods of treatment, and previous treatment outcomes were examined for each patient. A comparison was made based on voice disorders and responses to treatment between patients who received psychotherapy and patients who did not. Participants in this study comprised 58 patients, 10 male and 48 female. Voice therapy was applied in all patients, 54 (93.1%) of whom had improvement in their voice. Although all patients were advised to undergo psychological assessment, only 60.3% (35/58) of them underwent psychological assessment. No statistically significant difference was found between patients who did receive psychological support concerning their treatment responses and patients who did not. Relapse occurred in 14.7% (5/34) of the patients who applied for psychological assessment and in 50% (10/20) of those who did not. There was a statistically significant difference in relapse rates, which was higher among patients who did not receive psychological support (P < 0.005). Voice therapy is an efficient treatment method for PVD. However, in the long-term follow-up, relapse of the disease is observed to be higher among patients who failed to follow up on the recommendation for psychological assessment. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
An automatic speech recognition system with speaker-independent identification support
NASA Astrophysics Data System (ADS)
Caranica, Alexandru; Burileanu, Corneliu
2015-02-01
The novelty of this work relies on the application of an open source research software toolkit (CMU Sphinx) to train, build and evaluate a speech recognition system, with speaker-independent support, for voice-controlled hardware applications. Moreover, we propose to use the trained acoustic model to successfully decode offline voice commands on embedded hardware, such as an ARMv6 low-cost SoC, Raspberry PI. This type of single-board computer, mainly used for educational and research activities, can serve as a proof-of-concept software and hardware stack for low cost voice automation systems.
When You've Heard It Before and Still Can't Read.
ERIC Educational Resources Information Center
Gibbs, Colin J.
A study was conducted to determine whether voice support helps children learn to read. Voice support involves reading to children while they follow the text, either informally as in bedtime reading, or formally, as in teacher or audiotaped readings of stories in the classroom. Subjects, 64 children just entering school, were unfamiliar with the 12…
Telemedicine using free voice over internet protocol (VoIP) technology.
Miller, David J; Miljkovic, Nikola; Chiesa, Chad; Callahan, John B; Webb, Brad; Boedeker, Ben H
2011-01-01
Though dedicated videoteleconference (VTC) systems deliver high quality, low-latency audio and video for telemedical applications, they require expensive hardware and extensive infrastructure. The purpose of this study was to investigate free commercially available Voice over Internet Protocol (VoIP) software as a low cost alternative for telemedicine.
A Conjoint Analysis of Voice Over IP Attributes.
ERIC Educational Resources Information Center
Zubey, Michael L.; Wagner, William; Otto, James R.
2002-01-01
Managers need to understand the tradeoffs associated with voice over Internet protocol (VoIP) networks as compared to the Public Switched Telephone Network (PSTN). This article measures the preference structures between IP telephony and PSTN services using conjoint analysis. The purpose is to suggest VoIP technology attributes that best meet…
VoiceThread: A Useful Program Evaluation Tool
ERIC Educational Resources Information Center
Mott, Rebecca
2018-01-01
With today's technology, Extension professionals have a variety of tools available for program evaluation. This article describes an innovative platform called VoiceThread that has been used in many classrooms but also is useful for conducting virtual focus group research. I explain how this tool can be used to collect qualitative participant…
Perceptions of the Design of Voice Output Communication Aids
ERIC Educational Resources Information Center
Judge, Simon; Townend, Gillian
2013-01-01
Background: Voice output communication aids (VOCAs) are a key form of aided communication within the field of augmentative and alternative communication (AAC). In recent years, rapid developments in technology have resulted in an explosion of devices available commercially, yet little research has been conducted into what people who use VOCAs…
Voice-Based Technology for Parent Involvement: Results and Effects.
ERIC Educational Resources Information Center
Bauch, Jerold P.
A study was conducted to implement and evaluate the Transparent School Model for improving parent involvement in nine Indiana schools. The Model uses computer-based voice messaging to exchange information between teachers and parents. Teachers record a brief message for parents that describes what was taught, special learning events, homework…
Southern European Task Force (SETAF)/US Army Africa Science and Technology Overview
2009-06-01
Translator ( VRT ) • Problem: – SETAF needs a language translation device for multiple African languages • Discussion: – Worked with PM Sequoyah to identify...potential solutions – Recommended VRT as best potential solution – Provides eyes-free, hands-free, voice-to-voice language translation
Mupparapu, Muralidhar
2008-03-01
Voice over Internet protocol (VoIP) is a revolutionary new technology that is causing a stir in the telecommunications industry and threatening the existence of traditional telephone service providers. Based on a simple method of converting analog audio signals into digital data before being transmitted over the Internet, VoIP has gained immense popularity among consumers. The technology is now regarded as an alternative to traditional telephone service for the orthodontic office. When the economics are considered, it is cost-effective, especially for a busy orthodontic practice where the call volumes both in and out are always high. VoIP has the potential to reduce costs, break the barriers between local vs long-distance calling, and make life easier for the office staff. However, deploying VoIP requires a cautious and thought-out process. Users should fully understand the risks and benefits before switching from the public switched telephone network. VoIP customers and service providers are vulnerable to many of the same impersonation-based attacks by those who attempt toll fraud, and identity and information theft. In this article, VoIP is introduced to orthodontic practitioners, who might be unfamiliar with this technology. Internet protocol based private branch exchange systems that are currently marketed as open-source technologies are also reviewed. Additionally, VoIP is compared with the traditional public switched telephone network technology and evaluated for its potential applications in an orthodontic office for both increased efficiency and cost savings.
Kitzmiller, Rebecca R; McDaniel, Reuben R; Johnson, Constance M; Lind, E Allan; Anderson, Ruth A
2013-01-01
We examine how interpersonal behavior and social interaction influence team sensemaking and subsequent team actions during a hospital-based health information technology (HIT) implementation project. Over the course of 18 months, we directly observed the interpersonal interactions of HIT implementation teams using a sensemaking lens. We identified three voice-promoting strategies enacted by team leaders that fostered team member voice and sensemaking; communicating a vision; connecting goals to team member values; and seeking team member input. However, infrequent leader expressions of anger quickly undermined team sensemaking, halting dialog essential to problem solving. By seeking team member opinions, team leaders overcame the negative effects of anger. Leaders must enact voice-promoting behaviors and use them throughout a team's engagement. Further, training teams in how to use conflict to achieve greater innovation may improve sensemaking essential to project risk mitigation. Health care work processes are complex; teams involved in implementing improvements must be prepared to deal with conflicting, contentious issues, which will arise during change. Therefore, team conflict training may be essential to sustaining sensemaking. Future research should seek to identify team interactions that foster sensemaking, especially when topics are difficult or unwelcome, then determine the association between staff sensemaking and the impact on HIT implementation outcomes. We are among the first to focus on project teams tasked with HIT implementation. This research extends our understanding of how leaders' behaviors might facilitate or impeded speaking up among project teams in health care settings.
RECOGNIZING AND IDENTIFYING PEOPLE: A neuropsychological review
Barton, Jason J S; Corrow, Sherryse L
2016-01-01
Recognizing people is a classic example of a cognitive function that involves multiple processing stages and parallel routes of information. Neuropsychological data have provided important evidence for models of this process, particularly from case reports; however, the quality and extent of the data varies widely between studies. In this review we first discuss the requirements and logical basis of the types of neuropsychological evidence to support conclusions about the modules in this process. We then survey the adequacy of the current body of reports to address two key issues. First is the question of which cognitive operation generates a sense of familiarity: the current debate revolves around whether familiarity arises in modality-specific recognition units or later amodal processes. Key evidence on this point comes from the search for dissociations between familiarity for faces, voices and names. The second question is whether lesions can differentially affect the abilities to link diverse sources of person information (e.g. face, voice, name, biographic data). Dissociations of these linkages may favour a distributed-only model of the organization of semantic knowledge, whereas a ‘person-hub’ model would predict uniform impairments of all linkages. While we conclude that there is reasonable evidence for dissociations in name, voice and face familiarity in regards to the first question, the evidence for or against dissociated linkages between information stores in regards to the second is tenuous at best. We identify deficiencies in the current literature that should motivate and inform the design of future studies. PMID:26773237
Multi-talker background and semantic priming effect
Dekerle, Marie; Boulenger, Véronique; Hoen, Michel; Meunier, Fanny
2014-01-01
The reported studies have aimed to investigate whether informational masking in a multi-talker background relies on semantic interference between the background and target using an adapted semantic priming paradigm. In 3 experiments, participants were required to perform a lexical decision task on a target item embedded in backgrounds composed of 1–4 voices. These voices were Semantically Consistent (SC) voices (i.e., pronouncing words sharing semantic features with the target) or Semantically Inconsistent (SI) voices (i.e., pronouncing words semantically unrelated to each other and to the target). In the first experiment, backgrounds consisted of 1 or 2 SC voices. One and 2 SI voices were added in Experiments 2 and 3, respectively. The results showed a semantic priming effect only in the conditions where the number of SC voices was greater than the number of SI voices, suggesting that semantic priming depended on prime intelligibility and strategic processes. However, even if backgrounds were composed of 3 or 4 voices, reducing intelligibility, participants were able to recognize words from these backgrounds, although no semantic priming effect on the targets was observed. Overall this finding suggests that informational masking can occur at a semantic level if intelligibility is sufficient. Based on the Effortfulness Hypothesis, we also suggest that when there is an increased difficulty in extracting target signals (caused by a relatively high number of voices in the background), more cognitive resources were allocated to formal processes (i.e., acoustic and phonological), leading to a decrease in available resources for deeper semantic processing of background words, therefore preventing semantic priming from occurring. PMID:25400572
NASA's mobile satellite development program
NASA Technical Reports Server (NTRS)
Rafferty, William; Dessouky, Khaled; Sue, Miles
1988-01-01
A Mobile Satellite System (MSS) will provide data and voice communications over a vast geographical area to a large population of mobile users. A technical overview is given of the extensive research and development studies and development performed under NASA's mobile satellite program (MSAT-X) in support of the introduction of a U.S. MSS. The critical technologies necessary to enable such a system are emphasized: vehicle antennas, modulation and coding, speech coders, networking and propagation characterization. Also proposed is a first, and future generation MSS architecture based upon realized ground segment equipment and advanced space segment studies.
Automated speech understanding: the next generation
NASA Astrophysics Data System (ADS)
Picone, J.; Ebel, W. J.; Deshmukh, N.
1995-04-01
Modern speech understanding systems merge interdisciplinary technologies from Signal Processing, Pattern Recognition, Natural Language, and Linguistics into a unified statistical framework. These systems, which have applications in a wide range of signal processing problems, represent a revolution in Digital Signal Processing (DSP). Once a field dominated by vector-oriented processors and linear algebra-based mathematics, the current generation of DSP-based systems rely on sophisticated statistical models implemented using a complex software paradigm. Such systems are now capable of understanding continuous speech input for vocabularies of several thousand words in operational environments. The current generation of deployed systems, based on small vocabularies of isolated words, will soon be replaced by a new technology offering natural language access to vast information resources such as the Internet, and provide completely automated voice interfaces for mundane tasks such as travel planning and directory assistance.
Independent Space Operators: Gaining a Voice in Design for Operability
NASA Technical Reports Server (NTRS)
McCleskey, Carey M.; Claybaugh, William R., II
2006-01-01
Affordable and sustainable space exploration remains an elusive goal. We explore the competitive advantages of evolving towards independent operators for space transportation in our economy. We consider the pros and cons of evolving business organizations that operate and maintain space transportation system assets independently from flight system manufacturers and from host spaceports. The case is made that a more competitive business climate for creating inherently operable, dependable, and supportable space transportation systems can evolve out of today's traditional vertical business model-a model within which the voice of the operator is often heard, but rarely acted upon during crucial design commitments and critical design processes. Thus new business models may be required, driven less by hardware consumption and more by space system utilization.
Paperless medical records: measuring success.
Tobey, Mary Ellen
2004-01-01
North Shore Magnetic Imaging Center (NSMIC) underwent a major transformation of the patient process through an 18-month "Reinvention Project." The project began in October 2002, with an assessment of systems in place. A complete review of each stage of the patient process--scheduling, registration, insurance verification, screening, scanning, transcription, and billing--resulted in the discovery that the paperwork for a single patient examination could go though as many as 20 sets of hands. The project was supported by the formation of an internal team comprised of staff members from all departments (support, patient accounts, and technical), radiologists, and members of the center's senior management team. The team had 2 goals: increase the level of patient care, and create a paperless environment. External teams were formed to address specific areas targeted to support the process. The transformation for all involved--patients, staff, radiologists, and referring physicians--has proved to be very positive. The work, however, was not finished. Upon the project's completion, NSMIC recognized the importance of identifying successes and areas for improvement. These included ongoing reviews of the project's impact on all stakeholders and looking for new technologies and programs to enhance the new systems in place. There are plans for a project "sequel." Strategies are being developed for "Reinvention, Phase II." Elements of these strategies include enhancement of the scheduling programs to create more checks and balances for the staff and investigating an online scheduling option for the center's referring physicians. The purchase and implementation of a voice recognition system--tabled during Phase I--is scheduled for the first quarter of 2005.
Tailoring Cognitive Behavioral Therapy to Subtypes of Voice-Hearing
Smailes, David; Alderson-Day, Ben; Fernyhough, Charles; McCarthy-Jones, Simon; Dodgson, Guy
2015-01-01
Cognitive behavioral therapy (CBT) for voice-hearing (i.e., auditory verbal hallucinations; AVH) has, at best, small to moderate effects. One possible reason for this limited efficacy is that current CBT approaches tend to conceptualize voice-hearing as a homogenous experience in terms of the cognitive processes involved in AVH. However, the highly heterogeneous nature of voice-hearing suggests that many different cognitive processes may be involved in the etiology of AVH. These heterogeneous voice-hearing experiences do, however, appear to cluster into a set of subtypes, opening up the possibility of tailoring treatment to the subtype of AVH that a voice-hearer reports. In this paper, we (a) outline our rationale for tailoring CBT to subtypes of voice-hearing, (b) describe CBT for three putative subtypes of AVH (inner speech-based AVH, memory-based AVH, and hypervigilance AVH), and (c) discuss potential limitations and problems with such an approach. We conclude by arguing that tailoring CBT to subtypes of voice-hearing could prove to be a valuable therapeutic development, which may be especially effective when used in early intervention in psychosis services. PMID:26733919
Nurse moral distress: A survey identifying predictors and potential interventions.
Rathert, Cheryl; May, Douglas R; Chung, Hye Sook
2016-01-01
Ethical dilemmas and conflicts are inherent in today's health care organizations and may lead to moral distress, which is often associated with physical and psychological symptoms. Although the existence of moral distress has been observed by scholars for decades, most of the research has been descriptive and has examined what types of health care conflicts lead to distress. This study tested a comprehensive model, underpinned by Social Cognitive Theory, that examined work environment and intrapersonal variables that may influence moral distress. We surveyed nursing staff employed in a U.S. acute care hospital (response rate=45%; n=290). More than half of the respondents reported they experience ethical dilemmas and conflicts from several times a month to daily, and nearly half reported they experience moral distress at least several times a month. Structural equation modeling analysis simultaneously examined the effects of five independent variables on moral distress and moral voice: (a) frequency of ethical dilemmas and conflicts; (b) moral efficacy; (c) ethics communication; (d) ethical environment; and (e) organizational ethics support. Results revealed significant independent effects of the frequency of ethics issues and organizational ethics support on moral distress. Bootstrapping analysis indicated that voice fully mediated the relationship between moral efficacy and moral distress, and partially mediated the relationship between organizational ethics support and distress. Supplemental analysis revealed that organizational ethics support moderated the moral efficacy-voice-moral distress relationship such that when organizational support was low, moral efficacy was negatively related to moral distress via voice. Although it may be impossible to eliminate all ethical dilemmas and conflicts, leaders and organizations may wish to help improve nurses' moral efficacy, which appears to give rise to voice, and reduced moral distress. Increasing organizational ethics support may be a key approach. Copyright © 2015 Elsevier Ltd. All rights reserved.
NASA Johnson Space Center Usability Testing and Analysis facility (UTAF) Overview
NASA Technical Reports Server (NTRS)
Whitmore, Mihriban; Holden, Kritina L.
2005-01-01
The Usability Testing and Analysis Facility (UTAF) is part of the Space Human Factors Laboratory at the NASA Johnson Space Center in Houston, Texas. The facility performs research for NASA's HumanSystems Integration Program, under the HumanSystems Research and Technology Division. Specifically, the UTAF provides human factors support for space vehicles, including the International Space Station, the Space Shuttle, and the forthcoming Crew Exploration Vehicle. In addition, there are ongoing collaborative research efforts with external corporations and universities. The UTAF provides human factors analysis, evaluation, and usability testing of crew interfaces for space applications. This includes computer displays and controls, workstation systems, and work environments. The UTAF has a unique mix of capabilities, with a staff experienced in both cognitive human factors and ergonomics. The current areas of focus are: human factors applications in emergency medical care and informatics; control and display technologies for electronic procedures and instructions; voice recognition in noisy environments; crew restraint design for unique microgravity workstations; and refinement of human factors processes and requirements. This presentation will provide an overview of ongoing activities, and will address how the UTAF projects will evolve to meet new space initiatives.
NASA Astrophysics Data System (ADS)
Sridevi, B.; Supriya, T. S.; Rajaram, S.
2013-01-01
The current generation of wireless networks has been designed predominantly to support voice and more recently data traffic. WiMAX is currently one of the hottest technologies in wireless. The main motive of the mobile technologies is to provide seamless cost effective mobility. But this is affected by Authentication cost and handover delay since on each handoff the Mobile Station (MS) has to undergo all steps of authentication. Pre-Authentication is used to reduce the handover delay and increase the speed of the Intra-ASN Handover. Proposed Pre-Authentication method is intended to reduce the authentication delay by getting pre authenticated by central authority called Pre Authentication Authority (PAA). MS requests PAA for Pre Authentication Certificate (PAC) before performing handoff. PAA verifies the identity of MS and provides PAC to MS and also to the neighboring target Base Stations (tBSs). MS having time bound PAC can skip the authentication process when recognized by target BS during handoff. It also prevents the DOS (Denial Of Service) attack and Replay attack. It has no wastage of unnecessary key exchange of the resources. The proposed work is simulated by NS2 model and by MATLAB.
Every Voice Counts: Holding a Shared Leadership Event To Make Decisions Together.
ERIC Educational Resources Information Center
Strauss, Jessica
Shared leadership is a process empowering parents to exercise their civil rights, build leadership skills, and guide the design of policy positions and programs meant to improve their lives and the well-being of their children. Based on principles of family support practice, this book describes how to plan and host a 1- to 3-day shared leadership…
ERIC Educational Resources Information Center
Hills, Kaitlyn E.
2017-01-01
Teacher evaluation is an important but often neglected component of the education system. It exists, but it has often been seen and used as a supervisory tool rather than an opportunity for supported teacher development. As such, both teachers and administrators have dismissed teacher evaluation as a meaningless process when it has the potential…
The development of emotion perception in face and voice during infancy.
Grossmann, Tobias
2010-01-01
Interacting with others by reading their emotional expressions is an essential social skill in humans. How this ability develops during infancy and what brain processes underpin infants' perception of emotion in different modalities are the questions dealt with in this paper. Literature review. The first part provides a systematic review of behavioral findings on infants' developing emotion-reading abilities. The second part presents a set of new electrophysiological studies that provide insights into the brain processes underlying infants' developing abilities. Throughout, evidence from unimodal (face or voice) and multimodal (face and voice) processing of emotion is considered. The implications of the reviewed findings for our understanding of developmental models of emotion processing are discussed. The reviewed infant data suggest that (a) early in development, emotion enhances the sensory processing of faces and voices, (b) infants' ability to allocate increased attentional resources to negative emotional information develops earlier in the vocal domain than in the facial domain, and (c) at least by the age of 7 months, infants reliably match and recognize emotional information across face and voice.
Look, Clarisse; McCabe, Patricia; Heard, Robert; Madill, Catherine J
2018-02-02
Modeling and instruction are frequent components of both traditional and technology-assisted voice therapy. This study investigated the value of video modeling and instruction in the early acquisition and short-term retention of a complex voice task without external feedback. Thirty participants were randomized to two conditions and trained to produce a vocal siren over 40 trials. One group received a model and verbal instructions, the other group received a model only. Sirens were analyzed for phonation time, vocal intensity, cepstral peak prominence, peak-to-peak time, and root-mean-square error at five time points. The model and instruction group showed significant improvement on more outcome measures than the model-only group. There was an interaction effect for vocal intensity, which showed that instructions facilitated greater improvement when they were first introduced. However, neither group reproduced the model's siren performance across all parameters or retained the skill 1 day later. Providing verbal instruction with a model appears more beneficial than providing a model only in the prepractice phase of acquiring a complex voice skill. Improved performance was observed; however, the higher level of performance was not retained after 40 trials in both conditions. Other prepractice variables may need to be considered. Findings have implications for traditional and technology-assisted voice therapy. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Standardization of pitch-range settings in voice acoustic analysis.
Vogel, Adam P; Maruff, Paul; Snyder, Peter J; Mundt, James C
2009-05-01
Voice acoustic analysis is typically a labor-intensive, time-consuming process that requires the application of idiosyncratic parameters tailored to individual aspects of the speech signal. Such processes limit the efficiency and utility of voice analysis in clinical practice as well as in applied research and development. In the present study, we analyzed 1,120 voice files, using standard techniques (case-by-case hand analysis), taking roughly 10 work weeks of personnel time to complete. The results were compared with the analytic output of several automated analysis scripts that made use of preset pitch-range parameters. After pitch windows were selected to appropriately account for sex differences, the automated analysis scripts reduced processing time of the 1,120 speech samples to less than 2.5 h and produced results comparable to those obtained with hand analysis. However, caution should be exercised when applying the suggested preset values to pathological voice populations.
Acoustics of the trained versus untrained singing voice.
Howard, David M
2009-06-01
Acoustic voice analysis is now widely available on today's multimedia computers and knowledge of the acoustics of the trained and untrained singing voice has advanced dramatically in recent years. New techniques have emerged that are providing clearer representations of aspects of the physiology of voice function and a greater understanding of the differences between the voices of untrained and trained singers. Improvements in endoscope technology are changing understanding of vocal fold function and videokymography provides a new way of interpreting the output; some new and interesting possibilities are emerging. Larynx height variation is a feature of untrained singing and singing in different styles and its measurement has been inaccurate hitherto; perhaps the laryngoaltimeter will provide a solution. Magnetic resonance imaging is now a vital tool for vocal tract shape measurement but a new bio-inspired computing is offering a possible alternative. Differences between an untrained and trained singing voice lie in one or more of breathing technique, larynx settings or vocal tract settings. Measurement techniques in each of these areas are important to provide data on the singing voice, and accurate data are essential for natural personalized electronic voice synthesis in the future.
Auditory traits of "own voice".
Kimura, Marino; Yotsumoto, Yuko
2018-01-01
People perceive their recorded voice differently from their actively spoken voice. The uncanny valley theory proposes that as an object approaches humanlike characteristics, there is an increase in the sense of familiarity; however, eventually a point is reached where the object becomes strangely similar and makes us feel uneasy. The feeling of discomfort experienced when people hear their recorded voice may correspond to the floor of the proposed uncanny valley. To overcome the feeling of eeriness of own-voice recordings, previous studies have suggested equalization of the recorded voice with various types of filters, such as step, bandpass, and low-pass, yet the effectiveness of these filters has not been evaluated. To address this, the aim of experiment 1 was to identify what type of voice recording was the most representative of one's own voice. The voice recordings were presented in five different conditions: unadjusted recorded voice, step filtered voice, bandpass filtered voice, low-pass filtered voice, and a voice for which the participants freely adjusted the parameters. We found large individual differences in the most representative own-voice filter. In order to consider roles of sense of agency, experiment 2 investigated if lip-synching would influence the rating of own voice. The result suggested lip-synching did not affect own voice ratings. In experiment 3, based on the assumption that the voices used in previous experiments corresponded to continuous representations of non-own voice to own voice, the existence of an uncanny valley was examined. Familiarity, eeriness, and the sense of own voice were rated. The result did not support the existence of an uncanny valley. Taken together, the experiments led us to the following conclusions: there is no general filter that can represent own voice for everyone, sense of agency has no effect on own voice rating, and the uncanny valley does not exist for own voice, specifically.
Remote-area health care delivery through space technology - STARPAHC
NASA Technical Reports Server (NTRS)
Belasco, N.; Johnston, R. S.; Stonesifer, J. C.; Pool, S. L.
1977-01-01
A joint NASA/HEW project called Space Technology Applied to Rural Papage Advanced Health Care (STARPAHC) has been developed to deliver quality health care to inhabitants of remote geographical areas. The system consists of a hospital-based support control center, a fixed clinic, a mobile clinic, and a referral center with access to specialists via television links to the control center. A strategically located relay station routes television, voice, and data transmissions between system elements. A model system has been installed on the Papage Indian Reservation in Arizona, and is undergoing a 2-year evaluation. The system has been shown to be both effective and cost-efficient, and applications of the concept are planned for future manned spacecraft flights.
A cyber-physical management system for delivering and monitoring surgical instruments in the OR.
Li, Yu-Ting; Jacob, Mithun; Akingba, George; Wachs, Juan P
2013-08-01
The standard practice in the operating room (OR) is having a surgical technician deliver surgical instruments to the surgeon quickly and inexpensively, as required. This human "in the loop" system may result in mistakes (eg, missing information, ambiguity of instructions, and delays). Errors can be reduced or eliminated by integrating information technology (IT) and cybernetics into the OR. Gesture and voice automatic acquisition, processing, and interpretation allow interaction with these new systems without disturbing the normal flow of surgery. This article describes the development of a cyber-physical management system (CPS), including a robotic scrub nurse, to support surgeons by passing surgical instruments during surgery as required and recording counts of surgical instruments into a personal health record (PHR). The robot used responds to hand signals and voice messages detected through sophisticated computer vision and data mining techniques. The CPS was tested during a mock surgery in the OR. The in situ experiment showed that the robot recognized hand gestures reliably (with an accuracy of 97%), it can retrieve instruments as close as 25 mm, and the total delivery time was less than 3 s on average. This online health tool allows the exchange of clinical and surgical information to electronic medical record-based and PHR-based applications among different hospitals, regardless of the style viewer. The CPS has the potential to be adopted in the OR to handle surgical instruments and track them in a safe and accurate manner, releasing the human scrub tech from these tasks.
Interactive Communication: A Few Research Answers for a Technological Explosion.
ERIC Educational Resources Information Center
Chapanis, Alphonse
The techniques, procedures, and principal findings of 15 different experiments in a research program on interactive communication are summarized in this paper. Among the principal findings reported are that: problems are solved faster in communication modes that have a voice channel than in those that do not have a voice channel, modes of…
Amplifying Youth Voices in the Developing World
ERIC Educational Resources Information Center
Fotenos, Saori; Rohatgi, Deepti
2007-01-01
In the past few years, an explosion of user-generated content has flooded the Internet. The dramatic drop in the cost of digital video equipment and the increased accessibility of the Internet create a unique opportunity to allow youth to create meaningful content. Today youth around the world can leverage technological tools to give voice to…
Digital voice recording: An efficient alternative for data collection
Mark A. Rumble; Thomas M. Juntti; Thomas W. Bonnot; Joshua J. Millspaugh
2009-01-01
Study designs are usually constrained by logistical and budgetary considerations that can affect the depth and breadth of the research. Little attention has been paid to increasing the efficiency of data recording. Digital voice recording and translation may offer improved efficiency of field personnel. Using this technology, we increased our data collection by 55...
New Civic Voices & the Emerging Media Literacy Landscape
ERIC Educational Resources Information Center
Mihailidis, Paul
2011-01-01
The recent protests across the Middle East, generally referred to as the largest civic uprising enabled by social media platforms and mobile technologies, are actively reshaping how one thinks about citizenship, community, and participation in the 21st Century. Within these new spaces have emerged voices--largely those of everyday citizens--that…
Image/Music/Voice: Song Dubbing in Hollywood Musicals.
ERIC Educational Resources Information Center
Siefert, Marsha
1995-01-01
Uses the practice of song dubbing in the Hollywood film musical to explore the implications and consequences of the singing voice for imaging practices in the 1930s through 1960s. Discusses the ideological, technological, and socioeconomic basis for song dubbing. Discusses gender, race, and ethnicity patterns of image-sound practices. (SR)
Using Video Technology to Enable Student Voice in Assessment Feedback
ERIC Educational Resources Information Center
Van der Kleij, Fabienne; Adie, Lenore; Cumming, Joy
2017-01-01
Students' voices have been remarkably absent in feedback research, yet research shows that the way students engage with feedback significantly impacts on its effect on learning. Feedback research has mainly focused on aspects of the feedback message between a sender and receiver, with little consideration of the positioning of students in this…
Shaping the Future of Learning Using the Student Voice: We're Listening but Are We Hearing Clearly?
ERIC Educational Resources Information Center
Meadows, Chris; Soper, Kate; Cullen, Rod; Wasiuk, Catherine; McAllister-Gibson, Colin; Danby, Phil
2016-01-01
Student voice data is a key factor as Manchester Metropolitan University strives to continually improve institutional technology enhanced learning (TEL) infrastructure. A bi-annual Institutional Student Survey enables students to communicate their experience of learning, teaching and assessment on programmes and specific units studied. Each cycle…
Epistemological Pluralism: Styles and Voices within the Computer Culture.
ERIC Educational Resources Information Center
Papert, Seymour; Turkle, Sherry
1992-01-01
Describes a study in which the computer has emerged as an important factor in revolution of concrete--privileged medium for the growth of alternative voices in addressing the world of formal systems. Focuses on a concrete way of knowing and concludes that recent technological developments in interfaces, programming philosophy, and artificial…
Exploring AI Language Assistants with Primary EFL Students
ERIC Educational Resources Information Center
Underwood, Joshua
2017-01-01
The main objective of this study was to identify ways to incorporate voice-driven Artificial Intelligence (AI) effectively in classroom language learning. This nine month teacher-led design research study employed technology probes (Amazon's Alexa, Apple's Siri, Google voice search) and co-design methods with a class of primary age English as a…
Transmasculine People's Voice Function: A Review of the Currently Available Evidence.
Azul, David; Nygren, Ulrika; Södersten, Maria; Neuschaefer-Rube, Christiane
2017-03-01
This study aims to evaluate the currently available discursive and empirical data relating to those aspects of transmasculine people's vocal situations that are not primarily gender-related, to identify restrictions to voice function that have been observed in this population, and to make suggestions for future voice research and clinical practice. We conducted a comprehensive review of the voice literature. Publications were identified by searching six electronic databases and bibliographies of relevant articles. Twenty-two publications met inclusion criteria. Discourses and empirical data were analyzed for factors and practices that impact on voice function and for indications of voice function-related problems in transmasculine people. The quality of the evidence was appraised. The extent and quality of studies investigating transmasculine people's voice function was found to be limited. There was mixed evidence to suggest that transmasculine people might experience restrictions to a range of domains of voice function, including vocal power, vocal control/stability, glottal function, pitch range/variability, vocal endurance, and voice quality. More research into the different factors and practices affecting transmasculine people's voice function that takes account of a range of parameters of voice function and considers participants' self-evaluations is needed to establish how functional voice production can be best supported in this population. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Response time effects of alerting tone and semantic context for synthesized voice cockpit warnings
NASA Technical Reports Server (NTRS)
Simpson, C. A.; Williams, D. H.
1980-01-01
Some handbooks and human factors design guides have recommended that a voice warning should be preceded by a tone to attract attention to the warning. As far as can be determined from a search of the literature, no experimental evidence supporting this exists. A fixed-base simulator flown by airline pilots was used to test the hypothesis that the total 'system-time' to respond to a synthesized voice cockpit warning would be longer when the message was preceded by a tone because the voice itself was expected to perform both the alerting and the information transfer functions. The simulation included realistic ATC radio voice communications, synthesized engine noise, cockpit conversation, and realistic flight routes. The effect of a tone before a voice warning was to lengthen response time; that is, responses were slower with an alerting tone. Lengthening the voice warning with another work, however, did not increase response time.
Interactive Voice/Web Response System in clinical research
Ruikar, Vrishabhsagar
2016-01-01
Emerging technologies in computer and telecommunication industry has eased the access to computer through telephone. An Interactive Voice/Web Response System (IxRS) is one of the user friendly systems for end users, with complex and tailored programs at its backend. The backend programs are specially tailored for easy understanding of users. Clinical research industry has experienced revolution in methodologies of data capture with time. Different systems have evolved toward emerging modern technologies and tools in couple of decades from past, for example, Electronic Data Capture, IxRS, electronic patient reported outcomes, etc. PMID:26952178
Interactive Voice/Web Response System in clinical research.
Ruikar, Vrishabhsagar
2016-01-01
Emerging technologies in computer and telecommunication industry has eased the access to computer through telephone. An Interactive Voice/Web Response System (IxRS) is one of the user friendly systems for end users, with complex and tailored programs at its backend. The backend programs are specially tailored for easy understanding of users. Clinical research industry has experienced revolution in methodologies of data capture with time. Different systems have evolved toward emerging modern technologies and tools in couple of decades from past, for example, Electronic Data Capture, IxRS, electronic patient reported outcomes, etc.
Predicting mutational change in the speaking voice of boys.
Fuchs, Michael; Fröehlich, Matthias; Hentschel, Bettina; Stuermer, Ingo W; Kruse, Eberhard; Knauft, Daniel
2007-03-01
The authors investigated whether acoustic speaking voice analyses can be used to predict the beginning of mutation in 21 male members of a professional boys' choir. Over a period of 3 years before mutation, children were examined every 3 months by ear, nose, and throat (ENT) and phoniatric specialists. At the same time, the voice was evaluated acoustically using analysis features of the Goettingen Hoarseness Diagram (GHD). Irregularity component and noise component, jitter, shimmer, mean waveform correlation coefficient, and fundamental frequency were determined from recordings of the speaking voice. Significant changes of acoustic features appeared 7 and 5 months before mutation onset, which indicates that vocal function is already restricted 6 months before mutation onset. This acoustic voice analysis is therefore suitable to support the care of the professional singing voice.
Tomicic, Alemka; Martínez, Claudio; Pérez, J Carola; Hollenstein, Tom; Angulo, Salvador; Gerstmann, Adam; Barroux, Isabelle; Krause, Mariane
2015-01-01
This study seeks to provide evidence of the dynamics associated with the configurations of discourse-voice regulatory strategies in patient-therapist interactions in relevant episodes within psychotherapeutic sessions. Its central assumption is that discourses manifest themselves differently in terms of their prosodic characteristics according to their regulatory functions in a system of interactions. The association between discourse and vocal quality in patients and therapists was analyzed in a sample of 153 relevant episodes taken from 164 sessions of five psychotherapies using the state space grid (SSG) method, a graphical tool based on the dynamic systems theory (DST). The results showed eight recurrent and stable discourse-voice regulatory strategies of the patients and three of the therapists. Also, four specific groups of these discourse-voice strategies were identified. The latter were interpreted as regulatory configurations, that is to say, as emergent self-organized groups of discourse-voice regulatory strategies constituting specific interactional systems. Both regulatory strategies and their configurations differed between two types of relevant episodes: Change Episodes and Rupture Episodes. As a whole, these results support the assumption that speaking and listening, as dimensions of the interaction that takes place during therapeutic conversation, occur at different levels. The study not only shows that these dimensions are dependent on each other, but also that they function as a complex and dynamic whole in therapeutic dialog, generating relational offers which allow the patient and the therapist to regulate each other and shape the psychotherapeutic process that characterizes each type of relevant episode.
Temporal Lobe Epilepsy Alters Auditory-motor Integration For Voice Control
Li, Weifeng; Chen, Ziyi; Yan, Nan; Jones, Jeffery A.; Guo, Zhiqiang; Huang, Xiyan; Chen, Shaozhen; Liu, Peng; Liu, Hanjun
2016-01-01
Temporal lobe epilepsy (TLE) is the most common drug-refractory focal epilepsy in adults. Previous research has shown that patients with TLE exhibit decreased performance in listening to speech sounds and deficits in the cortical processing of auditory information. Whether TLE compromises auditory-motor integration for voice control, however, remains largely unknown. To address this question, event-related potentials (ERPs) and vocal responses to vocal pitch errors (1/2 or 2 semitones upward) heard in auditory feedback were compared across 28 patients with TLE and 28 healthy controls. Patients with TLE produced significantly larger vocal responses but smaller P2 responses than healthy controls. Moreover, patients with TLE exhibited a positive correlation between vocal response magnitude and baseline voice variability and a negative correlation between P2 amplitude and disease duration. Graphical network analyses revealed a disrupted neuronal network for patients with TLE with a significant increase of clustering coefficients and path lengths as compared to healthy controls. These findings provide strong evidence that TLE is associated with an atypical integration of the auditory and motor systems for vocal pitch regulation, and that the functional networks that support the auditory-motor processing of pitch feedback errors differ between patients with TLE and healthy controls. PMID:27356768
Spirituality and hearing voices: considering the relation
McCarthy-Jones, Simon; Waegeli, Amanda; Watkins, John
2013-01-01
For millennia, some people have heard voices that others cannot hear. These have been variously understood as medical, psychological and spiritual phenomena. In this article we consider the specific role of spirituality in voice-hearing in two ways. First, we examine how spirituality may help or hinder people who hear voices. Benefits are suggested to include offering an alternative meaning to the experience which can give more control and comfort, enabling the development of specific coping strategies, increasing social support, and encouraging forgiveness. Potential drawbacks are noted to include increased distress and reduced control resulting from placing frightening or coercive constructions on voices, social isolation, the development of dysfunctional beliefs, and missed/delayed opportunities for successful mental health interventions. After examining problems surrounding classifying voices as either spiritual or psychotic, we move beyond an essentialist position to examine how such a classification is likely to be fluid, and how a given voice may move between these designations. We also highlight tensions between modernist and postmodernist approaches to voice-hearing. PMID:24273597
Teacher Voice in Global Conversations around Education Access, Equity, and Quality
ERIC Educational Resources Information Center
Gozali, Charlina; Claassen Thrush, Elizabeth; Soto-Peña, Michelle; Whang, Christine; Luschei, Thomas F.
2017-01-01
Despite public commitments internationally and nationally to include the voices of all stakeholders, the voices of teachers have continued to be marginalized in the literature and in policy-making related to global educational development. The purpose of the current study is to examine the process of invoking teacher voice using a sample of…
ERIC Educational Resources Information Center
Perrachione, Tyler K.; Wong, Patrick C. M.
2007-01-01
Brain imaging studies of voice perception often contrast activation from vocal and verbal tasks to identify regions uniquely involved in processing voice. However, such a strategy precludes detection of the functional relationship between speech and voice perception. In a pair of experiments involving identifying voices from native and foreign…
ERIC Educational Resources Information Center
Dong, Jie; Dong, Yan
2013-01-01
This article explores voicing processes of identity construction among labor immigrants both inside China and in the Dutch Chinese Diaspora. We provide ethnographically grounded data oriented toward a theoretical point: voicing is an essential problem in communication. Whether one is able to achieve his voice--an outcome of a communicative…
Factors associated with patterns of mobile technology use among persons who inject drugs.
Collins, Kelly M; Armenta, Richard F; Cuevas-Mota, Jazmine; Liu, Lin; Strathdee, Steffanie A; Garfein, Richard S
2016-01-01
New and innovative methods of delivering interventions are needed to further reduce risky behaviors and increase overall health among persons who inject drugs (PWID). Mobile health (mHealth) interventions have potential for reaching PWID; however, little is known about mobile technology use (MTU) in this population. In this study, the authors identify patterns of MTU and identified factors associated with MTU among a cohort of PWID. Data were collected through a longitudinal cohort study examining drug use, risk behaviors, and health status among PWID in San Diego, California. Latent class analysis (LCA) was used to define patterns of MTU (i.e., making voice calls, text messaging, and mobile Internet access). Multinomial logistic regression was then used to identify demographic characteristics, risk behaviors, and health indicators associated with mobile technology use class. In LCA, a 4-class solution fit the data best. Class 1 was defined by low MTU (22%, n = 100); class 2, by PWID who accessed the Internet using a mobile device but did not use voice or text messaging (20%, n = 95); class 3, by primarily voice, text, and connected Internet use (17%, n = 91); and class 4, by high MTU (41%, n = 175). Compared with low MTU, high MTU class members were more likely to be younger, have higher socioeconomic status, sell drugs, and inject methamphetamine daily. The majority of PWID in San Diego use mobile technology for voice, text, and/or Internet access, indicating that rapid uptake of mHealth interventions may be possible in this population. However, low ownership and use of mobile technology among older and/or homeless individuals will need to be considered when implementing mHealth interventions among PWID.
Top 10 "Smart" Technologies for Schools.
ERIC Educational Resources Information Center
Fodeman, Doug; Holzberg, Carol S.; Kennedy, Kristen; McIntire, Todd; McLester, Susan; Ohler, Jason; Parham, Charles; Poftak, Amy; Schrock, Kathy; Warlick, David
2002-01-01
Describes 10 smart technologies for education, including voice to text software; mobile computing; hybrid computing; virtual reality; artificial intelligence; telementoring; assessment methods; digital video production; fingerprint recognition; and brain functions. Lists pertinent Web sites for each technology. (LRW)
Pavlovian Conditioning-Induced Hallucinations Result from Overweighting of Perceptual Priors
Powers, A.R.; Mathys, C.; Corlett, P.R.
2018-01-01
Some people hear voices that others do not, but only some of those people seek treatment. Using a Pavlovian learning task, we induced conditioned hallucinations in four groups of people who differed orthogonally in their voice-hearing and treatment-seeking statuses. People who hear voices were significantly more susceptible to the effect. Using functional neuroimaging and computational modeling of perception, we identified processes that differentiated voice-hearers from non-voice-hearers and treatment-seekers from non-treatment-seekers and characterized a brain circuit that mediated the conditioned hallucinations. These data demonstrate the profound and sometimes pathological impact of top-down cognitive processes on perception and may represent an objective means to discern people with a need for treatment from those without. PMID:28798131
Ethernet-Based Services for Next Generation Networks
NASA Astrophysics Data System (ADS)
Hernandez-Valencia, Enrique
Over the last few years, Ethernet technology and services have emerged as an indispensable component of the broadband networking and telecommunications infrastructure, both for network operators and service providers. As an example, Worldwide Enterprise customer demand for Ethernet services by itself is expected to hit the 30B US mark by year 2012. Use of Ethernet technology in the feeder networks that support residential applications, such as "triple play" voice, data, and video services, is equally on the rise. As the synergies between packet-aware transport and service oriented equipment continue to be exploited in the path toward transport convergence. Ethernet technology is expected to play a critical part in the evolution toward converged Optical/Packet Transport networks. Here we discuss the main business motivations, services, and technologies driving the specifications of so-called carrier Ethernet and highlight challenges associated with delivering the expectations for low implementation complexity, easy of use, provisioning and management of networks and network elements embracing this technology.
Föcker, Julia; Best, Anna; Hölig, Cordula; Röder, Brigitte
2012-07-01
Blind people rely much more on voices compared to sighted individuals when identifying other people. Previous research has suggested a faster processing of auditory input in blind individuals than sighted controls and an enhanced activation of temporal cortical regions during voice processing. The present study used event-related potentials (ERPs) to single out the sub-processes of auditory person identification that change and allow for superior voice processing after congenital blindness. A priming paradigm was employed in which two successive voices (S1 and S2) of either the same (50% of the trials) or different actors were presented. Congenitally blind and matched sighted participants made an old-young decision on the S2. During the pre-experimental familiarization with the stimuli, congenitally blind individuals showed faster learning rates than sighted controls. Reaction times were shorter in person-congruent trials than in person-incongruent trials in both groups. ERPs to S2 stimuli in person-incongruent as compared to person-congruent trials were significantly enhanced at early processing stages (100-160 ms) in congenitally blind participants only. A later negative ERP effect (>200 ms) was found in both groups. The scalp topographies of the experimental effects were characterized by a central and parietal distribution in the sighted but a more posterior distribution in the congenitally blind. These results provide evidence for an improvement of early voice processing stages and a reorganization of the person identification system as a neural correlate of compensatory behavioral improvements following congenital blindness. Copyright © 2012 Elsevier Ltd. All rights reserved.
McCormick, Michael; Seta, John J
2012-01-01
An attribute framing effect occurs when positive or negative associations produced by positive or negative frames are mapped onto evaluations resulting in a more favourable evaluation for the positively framed attribute. We used a new voice frequency manipulation to differentially enhance right versus left hemisphere processing. In doing so we found a strong attribute framing effect when a speaker with a low-frequency voice enhanced the contextual processing style of the right hemisphere. However, a framing effect was not obtained when a speaker with a high-frequency voice enhanced the inferential/analytical processing style of the left hemisphere. At the theoretical level our results provide evidence that the contextual processing style of the right hemisphere is especially susceptible to associative implications, such as those found in attribute framing manipulations. At the applied level we provide a simple method for altering the effectiveness of persuasion messages.
NASA Astrophysics Data System (ADS)
Bibac, Ionut
2005-08-01
The UMTS Bearer Independent Core Network program introduced the 3rd Generation Partnership Program Release 4 BICN architecture into the legacy UMTS TDM-switched network. BICN is the application of calI server archltecture for voice and circuit switched data, enabling the provisioning of traditional circuit-switched services using a packet-switched transport network. Today"s business climate has made it essential for service providers to develop a comprehensive networking strategy that means introduction of RCBICN networks. The R4-BICN solution to the evolution of the Core Network in UMTS will enable operators to significantly reduce the capital and operational costs of delivering both traditional voice sewices and new multimedia services. To build the optical backbone, which can support the third generation (3G) packetized infrastructure, the operators could choose a fibre connection, or they could retain the benefits of a wireless connectivity by using a FSO - Free Space Optical lmk, the only wireless technology available that is capable of achieving data rates up to 2.4 Gbit/s. FSO offers viable alternatives for both core transmission networks and for replacing microwaves links in NodeB - RNC access networks. The paper and presentation aim to demonstrate the manner in which FSO products and networks are employed into R4-BICN design solutions.
Keus van de Poll, Marijke; Carlsson, Johannes; Marsh, John E; Ljung, Robert; Odelius, Johan; Schlittmeier, Sabine J; Sundin, Gunilla; Sörqvist, Patrik
2015-08-01
Broadband noise is often used as a masking sound to combat the negative consequences of background speech on performance in open-plan offices. As office workers generally dislike broadband noise, it is important to find alternatives that are more appreciated while being at least not less effective. The purpose of experiment 1 was to compare broadband noise with two alternatives-multiple voices and water waves-in the context of a serial short-term memory task. A single voice impaired memory in comparison with silence, but when the single voice was masked with multiple voices, performance was on level with silence. Experiment 2 explored the benefits of multiple-voice masking in more detail (by comparing one voice, three voices, five voices, and seven voices) in the context of word processed writing (arguably a more office-relevant task). Performance (i.e., writing fluency) increased linearly from worst performance in the one-voice condition to best performance in the seven-voice condition. Psychological mechanisms underpinning these effects are discussed.
Voice Disorders in Teacher Students-A Prospective Study and a Randomized Controlled Trial.
Ohlsson, Ann-Christine; Andersson, Eva M; Södersten, Maria; Simberg, Susanna; Claesson, Silwa; Barregård, Lars
2016-11-01
Teachers are at risk of developing voice disorders, but longitudinal studies on voice problems among teachers are lacking. The aim of this randomized trial was to investigate long-term effects of voice education for teacher students with mild voice problems. In addition, vocal health was examined prospectively in a group of students without voice problems. First-semester students answered three questionnaires: one about background factors, one about voice symptoms (Screen6), and the Voice Handicap Index. Students with voice problems according to the questionnaire results were randomized to a voice training group or a control group. At follow-up in the sixth semester, all students answered Screen6 again together with four questions about factors that could have affected vocal health during their teacher education. The training group and the control group also answered the Voice Handicap Index a second time. At follow-up, 400 students remained in the study: 27 in the training group, 54 in the control group, and 319 without voice problems at baseline. Voice problems had decreased somewhat more in the training group than in the control group, but the difference was not statistically significant (P = 0.1). However, subgroup analyses showed significantly larger improvement among the students in the group with complete participation in the training program compared with the group with incomplete participation. Of the 319 students without voice problems at baseline, 14% had developed voice problems. Voice problems often develop in teacher students. Despite extensive dropout, our results support the hypothesis that voice education for teacher students has a preventive effect. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Reflux and Voice Disorders: Have We Established Causality?
Schneider, G. Todd; Vaezi, Michael F.; Francis, David O.
2017-01-01
A trend of attributing abnormal voice changes to reflux has gained momentum among medical professionals over the last few decades. Evidence supporting the connection between reflux and voice and the use of anti-reflux medication in patients with dysphonia is conflicting and deserves careful examination. In the current health care environment, it is important that medical decisions be based on science rather than anecdote and practice patterns. The goal of this review is to investigate the evidence linking reflux and voice changes. Specifically, this association will be examined in the context of the Bradford-Hill criteria to determine what evidence exists for a causal relationship between this exposure (reflux) and outcome (voice change). PMID:28163985
Full Duplex, Spread Spectrum Radio System
NASA Technical Reports Server (NTRS)
Harvey, Bruce A.
2000-01-01
The goal of this project was to support the development of a full duplex, spread spectrum voice communications system. The assembly and testing of a prototype system consisting of a Harris PRISM spread spectrum radio, a TMS320C54x signal processing development board and a Zilog Z80180 microprocessor was underway at the start of this project. The efforts under this project were the development of multiple access schemes, analysis of full duplex voice feedback delays, and the development and analysis of forward error correction (FEC) algorithms. The multiple access analysis involved the selection between code division multiple access (CDMA), frequency division multiple access (FDMA) and time division multiple access (TDMA). Full duplex voice feedback analysis involved the analysis of packet size and delays associated with full loop voice feedback for confirmation of radio system performance. FEC analysis included studies of the performance under the expected burst error scenario with the relatively short packet lengths, and analysis of implementation in the TMS320C54x digital signal processor. When the capabilities and the limitations of the components used were considered, the multiple access scheme chosen was a combination TDMA/FDMA scheme that will provide up to eight users on each of three separate frequencies. Packets to and from each user will consist of 16 samples at a rate of 8,000 samples per second for a total of 2 ms of voice information. The resulting voice feedback delay will therefore be 4 - 6 ms. The most practical FEC algorithm for implementation was a convolutional code with a Viterbi decoder. Interleaving of the bits of each packet will be required to offset the effects of burst errors.
Infant face interest is associated with voice information and maternal psychological health.
Taylor, Gemma; Slade, Pauline; Herbert, Jane S
2014-11-01
Early infant interest in their mother's face is driven by an experience based face processing system, and is associated with maternal psychological health, even within a non clinical community sample. The present study examined the role of the voice in eliciting infants' interest in mother and stranger faces and in the association between infant face interest and maternal psychological health. Infants aged 3.5-months were shown photographs of their mother's and a stranger's face paired with an audio recording of their mother's and a stranger's voice that was either matched (e.g., mother's face and voice) or mismatched (e.g., mother's face and stranger's voice). Infants spent more time attending to the stranger's matched face and voice than the mother's matched face and voice and the mismatched faces and voices. Thus, infants demonstrated an earlier preference for a stranger's face when given voice information than when the face is presented alone. In the present sample, maternal psychological health varied with 56.7% of mothers reporting mild mood symptoms (depression, anxiety or stress response to childbirth). Infants of mothers with significant mild maternal mood symptoms looked longer at the faces and voices compared to infants of mothers who did not report mild maternal mood symptoms. In sum, infants' experience based face processing system is sensitive to their mothers' maternal psychological health and the multimodal nature of faces. Copyright © 2014 Elsevier Inc. All rights reserved.
‘Inner voices’: the cerebral representation of emotional voice cues described in literary texts
Kreifelts, Benjamin; Gößling-Arnold, Christina; Wertheimer, Jürgen; Wildgruber, Dirk
2014-01-01
While non-verbal affective voice cues are generally recognized as a crucial behavioral guide in any day-to-day conversation their role as a powerful source of information may extend well beyond close-up personal interactions and include other modes of communication such as written discourse or literature as well. Building on the assumption that similarities between the different ‘modes’ of voice cues may not only be limited to their functional role but may also include cerebral mechanisms engaged in the decoding process, the present functional magnetic resonance imaging study aimed at exploring brain responses associated with processing emotional voice signals described in literary texts. Emphasis was placed on evaluating ‘voice’ sensitive as well as task- and emotion-related modulations of brain activation frequently associated with the decoding of acoustic vocal cues. Obtained findings suggest that several similarities emerge with respect to the perception of acoustic voice signals: results identify the superior temporal, lateral and medial frontal cortex as well as the posterior cingulate cortex and cerebellum to contribute to the decoding process, with similarities to acoustic voice perception reflected in a ‘voice’-cue preference of temporal voice areas as well as an emotion-related modulation of the medial frontal cortex and a task-modulated response of the lateral frontal cortex. PMID:24396008
Thomas, Neil; Hayward, Mark; Peters, Emmanuelle; van der Gaag, Mark; Bentall, Richard P.; Jenner, Jack; Strauss, Clara; Sommer, Iris E.; Johns, Louise C.; Varese, Filippo; García-Montes, José Manuel; Waters, Flavie; Dodgson, Guy; McCarthy-Jones, Simon
2014-01-01
This report from the International Consortium on Hallucinations Research considers the current status and future directions in research on psychological therapies targeting auditory hallucinations (hearing voices). Therapy approaches have evolved from behavioral and coping-focused interventions, through formulation-driven interventions using methods from cognitive therapy, to a number of contemporary developments. Recent developments include the application of acceptance- and mindfulness-based approaches, and consolidation of methods for working with connections between voices and views of self, others, relationships and personal history. In this article, we discuss the development of therapies for voices and review the empirical findings. This review shows that psychological therapies are broadly effective for people with positive symptoms, but that more research is required to understand the specific application of therapies to voices. Six key research directions are identified: (1) moving beyond the focus on overall efficacy to understand specific therapeutic processes targeting voices, (2) better targeting psychological processes associated with voices such as trauma, cognitive mechanisms, and personal recovery, (3) more focused measurement of the intended outcomes of therapy, (4) understanding individual differences among voice hearers, (5) extending beyond a focus on voices and schizophrenia into other populations and sensory modalities, and (6) shaping interventions for service implementation. PMID:24936081
Brain systems mediating voice identity processing in blind humans.
Hölig, Cordula; Föcker, Julia; Best, Anna; Röder, Brigitte; Büchel, Christian
2014-09-01
Blind people rely more on vocal cues when they recognize a person's identity than sighted people. Indeed, a number of studies have reported better voice recognition skills in blind than in sighted adults. The present functional magnetic resonance imaging study investigated changes in the functional organization of neural systems involved in voice identity processing following congenital blindness. A group of congenitally blind individuals and matched sighted control participants were tested in a priming paradigm, in which two voice stimuli (S1, S2) were subsequently presented. The prime (S1) and the target (S2) were either from the same speaker (person-congruent voices) or from two different speakers (person-incongruent voices). Participants had to classify the S2 as either a old or a young person. Person-incongruent voices (S2) compared with person-congruent voices elicited an increased activation in the right anterior fusiform gyrus in congenitally blind individuals but not in matched sighted control participants. In contrast, only matched sighted controls showed a higher activation in response to person-incongruent compared with person-congruent voices (S2) in the right posterior superior temporal sulcus. These results provide evidence for crossmodal plastic changes of the person identification system in the brain after visual deprivation. Copyright © 2014 Wiley Periodicals, Inc.
Bearing Witness: Personal and Poetic Descriptions of Seizure Therapy.
Fink, Max
2016-03-01
Many voices comment on personal experience with induced seizures. The encouraging voices of professionals are contrasted with those in theater and film that stigmatize the treatments. The negative images have done much to support restrictive legislation.
Twenty-Channel Voice Response System
DOT National Transportation Integrated Search
1981-06-01
This report documents the design and implementation of a Voice Response System, which provides Direct-User Access to the FAA's aviation-weather data base. This system supports 20 independent audio channels, and as of this report, speaks three weather...
76 FR 16367 - Implementation of the Truth in Caller ID Act of 2009
Federal Register 2010, 2011, 2012, 2013, 2014
2011-03-23
... in technologies that are successor or replacement technologies to telecommunications services or IP... in technologies that are successor or replacement technologies to telecommunications services or IP... spoofing ``in connection with any telecommunications service or IP-enabled voice service.'' The proposed...
NASA Johnson Space Center Usability Testing and Analysis Facility (UTAF) Overview
NASA Technical Reports Server (NTRS)
Whitmore, M.
2004-01-01
The Usability Testing and Analysis Facility (UTAF) is part of the Space Human Factors Laboratory at the NASA Johnson Space Center in Houston, Texas. The facility provides support to the Office of Biological and Physical Research, the Space Shuttle Program, the International Space Station Program, and other NASA organizations. In addition, there are ongoing collaborative research efforts with external businesses and universities. The UTAF provides human factors analysis, evaluation, and usability testing of crew interfaces for space applications. This includes computer displays and controls, workstation systems, and work environments. The UTAF has a unique mix of capabilities, with a staff experienced in both cognitive human factors and ergonomics. The current areas of focus are: human factors applications in emergency medical care and informatics; control and display technologies for electronic procedures and instructions; voice recognition in noisy environments; crew restraint design for unique microgravity workstations; and refinement of human factors processes. This presentation will provide an overview of ongoing activities, and will address how the projects will evolve to meet new space initiatives.
NASA Johnson Space Center Usability Testing and Analysis Facility (WAF) Overview
NASA Technical Reports Server (NTRS)
Whitmore, M.
2004-01-01
The Usability Testing and Analysis Facility (UTAF) is part of the Space Human Factors Laboratory at the NASA Johnson Space Center in Houston, Texas. The facility provides support to the Office of Biological and Physical Research, the Space Shuttle Program, the International Space Station Program, and other NASA organizations. In addition, there are ongoing collaborative research efforts with external businesses and universities. The UTAF provides human factors analysis, evaluation, and usability testing of crew interfaces for space applications. This includes computer displays and controls, workstation systems, and work environments. The UTAF has a unique mix of capabilities, with a staff experienced in both cognitive human factors and ergonomics. The current areas of focus are: human factors applications in emergency medical care and informatics; control and display technologies for electronic procedures and instructions; voice recognition in noisy environments; crew restraint design for unique microgravity workstations; and refinement of human factors processes. This presentation will provide an overview of ongoing activities, and will address how the projects will evolve to meet new space initiatives.
Emotional voices in context: A neurobiological model of multimodal affective information processing
NASA Astrophysics Data System (ADS)
Brück, Carolin; Kreifelts, Benjamin; Wildgruber, Dirk
2011-12-01
Just as eyes are often considered a gateway to the soul, the human voice offers a window through which we gain access to our fellow human beings' minds - their attitudes, intentions and feelings. Whether in talking or singing, crying or laughing, sighing or screaming, the sheer sound of a voice communicates a wealth of information that, in turn, may serve the observant listener as valuable guidepost in social interaction. But how do human beings extract information from the tone of a voice? In an attempt to answer this question, the present article reviews empirical evidence detailing the cerebral processes that underlie our ability to decode emotional information from vocal signals. The review will focus primarily on two prominent classes of vocal emotion cues: laughter and speech prosody (i.e. the tone of voice while speaking). Following a brief introduction, behavioral as well as neuroimaging data will be summarized that allows to outline cerebral mechanisms associated with the decoding of emotional voice cues, as well as the influence of various context variables (e.g. co-occurring facial and verbal emotional signals, attention focus, person-specific parameters such as gender and personality) on the respective processes. Building on the presented evidence, a cerebral network model will be introduced that proposes a differential contribution of various cortical and subcortical brain structures to the processing of emotional voice signals both in isolation and in context of accompanying (facial and verbal) emotional cues.
Emotional voices in context: a neurobiological model of multimodal affective information processing.
Brück, Carolin; Kreifelts, Benjamin; Wildgruber, Dirk
2011-12-01
Just as eyes are often considered a gateway to the soul, the human voice offers a window through which we gain access to our fellow human beings' minds - their attitudes, intentions and feelings. Whether in talking or singing, crying or laughing, sighing or screaming, the sheer sound of a voice communicates a wealth of information that, in turn, may serve the observant listener as valuable guidepost in social interaction. But how do human beings extract information from the tone of a voice? In an attempt to answer this question, the present article reviews empirical evidence detailing the cerebral processes that underlie our ability to decode emotional information from vocal signals. The review will focus primarily on two prominent classes of vocal emotion cues: laughter and speech prosody (i.e. the tone of voice while speaking). Following a brief introduction, behavioral as well as neuroimaging data will be summarized that allows to outline cerebral mechanisms associated with the decoding of emotional voice cues, as well as the influence of various context variables (e.g. co-occurring facial and verbal emotional signals, attention focus, person-specific parameters such as gender and personality) on the respective processes. Building on the presented evidence, a cerebral network model will be introduced that proposes a differential contribution of various cortical and subcortical brain structures to the processing of emotional voice signals both in isolation and in context of accompanying (facial and verbal) emotional cues. Copyright © 2011 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Lancioni, Giulio E.; Singh, Nirbhay N.; O'Reilly, Mark F.; Sigafoos, Jeff; Green, Vanessa; Chiapparino, Claudia; Stasolla, Fabrizio; Oliva, Doretta
2009-01-01
The present study assessed the use of a voice-detecting sensor interfaced with a scanning keyboard emulator to allow two boys with extensive motor disabilities to write. Specifically, the study (a) compared the effects of the voice-detecting sensor with those of a familiar pressure sensor on the boys' writing time, (b) checked which of the sensors…
ERIC Educational Resources Information Center
Millward, Pam; Stephenson, Maxine S.; Rio, Nane; Anderson, Helen
2011-01-01
This paper describes a research project, "Voices from Manukau", that investigated the impact of a joint initiative by a university and an institute of technology in New Zealand. The purpose of the initiative was to increase the participation of students traditionally under represented at tertiary-level study, particularly Maori…
ERIC Educational Resources Information Center
Riemenschneider, Cynthia K.; Manly, Tracy S.; Leonard, Lori N. K.
2016-01-01
Academic integrity continues to be a concern for universities and faculty. Yet practical methods for conveying ethical behavior can be difficult to achieve. This study uses the multidimensional ethics scale to gain insight into three situations involving students. The findings from those scenarios are then framed using the "Giving Voice to…
Luzzi, Simona; Baldinelli, Sara; Ranaldi, Valentina; Fabi, Katia; Cafazzo, Viviana; Fringuelli, Fabio; Silvestrini, Mauro; Provinciali, Leandro; Reverberi, Carlo; Gainotti, Guido
2017-01-08
Famous face and voice recognition is reported to be impaired both in semantic dementia (SD) and in Alzheimer's Disease (AD), although more severely in the former. In AD a coexistence of perceptual impairment in face and voice processing has also been reported and this could contribute to the altered performance in complex semantic tasks. On the other hand, in SD both face and voice recognition disorders could be related to the prevalence of atrophy in the right temporal lobe (RTL). The aim of the present study was twofold: (1) to investigate famous faces and voices recognition in SD and AD to verify if the two diseases show a differential pattern of impairment, resulting from disruption of different cognitive mechanisms; (2) to check if face and voice recognition disorders prevail in patients with atrophy mainly affecting the RTL. To avoid the potential influence of primary perceptual problems in face and voice recognition, a pool of patients suffering from early SD and AD were administered a detailed set of tests exploring face and voice perception. Thirteen SD (8 with prevalence of right and 5 with prevalence of left temporal atrophy) and 25 CE patients, who did not show visual and auditory perceptual impairment, were finally selected and were administered an experimental battery exploring famous face and voice recognition and naming. Twelve SD patients underwent cerebral PET imaging and were classified in right and left SD according to the onset modality and to the prevalent decrease in FDG uptake in right or left temporal lobe respectively. Correlation of PET imaging and famous face and voice recognition was performed. Results showed a differential performance profile in the two diseases, because AD patients were significantly impaired in the naming tests, but showed preserved recognition, whereas SD patients were profoundly impaired both in naming and in recognition of famous faces and voices. Furthermore, face and voice recognition disorders prevailed in SD patients with RTL atrophy, who also showed a conceptual impairment on the Pyramids and Palm Trees test more important in the pictorial than in the verbal modality. Finally, in 12SD patients in whom PET was available, a strong correlation between FDG uptake and face-to-name and voice-to-name matching data was found in the right but not in the left temporal lobe. The data support the hypothesis of a different cognitive basis for impairment of face and voice recognition in the two dementias and suggest that the pattern of impairment in SD may be due to a loss of semantic representations, while a defect of semantic control, with impaired naming and preserved recognition might be hypothesized in AD. Furthermore, the correlation between face and voice recognition disorders and RTL damage are consistent with the hypothesis assuming that in the RTL person-specific knowledge may be mainly based upon non-verbal representations. Copyright © 2016 Elsevier Ltd. All rights reserved.
Utilisation of Wearable Computing for Space Programmes Test Activities Optimasation
NASA Astrophysics Data System (ADS)
Basso, V.; Lazzari, D.; Alemanni, M.
2004-08-01
New technologies are assuming a relevant importance in the Space business domain also in the Assembly Integration and Test (AIT) activities allowing process optimization and capability that were unthinkable only few years ago. This paper has the aim to describe Alenia Spazio (ALS) gained experience on the remote interaction techniques as a results of collaborations established both on European Communities (EC) initiatives, with Alenia Aeronautica (ALA) and Politecnico of Torino (POLITO). The H/W and S/W components performances increase and costs reduction due to the home computing massive utilization (especially demanded by the games business) together with the network technology possibility (offered by the web as well as the hi-speed links and the wireless communications) allow today to re-think the traditional AIT process activities in the light of the multimedia data exchange: graphical, voice video and by sure more in the future. Aerospace business confirm its innovation vocation which in the year '80 represents the cradle of the CAD systems and today is oriented to the 3D data visualization/ interaction technologies and remote visualisation/ interaction in collaborative way on a much more user friendly bases (i.e. not for specialists). Fig. 1 collects AIT extended scenario studied and adopted by ALS in these years. ALS experimented two possibilities of remote visualization/interaction: Portable [e.g. Fig.2 Personal Digital Assistant (PDA), Wearable] and walls (e.g.VR-Lab) screens as both 2D/3D visualisation and interaction devices which could support many types of traditional (mainly based on EGSE and PDM/CAD utilisation/reports) company internal AIT applications: 1. design review support 2. facility management 3. storage management 4. personnel training 5. integration sequences definition 6. assembly and test operations follow up 7. documentation review and external access to AIT activities for remote operations (e.g. tele-testing) EGSE Portable Clean room Walls PDM/CAD Tele-operations Product Control room External World
Mobile telephony through LEO satellites: To OBP or not
NASA Technical Reports Server (NTRS)
Monte, Paul A.; Louie, Ming; Wiedeman, R.
1991-01-01
GLOBALSTAR is a satellite-based mobile communications system that is interoperable with the current and future Public Land Mobile Network (PLMN) and Public Switched Telephone Network (PSTN). The selection of the transponder type, bent-pipe, or onboard processing (OBP), for GLOBALSTAR is based on many criteria, each of which is essential to the commercial and technological feasibility of GLOBALSTAR. The trade study that was done to determine the pros and cons of a bent-pipe transponder or an onboard processing transponder is described. The design of GLOBALSTAR's telecommunications system is a multi-variable cost optimization between the cost and complexity of individual satellites, the number of satellites required to provide coverage to the service areas, the cost of launching the satellites into their selected orbits, the ground segment cost, user equipment cost, satellite voice channel capacity, and other issues. Emphasis is on the cost and complexity of the individual satellites, specifically the transponder type and the impact of the transponder type on satellite and ground segment cost, satellite power and weight, and satellite voice channel capacity.
Mobile telephony through LEO satellites: To OBP or not
NASA Astrophysics Data System (ADS)
Monte, Paul A.; Louie, Ming; Wiedeman, R.
1991-11-01
GLOBALSTAR is a satellite-based mobile communications system that is interoperable with the current and future Public Land Mobile Network (PLMN) and Public Switched Telephone Network (PSTN). The selection of the transponder type, bent-pipe, or onboard processing (OBP), for GLOBALSTAR is based on many criteria, each of which is essential to the commercial and technological feasibility of GLOBALSTAR. The trade study that was done to determine the pros and cons of a bent-pipe transponder or an onboard processing transponder is described. The design of GLOBALSTAR's telecommunications system is a multi-variable cost optimization between the cost and complexity of individual satellites, the number of satellites required to provide coverage to the service areas, the cost of launching the satellites into their selected orbits, the ground segment cost, user equipment cost, satellite voice channel capacity, and other issues. Emphasis is on the cost and complexity of the individual satellites, specifically the transponder type and the impact of the transponder type on satellite and ground segment cost, satellite power and weight, and satellite voice channel capacity.
Implicit prosody mining based on the human eye image capture technology
NASA Astrophysics Data System (ADS)
Gao, Pei-pei; Liu, Feng
2013-08-01
The technology of eye tracker has become the main methods of analyzing the recognition issues in human-computer interaction. Human eye image capture is the key problem of the eye tracking. Based on further research, a new human-computer interaction method introduced to enrich the form of speech synthetic. We propose a method of Implicit Prosody mining based on the human eye image capture technology to extract the parameters from the image of human eyes when reading, control and drive prosody generation in speech synthesis, and establish prosodic model with high simulation accuracy. Duration model is key issues for prosody generation. For the duration model, this paper put forward a new idea for obtaining gaze duration of eyes when reading based on the eye image capture technology, and synchronous controlling this duration and pronunciation duration in speech synthesis. The movement of human eyes during reading is a comprehensive multi-factor interactive process, such as gaze, twitching and backsight. Therefore, how to extract the appropriate information from the image of human eyes need to be considered and the gaze regularity of eyes need to be obtained as references of modeling. Based on the analysis of current three kinds of eye movement control model and the characteristics of the Implicit Prosody reading, relative independence between speech processing system of text and eye movement control system was discussed. It was proved that under the same text familiarity condition, gaze duration of eyes when reading and internal voice pronunciation duration are synchronous. The eye gaze duration model based on the Chinese language level prosodic structure was presented to change previous methods of machine learning and probability forecasting, obtain readers' real internal reading rhythm and to synthesize voice with personalized rhythm. This research will enrich human-computer interactive form, and will be practical significance and application prospect in terms of disabled assisted speech interaction. Experiments show that Implicit Prosody mining based on the human eye image capture technology makes the synthesized speech has more flexible expressions.
Cost-sensitive learning for emotion robust speaker recognition.
Li, Dongdong; Yang, Yingchun; Dai, Weihui
2014-01-01
In the field of information security, voice is one of the most important parts in biometrics. Especially, with the development of voice communication through the Internet or telephone system, huge voice data resources are accessed. In speaker recognition, voiceprint can be applied as the unique password for the user to prove his/her identity. However, speech with various emotions can cause an unacceptably high error rate and aggravate the performance of speaker recognition system. This paper deals with this problem by introducing a cost-sensitive learning technology to reweight the probability of test affective utterances in the pitch envelop level, which can enhance the robustness in emotion-dependent speaker recognition effectively. Based on that technology, a new architecture of recognition system as well as its components is proposed in this paper. The experiment conducted on the Mandarin Affective Speech Corpus shows that an improvement of 8% identification rate over the traditional speaker recognition is achieved.
Cost-Sensitive Learning for Emotion Robust Speaker Recognition
Li, Dongdong; Yang, Yingchun
2014-01-01
In the field of information security, voice is one of the most important parts in biometrics. Especially, with the development of voice communication through the Internet or telephone system, huge voice data resources are accessed. In speaker recognition, voiceprint can be applied as the unique password for the user to prove his/her identity. However, speech with various emotions can cause an unacceptably high error rate and aggravate the performance of speaker recognition system. This paper deals with this problem by introducing a cost-sensitive learning technology to reweight the probability of test affective utterances in the pitch envelop level, which can enhance the robustness in emotion-dependent speaker recognition effectively. Based on that technology, a new architecture of recognition system as well as its components is proposed in this paper. The experiment conducted on the Mandarin Affective Speech Corpus shows that an improvement of 8% identification rate over the traditional speaker recognition is achieved. PMID:24999492
Learning Media Application Based On Microcontroller Chip Technology In Early Age
NASA Astrophysics Data System (ADS)
Ika Hidayati, Permata
2018-04-01
In Early childhood cognitive intelligence need right rncdia learning that can help a child’s cognitive intelligence quickly. The purpose of this study to design a learning media in the form of a puppet can used to introduce human anatomy during early childhood. This educational doll utilizing voice recognition technology from EasyVR module to receive commands from the user to introduce body parts on a doll, is used as an indicator TED. In addition to providing the introduction of human anatomy, this dolljut. a user can give a shout out to mainly play previously stored voice module sound recorder. Results obtained from this study is that this educational dolls can detect more than voice and spoken commands that can be random detected. Distance concrete of this doll in detecting the sound is up to a distance of 2.5 meters.
McMullan, E; Gupta, A; Collins, S C
2018-04-01
WHAT IS KNOWN ON THE SUBJECT?: Community mental health staff and their service users have reported mixed views on the importance of talking about the content of voices. Community staff have reported feeling that they do not have the skills to explore voice content and worry about making things worse. Voice hearers experiencing extreme distress due to the content of their voices can access support through acute inpatient mental health services. No previous studies have focused on the experiences of staff who nurse voice hearers at a time of acute distress. WHAT DOES THIS STUDY ADD TO EXISTING KNOWLEDGE?: MHNs and HSWs working with voice hearers in acute distress report feeling powerless and helpless, as they feel that they cannot lessen the distress experienced by the voice hearer. Despite these difficult feelings, staff report finding ways of coping, including using structured tools to help make sense of their service users' voice-hearing experiences and accessing reflective practice forums. WHAT ARE THE IMPLICATIONS FOR PRACTICE?: Due to the current context of increased acuity and limited resources in acute services, there may be a need to further protect time for staff to access reflective practice groups and supervision forums to help them manage the difficult feelings arising from their work with voice hearers. Introduction Mental health nursing (MHN) staff in acute settings work with voice hearers at times of crises when they experience high levels of distress. Previous research has focused on community mental health staff's experiences and their service users views on exploring the content of voices. No studies have explored this from an acute mental health service perspective. Aim This study therefore sought to explore the experiences of staff working with voice hearers in an acute mental health service. Method Due to the exploratory nature of the research, a qualitative design was chosen. Three MHNs and five healthcare support workers (HSWs) were interviewed. The data were analysed using Interpretative Phenomenological Analysis. Results A group analysis elicited three master themes: "It's quite scary really, not unlike a horror movie;" "I can only influence what's in front of me;" and "Just chipping away". Discussion Staff working with voice hearers in acute settings experience feelings of powerlessness and helplessness, as they feel unable to reduce the distress experienced by voice hearers in their care. Staff employ coping strategies to help manage these difficult feelings, including using structured tools in their work with voice hearers and attending reflective practice forums. Implications for Practice Acute mental health services may need to protect time for staff to access regular reflective practice and other supervision forums to help manage their feelings of powerlessness and helplessness arising from their work with voice hearers. © 2017 John Wiley & Sons Ltd.
Neural correlates of the encoding of multimodal contextual features
Gottlieb, Lauren J.; Wong, Jenny; de Chastelaine, Marianne; Rugg, Michael D.
2012-01-01
Functional magnetic resonance imaging (fMRI) was employed to identify neural regions engaged during the encoding of contextual features belonging to different modalities. Subjects studied objects that were presented to the left or right of fixation. Each object was paired with its name, spoken in either a male or a female voice. The test requirement was to discriminate studied from unstudied pictures and, for each picture judged old, to retrieve its study location and the gender of the voice that spoke its name. Study trials associated with accurate rather than inaccurate location memory demonstrated enhanced activity in the fusiform and parahippocampal cortex and the hippocampus and reduced activity (a negative subsequent memory effect) in the medial occipital cortex. Successful encoding of voice information was associated with enhanced study activity in the right middle superior temporal sulcus and activity reduction in the right superior frontal cortex. These findings support the proposal that encoding of a contextual feature is associated with enhanced activity in regions engaged during its online processing. In addition, they indicate that negative subsequent memory effects can also demonstrate feature-selectivity. Relative to other classes of study trials, trials for which both contextual features were later retrieved demonstrated enhanced activity in the lateral occipital complex and reduced activity in the temporo-parietal junction. These findings suggest that multifeatural encoding was facilitated when the study item was processed efficiently and study processing was not interrupted by redirection of attention toward extraneous events. PMID:23166292
A model for treating voice disorders in school-age children within a video gaming environment.
King, Suzanne N; Davis, Larry; Lehman, Jeffrey J; Ruddy, Bari Hoffman
2012-09-01
Clinicians use a variety of approaches to motivate children with hyperfunctional voice disorders to comply with voice therapy in a therapeutic session and improve the motivation of children to practice home-based exercises. Utilization of current entertainment technology in such approaches may improve participation and motivation in voice therapy. The purpose of this study is to test the feasibility of using an entertainment video game as a therapy device. Prospective cohort and case-control study. Three levels of game testing were conducted to an existing entertainment video game for use as a voice therapy protocol. The game was tested by two computer programmers and five normal participants. The third level of testing was a case study with a child diagnosed with a hyperfunctional voice disorder. Modifications to the game were made after each feasibility test. Errors with the video game performance were modified, including the addition of a time stamp directory and game controller. Resonance voice exercises were modified to accommodate the gaming environment and unique competitive situation, including speech rate, acoustic parameters, game speed, and point allocations. The development of video games for voice therapeutic purposes attempt to replicate the high levels of engagement and motivation attained with entertainment video games, stimulating a more productive means of learning while doing. This case study found that a purely entertainment video game can be implemented as a voice therapeutic protocol based on information obtained from the case study. Copyright © 2012 The Voice Foundation. All rights reserved.
Electrolaryngographically derived voice source changes of child and adolescent singers.
Barlow, Christopher; Howard, David M
2005-01-01
Children are the most likely demographic group to undertake regular singing or singing training, but to date there has been little quantitative research into the voice production of children. The authors used closed quotient (CQ) measurements to analyse the singing voices of over 200 male and female, trained and untrained singers aged 8-18 years for differences in voice source according to sex, vocal training and age. Results indicated that the voice source production of subjects could be clearly divided into groups according to age, sex and the level of vocal training received. It was concluded that the process of training a young voice has a quantifiable effect upon the voice source. It was also concluded that sex differences result in significant differences in the voice source of child and adolescent singers.
Effects of Early Smoking Habits on Young Adult Female Voices in Greece.
Tafiadis, Dionysios; Toki, Eugenia I; Miller, Kevin J; Ziavra, Nausica
2017-11-01
Cigarette use is a preventable cause of mortality and diseases. The World Health Organization states that Europe and especially Greece has the highest occurrence of smoking among adults. The prevalence of smoking among women in Greece was estimated to be over 30% in 2012. Smoking is a risk factor for many diseases. Studies have demonstrated the association between smoking and laryngeal pathologies as well as changes in voice characteristics. The purpose of this study was to estimate the effect of early smoking habit on young adult female voices and if they perceive any vocal changes using two assessment methods. The Voice Handicap Index and the acoustic analyses of voice measurements were used, with both serving as mini-assessment protocols. Two hundred and ten young females (110 smokers and 100 nonsmokers) attending the Technological Educational Institute of Epirus in the School of Health and Welfare were included. Statistically significant increases for physical and total scores of the Voice Handicap Index were found in the smokers group (P < 0.05). Significant changes were observed for the acoustic parameters between smoker and nonsmoker groups. The results of this study indicated observable signs of change in the voice acoustic characteristics of young adults with early smoking habits. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Voice processing in monkey and human brains.
Scott, Sophie K
2008-09-01
Studies in humans have indicated that the anterior superior temporal sulcus has an important role in the processing of information about human voices, especially the identification of talkers from their voice. A new study using functional magnetic resonance imaging (fMRI) with macaques provides strong evidence that anterior auditory fields, part of the auditory 'what' pathway, preferentially respond to changes in the identity of conspecifics, rather than specific vocalizations from the same individual.
Perceived control and voice handicap in patients with voice disorders.
Frazier, Patricia; Merians, Addie; Misono, Stephanie
2017-11-01
The purpose of the study was to replicate and extend previous research on the relation between perceived present control and voice handicap and to further examine the psychometric properties of a present control scale adapted for patients with voice disorders (Misono, Meredith, Peterson, & Frazier, 2016). Sample 1 consisted of 1,129 patients recruited from a voice disorder clinic who completed measures of perceived present control, distress, and voice handicap in the clinic. Sample 2 consisted of 62 patients from the same clinic who completed measures of present control, distress, voice handicap, and general control beliefs online at baseline and measures of present control and voice handicap again 3 weeks later (n = 59). With regard to the psychometric properties of the voice-adapted present control scale, alpha coefficients were above .80 and the 3-week test-reliability coefficient was .69. There was mixed support for the hypothesized 1-factor structure of the scale. In Sample 1, present control was more strongly associated with lower voice handicap than was distress and accounted for significant variance in voice handicap controlling for distress. In Sample 2, present control at baseline predicted later voice handicap, controlling for general control beliefs and distress. Present control appears to be a promising target for adjunctive interventions for patients with voice disorders. An evidence-based online present control intervention (Hintz, Frazier, & Meredith, 2015) is being adapted for this patient population. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
ERIC Educational Resources Information Center
Kubiak, John
2017-01-01
There is a growing awareness of the value of using pupils' voices in educational research. At primary and second level, the principle of pupil voice has gained in profile over the last decade. However, in higher education, the use of voice in research collaborations remains under-theorised and under-utilised. This paper reports on an inclusive…
Cannito, Michael P; Chorna, Lesya B; Kahane, Joel C; Dworkin, James P
2014-05-01
This study evaluated the hypotheses that sentence production by speakers with adductor (AD) and abductor (AB) spasmodic dysphonia (SD) may be differentially influenced by consonant voicing and manner features, in comparison with healthy, matched, nondysphonic controls. This was a prospective, single blind study, using a between-groups, repeated measures design for the independent variables of perceived voice quality and sentence duration. Sixteen subjects with ADSD and 10 subjects with ABSD, as well as 26 matched healthy controls produced four short, simple sentences that were systematically loaded with voiced or voiceless consonants of either obstruant or continuant manner categories. Experienced voice clinicians, who were "blind" as to speakers' group affixations, used visual analog scaling to judge the overall voice quality of each sentence. Acoustic sentence durations were also measured. Speakers with ABSD or ADSD demonstrated significantly poorer than normal voice quality on all sentences. Speakers with ABSD exhibited longer than normal duration for voiceless consonant sentences. Speakers with ADSD had poorer voice quality for voiced than for voiceless consonant sentences. Speakers with ABSD had longer durations for voiceless than for voiced consonant sentences. The two subtypes of SD exhibit differential performance on the basis of consonant voicing in short, simple sentences; however, each subgroup manifested voicing-related differences on a different variable (voice quality vs sentence duration). Findings suggest different underlying pathophysiological mechanisms for ABSD and ADSD. Findings also support inclusion of short, simple sentences containing voiced or voiceless consonants as part of the diagnostic protocol for SD, with measurement of sentence duration in addition to judments of voice quality severity. Copyright © 2014 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Neural effects of environmental advertising: An fMRI analysis of voice age and temporal framing.
Casado-Aranda, Luis-Alberto; Martínez-Fiestas, Myriam; Sánchez-Fernández, Juan
2018-01-15
Ecological information offered to society through advertising enhances awareness of environmental issues, encourages development of sustainable attitudes and intentions, and can even alter behavior. This paper, by means of functional Magnetic Resonance Imaging (fMRI) and self-reports, explores the underlying mechanisms of processing ecological messages. The study specifically examines brain and behavioral responses to persuasive ecological messages that differ in temporal framing and in the age of the voice pronouncing them. The findings reveal that attitudes are more positive toward future-framed messages presented by young voices. The whole-brain analysis reveals that future-framed (FF) ecological messages trigger activation in brain areas related to imagery, prospective memories and episodic events, thus reflecting the involvement of past behaviors in future ecological actions. Past-framed messages (PF), in turn, elicit brain activations within the episodic system. Young voices (YV), in addition to triggering stronger activation in areas involved with the processing of high-timbre, high-pitched and high-intensity voices, are perceived as more emotional and motivational than old voices (OV) as activations in anterior cingulate cortex and amygdala. Messages expressed by older voices, in turn, exhibit stronger activation in areas formerly linked to low-pitched voices and voice gender perception. Interestingly, a link is identified between neural and self-report responses indicating that certain brain activations in response to future-framed messages and young voices predicted higher attitudes toward future-framed and young voice advertisements, respectively. The results of this study provide invaluable insight into the unconscious origin of attitudes toward environmental messages and indicate which voice and temporal frame of a message generate the greatest subconscious value. Copyright © 2017 Elsevier Ltd. All rights reserved.
Auditory and visual modulation of temporal lobe neurons in voice-sensitive and association cortices.
Perrodin, Catherine; Kayser, Christoph; Logothetis, Nikos K; Petkov, Christopher I
2014-02-12
Effective interactions between conspecific individuals can depend upon the receiver forming a coherent multisensory representation of communication signals, such as merging voice and face content. Neuroimaging studies have identified face- or voice-sensitive areas (Belin et al., 2000; Petkov et al., 2008; Tsao et al., 2008), some of which have been proposed as candidate regions for face and voice integration (von Kriegstein et al., 2005). However, it was unclear how multisensory influences occur at the neuronal level within voice- or face-sensitive regions, especially compared with classically defined multisensory regions in temporal association cortex (Stein and Stanford, 2008). Here, we characterize auditory (voice) and visual (face) influences on neuronal responses in a right-hemisphere voice-sensitive region in the anterior supratemporal plane (STP) of Rhesus macaques. These results were compared with those in the neighboring superior temporal sulcus (STS). Within the STP, our results show auditory sensitivity to several vocal features, which was not evident in STS units. We also newly identify a functionally distinct neuronal subpopulation in the STP that appears to carry the area's sensitivity to voice identity related features. Audiovisual interactions were prominent in both the STP and STS. However, visual influences modulated the responses of STS neurons with greater specificity and were more often associated with congruent voice-face stimulus pairings than STP neurons. Together, the results reveal the neuronal processes subserving voice-sensitive fMRI activity patterns in primates, generate hypotheses for testing in the visual modality, and clarify the position of voice-sensitive areas within the unisensory and multisensory processing hierarchies.
Auditory and Visual Modulation of Temporal Lobe Neurons in Voice-Sensitive and Association Cortices
Perrodin, Catherine; Kayser, Christoph; Logothetis, Nikos K.
2014-01-01
Effective interactions between conspecific individuals can depend upon the receiver forming a coherent multisensory representation of communication signals, such as merging voice and face content. Neuroimaging studies have identified face- or voice-sensitive areas (Belin et al., 2000; Petkov et al., 2008; Tsao et al., 2008), some of which have been proposed as candidate regions for face and voice integration (von Kriegstein et al., 2005). However, it was unclear how multisensory influences occur at the neuronal level within voice- or face-sensitive regions, especially compared with classically defined multisensory regions in temporal association cortex (Stein and Stanford, 2008). Here, we characterize auditory (voice) and visual (face) influences on neuronal responses in a right-hemisphere voice-sensitive region in the anterior supratemporal plane (STP) of Rhesus macaques. These results were compared with those in the neighboring superior temporal sulcus (STS). Within the STP, our results show auditory sensitivity to several vocal features, which was not evident in STS units. We also newly identify a functionally distinct neuronal subpopulation in the STP that appears to carry the area's sensitivity to voice identity related features. Audiovisual interactions were prominent in both the STP and STS. However, visual influences modulated the responses of STS neurons with greater specificity and were more often associated with congruent voice-face stimulus pairings than STP neurons. Together, the results reveal the neuronal processes subserving voice-sensitive fMRI activity patterns in primates, generate hypotheses for testing in the visual modality, and clarify the position of voice-sensitive areas within the unisensory and multisensory processing hierarchies. PMID:24523543
Communications dashboard (control rooms, take a cue from Facebook® !) Chapter 1
NASA Astrophysics Data System (ADS)
Scott, David W.
Papers published via IEEE and AIAA conferences have presented an overview of how social media could benefit NASA working environments in general [1] and proposed three specific social applications to benefit space flight control operations [2]. One of them, Communications Dashboard, would help a real time flight controller keep up with both the “ big picture” and significant details of operations via a cohesive interface similar to those of social networking services (SNS). Instead of recreational social features, “ CommDash” would support functions like console logging, categorized and threaded text chat streams with enhanced accountability and graphics display features, high-level status displays driven by telemetry or other events, and an on-screen hailing function for requesting voice or text stream conversation. Moving certain voice conversations to text streams would reduce confusion and stress in two ways. Within text conversations, there would be far less repetition of content since text conversations have visual persistence and are reviewable instantly, e.g., there's no need to brief new participants to a discussion - they just read what's already there. Remaining voice traffic would stand out more clearly, and quieter voice loops means fewer “ say again” calls and less distraction from visual and mental tasks, thus less stress. (Most flight controllers monitor 4 or 5 voice loops at once.) Links could be created from console log entries to chat selections so that underlying details are readily available yet unobtrusive. This would reduce the confusion that rises from having multiple and sometimes divergent copies of the same information due to cut/copy and paste operations, attachments, and asynchronous editing. This concept could apply to a plethora of real time control environments and to other settings with lots of information juggling. This paper explores the dashboard concept in further detail and chronic- es the first phase of a NASA IT Labs (Information Technology) project that could lead to a working system.
Communications Dashboard (Control Rooms Take a Cue from Facebook), Chapter 1
NASA Technical Reports Server (NTRS)
Scott, David w.
2013-01-01
Papers published via IEEE and AIAA conferences have presented an overview of how social media could benefit NASA working environments in general and proposed three specific social applications to benefit space flight control operations. One of them, Communications Dashboard, would help a real time flight controller keep up with both the "big picture" and significant details of operations via a cohesive interface similar to those of social networking services (SNS). Instead of recreational social features, "CommDash" would support functions like console logging, categorized and threaded text chat streams with enhanced accountability and graphics display features, high-level status displays driven by telemetry or other events, and an on-screen hailing function for requesting voice or text stream conversation. Moving certain voice conversations to text streams would reduce confusion and stress in two ways. Within text conversations, there would be far less repetition of content since text conversations have visual persistence and are reviewable instantly, e.g., there s no need to brief new participants to a discussion -- they just read what s already there. Remaining voice traffic would stand out more clearly, and quieter voice loops means fewer "say again" calls and less distraction from visual and mental tasks, thus less stress. (Most flight controllers monitor 4 or 5 voice loops at once.) Links could be created from console log entries to chat selections so that underlying details are readily available yet unobtrusive. This would reduce the confusion that rises from having multiple and sometimes divergent copies of the same information due to cut/copy and paste operations, attachments, and asynchronous editing. This concept could apply to a plethora of real time control environments and to other settings with lots of information juggling. This paper explores the dashboard concept in further detail and chronicles the first phase of a NASA IT Labs (Information Technology) project that could lead to a working system
Testing and reference model analysis of FTTH system
NASA Astrophysics Data System (ADS)
Feng, Xiancheng; Cui, Wanlong; Chen, Ying
2009-08-01
With rapid development of Internet and broadband access network, the technologies of xDSL, FTTx+LAN , WLAN have more applications, new network service emerges in endless stream, especially the increase of network game, meeting TV, video on demand, etc. FTTH supports all present and future service with enormous bandwidth, including traditional telecommunication service, traditional data service and traditional TV service, and the future digital TV and VOD. With huge bandwidth of FTTH, it wins the final solution of broadband network, becomes the final goal of development of optical access network.. Fiber to the Home (FTTH) will be the goal of telecommunications cable broadband access. In accordance with the development trend of telecommunication services, to enhance the capacity of integrated access network, to achieve triple-play (voice, data, image), based on the existing optical Fiber to the curb (FTTC), Fiber To The Zone (FTTZ), Fiber to the Building (FTTB) user optical cable network, the optical fiber can extend to the FTTH system of end-user by using EPON technology. The article first introduced the basic components of FTTH system; and then explain the reference model and reference point for testing of the FTTH system; Finally, by testing connection diagram, the testing process, expected results, primarily analyze SNI Interface Testing, PON interface testing, Ethernet performance testing, UNI interface testing, Ethernet functional testing, PON functional testing, equipment functional testing, telephone functional testing, operational support capability testing and so on testing of FTTH system. ...
Technology for Work, Home, and Leisure. Tech Use Guide: Using Computer Technology.
ERIC Educational Resources Information Center
Williams, John M.
This guide provides a brief introduction to several types of technological devices useful to individuals with disabilities and illustrates how some individuals are applying technology in the workplace and at home. Devices described include communication aids, low-vision products, voice-activated systems, environmental controls, and aids for…
VOT and the perception of voicing
NASA Astrophysics Data System (ADS)
Remez, Robert E.
2004-05-01
In explaining the ability to distinguish phonemes, linguists have described the dimension of voicing. Acoustic analyses have identified many correlates of the voicing contrast in initial, medial, and final consonants within syllables, and these in turn have motivated studies of the perceptual resolution of voicing. The framing conceptualization articulated by Lisker and Abramson 40 years ago in physiological, phonetic, and perceptual studies has been widely influential, and research on voicing now adopts their perspective without reservation. Their original survey included languages with two voicing categories (Dutch, Puerto Rican Spanish, Hungarian, Tamil, Cantonese, English), three voicing categories (Eastern Armenian, Thai, Korean), and four voicing categories (Hindi, Marathi). Perceptual studies inspired by this work have also ranged widely, including tests with different languages and with listeners of several species. The profound value of the analyses of Lisker and Abramson is evident in the empirical traction provided by the concept of VOT in research on the every important perceptual question about speech and language in our era. Some of these classic perceptual investigations will be reviewed. [Research supported by NIH (DC00308).
Voice input/output capabilities at Perception Technology Corporation
NASA Technical Reports Server (NTRS)
Ferber, Leon A.
1977-01-01
Condensed resumes of key company personnel at the Perception Technology Corporation are presented. The staff possesses recognition, speech synthesis, speaker authentication, and language identification. Hardware and software engineers' capabilities are included.
2016-07-25
CAPE-2: Cajun Advanced Picosatellite Experiment – ELaNa IV CAPE-2 was developed by students from the University of Louisiana Lafayette to engage, inspire and educate K-12 students to encourage them to pursue STEM careers. The secondary focus is the technology demonstration of deployed solar panels to support the following payloads: text to speech, voice repeater, tweeting, email, file transfer and data collection from buoys. Launched by NASA’s CubeSat Launch Initiative on the ELaNa IV mission as an auxiliary payload aboard the U.S. Air Force-led Operationally Responsive Space (ORS-3) Mission on November 19, 2013.
Bauer, S M; Lane, J P; Stone, V I; Unnikrishnan, N
1998-01-01
The Rehabilitation Engineering Research Center on Technology Evaluation and Transfer is exploring how the end users of assistive technology devices define the ideal device. This work is called the Consumer Ideal Product program. In this work, end users identify and establish the importance of a broad range of product design features, along with the related product support and service provided by manufacturers and vendors. This paper describes a method for systematically transforming end-user defined requirements into a form that is useful and accessible to product designers, manufacturers, and vendors. In particular, product requirements, importance weightings, and metrics are developed from the Consumer Ideal Product battery charger outcomes. Six battery charges are benchmarked against these product requirements using the metrics developed. The results suggest improvements for each product's design, service, and support. Overall, the six chargers meet roughly 45-75% of the ideal product's requirements. Many of the suggested improvements are low-cost changes that, if adopted, could provide companies a competitive advantage in the marketplace.
The stability of locus equation slopes across stop consonant voicing/aspiration
NASA Astrophysics Data System (ADS)
Sussman, Harvey M.; Modarresi, Golnaz
2004-05-01
The consistency of locus equation slopes as phonetic descriptors of stop place in CV sequences across voiced and voiceless aspirated stops was explored in the speech of five male speakers of American English and two male speakers of Persian. Using traditional locus equation measurement sites for F2 onsets, voiceless labial and coronal stops had significantly lower locus equation slopes relative to their voiced counterparts, whereas velars failed to show voicing differences. When locus equations were derived using F2 onsets for voiced stops that were measured closer to the stop release burst, comparable to the protocol for measuring voiceless aspirated stops, no significant effects of voicing/aspiration on locus equation slopes were observed. This methodological factor, rather than an underlying phonetic-based explanation, provides a reasonable account for the observed flatter locus equation slopes of voiceless labial and coronal stops relative to voiced cognates reported in previous studies [Molis et al., J. Acoust. Soc. Am. 95, 2925 (1994); O. Engstrand and B. Lindblom, PHONUM 4, 101-104]. [Work supported by NIH.
Thomas, Neil; Hayward, Mark; Peters, Emmanuelle; van der Gaag, Mark; Bentall, Richard P; Jenner, Jack; Strauss, Clara; Sommer, Iris E; Johns, Louise C; Varese, Filippo; García-Montes, José Manuel; Waters, Flavie; Dodgson, Guy; McCarthy-Jones, Simon
2014-07-01
This report from the International Consortium on Hallucinations Research considers the current status and future directions in research on psychological therapies targeting auditory hallucinations (hearing voices). Therapy approaches have evolved from behavioral and coping-focused interventions, through formulation-driven interventions using methods from cognitive therapy, to a number of contemporary developments. Recent developments include the application of acceptance- and mindfulness-based approaches, and consolidation of methods for working with connections between voices and views of self, others, relationships and personal history. In this article, we discuss the development of therapies for voices and review the empirical findings. This review shows that psychological therapies are broadly effective for people with positive symptoms, but that more research is required to understand the specific application of therapies to voices. Six key research directions are identified: (1) moving beyond the focus on overall efficacy to understand specific therapeutic processes targeting voices, (2) better targeting psychological processes associated with voices such as trauma, cognitive mechanisms, and personal recovery, (3) more focused measurement of the intended outcomes of therapy, (4) understanding individual differences among voice hearers, (5) extending beyond a focus on voices and schizophrenia into other populations and sensory modalities, and (6) shaping interventions for service implementation. © The Author 2014. Published by Oxford University Press on behalf of the Maryland Psychiatric Research Center.
DOE Office of Scientific and Technical Information (OSTI.GOV)
None
2011-02-25
There are many voices calling for a future of abundant clean energy. The choices are difficult and the challenges daunting. How will we get there? The National Renewable Energy Laboratory integrates the entire spectrum of innovation including fundamental science, market relevant research, systems integration, testing and validation, commercialization and deployment. The innovation process at NREL is interdependent and iterative. Many scientific breakthroughs begin in our own laboratories, but new ideas and technologies come to NREL at any point along the innovation spectrum to be validated and refined for commercial use.
None
2018-05-11
There are many voices calling for a future of abundant clean energy. The choices are difficult and the challenges daunting. How will we get there? The National Renewable Energy Laboratory integrates the entire spectrum of innovation including fundamental science, market relevant research, systems integration, testing and validation, commercialization and deployment. The innovation process at NREL is interdependent and iterative. Many scientific breakthroughs begin in our own laboratories, but new ideas and technologies come to NREL at any point along the innovation spectrum to be validated and refined for commercial use.
Fu, Qian-Jie; Chinchilla, Sherol; Galvin, John J
2004-09-01
The present study investigated the relative importance of temporal and spectral cues in voice gender discrimination and vowel recognition by normal-hearing subjects listening to an acoustic simulation of cochlear implant speech processing and by cochlear implant users. In the simulation, the number of speech processing channels ranged from 4 to 32, thereby varying the spectral resolution; the cutoff frequencies of the channels' envelope filters ranged from 20 to 320 Hz, thereby manipulating the available temporal cues. For normal-hearing subjects, results showed that both voice gender discrimination and vowel recognition scores improved as the number of spectral channels was increased. When only 4 spectral channels were available, voice gender discrimination significantly improved as the envelope filter cutoff frequency was increased from 20 to 320 Hz. For all spectral conditions, increasing the amount of temporal information had no significant effect on vowel recognition. Both voice gender discrimination and vowel recognition scores were highly variable among implant users. The performance of cochlear implant listeners was similar to that of normal-hearing subjects listening to comparable speech processing (4-8 spectral channels). The results suggest that both spectral and temporal cues contribute to voice gender discrimination and that temporal cues are especially important for cochlear implant users to identify the voice gender when there is reduced spectral resolution.
Developmental trends in the interaction between auditory and linguistic processing.
Jerger, S; Pirozzolo, F; Jerger, J; Elizondo, R; Desai, S; Wright, E; Reynosa, R
1993-09-01
The developmental course of multidimensional speech processing was examined in 80 children between 3 and 6 years of age and in 60 adults between 20 and 86 years of age. Processing interactions were assessed with a speeded classification task (Garner, 1974a), which required the subjects to attend selectively to the voice dimension while ignoring the linguistic dimension, and vice versa. The children and adults exhibited both similarities and differences in the patterns of processing dependencies. For all ages, performance for each dimension was slower in the presence of variation in the irrelevant dimension; irrelevant variation in the voice dimension disrupted performance more than irrelevant variation in the linguistic dimension. Trends in the degree of interference, on the other hand, showed significant differences between dimensions as a function of age. Whereas the degree of interference for the voice-dimension-relevant did not show significant age-related change, the degree of interference for the word-dimension-relevant declined significantly with age in a linear as well as a quadratic manner. A major age-related change in the relation between dimensions was that word processing, relative to voice-gender processing, required significantly more time in the children than in the adults. Overall, the developmental course characterizing multidimensional speech processing evidenced more pronounced change when the linguistic dimension, rather than the voice dimension, was relevant.
ERIC Educational Resources Information Center
Chen, Charlie C.; Vannoy, Sandra
2013-01-01
Voice over Internet Protocol- (VoIP) enabled online learning service providers struggling with high attrition rates and low customer loyalty issues despite VoIP's high degree of system fit for online global learning applications. Effective solutions to this prevalent problem rely on the understanding of system quality, information quality, and…
A Discussion Guide for UnCommon Knowledge: The "Voices of Girls" Documentary. [Videotape].
ERIC Educational Resources Information Center
AEL, Inc., Charleston, WV.
Rural and Urban Images: Voices of Girls in Science, Mathematics, and Technology was a 3-year project that began in fall 1995 with a group of sixth-grade girls and followed the same girls through eighth grade. The project took place in two West Virginia counties, but this videotaped documentary features its implementation in rural McDowell County…
Learners' Perceptions of the Benefits of Voice Tool-Based Tasks on Their Spoken Performance
ERIC Educational Resources Information Center
Wilches, Astrid
2014-01-01
The purpose of this study is to investigate learners' perceptions of the benefits of tasks using voice tools to reinforce their oral skills. Additionally, this study seeks to determine what aspects of task design affected the students' perceptions. Beginner learners aged 18 to 36 with little or no experience in the use of technological tools for…
ERIC Educational Resources Information Center
Janky, James M.; And Others
The diligent use of two-way voice links via satellites substantially improves the quality and the availability of health care and educational services in remote areas. This improvement was demonstrated in several experiments that were sponsored by the Department of Health, Education, and Welfare and the National Aeronautics and Space…
Conceptual Sound System Design for Clifford Odets' "GOLDEN BOY"
NASA Astrophysics Data System (ADS)
Yang, Yen Chun
There are two different aspects in the process of sound design, "Arts" and "Science". In my opinion, the sound design should engage both aspects strongly and in interaction with each other. I started the process of designing the sound for GOLDEN BOY by building the city soundscape of New York City in 1937. The scenic design for this piece is designed in the round, putting the audience all around the stage; this gave me a great opportunity to use surround and specialization techniques to transform the space into a different sonic world. My specialization design is composed of two subsystems -- one is the four (4) speakers center cluster diffusing towards the four (4) sections of audience, and the other is the four (4) speakers on the four (4) corners of the theatre. The outside ring provides rich sound source localization and the inside ring provides more support for control of the specialization details. In my design four (4) lavalier microphones are hung under the center iron cage from the four (4) corners of the stage. Each microphone is ten (10) feet above the stage. The signal for each microphone is sent to the two (2) center speakers in the cluster diagonally opposite the microphone. With the appropriate level adjustment of the microphones, the audience will not notice the amplification of the voices; however, through my specialization system, the presence and location of the voices of all actors are preserved for all audiences clearly. With such vocal reinforcements provided by the microphones, I no longer need to worry about overwhelming the dialogue on stage by the underscoring. A successful sound system design should not only provide a functional system, but also take the responsibility of bringing actors' voices to the audience and engaging the audience with the world that we create on stage. By designing a system which reinforces the actors' voices while at the same time providing control over localization of movement of sound effects, I was able not only to make the text present and clear for the audiences, but also to support the storyline strongly through my composed music, environmental soundscapes, and underscoring.
The singer's voice range profile: female professional opera soloists.
Lamarche, Anick; Ternström, Sten; Pabon, Peter
2010-07-01
This work concerns the collection of 30 voice range profiles (VRPs) of female operatic voice. We address the questions: Is there a need for a singer's protocol in VRP acquisition? Are physiological measurements sufficient or should the measurement of performance capabilities also be included? Can we address the female singing voice in general or is there a case for categorizing voices when studying phonetographic data? Subjects performed a series of structured tasks involving both standard speech voice protocols and additional singing tasks. Singers also completed an extensive questionnaire. Physiological VRPs differ from performance VRPs. Two new VRP metrics, the voice area above a defined level threshold and the dynamic range independent from the fundamental frequency (F(0)), were found to be useful in the analysis of singer VRPs. Task design had no effect on performance VRP outcomes. Voice category differences were mainly attributable to phonation frequency-based information. Results support the clinical importance of addressing the vocal instrument as it is used in performance. Equally important is the elaboration of a protocol suitable for the singing voice. The given context and instructions can be more important than task design for performance VRPs. Yet, for physiological VRP recordings, task design remains critical. Both types of VRPs are suggested for a singer's voice evaluation. Copyright (c) 2010 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Self, Voices and Embodiment: A Phenomenological Analysis
Rosen, C; Jones, N; Chase, KA; Grossman, LS; Gin, H; Sharma, RP
2016-01-01
Objective The primary aim of this study was to examine first-person phenomenological descriptions of the relationship between the self and Auditory Verbal Hallucinations (AVHs). Complex AVHs are frequently described as entities with clear interpersonal characteristics. Strikingly, investigations of first-person (subjective) descriptions of the phenomenology of the relationship are virtually absent from the literature. Method Twenty participants with psychosis and actively experiencing AVHs were recruited from the University of Illinois at Chicago. A mixed-methods design involving qualitative and quantitative components was utilized. Following a priority-sequence model of complementarity, quantitative analyses were used to test elements of emergent qualitative themes. Results The qualitative analysis identified three foundational constructs in the relationship between self and voices: ‘understanding of origin,’ ‘distinct interpersonal identities,’ and ‘locus of control.’ Quantitative analyses further supported identified links of these constructs. Subjects experienced their AVHs as having identities distinct from self and actively engaged with their AVHs experienced a greater sense of autonomy and control over AVHs. Discussion Given the clinical importance of AVHs and emerging strategies targeting the relationship between the hearer and voices, our findings highlight the importance of these relational constructs in improvement and innovation of clinical interventions. Our analyses also underscore the value of detailed voice assessments such as those provided by the Maastricht Interview are needed in the evaluation process. Subjects narratives shows that the relational phenomena between hearer and AVH(s) is dynamic, and can be influenced and changed through the hearers’ engagement, conversation, and negotiation with their voices. PMID:27099869
Tomicic, Alemka; Martínez, Claudio; Pérez, J. Carola; Hollenstein, Tom; Angulo, Salvador; Gerstmann, Adam; Barroux, Isabelle; Krause, Mariane
2015-01-01
This study seeks to provide evidence of the dynamics associated with the configurations of discourse-voice regulatory strategies in patient–therapist interactions in relevant episodes within psychotherapeutic sessions. Its central assumption is that discourses manifest themselves differently in terms of their prosodic characteristics according to their regulatory functions in a system of interactions. The association between discourse and vocal quality in patients and therapists was analyzed in a sample of 153 relevant episodes taken from 164 sessions of five psychotherapies using the state space grid (SSG) method, a graphical tool based on the dynamic systems theory (DST). The results showed eight recurrent and stable discourse-voice regulatory strategies of the patients and three of the therapists. Also, four specific groups of these discourse-voice strategies were identified. The latter were interpreted as regulatory configurations, that is to say, as emergent self-organized groups of discourse-voice regulatory strategies constituting specific interactional systems. Both regulatory strategies and their configurations differed between two types of relevant episodes: Change Episodes and Rupture Episodes. As a whole, these results support the assumption that speaking and listening, as dimensions of the interaction that takes place during therapeutic conversation, occur at different levels. The study not only shows that these dimensions are dependent on each other, but also that they function as a complex and dynamic whole in therapeutic dialog, generating relational offers which allow the patient and the therapist to regulate each other and shape the psychotherapeutic process that characterizes each type of relevant episode. PMID:25932014
Satellite switched FDMA advanced communication technology satellite program
NASA Technical Reports Server (NTRS)
Atwood, S.; Higton, G. H.; Wood, K.; Kline, A.; Furiga, A.; Rausch, M.; Jan, Y.
1982-01-01
The satellite switched frequency division multiple access system provided a detailed system architecture that supports a point to point communication system for long haul voice, video and data traffic between small Earth terminals at Ka band frequencies at 30/20 GHz. A detailed system design is presented for the space segment, small terminal/trunking segment at network control segment for domestic traffic model A or B, each totaling 3.8 Gb/s of small terminal traffic and 6.2 Gb/s trunk traffic. The small terminal traffic (3.8 Gb/s) is emphasized, for the satellite router portion of the system design, which is a composite of thousands of Earth stations with digital traffic ranging from a single 32 Kb/s CVSD voice channel to thousands of channels containing voice, video and data with a data rate as high as 33 Mb/s. The system design concept presented, effectively optimizes a unique frequency and channelization plan for both traffic models A and B with minimum reorganization of the satellite payload transponder subsystem hardware design. The unique zoning concept allows multiple beam antennas while maximizing multiple carrier frequency reuse. Detailed hardware design estimates for an FDMA router (part of the satellite transponder subsystem) indicate a weight and dc power budget of 353 lbs, 195 watts for traffic model A and 498 lbs, 244 watts for traffic model B.
Evitts, Paul M; Starmer, Heather; Teets, Kristine; Montgomery, Christen; Calhoun, Lauren; Schulze, Allison; MacKenzie, Jenna; Adams, Lauren
2016-11-01
There is currently minimal information on the impact of dysphonia secondary to phonotrauma on listeners. Considering the high incidence of voice disorders with professional voice users, it is important to understand the impact of a dysphonic voice on their audiences. Ninety-one healthy listeners (39 men, 52 women; mean age = 23.62 years) were presented with speech stimuli from 5 healthy speakers and 5 speakers diagnosed with dysphonia secondary to phonotrauma. Dependent variables included processing speed (reaction time [RT] ratio), speech intelligibility, and listener comprehension. Voice quality ratings were also obtained for all speakers by 3 expert listeners. Statistical results showed significant differences between RT ratio and number of speech intelligibility errors between healthy and dysphonic voices. There was not a significant difference in listener comprehension errors. Multiple regression analyses showed that voice quality ratings from the Consensus Assessment Perceptual Evaluation of Voice (Kempster, Gerratt, Verdolini Abbott, Barkmeier-Kraemer, & Hillman, 2009) were able to predict RT ratio and speech intelligibility but not listener comprehension. Results of the study suggest that although listeners require more time to process and have more intelligibility errors when presented with speech stimuli from speakers with dysphonia secondary to phonotrauma, listener comprehension may not be affected.
Community perceptions of carbon sequestration: insights from California
NASA Astrophysics Data System (ADS)
Wong-Parodi, Gabrielle; Ray, Isha
2009-07-01
Over the last decade, many energy experts have supported carbon sequestration as a viable technological response to climate change. Given the potential importance of sequestration in US energy policy, what might explain the views of communities that may be directly impacted by the siting of this technology? To answer this question, we conducted focus groups in two communities who were potentially pilot project sites for California's DOE-funded West Coast Regional Partnership (WESTCARB). We find that communities want a voice in defining the risks to be mitigated as well as the justice of the procedures by which the technology is implemented. We argue that a community's sense of empowerment is key to understanding its range of carbon sequestration opinions, where 'empowerment' includes the ability to mitigate community-defined risks of the technology. This sense of empowerment protects the community against the downside risk of government or corporate neglect, a risk that is rarely identified in risk assessments but that should be factored into assessment and communication strategies.
Identification and human condition analysis based on the human voice analysis
NASA Astrophysics Data System (ADS)
Mieshkov, Oleksandr Yu.; Novikov, Oleksandr O.; Novikov, Vsevolod O.; Fainzilberg, Leonid S.; Kotyra, Andrzej; Smailova, Saule; Kozbekova, Ainur; Imanbek, Baglan
2017-08-01
The paper presents a two-stage biotechnical system for human condition analysis that is based on analysis of human voice signal. At the initial stage, the voice signal is pre-processed and its characteristics in time domain are determined. At the first stage, the developed system is capable of identifying the person in the database on the basis of the extracted characteristics. At the second stage, the model of a human voice is built on the basis of the real voice signals after clustering the whole database.
ERIC Educational Resources Information Center
Nellen, Ted; Hogue, Dawn; Patterson, Nancy G.; Schulze, Patricia
2004-01-01
Four experienced teachers replaced the traditional classroom dynamics and environment having micromanaged routines with CyberEnglish classroom structure that supports scholars' choices and voices. The lack of voice in students' writing was one of the reasons for the changes in the classrooms and exciting results were achieved with different…
Wang, Rosalie H; Korotchenko, Alexandra; Clarke, Laura Hurd; Ben Mortenson, W; Mihailidis, Alex
2017-01-01
Collision avoidance technology has the capacity to facilitate safer mobility among older power mobility users with physical, sensory and cognitive impairments, thus enabling independence for more potential users. However, little is known about consumers’ perceptions of collision avoidance. This article draws on interviews with 29 users, five caregivers, and 10 prescribers to examine views on the design and utilization of this technology. Data analysis identified three themes: “useful situations or contexts”, “technology design issues and real life application”, and “appropriateness of collision avoidance technology for a variety of users”. Findings support the ongoing development of collision avoidance for older adult users. The majority of participants were supportive of the technology, and felt that it might benefit current power mobility users and users with visual impairments, but might be unsuitable for people with significant cognitive impairments. Some participants voiced concerns regarding the risk for injury with power mobility use and some identified situations where collision avoidance might be beneficial (driving backwards, avoiding dynamic obstacles, negotiating outdoor barriers, and learning power mobility use). Design issues include the need for context awareness, reliability, and user interface specifications. Furthermore, user desire to maintain driving autonomy indicates the need to develop collaboratively-controlled systems. This research lays the groundwork for future development by identifying and illustrating consumer needs for this technology. PMID:24458968
ERIC Educational Resources Information Center
Carey, Roderick L.
2016-01-01
This article highlights the voices of two 11th grade boys of color (one Black and one Latino), attending one U.S. urban charter school, as they describe the various ways their families supported their college going efforts. This study finds that these participants were equipped with what the author conceptualizes as "college going familial…
Mobilizing Science, Evidence and Technology for the Sendai Framework
NASA Astrophysics Data System (ADS)
Calkins, J. A.
2015-12-01
In March 2015, UN member states adopted the Sendai Framework for Disaster Risk Reduction: 2015-2030. The Sendai Framework recognises the cross-cutting nature of DRR policy and calls on a range of stakeholders to help governments. The Sendai Framework sets the aim of achieving "the substantial reduction of disaster risk and losses in lives, livelihoods and health and in the economic, physical, social, cultural and environmental assets of persons, businesses, communities and countries" (para 16). The international science community is acknowledged as a key stakeholder in supporting countries to implement the Sendai Framework. With this call to action and the gravity of disaster risk escalating around the globe, it is now vital that scientific knowledge and research resources are shared and become accessible in a form that can directly support coordinated application. Recent work is presented on the DRR gaps voiced by governments and scientists across a range of science and technology related needs, including through the drafting process for the Sendai Framework. Across regions and development levels, countries are seeking to address specific gaps they face in scientific capacities and information. Considering the many existing programmes, research initiatives and resources already seeking to generate evidence on DRR at all scales, how can science and technology improve delivery? Models and case studies prompt a useful discussion on what does and does not work. We provide an example of recent work in the UK disasters research community to assess scientific and technical capacity and collaborative effort to fulfil the commitment of the Sendai Framework. While there is no one-size-fits-all, any implementation approach needs to take into account the extraordinary, dynamic and localised nature of disasters and needs to be able to deliver relevant information to decision-makers at national and local levels, in a timely manner.
NASA Astrophysics Data System (ADS)
Meiyanti, R.; Subandi, A.; Fuqara, N.; Budiman, M. A.; Siahaan, A. P. U.
2018-03-01
A singer doesn’t just recite the lyrics of a song, but also with the use of particular sound techniques to make it more beautiful. In the singing technique, more female have a diverse sound registers than male. There are so many registers of the human voice, but the voice registers used while singing, among others, Chest Voice, Head Voice, Falsetto, and Vocal fry. Research of speech recognition based on the female’s voice registers in singing technique is built using Borland Delphi 7.0. Speech recognition process performed by the input recorded voice samples and also in real time. Voice input will result in weight energy values based on calculations using Hankel Transformation method and Macdonald Functions. The results showed that the accuracy of the system depends on the accuracy of sound engineering that trained and tested, and obtained an average percentage of the successful introduction of the voice registers record reached 48.75 percent, while the average percentage of the successful introduction of the voice registers in real time to reach 57 percent.
Boll, Daniel T; Rubin, Geoffrey D; Heye, Tobias; Pierce, Laura J
2017-04-01
The objective of this study is to analyze implementation of the voice-of-the-customer method to assess the current state of image postprocessing and reporting delivered by a radiology department and to plan improvements on the basis of referring physicians' preferences. The voice-of-the-customer method consisted of discovery, analysis, and optimization phases. Fifty referring physicians were invited to be interviewed. Interviews addressed the topics of structure, process, outcome, and support. Interviews were dissected into individual statements categorized as fact or feeling. Statements were grouped to find collective voices. Improvements were compiled from affinity charts and were processed by identifying insights. Ninety-four percent (47/50) of physicians participated, generating 352 statements (81 facts and 271 feelings) that subsequently underwent affinity chart clustering. The resultant affinity charts covered distinct themes: "we need you to know us better," "we need you to consider our workflow," "we need more from your services," "we want to review your data in certain ways," and "we want to do more with you." As a result of the insights gained, the following optimizations were implemented: a software application that improves study requesting, performance tracking, study prioritization, and longitudinal data archiving; six prototype reports containing tabulated data and annotated images; two prototype longitudinal reporting templates assessing aneurysm evolution and treatment-induced changes in organ size over time; and a teaching curriculum for trainees. This study has shown the clinical feasibility to assess the current state of image postprocessing and reporting and to implement improvements of and investments in image postprocessing and reporting infrastructure on the basis of referring physicians' preferences using the voice-of-the-customer method.
Voice: challenging the stigma of addiction; a nursing perspective.
Paivinen, Helena; Bade, Sherrie
2008-06-01
Voice is a collection of art, poetry and narratives created by women living with a history of substance use and addiction. The intent of this collection is to explore women's understanding of harm reduction, to challenge the effects of stigmatization and to explore the experiences of those who have historically been silenced or devalued. Voice was conceived by a group of Kamloops nurses who came together and used their knowledge of mainstream systems, aesthetic knowing, feminism and substance use to guide the development and implementation of this project. During weekly gatherings, women with histories of substance use and addiction worked alongside a nurse in the co-creation of artistic expressions. Gender sensitivity, trust, equality and respect were vital to the success of this process. A selection of the women's art was presented at several venues, including an International Conference on Drug Related Harm, a Nursing Conference and a local art gallery. The positive community response to the women's work contributed to feelings of great pride and enhanced the women's confidence in their ability to express themselves. Throughout this process, women had the opportunity to develop social networks and to become aware of the value that their creative knowledge has to the community in which they live. Gender sensitive programming that is inclusive, participative and promotes women's health is required to fully understand women's experience of substance use and addiction in relation to harm reduction. Participation in projects such as Voice supports and encourages women to make sense of the world they live in and encourages health-promoting activities. The promising outcomes of this project might well be developed by nurses in other settings to further promote the health of women who have traditionally been stigmatized.
Singh, Gaurav; Manjunatha, Narayana; Rao, Sabina; Shashidhara, H N; Moirangthem, Sydney; Madegowda, Rajendra K; Binukumar, B; Varghese, Mathew
2017-01-01
Mobile phone technology is being used worldwide to improve follow-ups in health care. Aim of the study is to evaluate whether the use of mobile technology will improve or not the follow-up of Indian patients from a community mental health center. Patients or caregivers having mobile phones and consenting for study were enrolled, and sociodemographic and clinical details of patients were taken. Participants were randomized into two groups (short message service [SMS] vs. non-SMS group). At first intervention level, a SMS was sent to SMS group (not in non-SMS group) 1 day before their appointment. At second-level intervention (voice call level), patients from both groups who missed their first appointment were given a voice call requesting them to come for follow-up, and the reasons for first missed appointments (MA) were also elicited. The effect of these two intervention levels (first SMS for SMS group and next voice calls for both groups) on follow-up was evaluated. A total of 214 patients were enrolled in the study. At first SMS intervention level of SMS group ( n = 106), 62.26% of participants reached appointment-on-time (RA), while in the non-SMS/as usual group ( n = 108), 45.37% of patients RA. The difference of these groups is statistically significant. At second-level intervention (voice call), 66 of 88 (another 15 were unable to contact) were came for follow-up consultation within 2 days of MA. Distance and diagnosis of alcohol dependence were significantly associated with MA. Social reasons were most common reasons for first MA. The use of mobile phone technology in an outpatient community psychiatric clinic improved follow-up significantly.
Multipath for Agricultural and Rural Information Services in China
NASA Astrophysics Data System (ADS)
Ge, Ningning; Zang, Zhiyuan; Gao, Lingwang; Shi, Qiang; Li, Jie; Xing, Chunlin; Shen, Zuorui
Internet cannot provide perfect information services for farmers in rural regions in China, because farmers in rural regions can hardly access the internet by now. But the wide coverage of mobile signal, telephone line, and television network, etc. gave us a chance to solve the problem. The integrated pest management platform of Northern fruit trees were developed based on the integrated technology, which can integrate the internet, mobile and fixed-line telephone network, and television network, to provide integrated pest management(IPM) information services for farmers in rural regions in E-mail, telephone-voice, short message, voice mail, videoconference or other format, to users' telephone, cell phone, personal computer, personal digital assistant(PDA), television, etc. alternatively. The architecture and the functions of the system were introduced in the paper. The system can manage the field monitoring data of agricultural pests, deal with enquiries to provide the necessary information to farmers accessing the interactive voice response(IVR) in the system with the experts on-line or off-line, and issue the early warnings about the fruit tree pests when it is necessary according to analysis on the monitoring data about the pests of fruit trees in variety of ways including SMS, fax, voice and intersystem e-mail.The system provides a platform and a new pattern for agricultural technology extension with a high coverage rate of agricultural technology in rural regions, and it can solve the problem of agriculture information service 'last kilometer' in China. The effectiveness of the system was certified.
Technology considerations in EHF Satcom systems
NASA Astrophysics Data System (ADS)
Cuccia, C. L.
The history of mm-wave communications is reviewed briefly and technological requirements for future implementation of mm-wave communications satellites for military and commercial applications are surveyed. The driving force for expanding mm-wave usage is an impending saturation of the GEO arc over North America with C- and Ku-band Satcoms. For military purposes, 44 GHz operations would provide antijamming capabilities and on-board processing. Necessary developments for the mm-wave Satcoms include scanning and multiple beam antennas, low-noise amplifiers, filters which channelize the frequency band, frequency hopping synthesizers, QPSK and MSK modulation systems and improvements in GaAs and indium phosphide ICs. Finally, digital systems are being explored for commercial integrated global data, voice and video systems.
Perceptual Adaptation of Voice Gender Discrimination with Spectrally Shifted Vowels
ERIC Educational Resources Information Center
Li, Tianhao; Fu, Qian-Jie
2011-01-01
Purpose: To determine whether perceptual adaptation improves voice gender discrimination of spectrally shifted vowels and, if so, which acoustic cues contribute to the improvement. Method: Voice gender discrimination was measured for 10 normal-hearing subjects, during 5 days of adaptation to spectrally shifted vowels, produced by processing the…
Hearing Story Characters' Voices: Auditory Imagery during Reading
ERIC Educational Resources Information Center
Gunraj, Danielle N.; Klin, Celia M.
2012-01-01
Despite the longstanding belief in an inner voice, there is surprisingly little known about the perceptual features of that voice during text processing. This article asked whether readers infer nonlinguistic phonological features, such as speech rate, associated with a character's speech. Previous evidence for this type of auditory imagery has…
ERIC Educational Resources Information Center
Future Teacher, 1995
1995-01-01
Prominent Americans were asked to reflect on the diversity challenge facing America's teacher workforce. The following leaders from several fields voiced their support of teachers and their beliefs America needs more diverse and culturally responsive teachers: (1) Mary Hatwood Futrell, President of Education International; (2) Carol Moseley-Braun,…
Technology in the Public Library: Results from the 1992 PLDS Survey of Technology.
ERIC Educational Resources Information Center
Fidler, Linda M.; Johnson, Debra Wilcox
1994-01-01
Discusses and compares the incorporation of technology by larger public libraries in Canada and the United States. Technology mentioned includes online public access catalogs; remote and local online database searching; microcomputers and software for public use; and fax, voice mail, and Telecommunication Devices for the Deaf and Teletype writer…
Intentional Voice Command Detection for Trigger-Free Speech Interface
NASA Astrophysics Data System (ADS)
Obuchi, Yasunari; Sumiyoshi, Takashi
In this paper we introduce a new framework of audio processing, which is essential to achieve a trigger-free speech interface for home appliances. If the speech interface works continually in real environments, it must extract occasional voice commands and reject everything else. It is extremely important to reduce the number of false alarms because the number of irrelevant inputs is much larger than the number of voice commands even for heavy users of appliances. The framework, called Intentional Voice Command Detection, is based on voice activity detection, but enhanced by various speech/audio processing techniques such as emotion recognition. The effectiveness of the proposed framework is evaluated using a newly-collected large-scale corpus. The advantages of combining various features were tested and confirmed, and the simple LDA-based classifier demonstrated acceptable performance. The effectiveness of various methods of user adaptation is also discussed.
Dragon Stream Cipher for Secure Blackbox Cockpit Voice Recorder
NASA Astrophysics Data System (ADS)
Akmal, Fadira; Michrandi Nasution, Surya; Azmi, Fairuz
2017-11-01
Aircraft blackbox is a device used to record all aircraft information, which consists of Flight Data Recorder (FDR) and Cockpit Voice Recorder (CVR). Cockpit Voice Recorder contains conversations in the aircraft during the flight.Investigations on aircraft crashes usually take a long time, because it is difficult to find the aircraft blackbox. Then blackbox should have the ability to send information to other places. Aircraft blackbox must have a data security system, data security is a very important part at the time of information exchange process. The system in this research is to perform the encryption and decryption process on Cockpit Voice Recorder by people who are entitled by using Dragon Stream Cipher algorithm. The tests performed are time of data encryption and decryption, and avalanche effect. Result in this paper show us time encryption and decryption are 0,85 seconds and 1,84 second for 30 seconds Cockpit Voice Recorder data witn an avalanche effect 48,67 %.
Voice tracking and spoken word recognition in the presence of other voices
NASA Astrophysics Data System (ADS)
Litong-Palima, Marisciel; Violanda, Renante; Saloma, Caesar
2004-12-01
We study the human hearing process by modeling the hair cell as a thresholded Hopf bifurcator and compare our calculations with experimental results involving human subjects in two different multi-source listening tasks of voice tracking and spoken-word recognition. In the model, we observed noise suppression by destructive interference between noise sources which weakens the effective noise strength acting on the hair cell. Different success rate characteristics were observed for the two tasks. Hair cell performance at low threshold levels agree well with results from voice-tracking experiments while those of word-recognition experiments are consistent with a linear model of the hearing process. The ability of humans to track a target voice is robust against cross-talk interference unlike word-recognition performance which deteriorates quickly with the number of uncorrelated noise sources in the environment which is a response behavior that is associated with linear systems.
Mecke, Ann-Christine; Sundberg, Johan; Richter, Bernhard
2010-10-01
In this investigation the voice source from trained boy singers was processed with a transfer function that contained the singer's formant cluster of a bass, a baritone, or a tenor. The modified voices were evaluated by a panel of highly specialized experts. The experts were asked 1) to assess how similar the examples sounded to the voice of the last castrato Alessandro Moreschi, and 2) to rate how similar they thought the examples were to their imagination of an 18th-century castrato voice. For both questions, the voices with tenor formants produced significantly higher ratings than the other voice types. However, the mean ratings for the second question were generally lower than those for the first.
Bogon, Johanna; Eisenbarth, Hedwig; Landgraf, Steffen; Dreisbach, Gesine
2017-09-01
Vocal events offer not only semantic-linguistic content but also information about the identity and the emotional-motivational state of the speaker. Furthermore, most vocal events have implications for our actions and therefore include action-related features. But the relevance and irrelevance of vocal features varies from task to task. The present study investigates binding processes for perceptual and action-related features of spoken words and their modulation by the task representation of the listener. Participants reacted with two response keys to eight different words spoken by a male or a female voice (Experiment 1) or spoken by an angry or neutral male voice (Experiment 2). There were two instruction conditions: half of participants learned eight stimulus-response mappings by rote (SR), and half of participants applied a binary task rule (TR). In both experiments, SR instructed participants showed clear evidence for binding processes between voice and response features indicated by an interaction between the irrelevant voice feature and the response. By contrast, as indicated by a three-way interaction with instruction, no such binding was found in the TR instructed group. These results are suggestive of binding and shielding as two adaptive mechanisms that ensure successful communication and action in a dynamic social environment.
Coelho, Ana Cristina; Brasolotto, Alcione Ghedini; Bevilacqua, Maria Cecília
2015-06-01
To compare some perceptual and acoustic characteristics of the voices of children who use the advanced combination encoder (ACE) or fine structure processing (FSP) speech coding strategies, and to investigate whether these characteristics differ from children with normal hearing. Acoustic analysis of the sustained vowel /a/ was performed using the multi-dimensional voice program (MDVP). Analyses of sequential and spontaneous speech were performed using the real time pitch. Perceptual analyses of these samples were performed using visual-analogic scales of pre-selected parameters. Seventy-six children from three years to five years and 11 months of age participated. Twenty-eight were users of ACE, 23 were users of FSP, and 25 were children with normal hearing. Although both groups with CI presented with some deviated vocal features, the users of ACE presented with voice quality more like children with normal hearing than the users of FSP. Sound processing of ACE appeared to provide better conditions for auditory monitoring of the voice, and consequently, for better control of the voice production. However, these findings need to be further investigated due to the lack of comparative studies published to understand exactly which attributes of sound processing are responsible for differences in performance.
Wall, Michael A; Jenney, Angelique; Walsh, Margaret
2018-01-20
Research and program evaluation processes that engage children and youth are becoming much more common due to influences from children's rights and the acknowledgement that children have the capacity to contribute to research, both as participants and co-researchers (Roberts, 2017). Recent technological advances in the form of tablet and internet-based applications have provided researchers with additional methodological tools to better capture the voices and experiences of children and their caregivers (Livingstone & Blum-Ross, 2017). However, little has been written on the ways in which these new technological advances can improve research experiences for children who have been exposed to intimate partner and family violence, as well as other forms of traumatic experiences. This paper provides a review of current literature and a case study example of how one children's mental health agency has implemented tablet-based data collection procedures. Copyright © 2018. Published by Elsevier Ltd.
Zeldin, Shepherd; Krauss, Steven Eric; Kim, Taehan; Collura, Jessica; Abdullah, Haslinda
2016-08-01
After-school programs are prevalent across the world, but there is a paucity of research that examines quality within the "black box" of programs at the point of service. Grounded in current theory, this research examined hypothesized pathways between the experience of youth-adult partnership (youth voice in decision-making; supportive adult relationships), the mediators of program safety and engagement, and the developmental outcomes of youth empowerment (leadership competence, policy control) and community connectedness (community connections, school attachment). Surveys were administered to 207 ethnically diverse (47.3 % female; 63.3 % Malay) youth, age 15-16, attending after-school co-curricular programs in Kuala Lumpur, Malaysia. Results showed that youth voice in program decision-making predicted both indicators of youth empowerment. Neither youth voice nor supportive adult relationships was directly associated with community connectedness, however. Program engagement mediated the associations between youth-adult partnership and empowerment. In contrast, program safety mediated the associations between youth-adult partnership and community connectedness. The findings indicate that the two core components of youth-adult partnership-youth voice and supportive adult relationships-may operate through different, yet complementary, pathways of program quality to predict developmental outcomes. Implications for future research are highlighted. For reasons of youth development and youth rights, the immediate challenge is to create opportunities for youth to speak on issues of program concern and to elevate those adults who are able and willing to help youth exercise their voice.
Impact of VoIP and QoS on Open and Distance Learning
ERIC Educational Resources Information Center
Saxena, P. C.; Jasola, Sanjay; Sharma, Ramesh C.
2006-01-01
Voice over Internet Protocol (VoIP) is becoming a reality in many organizations. The potential for mobility in voice over wi-fi networks will derive demand for the technology. Wireless VoIP is poised to rival VoIP as an alternative telephony tool. Internet has been used to transport data in the form of packet. In the past, Internet did not support…
Lorenz, Klara; Freddolino, Paul P; Comas-Herrera, Adelina; Knapp, Martin; Damant, Jacqueline
2017-01-01
The extent to which technology may be able to support people with dementia and their carers along the care pathway and in different care settings is of interest to policy makers and governments. In this paper we provide an overview of the role of technology in dementia care, treatment and support by mapping existing technologies - by function, target user and disease progression. Technologies identified are classified into seven functions: memory support, treatment, safety and security, training, care delivery, social interaction and other. Different groups of potential users are distinguished: people with mild cognitive impairment and early stages of dementia, people with moderate to severe dementia and unpaid carers and health- and social care professionals. We also identified the care settings, in which the technologies are used (or for which the technologies are developed): at home in the community and in institutional care settings. The evidence has been drawn from a rapid review of the literature, expert interviews and web and social media searches. The largest number of technologies identified aim to enhance the safety and security of people with dementia living in the community. These devices are often passive monitors, such as smoke detectors. Other safety interventions, such as panic buttons, require active intervention. The second largest number of interventions aims to enhance people's memory and includes global positioning systems devices and voice prompts. These technologies mostly target people in the early stages of dementia. A third group focusing on treatment and care delivery emerged from the literature. These interventions focus on technology-aided reminiscence or therapeutic aspects of care for people with dementia and their carers. While the review found a range of technologies available for people with dementia and carers there is very little evidence of widespread practical application. Instead, it appears that stakeholders frequently rely on everyday technologies re-purposed to meet their needs.
Temporal signatures of processing voiceness and emotion in sound
Gunter, Thomas C.
2017-01-01
Abstract This study explored the temporal course of vocal and emotional sound processing. Participants detected rare repetitions in a stimulus stream comprising neutral and surprised non-verbal exclamations and spectrally rotated control sounds. Spectral rotation preserved some acoustic and emotional properties of the vocal originals. Event-related potentials elicited to unrepeated sounds revealed effects of voiceness and emotion. Relative to non-vocal sounds, vocal sounds elicited a larger centro-parietally distributed N1. This effect was followed by greater positivity to vocal relative to non-vocal sounds beginning with the P2 and extending throughout the recording epoch (N4, late positive potential) with larger amplitudes in female than in male listeners. Emotion effects overlapped with the voiceness effects but were smaller and differed topographically. Voiceness and emotion interacted only for the late positive potential, which was greater for vocal-emotional as compared with all other sounds. Taken together, these results point to a multi-stage process in which voiceness and emotionality are represented independently before being integrated in a manner that biases responses to stimuli with socio-emotional relevance. PMID:28338796
Temporal signatures of processing voiceness and emotion in sound.
Schirmer, Annett; Gunter, Thomas C
2017-06-01
This study explored the temporal course of vocal and emotional sound processing. Participants detected rare repetitions in a stimulus stream comprising neutral and surprised non-verbal exclamations and spectrally rotated control sounds. Spectral rotation preserved some acoustic and emotional properties of the vocal originals. Event-related potentials elicited to unrepeated sounds revealed effects of voiceness and emotion. Relative to non-vocal sounds, vocal sounds elicited a larger centro-parietally distributed N1. This effect was followed by greater positivity to vocal relative to non-vocal sounds beginning with the P2 and extending throughout the recording epoch (N4, late positive potential) with larger amplitudes in female than in male listeners. Emotion effects overlapped with the voiceness effects but were smaller and differed topographically. Voiceness and emotion interacted only for the late positive potential, which was greater for vocal-emotional as compared with all other sounds. Taken together, these results point to a multi-stage process in which voiceness and emotionality are represented independently before being integrated in a manner that biases responses to stimuli with socio-emotional relevance. © The Author (2017). Published by Oxford University Press.
Ratajczak, Jan; Wójtowicz, Piotr; Krzeski, Antoni
2014-01-01
In recent years there has been an increasing number of cases of cancer, including cancer of the larynx. The choice of treatment should be primarily dictated by the complete elimination of cancer, but from the point of view of the patient, an important factor to keep in mind, is the quality of the voice that will be created at the end of the therapeutic process. The aim of this study was to evaluate the voice quality of patients after partial surgery of the larynx with vocal fold reconstruction pedunculated sterno-thyroid muscle flap. The study included 30 men aged 53-72 years who were treated at the Clinic of Otorhinolaryngology Department of Medical-Dental Medical University of Warsaw on account of cancer of the larynx, qualified according to the TNM classification T1 or T2. The radical removal of cancer was associated with resection of one vocal fold, laryngeal pouches and ventricular fold. In 15 patients, included to the group I at the end of phase oncology surgery, a reconstruction of "vocal fold" pedunculated sterno-thyroid muscle flap were performed simultaneously. The group II consisted of 15 patients who underwent surgery that removed only the cancerous lesions. Impact assessments arising after surgery of voice disorders on quality of life were made using the self-test failure of the voice (Voice Handicap Index in the Pruszewicz modification). The nature of the created voice was studied using GRBAS scale. All patients performed the laryngostroboscope examination. With "IRIS" program, prepared by a team at Wrocław University of Technology, the voice was recorded, and then was subjected to acoustic analysis. In addition, noise level and the maximum phonation time was measured. The results indicate that the patients of group I gained a better voice confirming the values of objective acoustic analysis. The assessment made by the scale GRBAS patients who supplemented the resulting loss after tumour removal, with much less hoarseness of voice, did not have the hyperkinesis and there was no puff character. They had the better flow of voice and phonation time of vowels "a" was much longer. The assessment of quality of life of patients in group I made in a test of the VHI was higher than in group II. The laryngostroboscope examination confirmed a good phonatory occlusion and a nearly regular, healthy vocal fold vibration in patients who had received flap of muscle. Evaluation of the quality of voice after removing the tumor multiplicity in the glottis with simultaneous reconstruction "vocal fold" has confirmed that this is a very good surgical technique, which allows radically curing cancer for the simultaneous preservation of a good voice. The operation has limited indications for radiotherapy because of the desire to preserve the phonatory function of the larynx, and thus avoids the negative effects of radiation. Copyright © 2014 Polish Otorhinolaryngology - Head and Neck Surgery Society. Published by Elsevier Urban & Partner Sp. z.o.o. All rights reserved.
Friendly, Rayna H.; Rendall, Drew; Trainor, Laurel J.
2013-01-01
Differentiating individuals by their voice is an important social skill for infants to acquire. In a previous study, we demonstrated that the ability to discriminate individuals by voice follows a pattern of perceptual narrowing (Friendly et al., 2013). Specifically, we found that the ability to discriminate between two foreign-species (rhesus monkey) voices decreased significantly between 6 and 12 months of age. Also during this period, there was a trend for the ability to discriminate human voices to increase. Here we investigate the extent to which plasticity remains at 12 months, after perceptual narrowing has occurred. We found that 12-month-olds who received 2 weeks of monkey-voice training were significantly better at discriminating between rhesus monkey voices than untrained 12-month-olds. Furthermore, discrimination was reinstated to a level slightly better than that of untrained 6-month-olds, suggesting that voice-processing abilities remain considerably plastic at the end of the first year. PMID:24130540
Effects of Vocal Function Exercises: A Systematic Review.
Angadi, Vrushali; Croake, Daniel; Stemple, Joseph
2017-11-03
The purpose of the present review was to systematically analyze the evidence for the effectiveness of vocal function exercises (VFEs) in improving voice production. A systematic literature search was performed by two independent reviewers using PubMed and EBSCOHost to access relevant databases and to locate outcome studies that used VFEs as an intervention. Articles that met inclusion criteria were appraised based on the American Speech-Language and Hearing Association's levels of evidence. Effect sizes for outcomes were calculated using Hedge's g. Voice outcomes were categorized according to the five domains of voice assessment: visual perceptual analysis, acoustic analysis, aerodynamic analysis, auditory-perceptual analysis, and patient self-report measures. Twenty-one articles were included for the final appraisal. All studies demonstrated positive effects of VFEs as demonstrated by effect sizes across selected voice parameters. Effect sizes across parameters ranged from -0.59 to 1.55. None of the included studies reported adverse voice outcomes as a result of VFEs. Outcome studies demonstrate that VFEs are efficacious in enhancing vocal function in individuals with normal and disordered voices, presbylaryngeus, and professional voice users. The available research suggests moderate to strong evidence to support the use of VFEs for a variety of voice disorders. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Perceptual, auditory and acoustic vocal analysis of speech and singing in choir conductors.
Rehder, Maria Inês Beltrati Cornacchioni; Behlau, Mara
2008-01-01
the voice of choir conductors. to evaluate the vocal quality of choir conductors based on the production of a sustained vowel during singing and when speaking in order to observe auditory and acoustic differences. participants of this study were 100 choir conductors, with an equal distribution between genders. Participants were asked to produce the sustained vowel "é" using a singing and speaking voice. Speech samples were analyzed based on auditory-perceptive and acoustic parameters. The auditory-perceptive analysis was carried out by two speech-language pathologist, specialists in this field of knowledge. The acoustic analysis was carried out with the support of the computer software Doctor Speech (Tiger Electronics, SRD, USA, version 4.0), using the Real Analysis module. the auditory-perceptive analysis of the vocal quality indicated that most conductors have adapted voices, presenting more alterations in their speaking voice. The acoustic analysis indicated different values between genders and between the different production modalities. The fundamental frequency was higher in the singing voice, as well as the values for the first formant; the second formant presented lower values in the singing voice, with statistically significant results only for women. the voice of choir conductors is adapted, presenting fewer deviations in the singing voice when compared to the speaking voice. Productions differ based the voice modality, singing or speaking.
Yiu, Edwin M-L; Murdoch, Bruce; Hird, Kathryn; Lau, Polly; Ho, Elaine Mandy
2008-01-01
Perceptual voice evaluation is a common clinical tool. However, to date, there is no consensus yet as to which common quality should be measured. Some available evidence shows that voice quality is a language-specific property which may be different across different languages. The familiarity of a language may affect the perception and reliability in rating voice quality. The present study set out to investigate the effects of listeners' cultural and language backgrounds on the perception of voice qualities. Forty speech pathology students from Australia and Hong Kong were asked to rate the breathy and rough qualities of synthesized voice signals in Cantonese and English. Results showed that the English stimulus sets as a whole were rated less severely than the Cantonese stimuli by both groups of listeners. In addition, the male Cantonese and English breathy stimuli were rated differently by the Australian and Hong Kong listeners. These results provided some evidence to support the claim that cultural and language backgrounds of the listeners would affect the perception for some voice quality types. Thus, the cultural and language backgrounds of judges should be taken into consideration in clinical voice evaluation. 2008 S. Karger AG, Basel.
Hayward, Mark; Fuller, Ella
2010-01-01
Current psychological models of voice hearing emphasise the personal meaning that individuals attribute to the voice hearing experience. Recent developments in theory and research have highlighted the importance of the relationship between the hearer and the voice. This study aims to contribute to this area of research, by exploring the experience and usefulness of a new form of 'Relating Therapy' that aims to modify distressing relationships with voices. Semi-structured interviews were conducted with ten participants and explored the experience and usefulness of a pilot of Relating Therapy: three therapists, three voice hearers, two relatives and two referrers. Interviews were transcribed and analysed using Interpretative Phenomenological Analysis. Three themes that emerged from the analysis are presented for discussion: engaging with the therapeutic model; developing a new relating style; and how change is described and defined by participants. This study is consistent with the growing body of theory and research that highlights the interpersonal nature of the voice hearing experience. It also offers tentative support for a therapeutic framework that aims to modify distressing relationships with voices as a means of bringing about positive change. Clinical implications and areas for future research are outlined. © 2009 John Wiley & Sons, Ltd.
Apollo experience report: Voice communications techniques and performance
NASA Technical Reports Server (NTRS)
Dabbs, J. H.; Schmidt, O. L.
1972-01-01
The primary performance requirement of the spaceborne Apollo voice communications system is percent word intelligibility, which is related to other link/channel parameters. The effect of percent word intelligibility on voice channel design and a description of the verification procedures are included. Development and testing performance problems and the techniques used to solve the problems are also discussed. Voice communications performance requirements should be comprehensive and verified easily; the total system must be considered in component design, and the necessity of voice processing and the associated effect on noise, distortion, and cross talk should be examined carefully.
Pavlikhin, O G; Romanenko, S G; Krasnikova, D I; Lesogorova, E V; Yakovlev, V S
The objective of the present study was to evaluate the clinical and functional condition of the voice apparatus in the elderly patients and to elaborate recommendations for the prevention of disturbances of the vocal function in the professional voice users. This comprehensive study involved 95 patients including the active professional voice users (n=48) and 45 non-occupational voice users at the age from 61 to 82 years with the employment history varying from 32 to 51 years. The study was designed to obtain the voice characteristics by means of the subjective auditory assessment, microlaryngoscopy, video laryngostroboscopy, determination of maximum phonation time (MPT), and computer-assisted acoustic analysis of the voice with the use of the MDVP Kay Pentaxy system. The level of anxiety of the patients was estimated based on the results of the HADS questionnaire study. It is concluded that the majority of the disturbances of the vocal function in the professional voice users have the functional nature. It is concluded that the method of neuro-muscular electrophonopedic stimulation (NMEPS) of laryngeal muscles is the method of choice for the diagnostics of the vocal function of the voice users in the late adulthood. It is recommended that the professional vocal load for such subjects should not exceed 12-14 hours per week. Rational psychotherapy must constitute an important component of the system of measures intended to support the working capacity of the voice users belonging to this age group.
Assistive Devices for People with Hearing, Voice, Speech, or Language Disorders
... sleepy. What research is being conducted on assistive technology? The National Institute on Deafness and Other Communication ... NIDCD) funds research into several areas of assistive technology, such as those described below. Improved devices for ...
Vogel, Adam P; Block, Susan; Kefalianos, Elaina; Onslow, Mark; Eadie, Patricia; Barth, Ben; Conway, Laura; Mundt, James C; Reilly, Sheena
2015-04-01
To investigate the feasibility of adopting automated interactive voice response (IVR) technology for remotely capturing standardized speech samples from stuttering children. Participants were 10 6-year-old stuttering children. Their parents called a toll-free number from their homes and were prompted to elicit speech from their children using a standard protocol involving conversation, picture description and games. The automated IVR system was implemented using an off-the-shelf telephony software program and delivered by a standard desktop computer. The software infrastructure utilizes voice over internet protocol. Speech samples were automatically recorded during the calls. Video recordings were simultaneously acquired in the home at the time of the call to evaluate the fidelity of the telephone collected samples. Key outcome measures included syllables spoken, percentage of syllables stuttered and an overall rating of stuttering severity using a 10-point scale. Data revealed a high level of relative reliability in terms of intra-class correlation between the video and telephone acquired samples on all outcome measures during the conversation task. Findings were less consistent for speech samples during picture description and games. Results suggest that IVR technology can be used successfully to automate remote capture of child speech samples.
[Information technology in learning sign language].
Hernández, Cesar; Pulido, Jose L; Arias, Jorge E
2015-01-01
To develop a technological tool that improves the initial learning of sign language in hearing impaired children. The development of this research was conducted in three phases: the lifting of requirements, design and development of the proposed device, and validation and evaluation device. Through the use of information technology and with the advice of special education professionals, we were able to develop an electronic device that facilitates the learning of sign language in deaf children. This is formed mainly by a graphic touch screen, a voice synthesizer, and a voice recognition system. Validation was performed with the deaf children in the Filadelfia School of the city of Bogotá. A learning methodology was established that improves learning times through a small, portable, lightweight, and educational technological prototype. Tests showed the effectiveness of this prototype, achieving a 32 % reduction in the initial learning time for sign language in deaf children.
Optical gesture sensing and depth mapping technologies for head-mounted displays: an overview
NASA Astrophysics Data System (ADS)
Kress, Bernard; Lee, Johnny
2013-05-01
Head Mounted Displays (HMDs), and especially see-through HMDs have gained renewed interest in recent time, and for the first time outside the traditional military and defense realm, due to several high profile consumer electronics companies presenting their products to hit market. Consumer electronics HMDs have quite different requirements and constrains as their military counterparts. Voice comments are the de-facto interface for such devices, but when the voice recognition does not work (not connection to the cloud for example), trackpad and gesture sensing technologies have to be used to communicate information to the device. We review in this paper the various technologies developed today integrating optical gesture sensing in a small footprint, as well as the various related 3d depth mapping sensors.
Eye Movements Reveal Fast, Voice-Specific Priming
Papesh, Megan H.; Goldinger, Stephen D.; Hout, Michael C.
2015-01-01
In spoken word perception, voice specificity effects are well-documented: When people hear repeated words in some task, performance is generally better when repeated items are presented in their originally heard voices, relative to changed voices. A key theoretical question about voice specificity effects concerns their time-course: Some studies suggest that episodic traces exert their influence late in lexical processing (the time-course hypothesis; McLennan & Luce, 2005), whereas others suggest that episodic traces influence immediate, online processing. We report two eye-tracking studies investigating the time-course of voice-specific priming within and across cognitive tasks. In Experiment 1, participants performed modified lexical decision or semantic classification to words spoken by four speakers. The tasks required participants to click a red “×” or a blue “+” located randomly within separate visual half-fields, necessitating trial-by-trial visual search with consistent half-field response mapping. After a break, participants completed a second block with new and repeated items, half spoken in changed voices. Voice effects were robust very early, appearing in saccade initiation times. Experiment 2 replicated this pattern while changing tasks across blocks, ruling out a response priming account. In the General Discussion, we address the time-course hypothesis, focusing on the challenge it presents for empirical disconfirmation, and highlighting the broad importance of indexical effects, beyond studies of priming. PMID:26726911
Processing voiceless vowels in Japanese: Effects of language-specific phonological knowledge
NASA Astrophysics Data System (ADS)
Ogasawara, Naomi
2005-04-01
There has been little research on processing allophonic variation in the field of psycholinguistics. This study focuses on processing the voiced/voiceless allophonic alternation of high vowels in Japanese. Three perception experiments were conducted to explore how listeners parse out vowels with the voicing alternation from other segments in the speech stream and how the different voicing statuses of the vowel affect listeners' word recognition process. The results from the three experiments show that listeners use phonological knowledge of their native language for phoneme processing and for word recognition. However, interactions of the phonological and acoustic effects are observed to be different in each process. The facilitatory phonological effect and the inhibitory acoustic effect cancel out one another in phoneme processing; while in word recognition, the facilitatory phonological effect overrides the inhibitory acoustic effect.
Poetic Voices: Writing, Reading, and Responding to Poetry
ERIC Educational Resources Information Center
Bandre, Patricia E.
2012-01-01
"Poetic Voices: Writing, Reading, and Responding to Poetry" was the title of the 2011 Master Class in Children's Literature. Woven into this session were the insights of poets Joyce Sidman and Pat Mora who shared their creative processes and the voices that inspire their poetry. In addition, Barbara Kiefer provided advice regarding how to connect…
The Limits of Children's Voices: From Authenticity to Critical, Reflexive Representation
ERIC Educational Resources Information Center
Spyrou, Spyros
2011-01-01
This article provides a critique of the preoccupation with children's voices in child-centred research by exploring their limits and problematizing their use in research. The article argues that critical, reflexive researchers need to reflect on the processes which produce children's voices in research, the power imbalances that shape them and the…
Federal Register 2010, 2011, 2012, 2013, 2014
2011-10-27
... areas of hearing and balance; smell and taste; and voice, speech, and language. The Strategic Plan... research training in the normal and disordered processes of hearing, balance, smell, taste, voice, speech... into three program areas: Hearing and balance; smell and taste; and voice, speech, and language. The...
Freedom to Grow: Children's Perspectives of Student Voice
ERIC Educational Resources Information Center
Quinn, Sarah; Owen, Susanne
2014-01-01
This article explores the power of student voice, in recognition of the child's right to be treated as a capable, competent social actor involved in the education process. In this study, student voice is considered in the light of improving students' engagement and personal and social development at the primary school level. It emphasizes the…
Gender in Voice Perception in Autism
ERIC Educational Resources Information Center
Groen, Wouter B.; van Orsouw, Linda; Zwiers, Marcel; Swinkels, Sophie; van der Gaag, Rutger Jan; Buitelaar, Jan K.
2008-01-01
Deficits in the perception of social stimuli may contribute to the characteristic impairments in social interaction in high functioning autism (HFA). Although the cortical processing of voice is abnormal in HFA, it is unclear whether this gives rise to impairments in the perception of voice gender. About 20 children with HFA and 20 matched…
Perception of initial obstruent voicing is influenced by gestural organization
Best, Catherine T.; Hallé, Pierre A.
2009-01-01
Cross-language differences in phonetic settings for phonological contrasts of stop voicing have posed a challenge for attempts to relate specific phonological features to specific phonetic details. We probe the phonetic-phonological relationship for voicing contrasts more broadly, analyzing in particular their relevance to nonnative speech perception, from two theoretical perspectives: feature geometry and articulatory phonology. Because these perspectives differ in assumptions about temporal/phasing relationships among features/gestures within syllable onsets, we undertook a cross-language investigation on perception of obstruent (stop, fricative) voicing contrasts in three nonnative onsets that use a common set of features/gestures but with differing time-coupling. Listeners of English and French, which differ in their phonetic settings for word-initial stop voicing distinctions, were tested on perception of three onset types, all nonnative to both English and French, that differ in how initial obstruent voicing is coordinated with a lateral feature/gesture and additional obstruent features/gestures. The targets, listed from least complex to most complex onsets, were: a lateral fricative voicing distinction (Zulu /ɬ/-ɮ/), a laterally-released affricate voicing distinction (Tlingit /tɬ/-/dɮ/), and a coronal stop voicing distinction in stop+/l/ clusters (Hebrew /tl/-/dl/). English and French listeners' performance reflected the differences in their native languages' stop voicing distinctions, compatible with prior perceptual studies on singleton consonant onsets. However, both groups' abilities to perceive voicing as a separable parameter also varied systematically with the structure of the target onsets, supporting the notion that the gestural organization of syllable onsets systematically affects perception of initial voicing distinctions. PMID:20228878
Akın Şenkal, Özgül; Özer, Cem
2015-09-01
The hoarseness in school-aged children disrupts the educational process because it affects the social progress, communication skills, and self-esteem of children. Besides otorhinolaryngological examination, the first treatment option is voice therapy when hoarseness occurs. The aim of the study was to determine the factors increasing the hoarseness in school-aged children by parental interview and to know preferable voice therapy on school-aged children within the frame of International Classification of Functioning (ICF). Retrospective analysis of data gathered from patient files. A total of 75 children (56 boys and 19 girls) were examined retrospectively. The age range of school-aged children is 7-14 years and average is 10.86 ± 2.51. A detailed history was taken from parents of children involved in this study. Information about vocal habits of children was gathered within the frame of ICF and then the voice therapies of children were started by scheduling appointments by an experienced speech-language pathologist. The differences between before and after voice therapy according to applied voice therapy methods, statistically significant differences were determined between maximum phonation time values and s/z rate. The relationship between voice therapy sessions and s/z rate with middle degree significance was found with physiological voice therapy sessions. According to ICF labels, most of voice complaints are matching with "body functions" and "activity and limitations." The appropriate voice therapy methods for hoarseness in school-aged children must be chosen and applied by speech-language therapists. The detailed history, which is received from family during the examination, within the frame of ICF affects the processes of choosing the voice therapy method and application of them positively. Child's family is very important for a successful management. Copyright © 2015 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Generation of surgical pathology report using a 5,000-word speech recognizer.
Tischler, A S; Martin, M R
1989-10-01
Pressures to decrease both turnaround time and operating costs simultaneously have placed conflicting demands on traditional forms of medical transcription. The new technology of voice recognition extends the promise of enabling the pathologist or other medical professional to dictate a correct report and have it printed and/or transmitted to a database immediately. The usefulness of voice recognition systems depends on several factors, including ease of use, reliability, speed, and accuracy. These in turn depend on the general underlying design of the systems and inclusion in the systems of a specific knowledge base appropriate for each application. Development of a good knowledge base requires close collaboration between a domain expert and a knowledge engineer with expertise in voice recognition. The authors have recently completed a knowledge base for surgical pathology using the Kurzweil VoiceReport 5,000-word system.
Using Technology to Improve Student Learning. NCREL Viewpoints, Volume 12
ERIC Educational Resources Information Center
Gahala, Jan, Ed.
2004-01-01
"Viewpoints" is a multimedia package containing two audio CDs and a short, informative booklet. This volume of "Viewpoints" focuses on how technology can help improve student learning. The audio CDs provide the voices, or viewpoints, of various leaders from the education field who work closely with technology issues. Their…
Lin, Szu-Han Joanna; Johnson, Russell E
2015-09-01
One way that employees contribute to organizational effectiveness is by expressing voice. They may offer suggestions for how to improve the organization (promotive voice behavior), or express concerns to prevent harmful events from occurring (prohibitive voice behavior). Although promotive and prohibitive voices are thought to be distinct types of behavior, very little is known about their unique antecedents and consequences. In this study we draw on regulatory focus and ego depletion theories to derive a theoretical model that outlines a dynamic process of the antecedents and consequences of voice behavior. Results from 2 multiwave field studies revealed that promotion and prevention foci have unique ties to promotive and prohibitive voice, respectively. Promotive and prohibitive voice, in turn, were associated with decreases and increases, respectively, in depletion. Consistent with the dynamic nature of self-control, depletion was associated with reductions in employees' subsequent voice behavior, regardless of the type of voice (promotive or prohibitive). Results were consistent across 2 studies and remained even after controlling for other established antecedents of voice and alternative mediating mechanisms beside depletion. (c) 2015 APA, all rights reserved).
ERIC Educational Resources Information Center
Safir, Abla
2015-01-01
With the adoption of sweeping changes in the governance of university admission, namely the implementation of the university entrance exam (UEE) in July 2014, and with broader reforms supported by the World Bank. The UEE reform improves governance in only one step, if a key one, in the long process to access and complete higher education: this…
The role of voice input for human-machine communication.
Cohen, P R; Oviatt, S L
1995-01-01
Optimism is growing that the near future will witness rapid growth in human-computer interaction using voice. System prototypes have recently been built that demonstrate speaker-independent real-time speech recognition, and understanding of naturally spoken utterances with vocabularies of 1000 to 2000 words, and larger. Already, computer manufacturers are building speech recognition subsystems into their new product lines. However, before this technology can be broadly useful, a substantial knowledge base is needed about human spoken language and performance during computer-based spoken interaction. This paper reviews application areas in which spoken interaction can play a significant role, assesses potential benefits of spoken interaction with machines, and compares voice with other modalities of human-computer interaction. It also discusses information that will be needed to build a firm empirical foundation for the design of future spoken and multimodal interfaces. Finally, it argues for a more systematic and scientific approach to investigating spoken input and performance with future language technology. PMID:7479803
Effects of audio compression in automatic detection of voice pathologies.
Sáenz-Lechón, Nicolás; Osma-Ruiz, Víctor; Godino-Llorente, Juan I; Blanco-Velasco, Manuel; Cruz-Roldán, Fernando; Arias-Londoño, Julián D
2008-12-01
This paper investigates the performance of an automatic system for voice pathology detection when the voice samples have been compressed in MP3 format and different binary rates (160, 96, 64, 48, 24, and 8 kb/s). The detectors employ cepstral and noise measurements, along with their derivatives, to characterize the voice signals. The classification is performed using Gaussian mixtures models and support vector machines. The results between the different proposed detectors are compared by means of detector error tradeoff (DET) and receiver operating characteristic (ROC) curves, concluding that there are no significant differences in the performance of the detector when the binary rates of the compressed data are above 64 kb/s. This has useful applications in telemedicine, reducing the storage space of voice recordings or transmitting them over narrow-band communications channels.
Differential Neural Contributions to Native- and Foreign-Language Talker Identification
ERIC Educational Resources Information Center
Perrachione, Tyler K.; Pierrehumbert, Janet B.; Wong, Patrick C. M.
2009-01-01
Humans are remarkably adept at identifying individuals by the sound of their voice, a behavior supported by the nervous system's ability to integrate information from voice and speech perception. Talker-identification abilities are significantly impaired when listeners are unfamiliar with the language being spoken. Recent behavioral studies…
Joint Communications Support Element: The Voice Heard Round the World
2013-01-01
Initial Entry Package ( IEP ), Early Entry Package (EEP), and Joint Mobil- ity Package provide secure and nonsecure voice, video, and data to small mobile...teams operating worldwide. The IEP and EEP can be rapidly scaled to meet force surge require- ments from small dismounted teams up to an advance
Growing Student Voice in Curriculum Decisions at the University
ERIC Educational Resources Information Center
Rojas Pernia, Susana; Haya Salmón, Ignacio; Susinos Rada, Teresa
2016-01-01
This paper is a result of the development of the R+D Project "Schools that are moving towards inclusive education: working with the local community, the student voice and educational support for promoting change" in conjunction with the Innovation Project "Building Bridges. An Educational Innovation Project in the European Higher…
[A New Challenge of Working Together: Psychiatry and Heraing Voices].
Laval, Christian
Recovery model, as the hearing voices movement, promotes the users discourse among the health workers. This interference of users in the expertise considering the meaning of their voices is faced with the vision of advocating health workers, particularly the psychiatrists supporting the psychiatric users movement. On the field, this association seems challenging. On the experimental site in Marseille, professionals through various tensions and practical difficulties tend to reconsider their power / knowledge. This new challenge of working together leads them to adopt both a more humble position towards their clinical expertise and a more ambitious position regarding their participation in the construction of public controversies highlighted by the speaking experience of the hearing voices movement.
Cottam, S; Paul, S N; Doughty, O J; Carpenter, L; Al-Mousawi, A; Karvounis, S; Done, D J
2011-09-01
Introduction. Hearing voices occurs in people without psychosis. Why hearing voices is such a key pathological feature of psychosis whilst remaining a manageable experience in nonpsychotic people is yet to be understood. We hypothesised that religious voice hearers would interpret voices in accordance with their beliefs and therefore experience less distress. Methods. Three voice hearing groups, which comprised: 20 mentally healthy Christians, 15 Christian patients with psychosis, and 14 nonreligious patients with psychosis. All completed (1) questionnaires with rating scales measuring the perceptual and emotional aspects of hallucinated voices, and (2) a semistructured interview to explore whether religious belief is used to make sense of the voice hearing experience. Results. The three groups had perceptually similar experiences when hearing the voices. Mentally healthy Christians appeared to assimilate the experience with their religious beliefs (schematic processing) resulting in positive interpretations. Christian patients tended not to assimilate the experience with their religious beliefs, frequently reporting nonreligious interpretations that were predominantly negative. Nearly all participants experienced voices as powerful, but mentally healthy Christians reported the power of voices positively. Conclusion. Religious belief appeared to have a profound, beneficial influence on the mentally healthy Christians' interpretation of hearing voices, but had little or no influence in the case of Christian patients.
Assent in research: the voices of adolescents
Grady, Christine; Wiener, Lori; Abdoler, Emily; Trauernicht, Emily; Zadeh, Sima; Diekema, Douglas S.; Wilfond, Benjamin S.; Wendler, David
2014-01-01
Purpose Adolescents join clinical research after investigators obtain their positive agreement, or ‘assent.’ Although intended to respect adolescents, little is known about the views of adolescents or their parents regarding assent or research enrollment decisions. This study aimed to better understand perspectives of adolescent research participants and their parents about assent and parental permission. Methods Structured interviews were conducted with 13 through 17 year old teens, enrolled in clinical research at NIH or Seattle Children’s Hospital, and separately with one parent. Results 177 adolescent-parent pairs were interviewed. Teens were well distributed by age and gender, represented a wide variety of research and illnesses ranging in severity from mild to life-threatening; 20% were healthy volunteers. Teens and parents were generally satisfied with the assent/permission process. Normally, they made the enrollment decision together and teens wanted parents’ input and support. About 25% of teens reported pressure to enroll, from parents or relatives, but also from doctors/nurses/research teams. Only 2% of teens preferred not to sign a consent form. Conclusions Despite some differing views about how decisions should be made, the current assent/permission process is perceived as satisfactorily respectful by most teens in research. Many teens want to sign consent forms and teens’ signatures should generally be sought. Flexible guidance allows research teams and IRBs to customize the assent process for teens in particular studies in order to facilitate an appropriate balance between giving teens a voice reflective of their emerging independence and enabling supportive collaboration with parents. PMID:24630932
Possible cross-cultural differences in the perception of impact of voice disorders.
Yiu, Edwin M-L; Ho, Elaine M; Ma, Estella P-M; Verdolini Abbott, Katherine; Branski, Ryan; Richardson, Katherine; Li, Nicole Y-K
2011-05-01
Perception of the impact of voice disorders may differ across different cultural backgrounds. This study investigated the difference in the perception of the impact of voice disorders between the American (Pittsburgh) and Chinese (Hong Kong) cultures. Sixty dysphonic subjects from Hong Kong, China, and 60 dysphonic subjects from Pittsburgh, USA, were recruited to complete the Chinese and English versions of the Voice Activity and Participation Profile (VAPP), respectively. Data analyses using independent t tests were conducted on (1) the total profile scores; (b) total activity limitation score (ALS) and total participation restriction score (PRS); (c) section scores of job, daily communication, and social communication; and (d) section ALS and section PRS. Hong Kong subjects showed significantly higher scores than the Pittsburgh subjects in total profile and other subsections except in the job PRS. RESULTS support the possible influence of individualist and collectivist cultures on the perceived impact of voice disorders on the activity and participation. Copyright © 2011 The Voice Foundation. Published by Mosby, Inc. All rights reserved.
Global Youth Voices: engaging Bedouin youth in health promotion in the Middle East.
Bader, Russell; Wanono, Revital; Hamden, Sami; Skinner, Harvey A
2007-01-01
This study evaluated the six-phase Global Youth Voices model (EIPARS) for engaging youth in community health promotion in the Middle East: 1) engagement, 2) issue identification, 3) planning, 4) action, 5) Rs; research, reflect, reward and 6) sustainability. 20 Grade 9 students (10 boys, 10 girls) from two Bedouin communities: 10 from Tuba-Zangaria in North Israel; 10 from Segev Shalom in the Negev--South Israel. Using low-end (photo-voice, photography) and high-end internet-based technology, youth identified and documented the strengths and weaknesses of their communities, and then undertook a community action project focusing on one important issue. Workbooks including process guides and tools for each step of the EIPARS model are available online at: http://www.globalyouthvoices.org/resources.html. Similar issues were identified in each community: e.g., smoking, injuries, friendships, Bedouin culture. Unique issues identified were suicide at Tuba and industrial pollution at Segev Shalom. Students at Tuba selected Suicide Prevention for a community project and prepared a PowerPoint presentation for educating their peers, teachers and parents. Youth from Segev Shalom created a video and photo exhibit on Violence Prevention. Photo-essays and their action projects were uploaded to the Global Youth Voices website for virtual discussion and sharing with youth globally (www.globalyouthvoices.org/middle-east/greetings-en.html). This project demonstrated that the EIPARS model can be used successfully for engaging youth and creating youth-driven community action in the Middle East. This public health intervention provides a positive approach for building cooperation in conflicted regions.