Science.gov

Sample records for audio feature space

  1. Audio feature extraction using probability distribution function

    NASA Astrophysics Data System (ADS)

    Suhaib, A.; Wan, Khairunizam; Aziz, Azri A.; Hazry, D.; Razlan, Zuradzman M.; Shahriman A., B.

    2015-05-01

    Voice recognition has been one of the popular applications in robotic field. It is also known to be recently used for biometric and multimedia information retrieval system. This technology is attained from successive research on audio feature extraction analysis. Probability Distribution Function (PDF) is a statistical method which is usually used as one of the processes in complex feature extraction methods such as GMM and PCA. In this paper, a new method for audio feature extraction is proposed which is by using only PDF as a feature extraction method itself for speech analysis purpose. Certain pre-processing techniques are performed in prior to the proposed feature extraction method. Subsequently, the PDF result values for each frame of sampled voice signals obtained from certain numbers of individuals are plotted. From the experimental results obtained, it can be seen visually from the plotted data that each individuals' voice has comparable PDF values and shapes.

  2. Survey of compressed domain audio features and their expressiveness

    NASA Astrophysics Data System (ADS)

    Pfeiffer, Silvia; Vincent, Thomas

    2003-01-01

    We give an overview of existing audio analysis approaches in the compressed domain and incorporate them into a coherent formal structure. After examining the kinds of information accessible in an MPEG-1 compressed audio stream, we describe a coherent approach to determine features from them and report on a number of applications they enable. Most of them aim at creating an index to the audio stream by segmenting the stream into temporally coherent regions, which may be classified into pre-specified types of sounds such as music, speech, speakers, animal sounds, sound effects, or silence. Other applications centre around sound recognition such as gender, beat or speech recognition.

  3. Simple Solutions for Space Station Audio Problems

    NASA Technical Reports Server (NTRS)

    Wood, Eric

    2016-01-01

    Throughout this summer, a number of different projects were supported relating to various NASA programs, including the International Space Station (ISS) and Orion. The primary project that was worked on was designing and testing an acoustic diverter which could be used on the ISS to increase sound pressure levels in Node 1, a module that does not have any Audio Terminal Units (ATUs) inside it. This acoustic diverter is not intended to be a permanent solution to providing audio to Node 1; it is simply intended to improve conditions while more permanent solutions are under development. One of the most exciting aspects of this project is that the acoustic diverter is designed to be 3D printed on the ISS, using the 3D printer that was set up earlier this year. Because of this, no new hardware needs to be sent up to the station, and no extensive hardware testing needs to be performed on the ground before sending it to the station. Instead, the 3D part file can simply be uploaded to the station's 3D printer, where the diverter will be made.

  4. Space Shuttle Orbiter audio subsystem. [to communication and tracking system

    NASA Technical Reports Server (NTRS)

    Stewart, C. H.

    1978-01-01

    The selection of the audio multiplex control configuration for the Space Shuttle Orbiter audio subsystem is discussed and special attention is given to the evaluation criteria of cost, weight and complexity. The specifications and design of the subsystem are described and detail is given to configurations of the audio terminal and audio central control unit (ATU, ACCU). The audio input from the ACCU, at a signal level of -12.2 to 14.8 dBV, nominal range, at 1 kHz, was found to have balanced source impedance and a balanced local impedance of 6000 + or - 600 ohms at 1 kHz, dc isolated. The Lyndon B. Johnson Space Center (JSC) electroacoustic test laboratory, an audio engineering facility consisting of a collection of acoustic test chambers, analyzed problems of speaker and headset performance, multiplexed control data coupled with audio channels, and the Orbiter cabin acoustic effects on the operational performance of voice communications. This system allows technical management and project engineering to address key constraining issues, such as identifying design deficiencies of the headset interface unit and the assessment of the Orbiter cabin performance of voice communications, which affect the subsystem development.

  5. Audio-visual synchrony and feature-selective attention co-amplify early visual processing.

    PubMed

    Keitel, Christian; Müller, Matthias M

    2016-05-01

    Our brain relies on neural mechanisms of selective attention and converging sensory processing to efficiently cope with rich and unceasing multisensory inputs. One prominent assumption holds that audio-visual synchrony can act as a strong attractor for spatial attention. Here, we tested for a similar effect of audio-visual synchrony on feature-selective attention. We presented two superimposed Gabor patches that differed in colour and orientation. On each trial, participants were cued to selectively attend to one of the two patches. Over time, spatial frequencies of both patches varied sinusoidally at distinct rates (3.14 and 3.63 Hz), giving rise to pulse-like percepts. A simultaneously presented pure tone carried a frequency modulation at the pulse rate of one of the two visual stimuli to introduce audio-visual synchrony. Pulsed stimulation elicited distinct time-locked oscillatory electrophysiological brain responses. These steady-state responses were quantified in the spectral domain to examine individual stimulus processing under conditions of synchronous versus asynchronous tone presentation and when respective stimuli were attended versus unattended. We found that both, attending to the colour of a stimulus and its synchrony with the tone, enhanced its processing. Moreover, both gain effects combined linearly for attended in-sync stimuli. Our results suggest that audio-visual synchrony can attract attention to specific stimulus features when stimuli overlap in space.

  6. [Speaker gender identification based on audio fractal dimension and pitch feature].

    PubMed

    Wang, Zhenhua; Yang, Cuirong; Wu, Wei; Fan, Yingle

    2008-08-01

    Automatic speaker gender identification based on voice feature is an important task in voice processing and analysis fields. In this paper non-linear parameters such as fractal dimension are applied to be one part of feature space for improving the ability of describing speaker gender feature through conventional linear parameters method. Pitch is picked using lifting scheme, and audio fractal dimension is extracted. Then based on Takens theory, the time delay method is used to reconstruct the phase space of fractal dimension sequence. And fractal dimension complexity is obtained by calculating Approximate Entropy. Three dimension feature vectors, including the pitch, the fractal dimension and the fractal dimension complexity, are applied to speaker gender identification. Experiment results show that through adding non-linear parameters, compared with the linear parameter using one dimension only such as pitch, the proposed method is more accurate and robust, and thus provides a new way for speaker gender identification.

  7. Electronystagmography and audio potentials in space flight

    NASA Technical Reports Server (NTRS)

    Thornton, William E.; Biggers, W. P.; Pool, Sam L.; Thomas, W. G.; Thagard, Norman E.

    1985-01-01

    Beginning with the fourth flight of the Space Transport System (STS-4), objective measurements of inner ear function were conducted in near-zero G conditions in earth orbit. The problem of space motion sickness (SMS) was approached much like any disequilibrium problem encountered clinically. However, objective testing techniques had built-in limitations superimposed by the strict parameters inherent in each mission. An attempt was made to objectively characterize SMS, and to first ascertain whether the objective measurements indicated that this disorder was of peripheral or central origin. Electronystagmography and auditory brain stem response recordings were the primary investigative tools. One of the authors (W.E.T.) was a mission specialist on board the orbiter Challenger on the eighth shuttle mission (STS-8) and had the opportunity to make direct and personal observations regarding SMS, an opportunity which has added immeasurably to our understanding of this disorder. Except for two abnormal ENG records, which remain to be explained, the remaining ENG records and all the ABR records made in the weightless environment of space were normal.

  8. Electronystagmography and audio potentials in space flight

    NASA Technical Reports Server (NTRS)

    Thornton, William E.; Biggers, W. P.; Pool, Sam L.; Thomas, W. G.; Thagard, Norman E.

    1985-01-01

    Beginning with the fourth flight of the Space Transport System (STS-4), objective measurements of inner ear function were conducted in near-zero G conditions in earth orbit. The problem of space motion sickness (SMS) was approached much like any disequilibrium problem encountered clinically. However, objective testing techniques had built-in limitations superimposed by the strict parameters inherent in each mission. An attempt was made to objectively characterize SMS, and to first ascertain whether the objective measurements indicated that this disorder was of peripheral or central origin. Electronystagmography and auditory brain stem response recordings were the primary investigative tools. One of the authors (W.E.T.) was a mission specialist on board the orbiter Challenger on the eighth shuttle mission (STS-8) and had the opportunity to make direct and personal observations regarding SMS, an opportunity which has added immeasurably to our understanding of this disorder. Except for two abnormal ENG records, which remain to be explained, the remaining ENG records and all the ABR records made in the weightless environment of space were normal.

  9. Audio-visual interaction and perceptual assessment of water features used over road traffic noise.

    PubMed

    Galbrun, Laurent; Calarco, Francesca M A

    2014-11-01

    This paper examines the audio-visual interaction and perception of water features used over road traffic noise, including their semantic aural properties, as well as their categorization and evocation properties. The research focused on a wide range of small to medium sized water features that can be used in gardens and parks to promote peacefulness and relaxation. Paired comparisons highlighted the inter-dependence between uni-modal (audio-only or visual-only) and bi-modal (audio-visual) perception, indicating that equal attention should be given to the design of both stimuli. In general, natural looking features tended to increase preference scores (compared to audio-only paired comparison scores), while manmade looking features decreased them. Semantic descriptors showed significant correlations with preferences and were found to be more reliable design criteria than physical parameters. A principal component analysis identified three components within the nine semantic attributes tested: "emotional assessment," "sound quality," and "envelopment and temporal variation." The first two showed significant correlations with audio-only preferences, "emotional assessment" being the most important predictor of preferences, and its attributes naturalness, relaxation, and freshness also being significantly correlated with preferences. Categorization results indicated that natural stream sounds are easily identifiable (unlike waterfalls and fountains), while evocation results showed no unique relationship with preferences.

  10. Expansion Techniques of Embedding Audio Watermark Data Rate for Constructing Ubiquitous Acoustic Spaces

    NASA Astrophysics Data System (ADS)

    Modegi, Toshio

    We are proposing “Ubiquitous Acoustic Spaces”, where each sound source can emit some address information with audio signals and make us automatically access to its related cyber space, using handheld devices such as cellphones. In order to realize this concept, we have considered three types of extraction methods, which were an acoustic modulation, an audio fingerprint, and an audio watermark technique. Then we have proposed a novel audio watermarking technique, which enables contactless asynchronous detection of embedded audio watermarks through speaker and microphone devices. However its embedding data rate was around 10 [bps], which was not sufficient for embedding generally used URL address texts. Therefore, we have extended the embedding frequency range and proposed a duplicated embedding algorithm, which uses both previously proposed frequency division method and temporal division method together. By these improvements, possible embedding data rate could be extended to 61.5 [bps], and we could extract watermarks through public telephone networks, even from a cell phone sound source. In this paper, we describe abstracts of our improved watermark embedding and extracting algorithms, and experimental results of watermark extraction precision on several audio signal capturing conditions.

  11. Audio-Visual Space Reorganization Study. RDU-75-05.

    ERIC Educational Resources Information Center

    Baker, Martha

    Space layout and work flow patterns in the Audiovisual Center at Purdue University were studied with respect to effective space utilization and the need for planning space requirements in relationship to the activities being performed. Space and work areas were reorganized to facilitate the flow of work and materials between areas, and equipment…

  12. 47 CFR 25.214 - Technical requirements for space stations in the Satellite Digital Audio Radio Service and...

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... 47 Telecommunication 2 2013-10-01 2013-10-01 false Technical requirements for space stations in the Satellite Digital Audio Radio Service and associated terrestrial repeaters. 25.214 Section 25.214... Technical Standards § 25.214 Technical requirements for space stations in the Satellite Digital Audio Radio...

  13. 47 CFR 25.214 - Technical requirements for space stations in the Satellite Digital Audio Radio Service and...

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... 47 Telecommunication 2 2014-10-01 2014-10-01 false Technical requirements for space stations in the Satellite Digital Audio Radio Service and associated terrestrial repeaters. 25.214 Section 25.214... Technical Standards § 25.214 Technical requirements for space stations in the Satellite Digital Audio Radio...

  14. Reconsidering the Role of Recorded Audio as a Rich, Flexible and Engaging Learning Space

    ERIC Educational Resources Information Center

    Middleton, Andrew

    2016-01-01

    Audio needs to be recognised as an integral medium capable of extending education's formal and informal, virtual and physical learning spaces. This paper reconsiders the value of educational podcasting through a review of literature and a module case study. It argues that a pedagogical understanding is needed and challenges technology-centred or…

  15. Reconsidering the Role of Recorded Audio as a Rich, Flexible and Engaging Learning Space

    ERIC Educational Resources Information Center

    Middleton, Andrew

    2016-01-01

    Audio needs to be recognised as an integral medium capable of extending education's formal and informal, virtual and physical learning spaces. This paper reconsiders the value of educational podcasting through a review of literature and a module case study. It argues that a pedagogical understanding is needed and challenges technology-centred or…

  16. Feature space analysis of MRI

    NASA Astrophysics Data System (ADS)

    Soltanian-Zadeh, Hamid; Windham, Joe P.; Peck, Donald J.

    1997-04-01

    This paper presents development and performance evaluation of an MRI feature space method. The method is useful for: identification of tissue types; segmentation of tissues; and quantitative measurements on tissues, to obtain information that can be used in decision making (diagnosis, treatment planning, and evaluation of treatment). The steps of the work accomplished are as follows: (1) Four T2-weighted and two T1-weighted images (before and after injection of Gadolinium) were acquired for ten tumor patients. (2) Images were analyed by two image analysts according to the following algorithm. The intracranial brain tissues were segmented from the scalp and background. The additive noise was suppressed using a multi-dimensional non-linear edge- preserving filter which preserves partial volume information on average. Image nonuniformities were corrected using a modified lowpass filtering approach. The resulting images were used to generate and visualize an optimal feature space. Cluster centers were identified on the feature space. Then images were segmented into normal tissues and different zones of the tumor. (3) Biopsy samples were extracted from each patient and were subsequently analyzed by the pathology laboratory. (4) Image analysis results were compared to each other and to the biopsy results. Pre- and post-surgery feature spaces were also compared. The proposed algorithm made it possible to visualize the MRI feature space and to segment the image. In all cases, the operators were able to find clusters for normal and abnormal tissues. Also, clusters for different zones of the tumor were found. Based on the clusters marked for each zone, the method successfully segmented the image into normal tissues (white matter, gray matter, and CSF) and different zones of the lesion (tumor, cyst, edema, radiation necrosis, necrotic core, and infiltrated tumor). The results agreed with those obtained from the biopsy samples. Comparison of pre- to post-surgery and radiation

  17. 47 CFR 25.214 - Technical requirements for space stations in the satellite digital audio radio service and...

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... the satellite digital audio radio service and associated terrestrial repeaters. 25.214 Section 25.214 Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED) COMMON CARRIER SERVICES SATELLITE COMMUNICATIONS Technical Standards § 25.214 Technical requirements for space stations in the satellite digital audio radio...

  18. Feature Selection in the Tensor Product Feature Space

    PubMed Central

    Smalter, Aaron; Huan, Jun; Lushington, Gerald

    2014-01-01

    Classifying objects that are sampled jointly from two or more domains has many applications. The tensor product feature space is useful for modeling interactions between feature sets in different domains but feature selection in the tensor product feature space is challenging. Conventional feature selection methods ignore the structure of the feature space and may not provide the optimal results. In this paper we propose methods for selecting features in the original feature spaces of different domains. We obtained sparsity through two approaches, one using integer quadratic programming and another using L1-norm regularization. Experimental studies on biological data sets validate our approach. PMID:24632658

  19. Importance of perceptive adaptation of sound features in audio content processing

    NASA Astrophysics Data System (ADS)

    Pfeiffer, Silvia

    1998-12-01

    The importance of perceptive modeling for calculation of sound features is well known. Use of simple perception-based adaptations of physically measured stimuli, such as the dB- scale or loudness, is a minimal requirement. Exactly how much value can be gained by more complex perceptive modeling, has not been investigated in detail. The paper examines this question for loudness measures, using well- known psychoacoustic knowledge for their calculation. Profiles of these measures are calculated on audio data of movie material, deliberately using 'natural' sound, instead of reverting to artificial sounds in the laboratory. Ultimately, the quality of a sound feature can only be judged by comparison to human estimates. Therefore, test people were asked to express their perception of loudness by continuous classification into five classes (called pp, p, mf, f, and ff). The results were used to evaluate two loudness measures: the sound pressure level, and an integral loudness measure, developed in the discussed research. The correlation of the human loudness estimates to the integral loudness measure, is about 10 percent higher than to the sound pressure level. In addition, the integral loudness results in a significantly better approximation of the curve of human loudness estimates.

  20. Unique features of space reactors

    SciTech Connect

    Buden, D.

    1990-01-01

    Space reactors are designed to meet a unique set of requirements; they must be sufficiently compact to be launched in a rocket to their operational location, operate for many years without maintenance and servicing, operate in extreme environments, and reject heat by radiation to space. To meet these restrictions, operating temperatures are much greater than in terrestrial power plants, and the reactors tend to have a fast neutron spectrum. Currently, a new generation of space reactor power plants is being developed. The major effort is in the SP-100 program, where the power plant is being designed for seven years of full power, and no maintenance operation at a reactor outlet operating temperature of 1350 K. 8 refs., 3 figs., 1 tab.

  1. Feature integration across space, time, and orientation

    PubMed Central

    Otto, Thomas U.; Öğmen, Haluk; Herzog, Michael H.

    2012-01-01

    The perception of a visual target can be strongly influenced by flanking stimuli. In static displays, performance on the target improves when the distance to the flanking elements increases- proposedly because feature pooling and integration vanishes with distance. Here, we studied feature integration with dynamic stimuli. We show that features of single elements presented within a continuous motion stream are integrated largely independent of spatial distance (and orientation). Hence, space based models of feature integration cannot be extended to dynamic stimuli. We suggest that feature integration is guided by perceptual grouping operations that maintain the identity of perceptual objects over space and time. PMID:19968428

  2. Detecting paralinguistic events in audio stream using context in features and probabilistic decisions☆

    PubMed Central

    Gupta, Rahul; Audhkhasi, Kartik; Lee, Sungbok; Narayanan, Shrikanth

    2017-01-01

    Non-verbal communication involves encoding, transmission and decoding of non-lexical cues and is realized using vocal (e.g. prosody) or visual (e.g. gaze, body language) channels during conversation. These cues perform the function of maintaining conversational flow, expressing emotions, and marking personality and interpersonal attitude. In particular, non-verbal cues in speech such as paralanguage and non-verbal vocal events (e.g. laughters, sighs, cries) are used to nuance meaning and convey emotions, mood and attitude. For instance, laughters are associated with affective expressions while fillers (e.g. um, ah, um) are used to hold floor during a conversation. In this paper we present an automatic non-verbal vocal events detection system focusing on the detect of laughter and fillers. We extend our system presented during Interspeech 2013 Social Signals Sub-challenge (that was the winning entry in the challenge) for frame-wise event detection and test several schemes for incorporating local context during detection. Specifically, we incorporate context at two separate levels in our system: (i) the raw frame-wise features and, (ii) the output decisions. Furthermore, our system processes the output probabilities based on a few heuristic rules in order to reduce erroneous frame-based predictions. Our overall system achieves an Area Under the Receiver Operating Characteristics curve of 95.3% for detecting laughters and 90.4% for fillers on the test set drawn from the data specifications of the Interspeech 2013 Social Signals Sub-challenge. We perform further analysis to understand the interrelation between the features and obtained results. Specifically, we conduct a feature sensitivity analysis and correlate it with each feature's stand alone performance. The observations suggest that the trained system is more sensitive to a feature carrying higher discriminability with implications towards a better system design. PMID:28713197

  3. Audio 2008: Audio Fixation

    ERIC Educational Resources Information Center

    Kaye, Alan L.

    2008-01-01

    Take a look around the bus or subway and see just how many people are bumping along to an iPod or an MP3 player. What they are listening to is their secret, but the many signature earbuds in sight should give one a real sense of just how pervasive digital audio has become. This article describes how that popularity is mirrored in library audio…

  4. Audio 2008: Audio Fixation

    ERIC Educational Resources Information Center

    Kaye, Alan L.

    2008-01-01

    Take a look around the bus or subway and see just how many people are bumping along to an iPod or an MP3 player. What they are listening to is their secret, but the many signature earbuds in sight should give one a real sense of just how pervasive digital audio has become. This article describes how that popularity is mirrored in library audio…

  5. Design specifications of audio-guidance systems for the blind in public spaces.

    PubMed

    Iwamiya, Shin-ichiro; Yamauchi, Katsuya; Shiraishi, Kousuke; Takada, Masayuki; Sato, Masaru

    2004-11-01

    The government of Fukuoka City conducted a survey to determine the effectiveness of an audio-guidance system for the blind. The blind participants confirmed the usefulness of the audio-guidance. In addition, the blind participants and the walking instructors also provided various comments and suggestions for the better utilization of audio-guidance systems for smoother transportation. In order for the participants to be able to recognize auditory signals, it was important to be able to hear them at their peak volumes. To understand the actual meaning of announcements, however, the average volumes of the signals were more important than their peak volumes. The blind participants suggested that auditory signals and announcements should provide short and simple messages. The walking instructors provided comments regarding the placement of loudspeakers to enhance auditory localization. They recommended hanging the loudspeakers from ceilings located in front of passengers. Furthermore, the necessity of controlling excess reverberations was indicated in order to better enable blind citizens to recognize and localize the auditory signals. It was suggested that using different auditory signals for different purposes and places was effective for smoother transportation.

  6. Nonnegative matrix factorization in polynomial feature space.

    PubMed

    Buciu, Ioan; Nikolaidis, Nikos; Pitas, Ioannis

    2008-06-01

    Plenty of methods have been proposed in order to discover latent variables (features) in data sets. Such approaches include the principal component analysis (PCA), independent component analysis (ICA), factor analysis (FA), etc., to mention only a few. A recently investigated approach to decompose a data set with a given dimensionality into a lower dimensional space is the so-called nonnegative matrix factorization (NMF). Its only requirement is that both decomposition factors are nonnegative. To approximate the original data, the minimization of the NMF objective function is performed in the Euclidean space, where the difference between the original data and the factors can be minimized by employing L(2)-norm. In this paper, we propose a generalization of the NMF algorithm by translating the objective function into a Hilbert space (also called feature space) under nonnegativity constraints. With the help of kernel functions, we developed an approach that allows high-order dependencies between the basis images while keeping the nonnegativity constraints on both basis images and coefficients. Two practical applications, namely, facial expression and face recognition, show the potential of the proposed approach.

  7. Audio Restoration

    NASA Astrophysics Data System (ADS)

    Esquef, Paulo A. A.

    The first reproducible recording of human voice was made in 1877 on a tinfoil cylinder phonograph devised by Thomas A. Edison. Since then, much effort has been expended to find better ways to record and reproduce sounds. By the mid-1920s, the first electrical recordings appeared and gradually took over purely acoustic recordings. The development of electronic computers, in conjunction with the ability to record data onto magnetic or optical media, culminated in the standardization of compact disc format in 1980. Nowadays, digital technology is applied to several audio applications, not only to improve the quality of modern and old recording/reproduction techniques, but also to trade off sound quality for less storage space and less taxing transmission capacity requirements.

  8. Feature Detection in Linked Derived Spaces

    NASA Technical Reports Server (NTRS)

    Henze, Chris; Gearld-Yamasaki, Michael (Technical Monitor)

    1998-01-01

    This paper describes by example a strategy for plotting and interacting with data in multiple metric spaces. The example system was designed for use with time-varying computational fluid dynamics (CFD) datasets, but the methodology is directly applicable to other types of field data. The central objects embodied by the tool are {\\em portraits}, which show the data in various coordinate systems, while preserving their spatial connectivity and temporal variability. The coordinates are derived in various ways from the field data, and an important feature is that new and derived portraits can be created interactively. The primary operations supported by the tool are brushing and linking: the user can select a subset of a given portrait, and this subset is highlighted in all portraits. The user can combine highlighted subsets from an arbitrary number of portraits with the usual logical operators, thereby indicating where an arbitrarily complex set of conditions holds. The system is useful for exploratory visualization and feature detection in multivariate data.

  9. Audio-visual imposture

    NASA Astrophysics Data System (ADS)

    Karam, Walid; Mokbel, Chafic; Greige, Hanna; Chollet, Gerard

    2006-05-01

    A GMM based audio visual speaker verification system is described and an Active Appearance Model with a linear speaker transformation system is used to evaluate the robustness of the verification. An Active Appearance Model (AAM) is used to automatically locate and track a speaker's face in a video recording. A Gaussian Mixture Model (GMM) based classifier (BECARS) is used for face verification. GMM training and testing is accomplished on DCT based extracted features of the detected faces. On the audio side, speech features are extracted and used for speaker verification with the GMM based classifier. Fusion of both audio and video modalities for audio visual speaker verification is compared with face verification and speaker verification systems. To improve the robustness of the multimodal biometric identity verification system, an audio visual imposture system is envisioned. It consists of an automatic voice transformation technique that an impostor may use to assume the identity of an authorized client. Features of the transformed voice are then combined with the corresponding appearance features and fed into the GMM based system BECARS for training. An attempt is made to increase the acceptance rate of the impostor and to analyzing the robustness of the verification system. Experiments are being conducted on the BANCA database, with a prospect of experimenting on the newly developed PDAtabase developed within the scope of the SecurePhone project.

  10. Feature Integration across Space, Time, and Orientation

    ERIC Educational Resources Information Center

    Otto, Thomas U.; Ogmen, Haluk; Herzog, Michael H.

    2009-01-01

    The perception of a visual target can be strongly influenced by flanking stimuli. In static displays, performance on the target improves when the distance to the flanking elements increases--presumably because feature pooling and integration vanishes with distance. Here, we studied feature integration with dynamic stimuli. We show that features of…

  11. Real World Audio

    NASA Technical Reports Server (NTRS)

    1998-01-01

    Crystal River Engineering was originally featured in Spinoff 1992 with the Convolvotron, a high speed digital audio processing system that delivers three-dimensional sound over headphones. The Convolvotron was developed for Ames' research on virtual acoustic displays. Crystal River is a now a subsidiary of Aureal Semiconductor, Inc. and they together develop and market the technology, which is a 3-D (three dimensional) audio technology known commercially today as Aureal 3D (A-3D). The technology has been incorporated into video games, surround sound systems, and sound cards.

  12. Efficient audio signal processing for embedded systems

    NASA Astrophysics Data System (ADS)

    Chiu, Leung Kin

    As mobile platforms continue to pack on more computational power, electronics manufacturers start to differentiate their products by enhancing the audio features. However, consumers also demand smaller devices that could operate for longer time, hence imposing design constraints. In this research, we investigate two design strategies that would allow us to efficiently process audio signals on embedded systems such as mobile phones and portable electronics. In the first strategy, we exploit properties of the human auditory system to process audio signals. We designed a sound enhancement algorithm to make piezoelectric loudspeakers sound ”richer" and "fuller." Piezoelectric speakers have a small form factor but exhibit poor response in the low-frequency region. In the algorithm, we combine psychoacoustic bass extension and dynamic range compression to improve the perceived bass coming out from the tiny speakers. We also developed an audio energy reduction algorithm for loudspeaker power management. The perceptually transparent algorithm extends the battery life of mobile devices and prevents thermal damage in speakers. This method is similar to audio compression algorithms, which encode audio signals in such a ways that the compression artifacts are not easily perceivable. Instead of reducing the storage space, however, we suppress the audio contents that are below the hearing threshold, therefore reducing the signal energy. In the second strategy, we use low-power analog circuits to process the signal before digitizing it. We designed an analog front-end for sound detection and implemented it on a field programmable analog array (FPAA). The system is an example of an analog-to-information converter. The sound classifier front-end can be used in a wide range of applications because programmable floating-gate transistors are employed to store classifier weights. Moreover, we incorporated a feature selection algorithm to simplify the analog front-end. A machine

  13. Mining recurring concepts in a dynamic feature space.

    PubMed

    Gomes, João Bártolo; Gaber, Mohamed Medhat; Sousa, Pedro A C; Menasalvas, Ernestina

    2014-01-01

    Most data stream classification techniques assume that the underlying feature space is static. However, in real-world applications the set of features and their relevance to the target concept may change over time. In addition, when the underlying concepts reappear, reusing previously learnt models can enhance the learning process in terms of accuracy and processing time at the expense of manageable memory consumption. In this paper, we propose mining recurring concepts in a dynamic feature space (MReC-DFS), a data stream classification system to address the challenges of learning recurring concepts in a dynamic feature space while simultaneously reducing the memory cost associated with storing past models. MReC-DFS is able to detect and adapt to concept changes using the performance of the learning process and contextual information. To handle recurring concepts, stored models are combined in a dynamically weighted ensemble. Incremental feature selection is performed to reduce the combined feature space. This contribution allows MReC-DFS to store only the features most relevant to the learnt concepts, which in turn increases the memory efficiency of the technique. In addition, an incremental feature selection method is proposed that dynamically determines the threshold between relevant and irrelevant features. Experimental results demonstrating the high accuracy of MReC-DFS compared with state-of-the-art techniques on a variety of real datasets are presented. The results also show the superior memory efficiency of MReC-DFS.

  14. The Space Shuttle Main Engine and its maintenance features

    NASA Technical Reports Server (NTRS)

    Wheelock, V. J.

    1973-01-01

    The Space Shuttle Main Engine (SSME) is a reusable, high-performance rocket engine being developed to satisfy the performance, life, reliability, and operational requirements of the Space Shuttle Orbiter. The design includes simple, low-cost maintenance features resulting from a viable maintainability program dedicated to minimizing engine cost per flight.

  15. pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis

    PubMed Central

    Giannakopoulos, Theodoros

    2015-01-01

    Audio information plays a rather important role in the increasing digital content that is available today, resulting in a need for methodologies that automatically analyze such content: audio event recognition for home automations and surveillance systems, speech recognition, music information retrieval, multimodal analysis (e.g. audio-visual analysis of online videos for content-based recommendation), etc. This paper presents pyAudioAnalysis, an open-source Python library that provides a wide range of audio analysis procedures including: feature extraction, classification of audio signals, supervised and unsupervised segmentation and content visualization. pyAudioAnalysis is licensed under the Apache License and is available at GitHub (https://github.com/tyiannak/pyAudioAnalysis/). Here we present the theoretical background behind the wide range of the implemented methodologies, along with evaluation metrics for some of the methods. pyAudioAnalysis has been already used in several audio analysis research applications: smart-home functionalities through audio event detection, speech emotion recognition, depression classification based on audio-visual features, music segmentation, multimodal content-based movie recommendation and health applications (e.g. monitoring eating habits). The feedback provided from all these particular audio applications has led to practical enhancement of the library. PMID:26656189

  16. pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis.

    PubMed

    Giannakopoulos, Theodoros

    2015-01-01

    Audio information plays a rather important role in the increasing digital content that is available today, resulting in a need for methodologies that automatically analyze such content: audio event recognition for home automations and surveillance systems, speech recognition, music information retrieval, multimodal analysis (e.g. audio-visual analysis of online videos for content-based recommendation), etc. This paper presents pyAudioAnalysis, an open-source Python library that provides a wide range of audio analysis procedures including: feature extraction, classification of audio signals, supervised and unsupervised segmentation and content visualization. pyAudioAnalysis is licensed under the Apache License and is available at GitHub (https://github.com/tyiannak/pyAudioAnalysis/). Here we present the theoretical background behind the wide range of the implemented methodologies, along with evaluation metrics for some of the methods. pyAudioAnalysis has been already used in several audio analysis research applications: smart-home functionalities through audio event detection, speech emotion recognition, depression classification based on audio-visual features, music segmentation, multimodal content-based movie recommendation and health applications (e.g. monitoring eating habits). The feedback provided from all these particular audio applications has led to practical enhancement of the library.

  17. Multiclass Bayes error estimation by a feature space sampling technique

    NASA Technical Reports Server (NTRS)

    Mobasseri, B. G.; Mcgillem, C. D.

    1979-01-01

    A general Gaussian M-class N-feature classification problem is defined. An algorithm is developed that requires the class statistics as its only input and computes the minimum probability of error through use of a combined analytical and numerical integration over a sequence simplifying transformations of the feature space. The results are compared with those obtained by conventional techniques applied to a 2-class 4-feature discrimination problem with results previously reported and 4-class 4-feature multispectral scanner Landsat data classified by training and testing of the available data.

  18. Multiclass Bayes error estimation by a feature space sampling technique

    NASA Technical Reports Server (NTRS)

    Mobasseri, B. G.; Mcgillem, C. D.

    1979-01-01

    A general Gaussian M-class N-feature classification problem is defined. An algorithm is developed that requires the class statistics as its only input and computes the minimum probability of error through use of a combined analytical and numerical integration over a sequence simplifying transformations of the feature space. The results are compared with those obtained by conventional techniques applied to a 2-class 4-feature discrimination problem with results previously reported and 4-class 4-feature multispectral scanner Landsat data classified by training and testing of the available data.

  19. Space Station services and design features for users

    NASA Technical Reports Server (NTRS)

    Kurzhals, Peter R.; Mckinney, Royce L.

    1987-01-01

    The operational design features and services planned for the NASA Space Station will furnish, in addition to novel opportunities and facilities, lower costs through interface standardization and automation and faster access by means of computer-aided integration and control processes. By furnishing a basis for large-scale space exploitation, the Space Station will possess industrial production and operational services capabilities that may be used by the private sector for commercial ventures; it could also ultimately support lunar and planetary exploration spacecraft assembly and launch facilities.

  20. Space Station services and design features for users

    NASA Technical Reports Server (NTRS)

    Kurzhals, Peter R.; Mckinney, Royce L.

    1987-01-01

    The operational design features and services planned for the NASA Space Station will furnish, in addition to novel opportunities and facilities, lower costs through interface standardization and automation and faster access by means of computer-aided integration and control processes. By furnishing a basis for large-scale space exploitation, the Space Station will possess industrial production and operational services capabilities that may be used by the private sector for commercial ventures; it could also ultimately support lunar and planetary exploration spacecraft assembly and launch facilities.

  1. Exploration of complex visual feature spaces for object perception

    PubMed Central

    Leeds, Daniel D.; Pyles, John A.; Tarr, Michael J.

    2014-01-01

    The mid- and high-level visual properties supporting object perception in the ventral visual pathway are poorly understood. In the absence of well-specified theory, many groups have adopted a data-driven approach in which they progressively interrogate neural units to establish each unit's selectivity. Such methods are challenging in that they require search through a wide space of feature models and stimuli using a limited number of samples. To more rapidly identify higher-level features underlying human cortical object perception, we implemented a novel functional magnetic resonance imaging method in which visual stimuli are selected in real-time based on BOLD responses to recently shown stimuli. This work was inspired by earlier primate physiology work, in which neural selectivity for mid-level features in IT was characterized using a simple parametric approach (Hung et al., 2012). To extend such work to human neuroimaging, we used natural and synthetic object stimuli embedded in feature spaces constructed on the basis of the complex visual properties of the objects themselves. During fMRI scanning, we employed a real-time search method to control continuous stimulus selection within each image space. This search was designed to maximize neural responses across a pre-determined 1 cm3 brain region within ventral cortex. To assess the value of this method for understanding object encoding, we examined both the behavior of the method itself and the complex visual properties the method identified as reliably activating selected brain regions. We observed: (1) Regions selective for both holistic and component object features and for a variety of surface properties; (2) Object stimulus pairs near one another in feature space that produce responses at the opposite extremes of the measured activity range. Together, these results suggest that real-time fMRI methods may yield more widely informative measures of selectivity within the broad classes of visual features

  2. Exploration of complex visual feature spaces for object perception.

    PubMed

    Leeds, Daniel D; Pyles, John A; Tarr, Michael J

    2014-01-01

    The mid- and high-level visual properties supporting object perception in the ventral visual pathway are poorly understood. In the absence of well-specified theory, many groups have adopted a data-driven approach in which they progressively interrogate neural units to establish each unit's selectivity. Such methods are challenging in that they require search through a wide space of feature models and stimuli using a limited number of samples. To more rapidly identify higher-level features underlying human cortical object perception, we implemented a novel functional magnetic resonance imaging method in which visual stimuli are selected in real-time based on BOLD responses to recently shown stimuli. This work was inspired by earlier primate physiology work, in which neural selectivity for mid-level features in IT was characterized using a simple parametric approach (Hung et al., 2012). To extend such work to human neuroimaging, we used natural and synthetic object stimuli embedded in feature spaces constructed on the basis of the complex visual properties of the objects themselves. During fMRI scanning, we employed a real-time search method to control continuous stimulus selection within each image space. This search was designed to maximize neural responses across a pre-determined 1 cm(3) brain region within ventral cortex. To assess the value of this method for understanding object encoding, we examined both the behavior of the method itself and the complex visual properties the method identified as reliably activating selected brain regions. We observed: (1) Regions selective for both holistic and component object features and for a variety of surface properties; (2) Object stimulus pairs near one another in feature space that produce responses at the opposite extremes of the measured activity range. Together, these results suggest that real-time fMRI methods may yield more widely informative measures of selectivity within the broad classes of visual features

  3. Re-Identification in the Function Space of Feature Warps.

    PubMed

    Martinel, Niki; Das, Abir; Micheloni, Christian; Roy-Chowdhury, Amit K

    2015-08-01

    Person re-identification in a non-overlapping multicamera scenario is an open challenge in computer vision because of the large changes in appearances caused by variations in viewing angle, lighting, background clutter, and occlusion over multiple cameras. As a result of these variations, features describing the same person get transformed between cameras. To model the transformation of features, the feature space is nonlinearly warped to get the "warp functions". The warp functions between two instances of the same target form the set of feasible warp functions while those between instances of different targets form the set of infeasible warp functions. In this work, we build upon the observation that feature transformations between cameras lie in a nonlinear function space of all possible feature transformations. The space consisting of all the feasible and infeasible warp functions is the warp function space (WFS). We propose to learn a discriminating surface separating these two sets of warp functions in the WFS and to re-identify persons by classifying a test warp function as feasible or infeasible. Towards this objective, a Random Forest (RF) classifier is employed which effectively chooses the warp function components according to their importance in separating the feasible and the infeasible warp functions in the WFS. Extensive experiments on five datasets are carried out to show the superior performance of the proposed approach over state-of-the-art person re-identification methods. We show that our approach outperforms all other methods when large illumination variations are considered. At the same time it has been shown that our method reaches the best average performance over multiple combinations of the datasets, thus, showing that our method is not designed only to address a specific challenge posed by a particular dataset.

  4. Fuzzy Logic-Based Audio Pattern Recognition

    NASA Astrophysics Data System (ADS)

    Malcangi, M.

    2008-11-01

    Audio and audio-pattern recognition is becoming one of the most important technologies to automatically control embedded systems. Fuzzy logic may be the most important enabling methodology due to its ability to rapidly and economically model such application. An audio and audio-pattern recognition engine based on fuzzy logic has been developed for use in very low-cost and deeply embedded systems to automate human-to-machine and machine-to-machine interaction. This engine consists of simple digital signal-processing algorithms for feature extraction and normalization, and a set of pattern-recognition rules manually tuned or automatically tuned by a self-learning process.

  5. A face feature space in the macaque temporal lobe.

    PubMed

    Freiwald, Winrich A; Tsao, Doris Y; Livingstone, Margaret S

    2009-09-01

    The ability of primates to effortlessly recognize faces has been attributed to the existence of specialized face areas. One such area, the macaque middle face patch, consists almost entirely of cells that are selective for faces, but the principles by which these cells analyze faces are unknown. We found that middle face patch neurons detect and differentiate faces using a strategy that is both part based and holistic. Cells detected distinct constellations of face parts. Furthermore, cells were tuned to the geometry of facial features. Tuning was most often ramp-shaped, with a one-to-one mapping of feature magnitude to firing rate. Tuning amplitude depended on the presence of a whole, upright face and features were interpreted according to their position in a whole, upright face. Thus, cells in the middle face patch encode axes of a face space specialized for whole, upright faces.

  6. Audio Indexing for Efficiency

    ERIC Educational Resources Information Center

    Rahnlom, Harold F.; Pedrick, Lillian

    1978-01-01

    This article describes Zimdex, an audio indexing system developed to solve the problem of indexing audio materials for individual instruction in the content area of the mathematics of life insurance. (Author)

  7. Feature extraction on local jet space for texture classification

    NASA Astrophysics Data System (ADS)

    Oliveira, Marcos William da Silva; da Silva, Núbia Rosa; Manzanera, Antoine; Bruno, Odemir Martinez

    2015-12-01

    The proposal of this study is to analyze the texture pattern recognition over the local jet space looking forward to improve the texture characterization. Local jets decompose the image based on partial derivatives allowing the texture feature extraction be exploited in different levels of geometrical structures. Each local jet component evidences a different local pattern, such as, flat regions, directional variations and concavity or convexity. Subsequently, a texture descriptor is used to extract features from 0th, 1st and 2nd-derivative components. Four well-known databases (Brodatz, Vistex, Usptex and Outex) and four texture descriptors (Fourier descriptors, Gabor filters, Local Binary Pattern and Local Binary Pattern Variance) were used to validate the idea, showing in most cases an increase of the success rates.

  8. Feature tracking with temporal-scale space modeling

    NASA Astrophysics Data System (ADS)

    Zafiropoulos, Pany; Schenk, Toni F.

    1998-12-01

    This study investigates the extension of scale space techniques for manipulating image sequences for extracting and classifying useful information for further vision processes toward image understanding and sequentially facilitating the acquisition phase of Geographic Information Systems. More precisely, the basic concepts, axioms and the implementational issues of multi-resolution approaches will be discussed for treating continuous monotonic scale changes in images taken in a dynamic environment as in the case when a mobile mapping system captures images at constant time intervals. Moreover, discussion will be devoted to the scale variations and resolution issues, in a stereo-pair and within a sequence, and consequently to the use of an operator with adaptable size for tracking and interpreting spatial scale changes with respect to the registered object space. The analysis of a set of images representing the same object space and by using scale space tracing techniques supported with scale changes in temporal basis results to the characterization of features in which their precise localization is extracted by applying a multiple image matching framework, satisfying a set of criteria and taking into account the geometric and radiometric dissimilarities for accommodating the scale differences. Finally, the above framework can be applied to the localization and to the archiving of traffic signs, reaching high levels of automation in real time in a controlled environment, to the registration of the road centerline and to the post-processing of sequences for producing DEMs and Orthoimagery.

  9. Detecting double compression of audio signal

    NASA Astrophysics Data System (ADS)

    Yang, Rui; Shi, Yun Q.; Huang, Jiwu

    2010-01-01

    MP3 is the most popular audio format nowadays in our daily life, for example music downloaded from the Internet and file saved in the digital recorder are often in MP3 format. However, low bitrate MP3s are often transcoded to high bitrate since high bitrate ones are of high commercial value. Also audio recording in digital recorder can be doctored easily by pervasive audio editing software. This paper presents two methods for the detection of double MP3 compression. The methods are essential for finding out fake-quality MP3 and audio forensics. The proposed methods use support vector machine classifiers with feature vectors formed by the distributions of the first digits of the quantized MDCT (modified discrete cosine transform) coefficients. Extensive experiments demonstrate the effectiveness of the proposed methods. To the best of our knowledge, this piece of work is the first one to detect double compression of audio signal.

  10. Automated Solar Feature Detection for Space Weather Applications

    NASA Astrophysics Data System (ADS)

    Pérez-Suárez, David; Higgins, Paul A.; Bloomfield, D. Shaun; McAteer, R. T. James; Krista, Larisza D.; Byrne, Jason P.; Gallagher, Peter. T.

    2011-03-01

    The solar surface and atmosphere are highly dynamic plasma environments, which evolve over a wide range of temporal and spatial scales. Large-scale eruptions, such as coronal mass ejections, can be accelerated to millions of kilometres per hour in a matter of minutes, making their automated detection and characterisation challenging. Additionally, there are numerous faint solar features, such as coronal holes and coronal dimmings, which are important for space weather monitoring and forecasting, but their low intensity and sometimes transient nature makes them problematic to detect using traditional image processing techniques. These difficulties are compounded by advances in ground- and space- based instrumentation, which have increased the volume of data that solar physicists are confronted with on a minute-by-minute basis; NASA's Solar Dynamics Observatory for example is returning many thousands of images per hour (~1.5 TB/day). This chapter reviews recent advances in the application of images processing techniques to the automated detection of active regions, coronal holes, filaments, CMEs, and coronal dimmings for the purposes of space weather monitoring and prediction.

  11. Enhancing Manual Scan Registration Using Audio Cues

    NASA Astrophysics Data System (ADS)

    Ntsoko, T.; Sithole, G.

    2014-04-01

    Indoor mapping and modelling requires that acquired data be processed by editing, fusing, formatting the data, amongst other operations. Currently the manual interaction the user has with the point cloud (data) while processing it is visual. Visual interaction does have limitations, however. One way of dealing with these limitations is to augment audio in point cloud processing. Audio augmentation entails associating points of interest in the point cloud with audio objects. In coarse scan registration, reverberation, intensity and frequency audio cues were exploited to help the user estimate depth and occupancy of space of points of interest. Depth estimations were made reliably well when intensity and frequency were both used as depth cues. Coarse changes of depth could be estimated in this manner. The depth between surfaces can therefore be estimated with the aid of the audio objects. Sound reflections of an audio object provided reliable information of the object surroundings in some instances. For a point/area of interest in the point cloud, these reflections can be used to determine the unseen events around that point/area of interest. Other processing techniques could benefit from this while other information is estimated using other audio cues like binaural cues and Head Related Transfer Functions. These other cues could be used in position estimations of audio objects to aid in problems such as indoor navigation problems.

  12. Advances in audio source seperation and multisource audio content retrieval

    NASA Astrophysics Data System (ADS)

    Vincent, Emmanuel

    2012-06-01

    Audio source separation aims to extract the signals of individual sound sources from a given recording. In this paper, we review three recent advances which improve the robustness of source separation in real-world challenging scenarios and enable its use for multisource content retrieval tasks, such as automatic speech recognition (ASR) or acoustic event detection (AED) in noisy environments. We present a Flexible Audio Source Separation Toolkit (FASST) and discuss its advantages compared to earlier approaches such as independent component analysis (ICA) and sparse component analysis (SCA). We explain how cues as diverse as harmonicity, spectral envelope, temporal fine structure or spatial location can be jointly exploited by this toolkit. We subsequently present the uncertainty decoding (UD) framework for the integration of audio source separation and audio content retrieval. We show how the uncertainty about the separated source signals can be accurately estimated and propagated to the features. Finally, we explain how this uncertainty can be efficiently exploited by a classifier, both at the training and the decoding stage. We illustrate the resulting performance improvements in terms of speech separation quality and speaker recognition accuracy.

  13. Web Audio/Video Streaming Tool

    NASA Technical Reports Server (NTRS)

    Guruvadoo, Eranna K.

    2003-01-01

    In order to promote NASA-wide educational outreach program to educate and inform the public of space exploration, NASA, at Kennedy Space Center, is seeking efficient ways to add more contents to the web by streaming audio/video files. This project proposes a high level overview of a framework for the creation, management, and scheduling of audio/video assets over the web. To support short-term goals, the prototype of a web-based tool is designed and demonstrated to automate the process of streaming audio/video files. The tool provides web-enabled users interfaces to manage video assets, create publishable schedules of video assets for streaming, and schedule the streaming events. These operations are performed on user-defined and system-derived metadata of audio/video assets stored in a relational database while the assets reside on separate repository. The prototype tool is designed using ColdFusion 5.0.

  14. Audio and Electroacoustics

    NASA Astrophysics Data System (ADS)

    Davis, Mark

    This chapter surveys devices and systems associated with audio and electroacoustics: the acquisition, transmission, storage, and reproduction of audio. The chapter provides an historical overview of the field since before the days of Edison and Bell to the present day, and analyzes performance of audio transducers, components and systems from basic psychoacoustic principles, to arrive at an assessment of the perceptual performance of such elements and an indication of possible directions for future progress.

  15. Audio and Electroacoustics

    NASA Astrophysics Data System (ADS)

    Davis, Mark F.

    This chapter surveys devices and systems associated with audio and electroacoustics: The acquisition, transmission, storage, and reproduction of audio. The chapter provides an historical overview of the field since before the days of Edison and Bell to the present day, and analyzes performance of audio transducers, components and systems from basic psychoacoustic principles, to arrive at an assessment of the perceptual performance of such elements and an indication of possible directions for future progress.

  16. Audio fingerprint extraction for content identification

    NASA Astrophysics Data System (ADS)

    Shiu, Yu; Yeh, Chia-Hung; Kuo, C. C. J.

    2003-11-01

    In this work, we present an audio content identification system that identifies some unknown audio material by comparing its fingerprint with those extracted off-line and saved in the music database. We will describe in detail the procedure to extract audio fingerprints and demonstrate that they are robust to noise and content-preserving manipulations. The main feature in the proposed system is the zero-crossing rate extracted with the octave-band filter bank. The zero-crossing rate can be used to describe the dominant frequency in each subband with a very low computational cost. The size of audio fingerprint is small and can be efficiently stored along with the compressed files in the database. It is also robust to many modifications such as tempo change and time-alignment distortion. Besides, the octave-band filter bank is used to enhance the robustness to distortion, especially those localized on some frequency regions.

  17. Trajectory analysis via a geometric feature space approach

    SciTech Connect

    Rintoul, Mark D.; Wilson, Andrew T.

    2015-10-05

    This study aimed to organize a body of trajectories in order to identify, search for and classify both common and uncommon behaviors among objects such as aircraft and ships. Existing comparison functions such as the Fréchet distance are computationally expensive and yield counterintuitive results in some cases. We propose an approach using feature vectors whose components represent succinctly the salient information in trajectories. These features incorporate basic information such as the total distance traveled and the distance between start/stop points as well as geometric features related to the properties of the convex hull, trajectory curvature and general distance geometry. Additionally, these features can generally be mapped easily to behaviors of interest to humans who are searching large databases. Most of these geometric features are invariant under rigid transformation. Furthermore, we demonstrate the use of different subsets of these features to identify trajectories similar to an exemplar, cluster a database of several hundred thousand trajectories and identify outliers.

  18. Interactions between space-based and feature-based attention

    PubMed Central

    Leonard, Carly J.; Balestreri, Angela; Luck, Steven J.

    2014-01-01

    Although early research suggested that attention to nonspatial features (i.e., red) was confined to stimuli appearing at an attended spatial location, more recent research has emphasized the global nature of feature-based attention. For example, a distractor sharing a target feature may capture attention even if it occurs at a task-irrelevant location. Such findings have been used to argue that feature-based attention operates independently of spatial attention. However, feature-based attention may nonetheless interact with spatial attention, yielding larger feature-based effects at attended locations than at unattended locations. The present study tested this possibility. In two experiments, participants viewed a rapid serial visual presentation (RSVP) stream and identified a target letter defined by its color. Target-colored distractors were presented at various task-irrelevant locations during the RSVP stream. We found that feature-driven attentional capture effects were largest when the target-colored distractor was closer to the attended location. These results demonstrate that spatial attention modulates the strength of feature-based attention capture, calling into question the prior evidence that feature-based attention operates in a global manner that is independent of spatial attention. PMID:25285472

  19. MedlinePlus FAQ: Is audio description available for videos on MedlinePlus?

    MedlinePlus

    ... audiodescription.html Question: Is audio description available for videos on MedlinePlus? To use the sharing features on ... page, please enable JavaScript. Answer: Audio description of videos helps make the content of videos accessible to ...

  20. Aeronautical audio broadcasting via satellite

    NASA Technical Reports Server (NTRS)

    Tzeng, Forrest F.

    1993-01-01

    A system design for aeronautical audio broadcasting, with C-band uplink and L-band downlink, via Inmarsat space segments is presented. Near-transparent-quality compression of 5-kHz bandwidth audio at 20.5 kbit/s is achieved based on a hybrid technique employing linear predictive modeling and transform-domain residual quantization. Concatenated Reed-Solomon/convolutional codes with quadrature phase shift keying are selected for bandwidth and power efficiency. RF bandwidth at 25 kHz per channel, and a decoded bit error rate at 10(exp -6) with E(sub b)/N(sub o) at 3.75 dB are obtained. An interleaver, scrambler, modem synchronization, and frame format were designed, and frequency-division multiple access was selected over code-division multiple access. A link budget computation based on a worst-case scenario indicates sufficient system power margins. Transponder occupancy analysis for 72 audio channels demonstrates ample remaining capacity to accommodate emerging aeronautical services.

  1. Audio signal processor

    NASA Technical Reports Server (NTRS)

    Hymer, R. L.

    1970-01-01

    System provides automatic volume control for an audio amplifier or a voice communication system without introducing noise surges during pauses in the input, and without losing the initial signal when the input resumes.

  2. Forensic audio watermark detection

    NASA Astrophysics Data System (ADS)

    Steinebach, Martin; Zmudzinski, Sascha; Petrautzki, Dirk

    2012-03-01

    Digital audio watermarking detection is often computational complex and requires at least as much audio information as required to embed a complete watermark. In some applications, especially real-time monitoring, this is an important drawback. The reason for this is the usage of sync sequences at the beginning of the watermark, allowing a decision about the presence only if at least the sync has been found and retrieved. We propose an alternative method for detecting the presence of a watermark. Based on the knowledge of the secret key used for embedding, we create a mark for all potential marking stages and then use a sliding window to test a given audio file on the presence of statistical characteristics caused by embedding. In this way we can detect a watermark in less than 1 second of audio.

  3. Steganalysis for Audio Data

    DTIC Science & Technology

    2006-03-31

    evaluation system which provides the user the functionality of different watermarking algorithms (embedding and detecting) with a large database of audio...with a large database of audio signals (test material). The system also provides both single and profile attacks which can be used to evaluate the...in this case, music/ hiphop and sounds/computergen) the attack suite used performs very well compared to the overall set. In these cases the

  4. 366-AAA_audio

    NASA Image and Video Library

    1969-11-17

    Apollo 12 Public Affairs Officer (PAO) Mission Commentary, November 17, 1969. This is an hour of audio covering communications occurring between 64 hours, 38 minutes into the mission, through 79 hours, 2 minutes which was on November 17, 1969, from 0300-17:09 CST. Transcript of attached audio is available at http://www.jsc.nasa.gov/history/mission_trans/AS12_PAO.PDF, on pages 207-224 of the 979-page document.

  5. Detecting Image Splicing Using Merged Features in Chroma Space

    PubMed Central

    Liu, Guangjie; Dai, Yuewei

    2014-01-01

    Image splicing is an image editing method to copy a part of an image and paste it onto another image, and it is commonly followed by postprocessing such as local/global blurring, compression, and resizing. To detect this kind of forgery, the image rich models, a feature set successfully used in the steganalysis is evaluated on the splicing image dataset at first, and the dominant submodel is selected as the first kind of feature. The selected feature and the DCT Markov features are used together to detect splicing forgery in the chroma channel, which is convinced effective in splicing detection. The experimental results indicate that the proposed method can detect splicing forgeries with lower error rate compared to the previous literature. PMID:24574877

  6. Detecting image splicing using merged features in chroma space.

    PubMed

    Xu, Bo; Liu, Guangjie; Dai, Yuewei

    2014-01-01

    Image splicing is an image editing method to copy a part of an image and paste it onto another image, and it is commonly followed by postprocessing such as local/global blurring, compression, and resizing. To detect this kind of forgery, the image rich models, a feature set successfully used in the steganalysis is evaluated on the splicing image dataset at first, and the dominant submodel is selected as the first kind of feature. The selected feature and the DCT Markov features are used together to detect splicing forgery in the chroma channel, which is convinced effective in splicing detection. The experimental results indicate that the proposed method can detect splicing forgeries with lower error rate compared to the previous literature.

  7. Trajectory analysis via a geometric feature space approach

    DOE PAGES

    Rintoul, Mark D.; Wilson, Andrew T.

    2015-10-05

    This study aimed to organize a body of trajectories in order to identify, search for and classify both common and uncommon behaviors among objects such as aircraft and ships. Existing comparison functions such as the Fréchet distance are computationally expensive and yield counterintuitive results in some cases. We propose an approach using feature vectors whose components represent succinctly the salient information in trajectories. These features incorporate basic information such as the total distance traveled and the distance between start/stop points as well as geometric features related to the properties of the convex hull, trajectory curvature and general distance geometry. Additionally,more » these features can generally be mapped easily to behaviors of interest to humans who are searching large databases. Most of these geometric features are invariant under rigid transformation. Furthermore, we demonstrate the use of different subsets of these features to identify trajectories similar to an exemplar, cluster a database of several hundred thousand trajectories and identify outliers.« less

  8. Modified Dendrogram of High-dimensional Feature Space for Transfer Function Design

    PubMed Central

    Wang, Lei; Zhao, Xin; Kaufman, Arie

    2010-01-01

    We introduce a modified dendrogram (MD) (with sub-trees to represent the feature space clusters) and display it in continuous space for multi-dimensional transfer function (TF) design and modification. Such a TF for direct volume rendering often employs a multi-dimensional feature space. In an n-dimensional (nD) feature space, each voxel is described using n attributes and represented by a vector of n values. The MD reveals the hierarchical structure information of the high-dimensional feature space clusters. Using the MD user interface (UI), the user can design and modify the TF in 2D in an intuitive and informative manner instead of designing it directly in multi-dimensional space where it is complicated and harder to understand the relationship of the feature space vectors. In addition, we provide the capability to interactively change the granularity of the MD. The coarse-grained MD shows primarily the global information of the feature space while the fine-grained MD reveals the finer details, and the separation ability of the high-dimensional feature space is completely preserved in the finest granularity. With the so called multi-grained method, the user can efficiently create a TF using the coarse-grained MD, then fine tune it with the finer-grained MDs to improve the quality of the volume rendering. Furthermore, we propose a fast interactive hierarchical clustering (FIHC) algorithm for accelerating the MD computation and supporting the interactive multi-grained TF design. In the FIHC, the finest-grained MD is established by linking the feature space vectors, then the feature space vectors being the leaves of this tree are clustered using a hierarchical leaf clustering (HLC) algorithm forming a leaf vector hierarchical tree (LVHT). The granularity of the MD can be changed by setting the precision of the LVHT. Our method is independent on the type of the attributes and supports arbitrary-dimension feature space. PMID:26279612

  9. Limitations in 4-Year-Old Children's Sensitivity to the Spacing among Facial Features

    ERIC Educational Resources Information Center

    Mondloch, Catherine J.; Thomson, Kendra

    2008-01-01

    Four-year-olds' sensitivity to differences among faces in the spacing of features was tested under 4 task conditions: judging distinctiveness when the external contour was visible and when it was occluded, simultaneous match-to-sample, and recognizing the face of a friend. In each task, the foil differed only in the spacing of features, and…

  10. Audio Design: Creating Multi-sensory Images for the Mind.

    ERIC Educational Resources Information Center

    Ferrington, Gary

    1994-01-01

    Explores the concept of "theater of the mind" and discusses design factors in creating audio works that effectively stimulate mental pictures, including: narrative format in audio scripting; qualities of voice; use of concrete language; music; noise versus silence; and the creation of the illusion of space using monaural, stereophonic,…

  11. Audio Design: Creating Multi-sensory Images for the Mind.

    ERIC Educational Resources Information Center

    Ferrington, Gary

    1994-01-01

    Explores the concept of "theater of the mind" and discusses design factors in creating audio works that effectively stimulate mental pictures, including: narrative format in audio scripting; qualities of voice; use of concrete language; music; noise versus silence; and the creation of the illusion of space using monaural, stereophonic,…

  12. Space-based RF signal classification using adaptive wavelet features

    SciTech Connect

    Caffrey, M.; Briles, S.

    1995-04-01

    RF signals are dispersed in frequency as they propagate through the ionosphere. For wide-band signals, this results in nonlinearly- chirped-frequency, transient signals in the VHF portion of the spectrum. This ionospheric dispersion provide a means of discriminating wide-band transients from other signals (e.g., continuous-wave carriers, burst communications, chirped-radar signals, etc.). The transient nature of these dispersed signals makes them candidates for wavelet feature selection. Rather than choosing a wavelet ad hoc, we adaptively compute an optimal mother wavelet via a neural network. Gaussian weighted, linear frequency modulate (GLFM) wavelets are linearly combined by the network to generate our application specific mother wavelet, which is optimized for its capacity to select features that discriminate between the dispersed signals and clutter (e.g., multiple continuous-wave carriers), not for its ability to represent the dispersed signal. The resulting mother wavelet is then used to extract features for a neutral network classifier. The performance of the adaptive wavelet classifier is the compared to an FFT based neural network classifier.

  13. Audio-Tutorial Instruction; An Expanded Approach.

    ERIC Educational Resources Information Center

    Herrick, Merlyn C.

    The University of Missouri-Columbia School of Medicine is developing an audio-tutorial system with several unique features. A Didactor, a device which provides most of the capabilities of computer-assisted instruction but at a fraction of the cost, is the center of the system. The Didactor is combined with tape recordings and slides to present a…

  14. Reverse engineering the face space: Discovering the critical features for face identification.

    PubMed

    Abudarham, Naphtali; Yovel, Galit

    2016-01-01

    How do we identify people? What are the critical facial features that define an identity and determine whether two faces belong to the same person or different people? To answer these questions, we applied the face space framework, according to which faces are represented as points in a multidimensional feature space, such that face space distances are correlated with perceptual similarities between faces. In particular, we developed a novel method that allowed us to reveal the critical dimensions (i.e., critical features) of the face space. To that end, we constructed a concrete face space, which included 20 facial features of natural face images, and asked human observers to evaluate feature values (e.g., how thick are the lips). Next, we systematically and quantitatively changed facial features, and measured the perceptual effects of these manipulations. We found that critical features were those for which participants have high perceptual sensitivity (PS) for detecting differences across identities (e.g., which of two faces has thicker lips). Furthermore, these high PS features vary minimally across different views of the same identity, suggesting high PS features support face recognition across different images of the same face. The methods described here set an infrastructure for discovering the critical features of other face categories not studied here (e.g., Asians, familiar) as well as other aspects of face processing, such as attractiveness or trait inferences.

  15. Agency Video, Audio and Imagery Library

    NASA Technical Reports Server (NTRS)

    Grubbs, Rodney

    2015-01-01

    The purpose of this presentation was to inform the ISS International Partners of the new NASA Agency Video, Audio and Imagery Library (AVAIL) website. AVAIL is a new resource for the public to search for and download NASA-related imagery, and is not intended to replace the current process by which the International Partners receive their Space Station imagery products.

  16. Layered indexing of home video based on audio signals

    NASA Astrophysics Data System (ADS)

    Ogawa, Tomomi; Aizawa, Kiyoharu

    2003-12-01

    In this paper, we propose a home video indexing using an audio information to detect an event both a rules-based method and a GMM-based method. Although exclusive audio segmentation and classification was usually used, various sounds overlap in practice, in which case an audio in which various sound overlapped is expressed by a labeling layered index. With the rules-based method, low-level audio features are used to determine indexes, which are classied such as speech, silence, music, and EVN(Environment Noise). The GMM-based method which uses the same features as the rule based method also classifies an audio into the four classes. Smoothing is applied in order to determine the index. We show experiments in a few home video data.

  17. Enhancing Navigation Skills through Audio Gaming.

    PubMed

    Sánchez, Jaime; Sáenz, Mauricio; Pascual-Leone, Alvaro; Merabet, Lotfi

    2010-01-01

    We present the design, development and initial cognitive evaluation of an Audio-based Environment Simulator (AbES). This software allows a blind user to navigate through a virtual representation of a real space for the purposes of training orientation and mobility skills. Our findings indicate that users feel satisfied and self-confident when interacting with the audio-based interface, and the embedded sounds allow them to correctly orient themselves and navigate within the virtual world. Furthermore, users are able to transfer spatial information acquired through virtual interactions into real world navigation and problem solving tasks.

  18. Enhancing Navigation Skills through Audio Gaming

    PubMed Central

    Sánchez, Jaime; Sáenz, Mauricio; Pascual-Leone, Alvaro; Merabet, Lotfi

    2014-01-01

    We present the design, development and initial cognitive evaluation of an Audio-based Environment Simulator (AbES). This software allows a blind user to navigate through a virtual representation of a real space for the purposes of training orientation and mobility skills. Our findings indicate that users feel satisfied and self-confident when interacting with the audio-based interface, and the embedded sounds allow them to correctly orient themselves and navigate within the virtual world. Furthermore, users are able to transfer spatial information acquired through virtual interactions into real world navigation and problem solving tasks. PMID:25505796

  19. 3D Audio System

    NASA Technical Reports Server (NTRS)

    1992-01-01

    Ames Research Center research into virtual reality led to the development of the Convolvotron, a high speed digital audio processing system that delivers three-dimensional sound over headphones. It consists of a two-card set designed for use with a personal computer. The Convolvotron's primary application is presentation of 3D audio signals over headphones. Four independent sound sources are filtered with large time-varying filters that compensate for motion. The perceived location of the sound remains constant. Possible applications are in air traffic control towers or airplane cockpits, hearing and perception research and virtual reality development.

  20. The method of narrow-band audio classification based on universal noise background model

    NASA Astrophysics Data System (ADS)

    Rui, Rui; Bao, Chang-chun

    2013-03-01

    Audio classification is the basis of content-based audio analysis and retrieval. The conventional classification methods mainly depend on feature extraction of audio clip, which certainly increase the time requirement for classification. An approach for classifying the narrow-band audio stream based on feature extraction of audio frame-level is presented in this paper. The audio signals are divided into speech, instrumental music, song with accompaniment and noise using the Gaussian mixture model (GMM). In order to satisfy the demand of actual environment changing, a universal noise background model (UNBM) for white noise, street noise, factory noise and car interior noise is built. In addition, three feature schemes are considered to optimize feature selection. The experimental results show that the proposed algorithm achieves a high accuracy for audio classification, especially under each noise background we used and keep the classification time less than one second.

  1. Towards semantically sensitive text clustering: a feature space modeling technology based on dimension extension.

    PubMed

    Liu, Yuanchao; Liu, Ming; Wang, Xin

    2015-01-01

    The objective of text clustering is to divide document collections into clusters based on the similarity between documents. In this paper, an extension-based feature modeling approach towards semantically sensitive text clustering is proposed along with the corresponding feature space construction and similarity computation method. By combining the similarity in traditional feature space and that in extension space, the adverse effects of the complexity and diversity of natural language can be addressed and clustering semantic sensitivity can be improved correspondingly. The generated clusters can be organized using different granularities. The experimental evaluations on well-known clustering algorithms and datasets have verified the effectiveness of our approach.

  2. Towards Semantically Sensitive Text Clustering: A Feature Space Modeling Technology Based on Dimension Extension

    PubMed Central

    Liu, Yuanchao; Liu, Ming; Wang, Xin

    2015-01-01

    The objective of text clustering is to divide document collections into clusters based on the similarity between documents. In this paper, an extension-based feature modeling approach towards semantically sensitive text clustering is proposed along with the corresponding feature space construction and similarity computation method. By combining the similarity in traditional feature space and that in extension space, the adverse effects of the complexity and diversity of natural language can be addressed and clustering semantic sensitivity can be improved correspondingly. The generated clusters can be organized using different granularities. The experimental evaluations on well-known clustering algorithms and datasets have verified the effectiveness of our approach. PMID:25794172

  3. Digital Multicasting of Multiple Audio Streams

    NASA Technical Reports Server (NTRS)

    Macha, Mitchell; Bullock, John

    2007-01-01

    at the MCC. In the other access-control provision, the program verifies that the user is authorized to have access to the audio streams. Once both access-control checks are completed, the audio software presents a graphical display that includes audiostream-selection buttons and volume-control sliders. The user can select all or any subset of the available audio streams and can adjust the volume of each stream independently of that of the other streams. The audio-player program spawns a "read" process for the selected stream(s). The spawned process sends, to the router(s), a "multicast-join" request for the selected streams. The router(s) responds to the request by sending the encrypted multicast packets to the spawned process. The spawned process receives the encrypted multicast packets and sends a decryption packet to audio-driver software. As the volume or muting features are changed by the user, interrupts are sent to the spawned process to change the corresponding attributes sent to the audio-driver software. The total latency of this system - that is, the total time from the origination of the audio signals to generation of sound at a listener s computer - lies between four and six seconds.

  4. Audio Feedback -- Better Feedback?

    ERIC Educational Resources Information Center

    Voelkel, Susanne; Mello, Luciane V.

    2014-01-01

    National Student Survey (NSS) results show that many students are dissatisfied with the amount and quality of feedback they get for their work. This study reports on two case studies in which we tried to address these issues by introducing audio feedback to one undergraduate (UG) and one postgraduate (PG) class, respectively. In case study one…

  5. COMINT Audio Interface

    DTIC Science & Technology

    1999-07-01

    application in the military linguist’s work environment . Demonstrations conducted under this effort concluded that 3D audio localization techniques on their...own have not been developed to the point where they achieve the fidelity necessary for the military work environment . Recommended areas for additional

  6. Audio Feedback -- Better Feedback?

    ERIC Educational Resources Information Center

    Voelkel, Susanne; Mello, Luciane V.

    2014-01-01

    National Student Survey (NSS) results show that many students are dissatisfied with the amount and quality of feedback they get for their work. This study reports on two case studies in which we tried to address these issues by introducing audio feedback to one undergraduate (UG) and one postgraduate (PG) class, respectively. In case study one…

  7. An analysis of possible advanced space strategies featuring the role of space resource utilization

    NASA Astrophysics Data System (ADS)

    Cordell, Bruce; Steinbronn, Otto

    A major weakness of space planning in the U.S.A. has been the lack of clearly defined, major space goals within a coherent, politically palatable, long-range national space strategy. Unresolved issues include the Space Station's role, the most profitable space exploration strategies, and space resource use. We present an analysis of these factors with special emphasis on space resource utilization. Our performance modeling reveals that lunar oxygen is useful on or near the Moon and—if lunar hydrogen is available—lunar oxygen is also economical in LEO. Use of volatile materials from Phobos/Deimos is preferred or attractive in LEO, low lunar orbit, and—if lunar hydrogen is unavailable—on the Moon. Thus it appears that resource synergisms between operations in the Mars system and in Earth-Moon space could become commercially important.

  8. ENERGY STAR Certified Audio Video

    EPA Pesticide Factsheets

    Certified models meet all ENERGY STAR requirements as listed in the Version 3.0 ENERGY STAR Program Requirements for Audio Video Equipment that are effective as of May 1, 2013. A detailed listing of key efficiency criteria are available at http://www.energystar.gov/index.cfm?c=audio_dvd.pr_crit_audio_dvd

  9. Neurophysiological Correlates of Featural and Spacing Processing for Face and Non-face Stimuli

    PubMed Central

    Negrini, Marcello; Brkić, Diandra; Pizzamiglio, Sara; Premoli, Isabella; Rivolta, Davide

    2017-01-01

    The peculiar ability of humans to recognize hundreds of faces at a glance has been attributed to face-specific perceptual mechanisms known as holistic processing. Holistic processing includes the ability to discriminate individual facial features (i.e., featural processing) and their spatial relationships (i.e., spacing processing). Here, we aimed to characterize the spatio-temporal dynamics of featural- and spacing-processing of faces and objects. Nineteen healthy volunteers completed a newly created perceptual discrimination task for faces and objects (i.e., the “University of East London Face Task”) while their brain activity was recorded with a high-density (128 electrodes) electroencephalogram. Our results showed that early event related potentials at around 100 ms post-stimulus onset (i.e., P100) are sensitive to both facial features and spacing between the features. Spacing and features discriminability for objects occurred at circa 200 ms post-stimulus onset (P200). These findings indicate the existence of neurophysiological correlates of spacing vs. features processing in both face and objects, and demonstrate faster brain processing for faces. PMID:28348535

  10. Representing the Meanings of Object and Action Words: The Featural and Unitary Semantic Space Hypothesis

    ERIC Educational Resources Information Center

    Vigliocco, Gabriella; Vinson, David P.; Lewis, William; Garrett, Merrill F.

    2004-01-01

    This paper presents the Featural and Unitary Semantic Space (FUSS) hypothesis of the meanings of object and action words. The hypothesis, implemented in a statistical model, is based on the following assumptions: First, it is assumed that the meanings of words are grounded in conceptual featural representations, some of which are organized…

  11. Audio Spectral Processing

    DTIC Science & Technology

    2010-05-01

    briefing on the center system and the Northeast Center was included. 15 Received a request from Shawn McCormick, NYS Attorney General’s Office Sex ...clarify a recorded conversation, stored in MP3 format, contained on a compact disc. The NLECTC-NE/LEAF utilized FBI supplied equipment to collect and...conversation, stored in MP3 format, contained on a compact disc. The test audio data to be enhanced was a two-party conversation between a male

  12. Logistic Localized Modeling of the Sample Space for Feature Selection and Classification.

    PubMed

    Armanfard, Narges; Reilly, James P; Komeili, Majid

    2017-03-21

    Conventional feature selection algorithms assign a single common feature set to all regions of the sample space. In contrast, this paper proposes a novel algorithm for localized feature selection for which each region of the sample space is characterized by its individual distinct feature subset that may vary in size and membership. This approach can therefore select an optimal feature subset that adapts to local variations of the sample space, and hence offer the potential for improved performance. Feature subsets are computed by choosing an optimal coordinate space so that, within a localized region, within-class distances and between-class distances are, respectively, minimized and maximized. Distances are measured using a logistic function metric within the corresponding region. This enables the optimization process to focus on a localized region within the sample space. A local classification approach is utilized for measuring the similarity of a new input data point to each class. The proposed logistic localized feature selection (lLFS) algorithm is invariant to the underlying probability distribution of the data; hence, it is appropriate when the data are distributed on a nonlinear or disjoint manifold. lLFS is efficiently formulated as a joint convex/increasing quasi-convex optimization problem with a unique global optimum point. The method is most applicable when the number of available training samples is small. The performance of the proposed localized method is successfully demonstrated on a large variety of data sets. We demonstrate that the number of features selected by the lLFS method saturates at the number of available discriminative features. In addition, we have shown that the Vapnik-Chervonenkis dimension of the localized classifier is finite. Both these factors suggest that the lLFS method is insensitive to the overfitting issue, relative to other methods.

  13. Overall Features of EAST Operation Space by Using Simple Core-SOL-Divertor Model

    NASA Astrophysics Data System (ADS)

    Hiwatari, R.; Hatayama, A.; Zhu, Sizheng; Takizuka, T.; Tomita, Y.

    2006-01-01

    A simple Core-SOL-Divertor (C-S-D) model has been developed to investigate qualitatively the overall features of the operational space for the integrated core and edge plasma. To construct the simple C-S-D model, a simple core plasma model of ITER physics guidelines and a two-point SOL-divertor model are used. The simple C-S-D model is applied to the study of the EAST operational space with lower hybrid current drive experiments under various kinds of trade-off for the basic plasma parameters. Effective methods for extending the operational space are also presented. From this study for the EAST operational space, it is evident that the C-S-D model is a useful tool for understanding qualitatively the overall features of the plasma operational space.

  14. Effect of Microgravity on Interface Structural Features in Space Crystal Growth

    NASA Astrophysics Data System (ADS)

    Chen, Wan-chun; W, Chen C.; Mai, Zhen-hong; Z, Mai H.; K, Kato

    1998-08-01

    The distribution of Ca2+ at boundary layer of α-LiIO3 crystal grown both in space and on earth was studied by the synchrotron x-ray fluorescence microanalysis. It was found that: (1) the Ca2+ concentration in the space-crystal is higher than that in the earth-crystal, (2) a sharp discontinuity of Ca2+ concentration is found at boundary layers of space-crystals alone. This finding provides an evidence of microgravity effect on interface structural features in space crystal growth.

  15. Structuring feature space: a non-parametric method for volumetric transfer function generation.

    PubMed

    Maciejewski, Ross; Woo, Insoo; Chen, Wei; Ebert, David S

    2009-01-01

    The use of multi-dimensional transfer functions for direct volume rendering has been shown to be an effective means of extracting materials and their boundaries for both scalar and multivariate data. The most common multi-dimensional transfer function consists of a two-dimensional (2D) histogram with axes representing a subset of the feature space (e.g., value vs. value gradient magnitude), with each entry in the 2D histogram being the number of voxels at a given feature space pair. Users then assign color and opacity to the voxel distributions within the given feature space through the use of interactive widgets (e.g., box, circular, triangular selection). Unfortunately, such tools lead users through a trial-and-error approach as they assess which data values within the feature space map to a given area of interest within the volumetric space. In this work, we propose the addition of non-parametric clustering within the transfer function feature space in order to extract patterns and guide transfer function generation. We apply a non-parametric kernel density estimation to group voxels of similar features within the 2D histogram. These groups are then binned and colored based on their estimated density, and the user may interactively grow and shrink the binned regions to explore feature boundaries and extract regions of interest. We also extend this scheme to temporal volumetric data in which time steps of 2D histograms are composited into a histogram volume. A three-dimensional (3D) density estimation is then applied, and users can explore regions within the feature space across time without adjusting the transfer function at each time step. Our work enables users to effectively explore the structures found within a feature space of the volume and provide a context in which the user can understand how these structures relate to their volumetric data. We provide tools for enhanced exploration and manipulation of the transfer function, and we show that the initial

  16. Teledesic Global Wireless Broadband Network: Space Infrastructure Architecture, Design Features and Technologies

    NASA Technical Reports Server (NTRS)

    Stuart, James R.

    1995-01-01

    The Teledesic satellites are a new class of small satellites which demonstrate the important commercial benefits of using technologies developed for other purposes by U.S. National Laboratories. The Teledesic satellite architecture, subsystem design features, and new technologies are described. The new Teledesic satellite manufacturing, integration, and test approaches which use modern high volume production techniques and result in surprisingly low space segment costs are discussed. The constellation control and management features and attendant software architecture features are addressed. After briefly discussing the economic and technological impact on the USA commercial space industries of the space communications revolution and such large constellation projects, the paper concludes with observations on the trend toward future system architectures using networked groups of much smaller satellites.

  17. High-Resolution Infrared Space Observatory Spectroscopy of the Unidentified 21 Micron Feature

    NASA Technical Reports Server (NTRS)

    Volk, Kevin; Kwok, Sun; Hrivnak, Bruce

    1999-01-01

    We present Infrared Space Observatory SWS06 mode observations of the 21 micron feature in eight sources, including a first "detection of the feature in IRAS Z02229+6208. The observed feature peak-to-continuum ratios range from 0.13 in IRAS Z02229+6208 to 1.30 in IRAS 07134+1005. The normalized spectra, obtained by the removal of the underlying continua and by scaling the features to the same peak flux value. show that all features have the same intrinsic profile and peak wavelength. There is no evidence for any discrete substructure due to molecular bands in the observed spectra, suggesting that the 21 micron feature is due to either a solid substance or a mixture of many similarly structured large molecules.

  18. High-Resolution Infrared Space Observatory Spectroscopy of the Unidentified 21 Micron Feature

    NASA Technical Reports Server (NTRS)

    Volk, Kevin; Kwok, Sun; Hrivnak, Bruce

    1999-01-01

    We present Infrared Space Observatory SWS06 mode observations of the 21 micron feature in eight sources, including a first "detection of the feature in IRAS Z02229+6208. The observed feature peak-to-continuum ratios range from 0.13 in IRAS Z02229+6208 to 1.30 in IRAS 07134+1005. The normalized spectra, obtained by the removal of the underlying continua and by scaling the features to the same peak flux value. show that all features have the same intrinsic profile and peak wavelength. There is no evidence for any discrete substructure due to molecular bands in the observed spectra, suggesting that the 21 micron feature is due to either a solid substance or a mixture of many similarly structured large molecules.

  19. A scale space feature based registration technique for fusion of satellite imagery

    NASA Technical Reports Server (NTRS)

    Raghavan, Srini; Cromp, Robert F.; Campbell, William C.

    1997-01-01

    Feature based registration is one of the most reliable methods to register multi-sensor images (both active and passive imagery) since features are often more reliable than intensity or radiometric values. The only situation where a feature based approach will fail is when the scene is completely homogenous or densely textural in which case a combination of feature and intensity based methods may yield better results. In this paper, we present some preliminary results of testing our scale space feature based registration technique, a modified version of feature based method developed earlier for classification of multi-sensor imagery. The proposed approach removes the sensitivity in parameter selection experienced in the earlier version as explained later.

  20. Audio distribution and Monitoring Circuit

    NASA Technical Reports Server (NTRS)

    Kirkland, J. M.

    1983-01-01

    Versatile circuit accepts and distributes TV audio signals. Three-meter audio distribution and monitoring circuit provides flexibility in monitoring, mixing, and distributing audio inputs and outputs at various signal and impedance levels. Program material is simultaneously monitored on three channels, or single-channel version built to monitor transmitted or received signal levels, drive speakers, interface to building communications, and drive long-line circuits.

  1. Robust face identification scheme: KL expansion of an invariant feature space

    NASA Astrophysics Data System (ADS)

    Akamatsu, Shigeru; Sasaki, Tsutomu; Fukamachi, Hideo; Suenaga, Yasuhito

    1992-02-01

    This paper proposes a new approach for extracting features from face images that offer robust face identification against image variations. We combine the K-L expansion technique with two new operations that transform the face pattern into an invariant feature space. The two operations are the affine transformation which yields a standard face view from the input face image, and its transformation into the Fourier spectrum domain, which develops the property of shift-invariance. Although the basic idea of applying the K-L expansion to extract features for face recognition originates from the eigenface approach proposed by Turk and Pentland our scheme offers superior performance due to the transformation into the invariant feature space. The performance of the two schemes for face identification against various imaging conditions is compared.

  2. Audio-Vision: Audio-Visual Interaction in Desktop Multimedia.

    ERIC Educational Resources Information Center

    Daniels, Lee

    Although sophisticated multimedia authoring applications are now available to amateur programmers, the use of audio in of these programs has been inadequate. Due to the lack of research in the use of audio in instruction, there are few resources to assist the multimedia producer in using sound effectively and efficiently. This paper addresses the…

  3. A systematic exploration of the micro-blog feature space for teens stress detection.

    PubMed

    Zhao, Liang; Li, Qi; Xue, Yuanyuan; Jia, Jia; Feng, Ling

    2016-01-01

    In the modern stressful society, growing teenagers experience severe stress from different aspects from school to friends, from self-cognition to inter-personal relationship, which negatively influences their smooth and healthy development. Being timely and accurately aware of teenagers psychological stress and providing effective measures to help immature teenagers to cope with stress are highly valuable to both teenagers and human society. Previous work demonstrates the feasibility to sense teenagers' stress from their tweeting contents and context on the open social media platform-micro-blog. However, a tweet is still too short for teens to express their stressful status in a comprehensive way. Considering the topic continuity from the tweeting content to the follow-up comments and responses between the teenager and his/her friends, we combine the content of comments and responses under the tweet to supplement the tweet content. Also, such friends' caring comments like "what happened?", "Don't worry!", "Cheer up!", etc. provide hints to teenager's stressful status. Hence, in this paper, we propose to systematically explore the micro-blog feature space, comprised of four kinds of features [tweeting content features (FW), posting features (FP), interaction features (FI), and comment-response features (FC) between teenagers and friends] for teenager' stress category and stress level detection. We extract and analyze these feature values and their impacts on teens stress detection. We evaluate the framework through a real user study of 36 high school students aged 17. Different classifiers are employed to detect potential stress categories and corresponding stress levels. Experimental results show that all the features in the feature space positively affect stress detection, and linguistic negative emotion, proportion of negative sentences, friends' caring comments and teen's reply rate play more significant roles than the rest features. Micro-blog platform provides

  4. Multidetector CT and histological features of benign mesenchymoma of the infratemporal space: a rare case report

    PubMed Central

    Kelkar, CA; Desai, RS; Kambadakone, A; Shetty, SJ

    2013-01-01

    Benign mesenchymoma is a soft tissue neoplasm composed of an admixture of two or more benign mesenchymal components in addition to fibrous tissue. A rare case of benign mesenchymoma of the infratemporal space in a 14-year-old boy is presented. In this case report we discuss the salient imaging and histopathological features of this rare entity. PMID:22282510

  5. Applications of electric propulsion in scientific space projects: Benefits, features, and implementation examples

    NASA Astrophysics Data System (ADS)

    Martynov, M. B.; Petukhov, V. G.

    2012-12-01

    Benefits and features of electric propulsion systems (EPSs) are analyzed in the context of interplanetary travel in the Solar System. A methodological approach to assessing the performance of EPSs is formulated and illustrated by the example of a space mission to Venus.

  6. The Lowdown on Audio Downloads

    ERIC Educational Resources Information Center

    Farrell, Beth

    2010-01-01

    First offered to public libraries in 2004, downloadable audiobooks have grown by leaps and bounds. According to the Audio Publishers Association, their sales today account for 21% of the spoken-word audio market. It hasn't been easy, however. WMA. DRM. MP3. AAC. File extensions small on letters but very big on consequences for librarians,…

  7. The Lowdown on Audio Downloads

    ERIC Educational Resources Information Center

    Farrell, Beth

    2010-01-01

    First offered to public libraries in 2004, downloadable audiobooks have grown by leaps and bounds. According to the Audio Publishers Association, their sales today account for 21% of the spoken-word audio market. It hasn't been easy, however. WMA. DRM. MP3. AAC. File extensions small on letters but very big on consequences for librarians,…

  8. Discriminative genre-independent audio-visual scene change detection

    NASA Astrophysics Data System (ADS)

    Wilson, Kevin W.; Divakaran, Ajay

    2009-01-01

    We present a technique for genre-independent scene-change detection using audio and video features in a discriminative support vector machine (SVM) framework. This work builds on our previous work by adding a video feature based on the MPEG-7 "scalable color" descriptor. Adding this feature improves our detection rate over all genres by 5% to 15% for a fixed false positive rate of 10%. We also find that the genres that benefit the most are those with which the previous audio-only was least effective.

  9. Digital audio and video broadcasting by satellite

    NASA Astrophysics Data System (ADS)

    Yoshino, Takehiko

    In parallel with the progress of the practical use of satellite broadcasting and Hi-Vision or high-definition television technologies, research activities are also in progress to replace the conventional analog broadcasting services with a digital version. What we call 'digitalization' is not a mere technical matter but an important subject which will help promote multichannel or multimedia applications and, accordingly, can change the old concept of mass media, such as television or radio. NHK Science and Technical Research Laboratories has promoted studies of digital bandwidth compression, transmission, and application techniques. The following topics are covered: the trend of digital broadcasting; features of Integrated Services Digital Broadcasting (ISDB); compression encoding and transmission; transmission bit rate in 12 GHz band; number of digital TV transmission channels; multichannel pulse code modulation (PCM) audio broadcasting system via communication satellite; digital Hi-Vision broadcasting; and development of digital audio broadcasting (DAB) for mobile reception in Japan.

  10. Audio-visual biofeedback for respiratory-gated radiotherapy: Impact of audio instruction and audio-visual biofeedback on respiratory-gated radiotherapy

    SciTech Connect

    George, Rohini; Chung, Theodore D.; Vedam, Sastry S.; Ramakrishnan, Viswanathan; Mohan, Radhe; Weiss, Elisabeth; Keall, Paul J. . E-mail: pjkeall@vcu.edu

    2006-07-01

    Purpose: Respiratory gating is a commercially available technology for reducing the deleterious effects of motion during imaging and treatment. The efficacy of gating is dependent on the reproducibility within and between respiratory cycles during imaging and treatment. The aim of this study was to determine whether audio-visual biofeedback can improve respiratory reproducibility by decreasing residual motion and therefore increasing the accuracy of gated radiotherapy. Methods and Materials: A total of 331 respiratory traces were collected from 24 lung cancer patients. The protocol consisted of five breathing training sessions spaced about a week apart. Within each session the patients initially breathed without any instruction (free breathing), with audio instructions and with audio-visual biofeedback. Residual motion was quantified by the standard deviation of the respiratory signal within the gating window. Results: Audio-visual biofeedback significantly reduced residual motion compared with free breathing and audio instruction. Displacement-based gating has lower residual motion than phase-based gating. Little reduction in residual motion was found for duty cycles less than 30%; for duty cycles above 50% there was a sharp increase in residual motion. Conclusions: The efficiency and reproducibility of gating can be improved by: incorporating audio-visual biofeedback, using a 30-50% duty cycle, gating during exhalation, and using displacement-based gating.

  11. Computerized audio processor

    NASA Astrophysics Data System (ADS)

    Weiss, M. R.; Aschkenasy, E.

    1983-05-01

    The Computerized Audio Processor (CAP) is a computer synthesized electronic filter that removes interference from received or recorded speech signals. The CAP automatically detects and attenuates impulse sounds and tones (e.g., ignition noise, switching transients, whistles, chirps, hum, buzzes, FSK telegraphy, etc). It also attenuates wideband random noise. All operations of the CAP are fully automatic. Input signals are processed in real time, with a maximum lag of 340 msec. The CAP implements three proven signal processing techniques. One of these (IMP) virtually eliminates most loud impulse noises. A second technique (DSS) automatically detects tones and attenuates them by up to 46 dB. The third technique (INTEL) provides up to 18 dB attenuation of wideband random noise.

  12. Metrological digital audio reconstruction

    DOEpatents

    Fadeyev; Vitaliy , Haber; Carl

    2004-02-19

    Audio information stored in the undulations of grooves in a medium such as a phonograph record may be reconstructed, with little or no contact, by measuring the groove shape using precision metrology methods coupled with digital image processing and numerical analysis. The effects of damage, wear, and contamination may be compensated, in many cases, through image processing and analysis methods. The speed and data handling capacity of available computing hardware make this approach practical. Two examples used a general purpose optical metrology system to study a 50 year old 78 r.p.m. phonograph record and a commercial confocal scanning probe to study a 1920's celluloid Edison cylinder. Comparisons are presented with stylus playback of the samples and with a digitally re-mastered version of an original magnetic recording. There is also a more extensive implementation of this approach, with dedicated hardware and software.

  13. Direct broadcast satellite-audio, portable and mobile reception tradeoffs

    NASA Technical Reports Server (NTRS)

    Golshan, Nasser

    1992-01-01

    This paper reports on the findings of a systems tradeoffs study on direct broadcast satellite-radio (DBS-R). Based on emerging advanced subband and transform audio coding systems, four ranges of bit rates: 16-32 kbps, 48-64 kbps, 96-128 kbps and 196-256 kbps are identified for DBS-R. The corresponding grades of audio quality will be subjectively comparable to AM broadcasting, monophonic FM, stereophonic FM, and CD quality audio, respectively. The satellite EIRP's needed for mobile DBS-R reception in suburban areas are sufficient for portable reception in most single family houses when allowance is made for the higher G/T of portable table-top receivers. As an example, the variation of the space segment cost as a function of frequency, audio quality, coverage capacity, and beam size is explored for a typical DBS-R system.

  14. The Influences of Landscape Features on Visitation of Hospital Green Spaces-A Choice Experiment Approach.

    PubMed

    Chang, Kaowen Grace; Chien, Hungju

    2017-07-05

    Studies have suggested that visiting and viewing landscaping at hospitals accelerates patient's recovery from surgery and help staff's recovery from mental fatigue. To plan and construct such landscapes, we need to unravel landscape features desirable to different groups so that the space can benefit a wide range of hospital users. Using discrete choice modeling, we developed experimental choice sets to investigate how landscape features influence the visitations of different users in a large regional hospital in Taiwan. The empirical survey provides quantitative estimates of the influence of each landscape feature on four user groups, including patients, caregivers, staff, and neighborhood residents. Our findings suggest that different types of features promote visits from specific user groups. Landscape features facilitating physical activities effectively encourage visits across user groups especially for caregivers and staff. Patients in this study specify a strong need for contact with nature. The nearby community favors the features designed for children's play and family activities. People across user groups value the features that provide a mitigated microclimate of comfort, such as a shelter. Study implications and limitations are also discussed. Our study provides information essential for creating a better healing environment in a hospital setting.

  15. Perception of facial features and face-to-face communications in space.

    PubMed

    Cohen, M M

    2000-09-01

    This paper examines how human face-to-face communications may be compromised by exposure to the microgravity environment of space. Although efficient face-to-face communications are acknowledged to be essential for astronauts to work safely and effectively, the space environment causes facial edema, and allows speakers and listeners to confront one another in multiple orientations that can potentially interfere with many important non-linguistic communication cues. In addition, high background noises in space shuttles and stations can mask auditory messages. Currently, the effects of spaceflight on the ability to identify facial features, to read lips, to integrate visual and auditory sensory information, and to interpret spoken messages in noisy environments while the speakers are at various orientations with respect to the listeners, are virtually unknown. Ground-based experiments conducted in our laboratory with computer-displayed facial images presented at various orientations have suggested how non-linguistic cues may be degraded by changes in the retinal orientations of facial images. The results of our ground-based studies illustrating impaired recognition of facial features in non-erect images are described, their implications for effective face-to-face communications among astronauts in space are discussed, and two experiments that can quantify the effects of microgravity on face-to-face communications in space are outlined.

  16. Medical X-ray Image Hierarchical Classification Using a Merging and Splitting Scheme in Feature Space.

    PubMed

    Fesharaki, Nooshin Jafari; Pourghassem, Hossein

    2013-07-01

    Due to the daily mass production and the widespread variation of medical X-ray images, it is necessary to classify these for searching and retrieving proposes, especially for content-based medical image retrieval systems. In this paper, a medical X-ray image hierarchical classification structure based on a novel merging and splitting scheme and using shape and texture features is proposed. In the first level of the proposed structure, to improve the classification performance, similar classes with regard to shape contents are grouped based on merging measures and shape features into the general overlapped classes. In the next levels of this structure, the overlapped classes split in smaller classes based on the classification performance of combination of shape and texture features or texture features only. Ultimately, in the last levels, this procedure is also continued forming all the classes, separately. Moreover, to optimize the feature vector in the proposed structure, we use orthogonal forward selection algorithm according to Mahalanobis class separability measure as a feature selection and reduction algorithm. In other words, according to the complexity and inter-class distance of each class, a sub-space of the feature space is selected in each level and then a supervised merging and splitting scheme is applied to form the hierarchical classification. The proposed structure is evaluated on a database consisting of 2158 medical X-ray images of 18 classes (IMAGECLEF 2005 database) and accuracy rate of 93.6% in the last level of the hierarchical structure for an 18-class classification problem is obtained.

  17. Medical X-ray Image Hierarchical Classification Using a Merging and Splitting Scheme in Feature Space

    PubMed Central

    Fesharaki, Nooshin Jafari; Pourghassem, Hossein

    2013-01-01

    Due to the daily mass production and the widespread variation of medical X-ray images, it is necessary to classify these for searching and retrieving proposes, especially for content-based medical image retrieval systems. In this paper, a medical X-ray image hierarchical classification structure based on a novel merging and splitting scheme and using shape and texture features is proposed. In the first level of the proposed structure, to improve the classification performance, similar classes with regard to shape contents are grouped based on merging measures and shape features into the general overlapped classes. In the next levels of this structure, the overlapped classes split in smaller classes based on the classification performance of combination of shape and texture features or texture features only. Ultimately, in the last levels, this procedure is also continued forming all the classes, separately. Moreover, to optimize the feature vector in the proposed structure, we use orthogonal forward selection algorithm according to Mahalanobis class separability measure as a feature selection and reduction algorithm. In other words, according to the complexity and inter-class distance of each class, a sub-space of the feature space is selected in each level and then a supervised merging and splitting scheme is applied to form the hierarchical classification. The proposed structure is evaluated on a database consisting of 2158 medical X-ray images of 18 classes (IMAGECLEF 2005 database) and accuracy rate of 93.6% in the last level of the hierarchical structure for an 18-class classification problem is obtained. PMID:24672763

  18. Study on identifying deciduous forest by the method of feature space transformation

    NASA Astrophysics Data System (ADS)

    Zhang, Xuexia; Wu, Pengfei

    2009-10-01

    The thematic remotely sensed information extraction is always one of puzzling nuts which the remote sensing science faces, so many remote sensing scientists devotes diligently to this domain research. The methods of thematic information extraction include two kinds of the visual interpretation and the computer interpretation, the developing direction of which is intellectualization and comprehensive modularization. The paper tries to develop the intelligent extraction method of feature space transformation for the deciduous forest thematic information extraction in Changping district of Beijing city. The whole Chinese-Brazil resources satellite images received in 2005 are used to extract the deciduous forest coverage area by feature space transformation method and linear spectral decomposing method, and the result from remote sensing is similar to woodland resource census data by Chinese forestry bureau in 2004.

  19. System requirements and design features of Space Station Remote Manipulator System mechanisms

    NASA Technical Reports Server (NTRS)

    Kumar, Rajnish; Hayes, Robert

    1991-01-01

    The Space Station Remote Manipulator System (SSRMS) is a long robotic arm for handling large objects/payloads on the International Space Station Freedom. The mechanical components of the SSRMS include seven joints, two latching end effectors (LEEs), and two boom assemblies. The joints and LEEs are complex aerospace mechanisms. The system requirements and design features of these mechanisms are presented. All seven joints of the SSRMS have identical functional performance. The two LEES are identical. This feature allows either end of the SSRMS to be used as tip or base. As compared to the end effector of the Shuttle Remote Manipulator System, the LEE has a latch and umbilical mechanism in addition to the snare and rigidize mechanisms. The latches increase the interface preload and allow large payloads (up to 116,000 Kg) to be handled. The umbilical connectors provide power, data, and video signal transfer capability to/from the SSRMS.

  20. Halftone information hiding technology based on phase feature of space filling curves

    NASA Astrophysics Data System (ADS)

    Hu, Jianhua; Cao, Peng; Dong, Zhihong; Cao, Xiaohe

    2017-08-01

    To solve the problems of the production of interference fringes (namely moiré in printing) and improve the image quality in printing process of halftone screening for information hiding, a halftone screening security technique based on the phase feature of space filling curves is studied in this paper. This method effectively solves the problem of moire and optimizes the quality of the screening, so that the images presented after screening have achieved good visual effect. The pseudo-random scrambling encryption of the plaintext information and the halftone screening technique based on the phase feature of the space filling curves are carried out when screening,which not only eliminates the common moire in the screening but also improves the image quality and the security of information.

  1. Audio Spectrogram Representations for Processing with Convolutional Neural Networks

    NASA Astrophysics Data System (ADS)

    Wyse, L.

    2017-05-01

    One of the decisions that arise when designing a neural network for any application is how the data should be represented in order to be presented to, and possibly generated by, a neural network. For audio, the choice is less obvious than it seems to be for visual images, and a variety of representations have been used for different applications including the raw digitized sample stream, hand-crafted features, machine discovered features, MFCCs and variants that include deltas, and a variety of spectral representations. This paper reviews some of these representations and issues that arise, focusing particularly on spectrograms for generating audio using neural networks for style transfer.

  2. Radiometric responsivity determination for Feature Identification and Location Experiment (FILE) flown on space shuttle mission

    NASA Technical Reports Server (NTRS)

    Wilson, R. G.; Davis, R. E.; Wright, R. E., Jr.; Sivertson, W. E., Jr.; Bullock, G. F.

    1986-01-01

    A procedure was developed to obtain the radiometric (radiance) responsivity of the Feature Identification and Local Experiment (FILE) instrument in preparation for its flight on Space Shuttle Mission 41-G (November 1984). This instrument was designed to obtain Earth feature radiance data in spectral bands centered at 0.65 and 0.85 microns, along with corroborative color and color-infrared photographs, and to collect data to evaluate a technique for in-orbit autonomous classification of the Earth's primary features. The calibration process incorporated both solar radiance measurements and radiative transfer model predictions in estimating expected radiance inputs to the FILE on the Shuttle. The measured data are compared with the model predictions, and the differences observed are discussed. Application of the calibration procedure to the FILE over an 18-month period indicated a constant responsivity characteristic. This report documents the calibration procedure and the associated radiometric measurements and predictions that were part of the instrument preparation for flight.

  3. A centralized audio presentation manager

    SciTech Connect

    Papp, A.L. III; Blattner, M.M.

    1994-05-16

    The centralized audio presentation manager addresses the problems which occur when multiple programs running simultaneously attempt to use the audio output of a computer system. Time dependence of sound means that certain auditory messages must be scheduled simultaneously, which can lead to perceptual problems due to psychoacoustic phenomena. Furthermore, the combination of speech and nonspeech audio is examined; each presents its own problems of perceptibility in an acoustic environment composed of multiple auditory streams. The centralized audio presentation manager receives abstract parameterized message requests from the currently running programs, and attempts to create and present a sonic representation in the most perceptible manner through the use of a theoretically and empirically designed rule set.

  4. The Audio-Tutorial System

    ERIC Educational Resources Information Center

    Postlethwait, S. N.

    1970-01-01

    Describes the audio-tutorial program in Botany at Purdue University. Advantages include adaptability to individual stduent needs, integration of laboratory activities and information giving, aid flexibility in use of media and means of presentation. (EB)

  5. Detection of Coronal Mass Ejections Using Multiple Features and Space-Time Continuity

    NASA Astrophysics Data System (ADS)

    Zhang, Ling; Yin, Jian-qin; Lin, Jia-ben; Feng, Zhi-quan; Zhou, Jin

    2017-07-01

    Coronal Mass Ejections (CMEs) release tremendous amounts of energy in the solar system, which has an impact on satellites, power facilities and wireless transmission. To effectively detect a CME in Large Angle Spectrometric Coronagraph (LASCO) C2 images, we propose a novel algorithm to locate the suspected CME regions, using the Extreme Learning Machine (ELM) method and taking into account the features of the grayscale and the texture. Furthermore, space-time continuity is used in the detection algorithm to exclude the false CME regions. The algorithm includes three steps: i) define the feature vector which contains textural and grayscale features of a running difference image; ii) design the detection algorithm based on the ELM method according to the feature vector; iii) improve the detection accuracy rate by using the decision rule of the space-time continuum. Experimental results show the efficiency and the superiority of the proposed algorithm in the detection of CMEs compared with other traditional methods. In addition, our algorithm is insensitive to most noise.

  6. The nature and origin of periodically spaced tectonic features on Mars

    NASA Technical Reports Server (NTRS)

    Watters, Thomas R.

    1991-01-01

    The final report on the nature and origin of periodically spaced tectonic features on Mars is presented. The focus of this investigation was to characterize and model the periodically spaced wrinkle ridges observed in ridged plains material on Mars. The investigation centered on the wrinkle ridges in ridged plains material on the Tharsis Plateau. Wrinkle ridges are interpreted to be structural in origin, resulting from buckling followed by reverse or thrust faulting of the ridged plains material. The study extended beyond Tharsis to other ridged plains units, particularly those in Hesperia Planum. As a corollary, an analysis of the spacing of the anticlinal ridges of the Yakima Fold Belt of the Columbia Plateau in the NW United States was undertaken.

  7. Non-RF wireless helmet-mounted display and two-way audio connectivity using covert free-space optical communications

    NASA Astrophysics Data System (ADS)

    Strauss, M.; Volfson, L.

    2011-06-01

    Providing the warfighter with Head or Helmet Mounted Displays (HMDs) while in tracked vehicles provides a means to visually maintain access to systems information while in a high vibration environment. The high vibration and unique environment of military tracked and turreted vehicles impact the ability to distinctly see certain information on an HMD, especially small font size or graphics and information that requires long fixation (staring), rather than a brief or peripheral glance. The military and commercial use of HMDs was compiled from market research, market trends, and user feedback. Lessons learned from previous military and commercial use of HMD products were derived to determine the feasibility of HMDs use in the high vibration and the unique environments of tracked vehicles. The results are summarized into factors that determine HMD features which must be specified for successful implementation.

  8. Audio Steganography with Embedded Text

    NASA Astrophysics Data System (ADS)

    Teck Jian, Chua; Chai Wen, Chuah; Rahman, Nurul Hidayah Binti Ab.; Hamid, Isredza Rahmi Binti A.

    2017-08-01

    Audio steganography is about hiding the secret message into the audio. It is a technique uses to secure the transmission of secret information or hide their existence. It also may provide confidentiality to secret message if the message is encrypted. To date most of the steganography software such as Mp3Stego and DeepSound use block cipher such as Advanced Encryption Standard or Data Encryption Standard to encrypt the secret message. It is a good practice for security. However, the encrypted message may become too long to embed in audio and cause distortion of cover audio if the secret message is too long. Hence, there is a need to encrypt the message with stream cipher before embedding the message into the audio. This is because stream cipher provides bit by bit encryption meanwhile block cipher provide a fixed length of bits encryption which result a longer output compare to stream cipher. Hence, an audio steganography with embedding text with Rivest Cipher 4 encryption cipher is design, develop and test in this project.

  9. Audio-Visual Temporal Recalibration Can be Constrained by Content Cues Regardless of Spatial Overlap

    PubMed Central

    Roseboom, Warrick; Kawabe, Takahiro; Nishida, Shin’Ya

    2013-01-01

    It has now been well established that the point of subjective synchrony for audio and visual events can be shifted following exposure to asynchronous audio-visual presentations, an effect often referred to as temporal recalibration. Recently it was further demonstrated that it is possible to concurrently maintain two such recalibrated estimates of audio-visual temporal synchrony. However, it remains unclear precisely what defines a given audio-visual pair such that it is possible to maintain a temporal relationship distinct from other pairs. It has been suggested that spatial separation of the different audio-visual pairs is necessary to achieve multiple distinct audio-visual synchrony estimates. Here we investigated if this is necessarily true. Specifically, we examined whether it is possible to obtain two distinct temporal recalibrations for stimuli that differed only in featural content. Using both complex (audio visual speech; see Experiment 1) and simple stimuli (high and low pitch audio matched with either vertically or horizontally oriented Gabors; see Experiment 2) we found concurrent, and opposite, recalibrations despite there being no spatial difference in presentation location at any point throughout the experiment. This result supports the notion that the content of an audio-visual pair alone can be used to constrain distinct audio-visual synchrony estimates regardless of spatial overlap. PMID:23658549

  10. Audio-Visual Temporal Recalibration Can be Constrained by Content Cues Regardless of Spatial Overlap.

    PubMed

    Roseboom, Warrick; Kawabe, Takahiro; Nishida, Shin'ya

    2013-01-01

    It has now been well established that the point of subjective synchrony for audio and visual events can be shifted following exposure to asynchronous audio-visual presentations, an effect often referred to as temporal recalibration. Recently it was further demonstrated that it is possible to concurrently maintain two such recalibrated estimates of audio-visual temporal synchrony. However, it remains unclear precisely what defines a given audio-visual pair such that it is possible to maintain a temporal relationship distinct from other pairs. It has been suggested that spatial separation of the different audio-visual pairs is necessary to achieve multiple distinct audio-visual synchrony estimates. Here we investigated if this is necessarily true. Specifically, we examined whether it is possible to obtain two distinct temporal recalibrations for stimuli that differed only in featural content. Using both complex (audio visual speech; see Experiment 1) and simple stimuli (high and low pitch audio matched with either vertically or horizontally oriented Gabors; see Experiment 2) we found concurrent, and opposite, recalibrations despite there being no spatial difference in presentation location at any point throughout the experiment. This result supports the notion that the content of an audio-visual pair alone can be used to constrain distinct audio-visual synchrony estimates regardless of spatial overlap.

  11. Prediction of protein structure classes using hybrid space of multi-profile Bayes and bi-gram probability feature spaces.

    PubMed

    Hayat, Maqsood; Tahir, Muhammad; Khan, Sher Afzal

    2014-04-07

    Proteins are the executants of biological functions in living organisms. Comprehension of protein structure is a challenging problem in the era of proteomics, computational biology, and bioinformatics because of its pivotal role in protein folding patterns. Owing to the large exploration of protein sequences in protein databanks and intricacy of protein structures, experimental and theoretical methods are insufficient for prediction of protein structure classes. Therefore, it is highly desirable to develop an accurate, reliable, and high throughput computational model to predict protein structure classes correctly from polygenetic sequences. In this regard, we propose a promising model employing hybrid descriptor space in conjunction with optimized evidence-theoretic K-nearest neighbor algorithm. Hybrid space is the composition of two descriptor spaces including Multi-profile Bayes and bi-gram probability. In order to enhance the generalization power of the classifier, we have selected high discriminative descriptors from the hybrid space using particle swarm optimization, a well-known evolutionary feature selection technique. Performance evaluation of the proposed model is performed using the jackknife test on three low similarity benchmark datasets including 25PDB, 1189, and 640. The success rates of the proposed model are 87.0%, 86.6%, and 88.4%, respectively on the three benchmark datasets. The comparative analysis exhibits that our proposed model has yielded promising results compared to the existing methods in the literature. In addition, our proposed prediction system might be helpful in future research particularly in cases where the major focus of research is on low similarity datasets.

  12. Nanoscale Analysis of Space-Weathering Features in Soils from Itokawa

    NASA Technical Reports Server (NTRS)

    Thompson, M. S.; Christoffersen, R.; Zega, T. J.; Keller, L. P.

    2014-01-01

    Space weathering alters the spectral properties of airless body surface materials by redden-ing and darkening their spectra and attenuating characteristic absorption bands, making it challenging to characterize them remotely [1,2]. It also causes a discrepency between laboratory analysis of meteorites and remotely sensed spectra from asteroids, making it difficult to associate meteorites with their parent bodies. The mechanisms driving space weathering include mi-crometeorite impacts and the interaction of surface materials with solar energetic ions, particularly the solar wind. These processes continuously alter the microchemical and structural characteristics of exposed grains on airless bodies. The change of these properties is caused predominantly by the vapor deposition of reduced Fe and FeS nanoparticles (npFe(sup 0) and npFeS respectively) onto the rims of surface grains [3]. Sample-based analysis of space weathering has tra-ditionally been limited to lunar soils and select asteroidal and lunar regolith breccias [3-5]. With the return of samples from the Hayabusa mission to asteroid Itoka-wa [6], for the first time we are able to compare space-weathering features on returned surface soils from a known asteroidal body. Analysis of these samples will contribute to a more comprehensive model for how space weathering varies across the inner solar system. Here we report detailed microchemical and microstructal analysis of surface grains from Itokawa.

  13. Do features of public open spaces vary according to neighbourhood socio-economic status?

    PubMed

    Crawford, David; Timperio, Anna; Giles-Corti, Billie; Ball, Kylie; Hume, Clare; Roberts, Rebecca; Andrianopoulos, Nick; Salmon, Jo

    2008-12-01

    This study examined the relations between neighbourhood socio-economic status and features of public open spaces (POS) hypothesised to influence children's physical activity. Data were from the first follow-up of the Children Living in Active Neighbourhoods (CLAN) Study, which involved 540 families of 5-6 and 10-12-year-old children in Melbourne, Australia. The Socio-Economic Index for Areas Index (SEIFA) of Relative Socio-economic Advantage/Disadvantage was used to assign a socioeconomic index score to each child's neighbourhood, based on postcode. Participant addresses were geocoded using a Geographic Information System. The Open Space 2002 spatial data set was used to identify all POS within an 800 m radius of each participant's home. The features of each of these POS (1497) were audited. Variability of POS features was examined across quintiles of neighbourhood SEIFA. Compared with POS in lower socioeconomic neighbourhoods, POS in the highest socioeconomic neighbourhoods had more amenities (e.g. picnic tables and drink fountains) and were more likely to have trees that provided shade, a water feature (e.g. pond, creek), walking and cycling paths, lighting, signage regarding dog access and signage restricting other activities. There were no differences across neighbourhoods in the number of playgrounds or the number of recreation facilities (e.g. number of sports catered for on courts and ovals, the presence of other facilities such as athletics tracks, skateboarding facility and swimming pool). This study suggests that POS in high socioeconomic neighbourhoods possess more features that are likely to promote physical activity amongst children.

  14. Assessing efficiency of spatial sampling using combined coverage analysis in geographical and feature spaces

    NASA Astrophysics Data System (ADS)

    Hengl, Tomislav

    2015-04-01

    Efficiency of spatial sampling largely determines success of model building. This is especially important for geostatistical mapping where an initial sampling plan should provide a good representation or coverage of both geographical (defined by the study area mask map) and feature space (defined by the multi-dimensional covariates). Otherwise the model will need to extrapolate and, hence, the overall uncertainty of the predictions will be high. In many cases, geostatisticians use point data sets which are produced using unknown or inconsistent sampling algorithms. Many point data sets in environmental sciences suffer from spatial clustering and systematic omission of feature space. But how to quantify these 'representation' problems and how to incorporate this knowledge into model building? The author has developed a generic function called 'spsample.prob' (Global Soil Information Facilities package for R) and which simultaneously determines (effective) inclusion probabilities as an average between the kernel density estimation (geographical spreading of points; analysed using the spatstat package in R) and MaxEnt analysis (feature space spreading of points; analysed using the MaxEnt software used primarily for species distribution modelling). The output 'iprob' map indicates whether the sampling plan has systematically missed some important locations and/or features, and can also be used as an input for geostatistical modelling e.g. as a weight map for geostatistical model fitting. The spsample.prob function can also be used in combination with the accessibility analysis (cost of field survey are usually function of distance from the road network, slope and land cover) to allow for simultaneous maximization of average inclusion probabilities and minimization of total survey costs. The author postulates that, by estimating effective inclusion probabilities using combined geographical and feature space analysis, and by comparing survey costs to representation

  15. EEMD Independent Extraction for Mixing Features of Rotating Machinery Reconstructed in Phase Space

    PubMed Central

    Ma, Zaichao; Wen, Guangrui; Jiang, Cheng

    2015-01-01

    Empirical Mode Decomposition (EMD), due to its adaptive decomposition property for the non-linear and non-stationary signals, has been widely used in vibration analyses for rotating machinery. However, EMD suffers from mode mixing, which is difficult to extract features independently. Although the improved EMD, well known as the ensemble EMD (EEMD), has been proposed, mode mixing is alleviated only to a certain degree. Moreover, EEMD needs to determine the amplitude of added noise. In this paper, we propose Phase Space Ensemble Empirical Mode Decomposition (PSEEMD) integrating Phase Space Reconstruction (PSR) and Manifold Learning (ML) for modifying EEMD. We also provide the principle and detailed procedure of PSEEMD, and the analyses on a simulation signal and an actual vibration signal derived from a rubbing rotor are performed. The results show that PSEEMD is more efficient and convenient than EEMD in extracting the mixing features from the investigated signal and in optimizing the amplitude of the necessary added noise. Additionally PSEEMD can extract the weak features interfered with a certain amount of noise. PMID:25871723

  16. A linear feature space for simultaneous learning of spatio-spectral filters in BCI.

    PubMed

    Farquhar, J

    2009-11-01

    It is shown how two of the most common types of feature mapping used for classification of single trial Electroencephalography (EEG), i.e. spatial and frequency filtering, can be equivalently performed as linear operations in the space of frequency-specific detector covariance tensors. Thus by first mapping the data to this space, a simple linear classifier can directly learn optimal spatial + frequency filters. Significantly, if the classifier's loss function is convex, learning these filters is a convex minimisation problem. It is also shown how to pre-process the data such that the resulting decision function is robust to the biases inherent in EEG data. Further, based upon ideas from Max Margin Matrix Factorisation, it is shown how the trace norm can be used to select solutions which have low rank. Low rank solutions are preferred as they reflect prior information about the types of EEG signals we expect to see, i.e. that the classifiable information is contained in only a few spatio/spectral pairs. They are also easier to interpret. This feature-space transformation is compared with the Common-Spatial-Patterns on simulated and real Imagined Movement Brain Computer Interface (BCI) data and shown to give state-of-the-art performance.

  17. Optimizing view/illumination geometry for terrestrial features using Space Shuttle and aerial polarimetry

    NASA Technical Reports Server (NTRS)

    Israel, Steven A.; Holly, Mark H.; Whitehead, Victor S.

    1992-01-01

    This paper describes to relationship of polarimetric observations from orbital and aerial platforms and the determination optimum sun-target-sensor geometry. Polarimetric observations were evaluated for feature discrimination. The Space Shuttle experiment was performed using two boresighted Hasselblad 70 mm cameras with identical settings with linear polarizing filters aligned orthogonally about the optic axis. The aerial experiment was performed using a single 35 mm Nikon FE2 and rotating the linear polarizing filter 90 deg to acquire both minimum and maximum photographs. Characteristic curves were created by covertype and waveband for both aerial and Space Shuttle imagery. Though significant differences existed between the two datasets, the observed polarimetric signatures were unique and separable.

  18. Specific features of space-time variations of ozone during the development of intensive tropical disturbances

    NASA Technical Reports Server (NTRS)

    Nerushev, Alexander F.; Vasiliev, Victor I.

    1994-01-01

    An analysis of specific features of space-time variations of ozone in the tropical areas which has been performed on the basis of processing of the results of special expedition studies in the Atlantic and Pacific in 1987-1990 and the data of observations at the stations of the world ozonometric network over the 25-year period. The existence of a cause-and-effect relation has been revealed between the processes determining tropical cyclone (TC) development, and specific features of variations of the total content of ozone (TCO) and the vertical distribution of ozone (VDO) in the regions of TC action. Characteristic features of day-to-day and daily variations of TCO during TC development have been found. On the periphery of a developing TC, 1-4 days before it reaches the stage of storm, TCO increases, on average, by 5-8 percent, and a substantial increase in the concentration of ozone occurs in the middle and upper troposphere. The most probable physical mechanisms relating the observed specific features of ozone variations to TC evolution have been suggested. A hypothesis of the possibility of using ozone as an indicator for early prediction of TC development has been substantiated.

  19. Visualizing in vivo brain neural structures using volume rendered feature spaces.

    PubMed

    Nakao, Megumi; Kurebayashi, Kosuke; Sugiura, Tadao; Sato, Tetsuo; Sawada, Kazuaki; Kawakami, Ryosuke; Nemoto, Tomomi; Minato, Kotaro; Matsuda, Tetsuya

    2014-10-01

    Dendrites of cortical neurons are widely spread across several layers of the cortex. Recently developed two-photon microscopy systems are capable of visualizing the morphology of neurons within deeper layers of the brain and generate large amounts of volumetric imaging data from living tissue. For visual exploration of the three-dimensional (3D) structure of dendrites and the connectivity among neurons in the brain, we propose a visualization software and interface for 3D images based on a new transfer function design using volume rendered feature spaces. This software enables the visualization of multidimensional descriptors of shape and texture extracted from imaging data to characterize tissue. It also allows the efficient analysis and visualization of large data sets. We apply and demonstrate the software to two-photon microscopy images of a living mouse brain. By applying the developed visualization software and algorithms to two-photon microscope images of the mouse brain, we identified a set of feature values that distinguish characteristic structures such as soma, dendrites and apical dendrites in mouse brain. Also, the visualization interface was compared to conventional 1D/2D transfer function system. We have developed a visualization tool and interface that can represent 3D feature values as textures and shapes. This visualization system allows the analysis and characterization of the higher-dimensional feature values of living tissues at the micron level and will contribute to new discoveries in basic biology and clinical medicine. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. [Feature extraction of motor imagery electroencephalography based on time-frequency-space domains].

    PubMed

    Wang, Yueru; Li, Xin; Li, Honghong; Shao, Chengcheng; Ying, Lijuan; Wu, Shuicai

    2014-10-01

    The purpose of using brain-computer interface (BCI) is to build a bridge between brain and computer for the disable persons, in order to help them to communicate with the outside world. Electroencephalography (EEG) has low signal to noise ratio (SNR), and there exist some problems in the traditional methods for the feature extraction of EEG, such as low classification accuracy, lack of spatial information and huge amounts of features. To solve these problems, we proposed a new method based on time domain, frequency domain and space domain. In this study, independent component analysis (ICA) and wavelet transform were used to extract the temporal, spectral and spatial features from the original EEG signals, and then the extracted features were classified with the method combined support vector machine (SVM) with genetic algorithm (GA). The proposed method displayed a better classification performance, and made the mean accuracy of the Graz datasets in the BCI Competitions of 2003 reach 96%. The classification results showed that the proposed method with the three domains could effectively overcome the drawbacks of the traditional methods based solely on time-frequency domain when the EEG signals were used to describe the characteristics of the brain electrical signals.

  1. New features of electron phase space holes observed by the THEMIS mission.

    PubMed

    Andersson, L; Ergun, R E; Tao, J; Roux, A; Lecontel, O; Angelopoulos, V; Bonnell, J; McFadden, J P; Larson, D E; Eriksson, S; Johansson, T; Cully, C M; Newman, D L; Newman, D N; Goldman, M V; Glassmeier, K-H; Baumjohann, W

    2009-06-05

    Observations of electron phase-space holes (EHs) in Earth's plasma sheet by the THEMIS satellites include the first detection of a magnetic perturbation (deltaB_{ parallel}) parallel to the ambient magnetic field (B0). EHs with a detectable deltaB_{ parallel} have several distinguishing features including large electric field amplitudes, a magnetic perturbation perpendicular to B0, high speeds ( approximately 0.3c) along B0, and sizes along B0 of tens of Debye lengths. These EHs have a significant center potential (Phi approximately k_{B}T_{e}/e), suggesting strongly nonlinear behavior nearby such as double layers or magnetic reconnection.

  2. Features of the solar array drive mechanism for the space telescope

    NASA Technical Reports Server (NTRS)

    Hostenkamp, R. G.

    1985-01-01

    The solar array drive mechanism for the Space Telescope embodies several features not customarily found on solar array drives. Power and signal transfer is achieved by means of a flexible wire harness for which the chosen solution, consisting of 168 standard wires, is described. The torque performance data of the harness over its temperature range are presented. The off load system which protects the bearings from the launch loads is released by a trigger made from Nitinol, the memory alloy. The benefits of memory alloy and the caveats for the design are briefly discussed. The design of the off load system is described and test experience is reported.

  3. Supervised pixel classification using a feature space derived from an artificial visual system

    NASA Technical Reports Server (NTRS)

    Baxter, Lisa C.; Coggins, James M.

    1991-01-01

    Image segmentation involves labelling pixels according to their membership in image regions. This requires the understanding of what a region is. Using supervised pixel classification, the paper investigates how groups of pixels labelled manually according to perceived image semantics map onto the feature space created by an Artificial Visual System. Multiscale structure of regions are investigated and it is shown that pixels form clusters based on their geometric roles in the image intensity function, not by image semantics. A tentative abstract definition of a 'region' is proposed based on this behavior.

  4. Audio-Visual Training in Children with Reading Disabilities

    ERIC Educational Resources Information Center

    Magnan, Annie; Ecalle, Jean

    2006-01-01

    This study tested the effectiveness of audio-visual training in the discrimination of the phonetic feature of voicing on the recognition of written words by young children deemed to at risk of dyslexia (experiment 1) as well as on dyslexic children's phonological skills (experiment 2). In addition, the third experiment studied the effectiveness of…

  5. Astronaut Garneau working with Audio Control System panel

    NASA Image and Video Library

    1996-06-05

    STS077-392-007 (19-29 May 1996) --- Inside the Spacehab Module onboard the Earth-orbiting Space Shuttle Endeavour, Canadian astronaut Marc Garneau, mission specialist, joins astronaut Curtis L. Brown, Jr., pilot, in checking out the audio control system for Spacehab. The two joined four other NASA astronauts for nine days of research and experimentation in Earth-orbit.

  6. Toward the development of a feature-space representation for a complex natural category domain.

    PubMed

    Nosofsky, Robert M; Sanders, Craig A; Meagher, Brian J; Douglas, Bruce J

    2017-04-07

    This article reports data sets aimed at the development of a detailed feature-space representation for a complex natural category domain, namely 30 common subtypes of the categories of igneous, metamorphic, and sedimentary rocks. We conducted web searches to develop a library of 12 tokens each of the 30 subtypes, for a total of 360 rock pictures. In one study, subjects provided ratings along a set of 18 hypothesized primary dimensions involving visual characteristics of the rocks. In other studies, subjects provided similarity judgments among pairs of the rock tokens. Analyses are reported to validate the regularity and information value of the dimension ratings. In addition, analyses are reported that derive psychological scaling solutions from the similarity-ratings data and that interrelate the derived dimensions of the scaling solutions with the directly rated dimensions of the rocks. The stimulus set and various forms of ratings data, as well as the psychological scaling solutions, are made available on an online website (https://osf.io/w64fv/) associated with the article. The study provides a fundamental data set that should be of value for a wide variety of research purposes, including: (1) probing the statistical and psychological structure of a complex natural category domain, (2) testing models of similarity judgment, and (3) developing a feature-space representation that can be used in combination with formal models of category learning to predict classification performance in this complex natural category domain.

  7. Planning Schools for Use of Audio-Visual Materials. No. 3: The Audio-Visual Materials Center.

    ERIC Educational Resources Information Center

    National Education Association, Washington, DC. Dept. of Audiovisual Instruction.

    This manual discusses the role, organizational patterns, expected services, and space and housing needs of the audio-visual instructional materials center. In considering the housing of basic functions, photographs, floor layouts, diagrams, and specifications of equipment are presented. An appendix includes a 77-item bibliography, a 7-page list of…

  8. Audio direct broadcast satellites

    NASA Technical Reports Server (NTRS)

    Miller, J. E.

    1983-01-01

    Satellite sound broadcasting is, as the name implies, the use of satellite techniques and technology to broadcast directly from space to low-cost, consumer-quality receivers the types of sound programs commonly received in the AM and FM broadcast bands. It would be a ubiquitous service available to the general public in the home, in the car, and out in the open.

  9. Space-, object-, and feature-based attention interact to organize visual scenes

    PubMed Central

    Behrmann, Marlene

    2014-01-01

    Biased-competition accounts of attentional processing propose that attention arises from distributed interactions within and among different types of perceptual representations (e.g., spatial, featural, and object-based). Although considerable research has examined the facilitation in processing afforded by attending selectively to spatial locations, or to features, or to objects, surprisingly little research has addressed a key prediction of the biased-competition account: that attending to any stimulus should give rise to simultaneous interactions across all the types of perceptual representations encompassed by that stimulus. Here we show that, when an object in a visual display is cued, space-, feature-, and object-based forms of attention interact to enhance processing of that object and to create a scene-wide pattern of attentional facilitation. These results provide evidence to support the biased-competition framework and suggest that attention might be thought of as a mechanism by which multiple, disparate bottom-up, and even top-down, visual perceptual representations are coordinated and preferentially enhanced. PMID:22006523

  10. Do features of public open spaces vary between urban and rural areas?

    PubMed

    Veitch, Jenny; Salmon, Jo; Ball, Kylie; Crawford, David; Timperio, Anna

    2013-02-01

    Parks are an important setting for physical activity and specific park features have been shown to be associated with park visitation and physical activity. Most park-based research has been conducted in urban settings with few studies examining rural parks. This study examined differences in features of parks in urban compared with rural areas. In 2009/10 a tool was developed to audit 433 urban and 195 rural parks located in disadvantaged areas of Victoria, Australia. Features assessed included: access; lighting/safety; aesthetics; amenities; paths; outdoor courts/ovals; informal play spaces; and playgrounds (number, diversity, age appropriateness and safety of play equipment). Rural parks scored higher for aesthetics compared with urban parks (5.08 vs 4.44). Urban parks scored higher for access (4.64 vs 3.89), lighting/safety (2.01 vs 1.76), and diversity of play equipment (7.37 vs 6.24), and were more likely to have paths suitable for walking/cycling (58.8% vs 40.9%) and play equipment for older children (68.2% vs 17.1%). Although the findings cannot be generalized to all urban and rural parks, the results may be used to inform advocacy for park development in rural areas to create parks that are more supportive of physical activity for children and adults. Copyright © 2012 Elsevier Inc. All rights reserved.

  11. Development of Sensitivity to Spacing Versus Feature Changes in Pictures of Houses: Evidence for Slow Development of a General Spacing Detection Mechanism?

    ERIC Educational Resources Information Center

    Robbins, Rachel A.; Shergill, Yaadwinder; Maurer, Daphne; Lewis, Terri L.

    2011-01-01

    Adults are expert at recognizing faces, in part because of exquisite sensitivity to the spacing of facial features. Children are poorer than adults at recognizing facial identity and less sensitive to spacing differences. Here we examined the specificity of the immaturity by comparing the ability of 8-year-olds, 14-year-olds, and adults to…

  12. Development of Sensitivity to Spacing Versus Feature Changes in Pictures of Houses: Evidence for Slow Development of a General Spacing Detection Mechanism?

    ERIC Educational Resources Information Center

    Robbins, Rachel A.; Shergill, Yaadwinder; Maurer, Daphne; Lewis, Terri L.

    2011-01-01

    Adults are expert at recognizing faces, in part because of exquisite sensitivity to the spacing of facial features. Children are poorer than adults at recognizing facial identity and less sensitive to spacing differences. Here we examined the specificity of the immaturity by comparing the ability of 8-year-olds, 14-year-olds, and adults to…

  13. Diffraction of SH-waves by topographic features in a layered transversely isotropic half-space

    NASA Astrophysics Data System (ADS)

    Ba, Zhenning; Liang, Jianwen; Zhang, Yanju

    2017-01-01

    The scattering of plane SH-waves by topographic features in a layered transversely isotropic (TI) half-space is investigated by using an indirect boundary element method (IBEM). Firstly, the anti-plane dynamic stiffness matrix of the layered TI half-space is established and the free fields are solved by using the direct stiffness method. Then, Green's functions are derived for uniformly distributed loads acting on an inclined line in a layered TI half-space and the scattered fields are constructed with the deduced Green's functions. Finally, the free fields are added to the scattered ones to obtain the global dynamic responses. The method is verified by comparing results with the published isotropic ones. Both the steady-state and transient dynamic responses are evaluated and discussed. Numerical results in the frequency domain show that surface motions for the TI media can be significantly different from those for the isotropic case, which are strongly dependent on the anisotropy property, incident angle and incident frequency. Results in the time domain show that the material anisotropy has important effects on the maximum duration and maximum amplitudes of the time histories.

  14. Using MPEG-7 audio descriptors for music querying

    NASA Astrophysics Data System (ADS)

    Gruhne, M.; Dittmar, C.

    2006-08-01

    Due to the growing amount of digital audio an increasing need to automatically categorize music and to create self-controlled and suitable playlists has been emerged. A few approaches to this task relying on low-level features have been published so far. Unfortunately the results utilizing those technologies are not sufficient yet. This paper gives an introduction how to enhance the results with regard to the perceptual similarity using different high-level descriptors and a powerful interaction between the algorithm and the user to consider his preferences. A successful interaction between server and client requires a powerful standardized query language. This paper describes the tools of the MPEG-7 Audio standard in detail and gives examples of already established query languages. Furthermore the requirements of a multimedia query language are identified and its application is exemplified by an automatic audio creation system using a query language.

  15. Audio signal recognition for speech, music, and environmental sounds

    NASA Astrophysics Data System (ADS)

    Ellis, Daniel P. W.

    2003-10-01

    Human listeners are very good at all kinds of sound detection and identification tasks, from understanding heavily accented speech to noticing a ringing phone underneath music playing at full blast. Efforts to duplicate these abilities on computer have been particularly intense in the area of speech recognition, and it is instructive to review which approaches have proved most powerful, and which major problems still remain. The features and models developed for speech have found applications in other audio recognition tasks, including musical signal analysis, and the problems of analyzing the general ``ambient'' audio that might be encountered by an auditorily endowed robot. This talk will briefly review statistical pattern recognition for audio signals, giving examples in several of these domains. Particular emphasis will be given to common aspects and lessons learned.

  16. Virtual environment display for a 3D audio room simulation

    NASA Technical Reports Server (NTRS)

    Chapin, William L.; Foster, Scott H.

    1992-01-01

    The development of a virtual environment simulation system integrating a 3D acoustic audio model with an immersive 3D visual scene is discussed. The system complements the acoustic model and is specified to: allow the listener to freely move about the space, a room of manipulable size, shape, and audio character, while interactively relocating the sound sources; reinforce the listener's feeling of telepresence in the acoustical environment with visual and proprioceptive sensations; enhance the audio with the graphic and interactive components, rather than overwhelm or reduce it; and serve as a research testbed and technology transfer demonstration. The hardware/software design of two demonstration systems, one installed and one portable, are discussed through the development of four iterative configurations.

  17. Virtual environment display for a 3D audio room simulation

    NASA Technical Reports Server (NTRS)

    Chapin, William L.; Foster, Scott H.

    1992-01-01

    The development of a virtual environment simulation system integrating a 3D acoustic audio model with an immersive 3D visual scene is discussed. The system complements the acoustic model and is specified to: allow the listener to freely move about the space, a room of manipulable size, shape, and audio character, while interactively relocating the sound sources; reinforce the listener's feeling of telepresence in the acoustical environment with visual and proprioceptive sensations; enhance the audio with the graphic and interactive components, rather than overwhelm or reduce it; and serve as a research testbed and technology transfer demonstration. The hardware/software design of two demonstration systems, one installed and one portable, are discussed through the development of four iterative configurations.

  18. Digital Audio Application to Short Wave Broadcasting

    NASA Technical Reports Server (NTRS)

    Chen, Edward Y.

    1997-01-01

    Digital audio is becoming prevalent not only in consumer electornics, but also in different broadcasting media. Terrestrial analog audio broadcasting in the AM and FM bands will be eventually be replaced by digital systems.

  19. Digital Audio Application to Short Wave Broadcasting

    NASA Technical Reports Server (NTRS)

    Chen, Edward Y.

    1997-01-01

    Digital audio is becoming prevalent not only in consumer electornics, but also in different broadcasting media. Terrestrial analog audio broadcasting in the AM and FM bands will be eventually be replaced by digital systems.

  20. Space weathering effects in Diviner Lunar Radiometer multispectral infrared measurements of the lunar Christiansen Feature: Characteristics and mitigation

    NASA Astrophysics Data System (ADS)

    Lucey, Paul G.; Greenhagen, Benjamin T.; Song, Eugenie; Arnold, Jessica A.; Lemelin, Myriam; Hanna, Kerri Donaldson; Bowles, Neil E.; Glotch, Timothy D.; Paige, David A.

    2017-02-01

    Multispectral infrared measurements by the Diviner Lunar Radiometer Experiment on the Lunar Renaissance Orbiter enable the characterization of the position of the Christiansen Feature, a thermal infrared spectral feature that laboratory work has shown is proportional to the bulk silica content of lunar surface materials. Diviner measurements show that the position of this feature is also influenced by the changes in optical and physical properties of the lunar surface with exposure to space, the process known as space weathering. Large rayed craters and lunar swirls show corresponding Christiansen Feature anomalies. The space weathering effect is likely due to differences in thermal gradients in the optical surface imposed by the space weathering control of albedo. However, inspected at high resolution, locations with extreme compositions and Christiansen Feature wavelength positions - silica-rich and olivine-rich areas - do not have extreme albedos, and fall off the albedo- Christiansen Feature wavelength position trend occupied by most of the Moon. These areas demonstrate that the Christiansen Feature wavelength position contains compositional information and is not solely dictated by albedo. An optical maturity parameter derived from near-IR measurements is used to partly correct Diviner data for space weathering influences.

  1. Radioactive Decay: Audio Data Collection

    ERIC Educational Resources Information Center

    Struthers, Allan

    2009-01-01

    Many phenomena generate interesting audible time series. This data can be collected and processed using audio software. The free software package "Audacity" is used to demonstrate the process by recording, processing, and extracting click times from an inexpensive radiation detector. The high quality of the data is demonstrated with a simple…

  2. The Audio-Visual Man.

    ERIC Educational Resources Information Center

    Babin, Pierre, Ed.

    A series of twelve essays discuss the use of audiovisuals in religious education. The essays are divided into three sections: one which draws on the ideas of Marshall McLuhan and other educators to explore the newest ideas about audiovisual language and faith, one that describes how to learn and use the new language of audio and visual images, and…

  3. Radioactive Decay: Audio Data Collection

    ERIC Educational Resources Information Center

    Struthers, Allan

    2009-01-01

    Many phenomena generate interesting audible time series. This data can be collected and processed using audio software. The free software package "Audacity" is used to demonstrate the process by recording, processing, and extracting click times from an inexpensive radiation detector. The high quality of the data is demonstrated with a simple…

  4. Engaging Students with Audio Feedback

    ERIC Educational Resources Information Center

    Cann, Alan

    2014-01-01

    Students express widespread dissatisfaction with academic feedback. Teaching staff perceive a frequent lack of student engagement with written feedback, much of which goes uncollected or unread. Published evidence shows that audio feedback is highly acceptable to students but is underused. This paper explores methods to produce and deliver audio…

  5. Audio-Visual Teaching Machines.

    ERIC Educational Resources Information Center

    Dorsett, Loyd G.

    An audiovisual teaching machine (AVTM) presents programed audio and visual material simultaneously to a student and accepts his response. If his response is correct, the machine proceeds with the lesson; if it is incorrect, the machine so indicates and permits another choice (linear) or automatically presents supplementary material (branching).…

  6. Audio-Visual Materials Catalog.

    ERIC Educational Resources Information Center

    Anderson (M.D.) Hospital and Tumor Inst., Houston, TX.

    This catalog lists 27 audiovisual programs produced by the Department of Medical Communications of the University of Texas M. D. Anderson Hospital and Tumor Institute for public distribution. Video tapes, 16 mm. motion pictures and slide/audio series are presented dealing mostly with cancer and related subjects. The programs are intended for…

  7. Audio/ Videoconferencing Packages: Low Cost

    ERIC Educational Resources Information Center

    Treblay, Remy; Fyvie, Barb; Koritko, Brenda

    2005-01-01

    A comparison was conducted of "Voxwire MeetingRoom" and "iVocalize" v4.1.0.3, both Web-conferencing products using voice-over-Internet protocol (VoIP) to provide unlimited, inexpensive, international audio communication, and high-quality Web-conferencing fostering collaborative learning. The study used the evaluation criteria used in earlier…

  8. Audio-Tutorial in Art History.

    ERIC Educational Resources Information Center

    Cohen, Kathleen

    An audio-tutorial approach for an art history course taught at San Jose State College in the fall semester 1973 is evaluated by the Project Director and by an Evaluation Consultant. The teaching method combines several techniques: a weekly film presentation, filmstrips and audio tapes to be used in individual audio-tutorial sessions, a study guide…

  9. Audio Frequency Analysis in Mobile Phones

    ERIC Educational Resources Information Center

    Aguilar, Horacio Munguía

    2016-01-01

    A new experiment using mobile phones is proposed in which its audio frequency response is analyzed using the audio port for inputting external signal and getting a measurable output. This experiment shows how the limited audio bandwidth used in mobile telephony is the main cause of the poor speech quality in this service. A brief discussion is…

  10. 36 CFR 1002.12 - Audio disturbances.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 36 Parks, Forests, and Public Property 3 2014-07-01 2014-07-01 false Audio disturbances. 1002.12... RECREATION § 1002.12 Audio disturbances. (a) The following are prohibited: (1) Operating motorized equipment or machinery such as an electric generating plant, motor vehicle, motorized toy, or an audio device...

  11. 36 CFR 1002.12 - Audio disturbances.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 36 Parks, Forests, and Public Property 3 2012-07-01 2012-07-01 false Audio disturbances. 1002.12... RECREATION § 1002.12 Audio disturbances. (a) The following are prohibited: (1) Operating motorized equipment or machinery such as an electric generating plant, motor vehicle, motorized toy, or an audio device...

  12. 36 CFR 2.12 - Audio disturbances.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 36 Parks, Forests, and Public Property 1 2014-07-01 2014-07-01 false Audio disturbances. 2.12... RESOURCE PROTECTION, PUBLIC USE AND RECREATION § 2.12 Audio disturbances. (a) The following are prohibited..., motorized toy, or an audio device, such as a radio, television set, tape deck or musical instrument, in a...

  13. 36 CFR 1002.12 - Audio disturbances.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 36 Parks, Forests, and Public Property 3 2011-07-01 2011-07-01 false Audio disturbances. 1002.12... RECREATION § 1002.12 Audio disturbances. (a) The following are prohibited: (1) Operating motorized equipment or machinery such as an electric generating plant, motor vehicle, motorized toy, or an audio device...

  14. 36 CFR 1002.12 - Audio disturbances.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 36 Parks, Forests, and Public Property 3 2010-07-01 2010-07-01 false Audio disturbances. 1002.12... RECREATION § 1002.12 Audio disturbances. (a) The following are prohibited: (1) Operating motorized equipment or machinery such as an electric generating plant, motor vehicle, motorized toy, or an audio device...

  15. 36 CFR 2.12 - Audio disturbances.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 36 Parks, Forests, and Public Property 1 2010-07-01 2010-07-01 false Audio disturbances. 2.12... RESOURCE PROTECTION, PUBLIC USE AND RECREATION § 2.12 Audio disturbances. (a) The following are prohibited..., motorized toy, or an audio device, such as a radio, television set, tape deck or musical instrument, in a...

  16. 36 CFR 2.12 - Audio disturbances.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 36 Parks, Forests, and Public Property 1 2012-07-01 2012-07-01 false Audio disturbances. 2.12... RESOURCE PROTECTION, PUBLIC USE AND RECREATION § 2.12 Audio disturbances. (a) The following are prohibited..., motorized toy, or an audio device, such as a radio, television set, tape deck or musical instrument, in a...

  17. Audio Frequency Analysis in Mobile Phones

    ERIC Educational Resources Information Center

    Aguilar, Horacio Munguía

    2016-01-01

    A new experiment using mobile phones is proposed in which its audio frequency response is analyzed using the audio port for inputting external signal and getting a measurable output. This experiment shows how the limited audio bandwidth used in mobile telephony is the main cause of the poor speech quality in this service. A brief discussion is…

  18. 36 CFR 2.12 - Audio disturbances.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 36 Parks, Forests, and Public Property 1 2011-07-01 2011-07-01 false Audio disturbances. 2.12... RESOURCE PROTECTION, PUBLIC USE AND RECREATION § 2.12 Audio disturbances. (a) The following are prohibited..., motorized toy, or an audio device, such as a radio, television set, tape deck or musical instrument, in a...

  19. 36 CFR 2.12 - Audio disturbances.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 36 Parks, Forests, and Public Property 1 2013-07-01 2013-07-01 false Audio disturbances. 2.12... RESOURCE PROTECTION, PUBLIC USE AND RECREATION § 2.12 Audio disturbances. (a) The following are prohibited..., motorized toy, or an audio device, such as a radio, television set, tape deck or musical instrument, in a...

  20. 50 CFR 27.72 - Audio equipment.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... 50 Wildlife and Fisheries 9 2012-10-01 2012-10-01 false Audio equipment. 27.72 Section 27.72 Wildlife and Fisheries UNITED STATES FISH AND WILDLIFE SERVICE, DEPARTMENT OF THE INTERIOR (CONTINUED) THE... Audio equipment. The operation or use of audio devices including radios, recording and playback devices...

  1. 50 CFR 27.72 - Audio equipment.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... 50 Wildlife and Fisheries 8 2011-10-01 2011-10-01 false Audio equipment. 27.72 Section 27.72 Wildlife and Fisheries UNITED STATES FISH AND WILDLIFE SERVICE, DEPARTMENT OF THE INTERIOR (CONTINUED) THE... Audio equipment. The operation or use of audio devices including radios, recording and playback devices...

  2. Complete fold annotation of the human proteome using a novel structural feature space

    NASA Astrophysics Data System (ADS)

    Middleton, Sarah A.; Illuminati, Joseph; Kim, Junhyong

    2017-04-01

    Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this method by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.

  3. Complete fold annotation of the human proteome using a novel structural feature space

    PubMed Central

    Middleton, Sarah A.; Illuminati, Joseph; Kim, Junhyong

    2017-01-01

    Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this method by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families. PMID:28406174

  4. Complete fold annotation of the human proteome using a novel structural feature space.

    PubMed

    Middleton, Sarah A; Illuminati, Joseph; Kim, Junhyong

    2017-04-13

    Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this method by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.

  5. Complete fold annotation of the human proteome using a novel structural feature space

    DOE PAGES

    Middleton, Sarah A.; Illuminati, Joseph; Kim, Junhyong

    2017-04-13

    Recognition of protein structural fold is the starting point for many structure prediction tools and protein function inference. Fold prediction is computationally demanding and recognizing novel folds is difficult such that the majority of proteins have not been annotated for fold classification. Here we describe a new machine learning approach using a novel feature space that can be used for accurate recognition of all 1,221 currently known folds and inference of unknown novel folds. We show that our method achieves better than 94% accuracy even when many folds have only one training example. We demonstrate the utility of this methodmore » by predicting the folds of 34,330 human protein domains and showing that these predictions can yield useful insights into potential biological function, such as prediction of RNA-binding ability. Finally, our method can be applied to de novo fold prediction of entire proteomes and identify candidate novel fold families.« less

  6. Joint spatial-spectral feature space clustering for speech activity detection from ECoG signals.

    PubMed

    Kanas, Vasileios G; Mporas, Iosif; Benz, Heather L; Sgarbas, Kyriakos N; Bezerianos, Anastasios; Crone, Nathan E

    2014-04-01

    Brain-machine interfaces for speech restoration have been extensively studied for more than two decades. The success of such a system will depend in part on selecting the best brain recording sites and signal features corresponding to speech production. The purpose of this study was to detect speech activity automatically from electrocorticographic signals based on joint spatial-frequency clustering of the ECoG feature space. For this study, the ECoG signals were recorded while a subject performed two different syllable repetition tasks. We found that the optimal frequency resolution to detect speech activity from ECoG signals was 8 Hz, achieving 98.8% accuracy by employing support vector machines as a classifier. We also defined the cortical areas that held the most information about the discrimination of speech and nonspeech time intervals. Additionally, the results shed light on the distinct cortical areas associated with the two syllables repetition tasks and may contribute to the development of portable ECoG-based communication.

  7. Interactive Feature Space Explorer© for multi-modal magnetic resonance imaging.

    PubMed

    Özcan, Alpay; Türkbey, Barış; Choyke, Peter L; Akin, Oguz; Aras, Ömer; Mun, Seong K

    2015-07-01

    Wider information content of multi-modal biomedical imaging is advantageous for detection, diagnosis and prognosis of various pathologies. However, the necessity to evaluate a large number images might hinder these advantages and reduce the efficiency. Herein, a new computer aided approach based on the utilization of feature space (FS) with reduced reliance on multiple image evaluations is proposed for research and routine clinical use. The method introduces the physician experience into the discovery process of FS biomarkers for addressing biological complexity, e.g., disease heterogeneity. This, in turn, elucidates relevant biophysical information which would not be available when automated algorithms are utilized. Accordingly, the prototype platform was designed and built for interactively investigating the features and their corresponding anatomic loci in order to identify pathologic FS regions. While the platform might be potentially beneficial in decision support generally and specifically for evaluating outlier cases, it is also potentially suitable for accurate ground truth determination in FS for algorithm development. Initial assessments conducted on two different pathologies from two different institutions provided valuable biophysical perspective. Investigations of the prostate magnetic resonance imaging data resulted in locating a potential aggressiveness biomarker in prostate cancer. Preliminary findings on renal cell carcinoma imaging data demonstrated potential for characterization of disease subtypes in the FS. Copyright © 2015 Elsevier Inc. All rights reserved.

  8. Interactive Feature Space Explorer© for Multi–Modal Magnetic Resonance Imaging

    PubMed Central

    Türkbey, Barış; Choyke, Peter L.; Akin, Oguz; Aras, Ömer; Mun, Seong K.

    2015-01-01

    Wider information content of multi–modal biomedical imaging is advantageous for detection, diagnosis and prognosis of various pathologies. However, the necessity to evaluate a large number images might hinder these advantages and reduce the efficiency. Herein, a new computer aided approach based on the utilization of feature space (FS) with reduced reliance on multiple image evaluations is proposed for research and routine clinical use. The method introduces the physician experience into the discovery process of FS biomarkers for addressing biological complexity, e.g., disease heterogeneity. This, in turn, elucidates relevant biophysical information which would not be available when automated algorithms are utilized. Accordingly, the prototype platform was designed and built for interactively investigating the features and their corresponding anatomic loci in order to identify pathologic FS regions. While the platform might be potentially beneficial in decision support generally and specifically for evaluating outlier cases, it is also potentially suitable for accurate ground truth determination in FS for algorithm development. Initial assessments conducted on two different pathologies from two different institutions provided valuable biophysical perspective. Investigations of the prostate magnetic resonance imaging data resulted in locating a potential aggressiveness biomarker in prostate cancer. Preliminary findings on renal cell carcinoma imaging data demonstrated potential for characterization of disease subtypes in the FS. PMID:25868623

  9. From filters to features: scale-space analysis of edge and blur coding in human vision.

    PubMed

    Georgeson, Mark A; May, Keith A; Freeman, Tom C A; Hesse, Gillian S

    2007-10-19

    To make vision possible, the visual nervous system must represent the most informative features in the light pattern captured by the eye. Here we use Gaussian scale-space theory to derive a multiscale model for edge analysis and we test it in perceptual experiments. At all scales there are two stages of spatial filtering. An odd-symmetric, Gaussian first derivative filter provides the input to a Gaussian second derivative filter. Crucially, the output at each stage is half-wave rectified before feeding forward to the next. This creates nonlinear channels selectively responsive to one edge polarity while suppressing spurious or "phantom" edges. The two stages have properties analogous to simple and complex cells in the visual cortex. Edges are found as peaks in a scale-space response map that is the output of the second stage. The position and scale of the peak response identify the location and blur of the edge. The model predicts remarkably accurately our results on human perception of edge location and blur for a wide range of luminance profiles, including the surprising finding that blurred edges look sharper when their length is made shorter. The model enhances our understanding of early vision by integrating computational, physiological, and psychophysical approaches.

  10. [The new method monitoring crop water content based on NIR-Red spectrum feature space].

    PubMed

    Cheng, Xiao-juan; Xu, Xin-gang; Chen, Tian-en; Yang, Gui-jun; Li, Zhen-hai

    2014-06-01

    Moisture content is an important index of crop water stress condition, timely and effective monitoring of crop water content is of great significance for evaluating crop water deficit balance and guiding agriculture irrigation. The present paper was trying to build a new crop water index for winter wheat vegetation water content based on NIR-Red spectral space. Firstly, canopy spectrums of winter wheat with narrow-band were resampled according to relative spectral response function of HJ-CCD and ZY-3. Then, a new index (PWI) was set up to estimate vegetation water content of winter wheat by improveing PDI (perpendicular drought index) and PVI (perpendicular vegetation index) based on NIR-Red spectral feature space. The results showed that the relationship between PWI and VWC (vegetation water content) was stable based on simulation of wide-band multispectral data HJ-CCD and ZY-3 with R2 being 0.684 and 0.683, respectively. And then VWC was estimated by using PWI with the R2 and RMSE being 0.764 and 0.764, 3.837% and 3.840%, respectively. The results indicated that PWI has certain feasibility to estimate crop water content. At the same time, it provides a new method for monitoring crop water content using remote sensing data HJ-CCD and ZY-3.

  11. Can we discriminate between apnea and hypopnea using audio signals?

    PubMed

    Halevi, M; Dafna, E; Tarasiuk, A; Zigel, Y

    2016-08-01

    Obstructive sleep apnea (OSA) affects up to 14% of the population. OSA is characterized by recurrent apneas and hypopneas during sleep. The apnea-hypopnea index (AHI) is frequently used as a measure of OSA severity. In the current study, we explored the acoustic characteristics of hypopnea in order to distinguish it from apnea. We hypothesize that we can find audio-based features that can discriminate between apnea, hypopnea and normal breathing events. Whole night audio recordings were performed using a non-contact microphone on 44 subjects, simultaneously with the polysomnography study (PSG). Recordings were segmented into 2015 apnea, hypopnea, and normal breath events and were divided to design and validation groups. A classification system was built using a 3-class cubic-kernelled support vector machine (SVM) classifier. Its input is a 36-dimensional audio-based feature vector that was extracted from each event. Three-class accuracy rate using the hold-out method was 84.7%. A two-class model to separate apneic events (apneas and hypopneas) from normal breath exhibited accuracy rate of 94.7%. Here we show that it is possible to detect apneas or hypopneas from whole night audio signals. This might provide more insight about a patient's level of upper airway obstruction during sleep. This approach may be used for OSA severity screening and AHI estimation.

  12. Two-dimensional audio watermark for MPEG AAC audio

    NASA Astrophysics Data System (ADS)

    Tachibana, Ryuki

    2004-06-01

    Since digital music is often stored in a compressed file, it is desirable that an audio watermarking method in a content management system handles compressed files. Using an audio watermarking method that directly manipulates compressed files makes it unnecessary to decompress the files before embedding or detection, so more files can be processed per unit time. However, it is difficult to detect a watermark in a compressed file that has been compressed after the file was watermarked. This paper proposes an MPEG Advanced Audio Coding (AAC) bitstream watermarking method using a two-dimensional pseudo-random array. Detection is done by correlating the absolute values of the recovered MDCT coefficients and the pseudo-random array. Since the embedding algorithm uses the same pseudo-random values for two adjacent overlapping frames and the detection algorithm selects the better frame in the two by comparing detected watermark strengths, it is possible to detect a watermark from a compressed file that was compressed after the watermark was embedded in the original uncompressed file. Though the watermark is not detected as clearly in this case, the watermark can still be detected even when the watermark was embedded in a compressed file and the file was then decompressed, trimmed, and compressed again.

  13. Using Touch Screen Audio-CASI to Obtain Data on Sensitive Topics

    PubMed Central

    Cooley, Philip C.; Rogers, Susan M.; Turner, Charles F.; Al-Tayyib, Alia A.; Willis, Gordon; Ganapathi, Laxminarayana

    2011-01-01

    This paper describes a new interview data collection system that uses a laptop personal computer equipped with a touch-sensitive video monitor. The touch-screen-based audio computer-assisted self-interviewing system, or touch screen audio-CASI, enhances the ease of use of conventional audio CASI systems while simultaneously providing the privacy of self-administered questionnaires. We describe touch screen audio-CASI design features and operational characteristics. In addition, we present data from a recent clinic-based experiment indicating that the touch audio-CASI system is stable, robust, and suitable for administering relatively long and complex questionnaires on sensitive topics, including drug use and sexual behaviors associated with HIV and other sexually transmitted diseases. PMID:22081744

  14. Using Touch Screen Audio-CASI to Obtain Data on Sensitive Topics.

    PubMed

    Cooley, Philip C; Rogers, Susan M; Turner, Charles F; Al-Tayyib, Alia A; Willis, Gordon; Ganapathi, Laxminarayana

    2001-05-01

    This paper describes a new interview data collection system that uses a laptop personal computer equipped with a touch-sensitive video monitor. The touch-screen-based audio computer-assisted self-interviewing system, or touch screen audio-CASI, enhances the ease of use of conventional audio CASI systems while simultaneously providing the privacy of self-administered questionnaires. We describe touch screen audio-CASI design features and operational characteristics. In addition, we present data from a recent clinic-based experiment indicating that the touch audio-CASI system is stable, robust, and suitable for administering relatively long and complex questionnaires on sensitive topics, including drug use and sexual behaviors associated with HIV and other sexually transmitted diseases.

  15. Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion.

    PubMed

    Gebru, Israel; Ba, Sileye; Li, Xiaofei; Horaud, Radu

    2017-01-05

    Speaker diarization consists of assigning speech signals to people engaged in a dialogue. An audio-visual spatiotemporal diarization model is proposed. The model is well suited for challenging scenarios that consist of several participants engaged in multi-party interaction while they move around and turn their heads towards the other participants rather than facing the cameras and the microphones. Multiple-person visual tracking is combined with multiple speech-source localization in order to tackle the speech-to-person association problem. The latter is solved within a novel audio-visual fusion method on the following grounds: binaural spectral features are first extracted from a microphone pair, then a supervised audio-visual alignment technique maps these features onto an image, and finally a semisupervised clustering method assigns binaural spectral features to visible persons. The main advantage of this method over previous work is that it processes in a principled way speech signals uttered simultaneously by multiple persons. The diarization itself is cast into a latent-variable temporal graphical model that infers speaker identities and speech turns, based on the output of an audio-visual association process, executed at each time slice, and on the dynamics of the diarization variable itself. The proposed formulation yields an efficient exact inference procedure. A novel dataset, that contains audio-visual training data as well as a number of scenarios involving several participants engaged in formal and informal dialogue, is introduced. The proposed method is thoroughly tested and benchmarked with respect to several state-of-the art diarization algorithms.

  16. Audio-visual interactions in environment assessment.

    PubMed

    Preis, Anna; Kociński, Jędrzej; Hafke-Dys, Honorata; Wrzosek, Małgorzata

    2015-08-01

    The aim of the study was to examine how visual and audio information influences audio-visual environment assessment. Original audio-visual recordings were made at seven different places in the city of Poznań. Participants of the psychophysical experiments were asked to rate, on a numerical standardized scale, the degree of comfort they would feel if they were in such an environment. The assessments of audio-visual comfort were carried out in a laboratory in four different conditions: (a) audio samples only, (b) original audio-visual samples, (c) video samples only, and (d) mixed audio-visual samples. The general results of this experiment showed a significant difference between the investigated conditions, but not for all the investigated samples. There was a significant improvement in comfort assessment when visual information was added (in only three out of 7 cases), when conditions (a) and (b) were compared. On the other hand, the results show that the comfort assessment of audio-visual samples could be changed by manipulating the audio rather than the video part of the audio-visual sample. Finally, it seems, that people could differentiate audio-visual representations of a given place in the environment based rather of on the sound sources' compositions than on the sound level. Object identification is responsible for both landscape and soundscape grouping.

  17. How actions shape perception: learning action-outcome relations and predicting sensory outcomes promote audio-visual temporal binding.

    PubMed

    Desantis, Andrea; Haggard, Patrick

    2016-12-16

    To maintain a temporally-unified representation of audio and visual features of objects in our environment, the brain recalibrates audio-visual simultaneity. This process allows adjustment for both differences in time of transmission and time for processing of audio and visual signals. In four experiments, we show that the cognitive processes for controlling instrumental actions also have strong influence on audio-visual recalibration. Participants learned that right and left hand button-presses each produced a specific audio-visual stimulus. Following one action the audio preceded the visual stimulus, while for the other action audio lagged vision. In a subsequent test phase, left and right button-press generated either the same audio-visual stimulus as learned initially, or the pair associated with the other action. We observed recalibration of simultaneity only for previously-learned audio-visual outcomes. Thus, learning an action-outcome relation promotes temporal grouping of the audio and visual events within the outcome pair, contributing to the creation of a temporally unified multisensory object. This suggests that learning action-outcome relations and the prediction of perceptual outcomes can provide an integrative temporal structure for our experiences of external events.

  18. How actions shape perception: learning action-outcome relations and predicting sensory outcomes promote audio-visual temporal binding

    PubMed Central

    Desantis, Andrea; Haggard, Patrick

    2016-01-01

    To maintain a temporally-unified representation of audio and visual features of objects in our environment, the brain recalibrates audio-visual simultaneity. This process allows adjustment for both differences in time of transmission and time for processing of audio and visual signals. In four experiments, we show that the cognitive processes for controlling instrumental actions also have strong influence on audio-visual recalibration. Participants learned that right and left hand button-presses each produced a specific audio-visual stimulus. Following one action the audio preceded the visual stimulus, while for the other action audio lagged vision. In a subsequent test phase, left and right button-press generated either the same audio-visual stimulus as learned initially, or the pair associated with the other action. We observed recalibration of simultaneity only for previously-learned audio-visual outcomes. Thus, learning an action-outcome relation promotes temporal grouping of the audio and visual events within the outcome pair, contributing to the creation of a temporally unified multisensory object. This suggests that learning action-outcome relations and the prediction of perceptual outcomes can provide an integrative temporal structure for our experiences of external events. PMID:27982063

  19. Mining potential biomarkers associated with space flight in Caenorhabditis elegans experienced Shenzhou-8 mission with multiple feature selection techniques.

    PubMed

    Zhao, Lei; Gao, Ying; Mi, Dong; Sun, Yeqing

    To identify the potential biomarkers associated with space flight, a combined algorithm, which integrates the feature selection techniques, was used to deal with the microarray datasets of Caenorhabditis elegans obtained in the Shenzhou-8 mission. Compared with the ground control treatment, a total of 86 differentially expressed (DE) genes in responses to space synthetic environment or space radiation environment were identified by two filter methods. And then the top 30 ranking genes were selected by the random forest algorithm. Gene Ontology annotation and functional enrichment analyses showed that these genes were mainly associated with metabolism process. Furthermore, clustering analysis showed that 17 genes among these are positive, including 9 for space synthetic environment and 8 for space radiation environment only. These genes could be used as the biomarkers to reflect the space environment stresses. In addition, we also found that microgravity is the main stress factor to change the expression patterns of biomarkers for the short-duration spaceflight.

  20. [The new method monitoring agricultural drought based on SWIR-Red spectrum feature space].

    PubMed

    Feng, Hai-Xia; Qin, Qi-Ming; Li, Bin-Yong; Liu, Fang; Jiang, Hong-Bo; Dong, Heng; Wang, Jin-Liang; Liu, Ming-Chao; Zhang, Ning

    2011-11-01

    Drought was a chronic, natural disaster, and Remote sensing drought monitoring had become a potential research field. In the present, short-wave infrared and red bands which sensitive to moisture variation were selected to monitor farmland drought conditions by analyzing the spectral characteristics of vegetation and soil. The goal of this paper was to provide a new method of drought monitoring--normalized drought monitoring index (NPDI), based on new constructed spectrum feature space by the difference of SWIR and Red and the sum of SWIR and Red. Field surveyed soil moisture verified NPDI model, and the result showed that NDPI and MPDI model could effectively monitor agricultural drought, and that had high correlation with soil moisture. The R2 was 0.583 and 0.438 with soil water of 10 cm. The monitoring effect of NPDI model was better than the MPDL. This model was further improvement to PDI and MPDI, and it could monitor the drought condition of different vegetation coverage and whole growing season. It has high application potential and popularization value.

  1. Formation of regularly spaced networks as a general feature of actin bundle condensation by entropic forces

    NASA Astrophysics Data System (ADS)

    Huber, Florian; Strehle, Dan; Schnauß, Jörg; Käs, Josef

    2015-04-01

    Biopolymer networks contribute mechanical integrity as well as functional organization to living cells. One of their major constituents, the protein actin, is present in a large variety of different network architectures, ranging from extensive networks to densely packed bundles. The shape of the network is directly linked to its mechanical properties and essential physiological functions. However, a profound understanding of architecture-determining mechanisms and their physical constraints remains elusive. We use experimental bottom-up systems to study the formation of confined actin networks by entropic forces. Experiments based on molecular crowding as well as counterion condensation reveal a generic tendency of homogeneous filament solutions to aggregate into regular actin bundle networks connected by aster-like centers. The network architecture is found to critically rely on network formation history. Starting from identical biochemical compositions, we observe drastic changes in network architecture as a consequence of initially biased filament orientation or mixing-induced perturbations. Our experiments suggest that the tendency to form regularly spaced bundle networks is a rather general feature of isotropic, homogeneous filament solutions subject to uniform attractive interactions. Due to the fundamental nature of the considered interactions, we expect that the investigated type of network formation further implies severe physical constraints for cytoskeleton self-organization on the more complex level of living cells.

  2. NFL Films audio, video, and film production facilities

    NASA Astrophysics Data System (ADS)

    Berger, Russ; Schrag, Richard C.; Ridings, Jason J.

    2003-04-01

    The new NFL Films 200,000 sq. ft. headquarters is home for the critically acclaimed film production that preserves the NFL's visual legacy week-to-week during the football season, and is also the technical plant that processes and archives football footage from the earliest recorded media to the current network broadcasts. No other company in the country shoots more film than NFL Films, and the inclusion of cutting-edge video and audio formats demands that their technical spaces continually integrate the latest in the ever-changing world of technology. This facility houses a staggering array of acoustically sensitive spaces where music and sound are equal partners with the visual medium. Over 90,000 sq. ft. of sound critical technical space is comprised of an array of sound stages, music scoring stages, audio control rooms, music writing rooms, recording studios, mixing theaters, video production control rooms, editing suites, and a screening theater. Every production control space in the building is designed to monitor and produce multi channel surround sound audio. An overview of the architectural and acoustical design challenges encountered for each sophisticated listening, recording, viewing, editing, and sound critical environment will be discussed.

  3. Automatic Recognition of Solar Features for Developing Data Driven Prediction Models of Solar Activity and Space Weather

    DTIC Science & Technology

    2013-05-01

    Aschwanden, M. J. 2005, Physics of the Solar Corona . An Introduction with Problems and Solutions (2nd edition), ed. Aschwanden, M. J. Balasubramaniam, K...AFRL-OSR-VA-TR-2013-0020 Automatic Recognition of Solar Features for Developing Data Driven Prediction Models of Solar Activity...Automatic Recognition of Solar Features for Developing Data Driven Prediction Models of Solar Activity and Space Weather 5a. CONTRACT NUMBER FA9550-09

  4. Video genre categorization and representation using audio-visual information

    NASA Astrophysics Data System (ADS)

    Ionescu, Bogdan; Seyerlehner, Klaus; Rasche, Christoph; Vertan, Constantin; Lambert, Patrick

    2012-04-01

    We propose an audio-visual approach to video genre classification using content descriptors that exploit audio, color, temporal, and contour information. Audio information is extracted at block-level, which has the advantage of capturing local temporal information. At the temporal structure level, we consider action content in relation to human perception. Color perception is quantified using statistics of color distribution, elementary hues, color properties, and relationships between colors. Further, we compute statistics of contour geometry and relationships. The main contribution of our work lies in harnessing the descriptive power of the combination of these descriptors in genre classification. Validation was carried out on over 91 h of video footage encompassing 7 common video genres, yielding average precision and recall ratios of 87% to 100% and 77% to 100%, respectively, and an overall average correct classification of up to 97%. Also, experimental comparison as part of the MediaEval 2011 benchmarking campaign demonstrated the efficiency of the proposed audio-visual descriptors over other existing approaches. Finally, we discuss a 3-D video browsing platform that displays movies using feature-based coordinates and thus regroups them according to genre.

  5. SNR-adaptive stream weighting for audio-MES ASR.

    PubMed

    Lee, Ki-Seung

    2008-08-01

    Myoelectric signals (MESs) from the speaker's mouth region have been successfully shown to improve the noise robustness of automatic speech recognizers (ASRs), thus promising to extend their usability in implementing noise-robust ASR. In the recognition system presented herein, extracted audio and facial MES features were integrated by a decision fusion method, where the likelihood score of the audio-MES observation vector was given by a linear combination of class-conditional observation log-likelihoods of two classifiers, using appropriate weights. We developed a weighting process adaptive to SNRs. The main objective of the paper involves determining the optimal SNR classification boundaries and constructing a set of optimum stream weights for each SNR class. These two parameters were determined by a method based on a maximum mutual information criterion. Acoustic and facial MES data were collected from five subjects, using a 60-word vocabulary. Four types of acoustic noise including babble, car, aircraft, and white noise were acoustically added to clean speech signals with SNR ranging from -14 to 31 dB. The classification accuracy of the audio ASR was as low as 25.5%. Whereas, the classification accuracy of the MES ASR was 85.2%. The classification accuracy could be further improved by employing the proposed audio-MES weighting method, which was as high as 89.4% in the case of babble noise. A similar result was also found for the other types of noise.

  6. Audio-visual event detection based on mining of semantic audio-visual labels

    NASA Astrophysics Data System (ADS)

    Goh, King-Shy; Miyahara, Koji; Radhakrishnan, Regunathan; Xiong, Ziyou; Divakaran, Ajay

    2003-12-01

    Removing commercials from television programs is a much sought-after feature for a personal video recorder. In this paper, we employ an unsupervised clustering scheme (CM_Detect) to detect commercials in television programs. Each program is first divided into W8-minute chunks, and we extract audio and visual features from each of these chunks. Next, we apply k-means clustering to assign each chunk with a commercial/program label. In contrast to other methods, we do not make any assumptions regarding the program content. Thus, our method is highly content-adaptive and computationally inexpensive. Through empirical studies on various content, including American news, Japanese news, and sports programs, we demonstrate that our method is able to filter out most of the commercials without falsely removing the regular program.

  7. Audio-visual gender recognition

    NASA Astrophysics Data System (ADS)

    Liu, Ming; Xu, Xun; Huang, Thomas S.

    2007-11-01

    Combining different modalities for pattern recognition task is a very promising field. Basically, human always fuse information from different modalities to recognize object and perform inference, etc. Audio-Visual gender recognition is one of the most common task in human social communication. Human can identify the gender by facial appearance, by speech and also by body gait. Indeed, human gender recognition is a multi-modal data acquisition and processing procedure. However, computational multimodal gender recognition has not been extensively investigated in the literature. In this paper, speech and facial image are fused to perform a mutli-modal gender recognition for exploring the improvement of combining different modalities.

  8. Three-Dimensional Audio Client Library

    NASA Technical Reports Server (NTRS)

    Rizzi, Stephen A.

    2005-01-01

    The Three-Dimensional Audio Client Library (3DAudio library) is a group of software routines written to facilitate development of both stand-alone (audio only) and immersive virtual-reality application programs that utilize three-dimensional audio displays. The library is intended to enable the development of three-dimensional audio client application programs by use of a code base common to multiple audio server computers. The 3DAudio library calls vendor-specific audio client libraries and currently supports the AuSIM Gold-Server and Lake Huron audio servers. 3DAudio library routines contain common functions for (1) initiation and termination of a client/audio server session, (2) configuration-file input, (3) positioning functions, (4) coordinate transformations, (5) audio transport functions, (6) rendering functions, (7) debugging functions, and (8) event-list-sequencing functions. The 3DAudio software is written in the C++ programming language and currently operates under the Linux, IRIX, and Windows operating systems.

  9. Noise-Canceling Helmet Audio System

    NASA Technical Reports Server (NTRS)

    Seibert, Marc A.; Culotta, Anthony J.

    2007-01-01

    A prototype helmet audio system has been developed to improve voice communication for the wearer in a noisy environment. The system was originally intended to be used in a space suit, wherein noise generated by airflow of the spacesuit life-support system can make it difficult for remote listeners to understand the astronaut s speech and can interfere with the astronaut s attempt to issue vocal commands to a voice-controlled robot. The system could be adapted to terrestrial use in helmets of protective suits that are typically worn in noisy settings: examples include biohazard, fire, rescue, and diving suits. The system (see figure) includes an array of microphones and small loudspeakers mounted at fixed positions in a helmet, amplifiers and signal-routing circuitry, and a commercial digital signal processor (DSP). Notwithstanding the fixed positions of the microphones and loudspeakers, the system can accommodate itself to any normal motion of the wearer s head within the helmet. The system operates in conjunction with a radio transceiver. An audio signal arriving via the transceiver intended to be heard by the wearer is adjusted in volume and otherwise conditioned and sent to the loudspeakers. The wearer s speech is collected by the microphones, the outputs of which are logically combined (phased) so as to form a microphone- array directional sensitivity pattern that discriminates in favor of sounds coming from vicinity of the wearer s mouth and against sounds coming from elsewhere. In the DSP, digitized samples of the microphone outputs are processed to filter out airflow noise and to eliminate feedback from the loudspeakers to the microphones. The resulting conditioned version of the wearer s speech signal is sent to the transceiver.

  10. Machinery running state identification based on discriminant semi-supervised local tangent space alignment for feature fusion and extraction

    NASA Astrophysics Data System (ADS)

    Su, Zuqiang; Xiao, Hong; Zhang, Yi; Tang, Baoping; Jiang, Yonghua

    2017-04-01

    Extraction of sensitive features is a challenging but key task in data-driven machinery running state identification. Aimed at solving this problem, a method for machinery running state identification that applies discriminant semi-supervised local tangent space alignment (DSS-LTSA) for feature fusion and extraction is proposed. Firstly, in order to extract more distinct features, the vibration signals are decomposed by wavelet packet decomposition WPD, and a mixed-domain feature set consisted of statistical features, autoregressive (AR) model coefficients, instantaneous amplitude Shannon entropy and WPD energy spectrum is extracted to comprehensively characterize the properties of machinery running state(s). Then, the mixed-dimension feature set is inputted into DSS-LTSA for feature fusion and extraction to eliminate redundant information and interference noise. The proposed DSS-LTSA can extract intrinsic structure information of both labeled and unlabeled state samples, and as a result the over-fitting problem of supervised manifold learning and blindness problem of unsupervised manifold learning are overcome. Simultaneously, class discrimination information is integrated within the dimension reduction process in a semi-supervised manner to improve sensitivity of the extracted fusion features. Lastly, the extracted fusion features are inputted into a pattern recognition algorithm to achieve the running state identification. The effectiveness of the proposed method is verified by a running state identification case in a gearbox, and the results confirm the improved accuracy of the running state identification.

  11. Dynamic and scalable audio classification by collective network of binary classifiers framework: an evolutionary approach.

    PubMed

    Kiranyaz, Serkan; Mäkinen, Toni; Gabbouj, Moncef

    2012-10-01

    In this paper, we propose a novel framework based on a collective network of evolutionary binary classifiers (CNBC) to address the problems of feature and class scalability. The main goal of the proposed framework is to achieve a high classification performance over dynamic audio and video repositories. The proposed framework adopts a "Divide and Conquer" approach in which an individual network of binary classifiers (NBC) is allocated to discriminate each audio class. An evolutionary search is applied to find the best binary classifier in each NBC with respect to a given criterion. Through the incremental evolution sessions, the CNBC framework can dynamically adapt to each new incoming class or feature set without resorting to a full-scale re-training or re-configuration. Therefore, the CNBC framework is particularly designed for dynamically varying databases where no conventional static classifiers can adapt to such changes. In short, it is entirely a novel topology, an unprecedented approach for dynamic, content/data adaptive and scalable audio classification. A large set of audio features can be effectively used in the framework, where the CNBCs make appropriate selections and combinations so as to achieve the highest discrimination among individual audio classes. Experiments demonstrate a high classification accuracy (above 90%) and efficiency of the proposed framework over large and dynamic audio databases. Copyright © 2012 Elsevier Ltd. All rights reserved.

  12. Multiple-output support vector machine regression with feature selection for arousal/valence space emotion assessment.

    PubMed

    Torres-Valencia, Cristian A; Álvarez, Mauricio A; Orozco-Gutiérrez, Alvaro A

    2014-01-01

    Human emotion recognition (HER) allows the assessment of an affective state of a subject. Until recently, such emotional states were described in terms of discrete emotions, like happiness or contempt. In order to cover a high range of emotions, researchers in the field have introduced different dimensional spaces for emotion description that allow the characterization of affective states in terms of several variables or dimensions that measure distinct aspects of the emotion. One of the most common of such dimensional spaces is the bidimensional Arousal/Valence space. To the best of our knowledge, all HER systems so far have modelled independently, the dimensions in these dimensional spaces. In this paper, we study the effect of modelling the output dimensions simultaneously and show experimentally the advantages in modeling them in this way. We consider a multimodal approach by including features from the Electroencephalogram and a few physiological signals. For modelling the multiple outputs, we employ a multiple output regressor based on support vector machines. We also include an stage of feature selection that is developed within an embedded approach known as Recursive Feature Elimination (RFE), proposed initially for SVM. The results show that several features can be eliminated using the multiple output support vector regressor with RFE without affecting the performance of the regressor. From the analysis of the features selected in smaller subsets via RFE, it can be observed that the signals that are more informative into the arousal and valence space discrimination are the EEG, Electrooculogram/Electromiogram (EOG/EMG) and the Galvanic Skin Response (GSR).

  13. Digital Audio Radio Field Tests

    NASA Technical Reports Server (NTRS)

    Hollansworth, James E.

    1997-01-01

    Radio history continues to be made at the NASA Lewis Research Center with the beginning of phase two of Digital Audio Radio testing conducted by the Consumer Electronic Manufacturers Association (a sector of the Electronic Industries Association and the National Radio Systems Committee) and cosponsored by the Electronic Industries Association and the National Association of Broadcasters. The bulk of the field testing of the four systems should be complete by the end of October 1996, with results available soon thereafter. Lewis hosted phase one of the testing process, which included laboratory testing of seven proposed digital audio radio systems and modes (see the following table). Two of the proposed systems operate in two modes, thus making a total of nine systems for testing. These nine systems are divided into the following types of transmission: in-band on channel (IBOC), in-band adjacent channel (IBAC), and new bands - the L-band (1452 to 1492 MHz) and the S-band (2310 to 2360 MHz).

  14. Digital Audio Radio Field Tests

    NASA Technical Reports Server (NTRS)

    Hollansworth, James E.

    1997-01-01

    Radio history continues to be made at the NASA Lewis Research Center with the beginning of phase two of Digital Audio Radio testing conducted by the Consumer Electronic Manufacturers Association (a sector of the Electronic Industries Association and the National Radio Systems Committee) and cosponsored by the Electronic Industries Association and the National Association of Broadcasters. The bulk of the field testing of the four systems should be complete by the end of October 1996, with results available soon thereafter. Lewis hosted phase one of the testing process, which included laboratory testing of seven proposed digital audio radio systems and modes (see the following table). Two of the proposed systems operate in two modes, thus making a total of nine systems for testing. These nine systems are divided into the following types of transmission: in-band on channel (IBOC), in-band adjacent channel (IBAC), and new bands - the L-band (1452 to 1492 MHz) and the S-band (2310 to 2360 MHz).

  15. Using virtual 3D audio in multispeech channel and multimedia environments

    NASA Astrophysics Data System (ADS)

    Orosz, Michael D.; Karplus, Walter J.; Balakrishnan, Jerry D.

    2000-08-01

    The advantages and disadvantages of using virtual 3-D audio in mission-critical, multimedia display interfaces were evaluated. The 3D audio platform seems to be an especially promising candidate for aircraft cockpits, flight control rooms, and other command and control environments in which operators must make mission-critical decisions while handling demanding and routine tasks. Virtual audio signal processing creates the illusion for a listener wearing conventional earphones that each of a multiplicity of simultaneous speech or audio channels is originating from a different, program- specified location in virtual space. To explore the possible uses of this new, readily available technology, a test bed simulating some of the conditions experienced by the chief flight test coordinator at NASA's Dryden Flight Research Center was designed and implemented. Thirty test subjects simultaneously performed routine tasks requiring constant hand-eye coordination, while monitoring four speech channels, each generating continuous speech signals, for the occurrence of pre-specified keywords. Performance measures included accuracy in identifying the keywords, accuracy in identifying the speaker of the keyword, and response time. We found substantial improvements on all of these measures when comparing virtual audio with conventional, monaural transmissions. We also explored the effect on operator performance of different spatial configurations of the audio sources in 3-D space, simulated movement (dither) in the source locations, and of providing graphical redundancy. Some of these manipulations were less effective and may even decrease performance efficiency, even though they improve some aspects of the virtual space simulation.

  16. The Concept of Audio-Tutorial Teaching.

    ERIC Educational Resources Information Center

    Husband, D. D.; Postlethwait, S. N.

    Audio-tutorial teaching, originated at Purdue University, employs an audio tape as a vehicle for guiding the student through a series of learning experiences. The student may be directed to use his textbook for an explanation of a diagram, to refer to his laboratory manual, or to observe the specimens and experimental materials prepared for his…

  17. Digital Audio Sampling for Film and Video.

    ERIC Educational Resources Information Center

    Stanton, Michael J.

    Digital audio sampling is explained, and some of its implications in digital sound applications are discussed. Digital sound equipment is rapidly replacing analog recording devices as the state-of-the-art in audio technology. The philosophy of digital recording involves doing away with the continuously variable analog waveforms and turning the…

  18. Digital Advances in Contemporary Audio Production.

    ERIC Educational Resources Information Center

    Shields, Steven O.

    Noting that a revolution in sonic high fidelity occurred during the 1980s as digital-based audio production methods began to replace traditional analog modes, this paper offers both an overview of digital audio theory and descriptions of some of the related digital production technologies that have begun to emerge from the mating of the computer…

  19. Dual Audio Television: The First Public Broadcast.

    ERIC Educational Resources Information Center

    Borton, Terry; And Others

    A study was conducted in conjunction with the first public announced broadcast of dual audio television--a new method of combining simultaneous radio instruction and commercial entertainment TV. The study was designed to determine the audience which would be attracted to dual audio, the practicality of such instruction, and the correlation between…

  20. Dual Audio TV Instruction: A Broadcast Experiment.

    ERIC Educational Resources Information Center

    Borton, Terry; And Others

    An experiment assessed the potential effectiveness of "dual audio television instruction" (DATI) as a mass education medium. The DATI consisted of a radio program heard by children while they watched television shows. The audio instructor did not talk when the television characters spoke, but used the "quiet" times to help with…

  1. Independent component analysis for audio signal separation

    NASA Astrophysics Data System (ADS)

    Wellhausen, Jens; Gnann, Volker

    2005-10-01

    In this paper an audio separation algorithm is presented, which is based on Independent Component Analysis (ICA). Audio separation could be the basis for many applications for example in the field of telecommunications, quality enhancement of audio recordings or audio classification tasks. Well known ICA algorithms are not usable for real-world recordings at the time, because they are designed for signal mixtures based on linear and over time constant mixing matrices. To adapt a standard ICA algorithm for real-world two-channel auditory scenes with two audio sources, the input audio streams are segmented in the time domain and a constant mixing matrix within a segment is assumed. The next steps are a time-delay estimation for each audio source in the mixture and a determination of the number of existing sources. In the following processing steps, for each source the input signals are time shifted and a standard ICA for linear mixtures is performed. After that, the remaining tasks are an evaluation of the ICA results and the construction of the resulting audio streams containing the separated sources.

  2. Digital Audio: A Sound Design Element.

    ERIC Educational Resources Information Center

    Barron, Ann; Varnadoe, Susan

    1992-01-01

    Discussion of incorporating audio into videodiscs for multimedia educational applications highlights a project developed for the Navy that used digital audio in an interactive video delivery system (IVDS) for training sonar operators. Storage constraints with videodiscs are explained, design requirements for the IVDS are described, and production…

  3. Digital Audio: A Sound Design Element.

    ERIC Educational Resources Information Center

    Barron, Ann; Varnadoe, Susan

    1992-01-01

    Discussion of incorporating audio into videodiscs for multimedia educational applications highlights a project developed for the Navy that used digital audio in an interactive video delivery system (IVDS) for training sonar operators. Storage constraints with videodiscs are explained, design requirements for the IVDS are described, and production…

  4. Digital Advances in Contemporary Audio Production.

    ERIC Educational Resources Information Center

    Shields, Steven O.

    Noting that a revolution in sonic high fidelity occurred during the 1980s as digital-based audio production methods began to replace traditional analog modes, this paper offers both an overview of digital audio theory and descriptions of some of the related digital production technologies that have begun to emerge from the mating of the computer…

  5. Audio-visual simultaneity judgments.

    PubMed

    Zampini, Massimiliano; Guest, Steve; Shore, David I; Spence, Charles

    2005-04-01

    The relative spatiotemporal correspondence between sensory events affects multisensory integration across a variety of species; integration is maximal when stimuli in different sensory modalities are presented from approximately the same position at about the same time. In the present study, we investigated the influence of spatial and temporal factors on audio-visual simultaneity perception in humans. Participants made unspeeded simultaneous versus successive discrimination responses to pairs of auditory and visual stimuli presented at varying stimulus onset asynchronies from either the same or different spatial positions using either the method of constant stimuli (Experiments 1 and 2) or psychophysical staircases (Experiment 3). The participants in all three experiments were more likely to report the stimuli as being simultaneous when they originated from the same spatial position than when they came from different positions, demonstrating that the apparent perception of multisensory simultaneity is dependent on the relative spatial position from which stimuli are presented.

  6. Audio-video decision support for patients: the documentary genré as a basis for decision aids.

    PubMed

    Volandes, Angelo E; Barry, Michael J; Wood, Fiona; Elwyn, Glyn

    2013-09-01

    Decision support tools are increasingly using audio-visual materials. However, disagreement exists about the use of audio-visual materials as they may be subjective and biased. This is a literature review of the major texts for documentary film studies to extrapolate issues of objectivity and bias from film to decision support tools. The key features of documentary films are that they attempt to portray real events and that the attempted reality is always filtered through the lens of the filmmaker. The same key features can be said of decision support tools that use audio-visual materials. Three concerns arising from documentary film studies as they apply to the use of audio-visual materials in decision support tools include whose perspective matters (stakeholder bias), how to choose among audio-visual materials (selection bias) and how to ensure objectivity (editorial bias). Decision science needs to start a debate about how audio-visual materials are to be used in decision support tools. Simply because audio-visual materials may be subjective and open to bias does not mean that we should not use them. Methods need to be found to ensure consensus around balance and editorial control, such that audio-visual materials can be used. © 2011 John Wiley & Sons Ltd.

  7. Improvements of ModalMax High-Fidelity Piezoelectric Audio Device

    NASA Technical Reports Server (NTRS)

    Woodard, Stanley E.

    2005-01-01

    ModalMax audio speakers have been enhanced by innovative means of tailoring the vibration response of thin piezoelectric plates to produce a high-fidelity audio response. The ModalMax audio speakers are 1 mm in thickness. The device completely supplants the need to have a separate driver and speaker cone. ModalMax speakers can perform the same applications of cone speakers, but unlike cone speakers, ModalMax speakers can function in harsh environments such as high humidity or extreme wetness. New design features allow the speakers to be completely submersed in salt water, making them well suited for maritime applications. The sound produced from the ModalMax audio speakers has sound spatial resolution that is readily discernable for headset users.

  8. Content-based audio authentication using a hierarchical patchwork watermark embedding

    NASA Astrophysics Data System (ADS)

    Gulbis, Michael; Müller, Erika

    2010-05-01

    Content-based audio authentication watermarking techniques extract perceptual relevant audio features, which are robustly embedded into the audio file to protect. Manipulations of the audio file are detected on the basis of changes between the original embedded feature information and the anew extracted features during verification. The main challenges of content-based watermarking are on the one hand the identification of a suitable audio feature to distinguish between content preserving and malicious manipulations. On the other hand the development of a watermark, which is robust against content preserving modifications and able to carry the whole authentication information. The payload requirements are significantly higher compared to transaction watermarking or copyright protection. Finally, the watermark embedding should not influence the feature extraction to avoid false alarms. Current systems still lack a sufficient alignment of watermarking algorithm and feature extraction. In previous work we developed a content-based audio authentication watermarking approach. The feature is based on changes in DCT domain over time. A patchwork algorithm based watermark was used to embed multiple one bit watermarks. The embedding process uses the feature domain without inflicting distortions to the feature. The watermark payload is limited by the feature extraction, more precisely the critical bands. The payload is inverse proportional to segment duration of the audio file segmentation. Transparency behavior was analyzed in dependence of segment size and thus the watermark payload. At a segment duration of about 20 ms the transparency shows an optimum (measured in units of Objective Difference Grade). Transparency and/or robustness are fast decreased for working points beyond this area. Therefore, these working points are unsuitable to gain further payload, needed for the embedding of the whole authentication information. In this paper we present a hierarchical extension

  9. Authenticity examination of compressed audio recordings using detection of multiple compression and encoders' identification.

    PubMed

    Korycki, Rafal

    2014-05-01

    Since the appearance of digital audio recordings, audio authentication has been becoming increasingly difficult. The currently available technologies and free editing software allow a forger to cut or paste any single word without audible artifacts. Nowadays, the only method referring to digital audio files commonly approved by forensic experts is the ENF criterion. It consists in fluctuation analysis of the mains frequency induced in electronic circuits of recording devices. Therefore, its effectiveness is strictly dependent on the presence of mains signal in the recording, which is a rare occurrence. Recently, much attention has been paid to authenticity analysis of compressed multimedia files and several solutions were proposed for detection of double compression in both digital video and digital audio. This paper addresses the problem of tampering detection in compressed audio files and discusses new methods that can be used for authenticity analysis of digital recordings. Presented approaches consist in evaluation of statistical features extracted from the MDCT coefficients as well as other parameters that may be obtained from compressed audio files. Calculated feature vectors are used for training selected machine learning algorithms. The detection of multiple compression covers up tampering activities as well as identification of traces of montage in digital audio recordings. To enhance the methods' robustness an encoder identification algorithm was developed and applied based on analysis of inherent parameters of compression. The effectiveness of tampering detection algorithms is tested on a predefined large music database consisting of nearly one million of compressed audio files. The influence of compression algorithms' parameters on the classification performance is discussed, based on the results of the current study.

  10. Automatic Recognition of Solar Features for Developing Data Driven Prediction Models of Solar Activity and Space Weather

    DTIC Science & Technology

    2012-07-06

    Ephemeral Brightening,” 2nd ATST – East Workshop In Solar Physics: Magnetic Fields From The Photosphere To The Corona , Washington D.C., Mar 2012. [6...AFRL-RV-PS- AFRL-RV-PS- TR-2012-0133 TR-2012-0133 AUTOMATIC RECOGNITION OF SOLAR FEATURES FOR DEVELOPING DATA DRIVEN PREDICTION MODELS OF... SOLAR ACTIVITY AND SPACE WEATHER Jason Jackiewicz New Mexico State University Department of Astronomy PO Box 30001, MSC 4500 Las

  11. Performance enhancement for audio-visual speaker identification using dynamic facial muscle model.

    PubMed

    Asadpour, Vahid; Towhidkhah, Farzad; Homayounpour, Mohammad Mehdi

    2006-10-01

    Science of human identification using physiological characteristics or biometry has been of great concern in security systems. However, robust multimodal identification systems based on audio-visual information has not been thoroughly investigated yet. Therefore, the aim of this work to propose a model-based feature extraction method which employs physiological characteristics of facial muscles producing lip movements. This approach adopts the intrinsic properties of muscles such as viscosity, elasticity, and mass which are extracted from the dynamic lip model. These parameters are exclusively dependent on the neuro-muscular properties of speaker; consequently, imitation of valid speakers could be reduced to a large extent. These parameters are applied to a hidden Markov model (HMM) audio-visual identification system. In this work, a combination of audio and video features has been employed by adopting a multistream pseudo-synchronized HMM training method. Noise robust audio features such as Mel-frequency cepstral coefficients (MFCC), spectral subtraction (SS), and relative spectra perceptual linear prediction (J-RASTA-PLP) have been used to evaluate the performance of the multimodal system once efficient audio feature extraction methods have been utilized. The superior performance of the proposed system is demonstrated on a large multispeaker database of continuously spoken digits, along with a sentence that is phonetically rich. To evaluate the robustness of algorithms, some experiments were performed on genetically identical twins. Furthermore, changes in speaker voice were simulated with drug inhalation tests. In 3 dB signal to noise ratio (SNR), the dynamic muscle model improved the identification rate of the audio-visual system from 91 to 98%. Results on identical twins revealed that there was an apparent improvement on the performance for the dynamic muscle model-based system, in which the identification rate of the audio-visual system was enhanced from 87

  12. Implementing Audio-CASI on Windows’ Platforms

    PubMed Central

    Cooley, Philip C.; Turner, Charles F.

    2011-01-01

    Audio computer-assisted self interviewing (Audio-CASI) technologies have recently been shown to provide important and sometimes dramatic improvements in the quality of survey measurements. This is particularly true for measurements requiring respondents to divulge highly sensitive information such as their sexual, drug use, or other sensitive behaviors. However, DOS-based Audio-CASI systems that were designed and adopted in the early 1990s have important limitations. Most salient is the poor control they provide for manipulating the video presentation of survey questions. This article reports our experiences adapting Audio-CASI to Microsoft Windows 3.1 and Windows 95 platforms. Overall, our Windows-based system provided the desired control over video presentation and afforded other advantages including compatibility with a much wider array of audio devices than our DOS-based Audio-CASI technologies. These advantages came at the cost of increased system requirements --including the need for both more RAM and larger hard disks. While these costs will be an issue for organizations converting large inventories of PCS to Windows Audio-CASI today, this will not be a serious constraint for organizations and individuals with small inventories of machines to upgrade or those purchasing new machines today. PMID:22081743

  13. Implementing Audio-CASI on Windows' Platforms.

    PubMed

    Cooley, Philip C; Turner, Charles F

    1998-01-01

    Audio computer-assisted self interviewing (Audio-CASI) technologies have recently been shown to provide important and sometimes dramatic improvements in the quality of survey measurements. This is particularly true for measurements requiring respondents to divulge highly sensitive information such as their sexual, drug use, or other sensitive behaviors. However, DOS-based Audio-CASI systems that were designed and adopted in the early 1990s have important limitations. Most salient is the poor control they provide for manipulating the video presentation of survey questions. This article reports our experiences adapting Audio-CASI to Microsoft Windows 3.1 and Windows 95 platforms. Overall, our Windows-based system provided the desired control over video presentation and afforded other advantages including compatibility with a much wider array of audio devices than our DOS-based Audio-CASI technologies. These advantages came at the cost of increased system requirements --including the need for both more RAM and larger hard disks. While these costs will be an issue for organizations converting large inventories of PCS to Windows Audio-CASI today, this will not be a serious constraint for organizations and individuals with small inventories of machines to upgrade or those purchasing new machines today.

  14. High-Fidelity Piezoelectric Audio Device

    NASA Technical Reports Server (NTRS)

    Woodward, Stanley E.; Fox, Robert L.; Bryant, Robert G.

    2003-01-01

    ModalMax is a very innovative means of harnessing the vibration of a piezoelectric actuator to produce an energy efficient low-profile device with high-bandwidth high-fidelity audio response. The piezoelectric audio device outperforms many commercially available speakers made using speaker cones. The piezoelectric device weighs substantially less (4 g) than the speaker cones which use magnets (10 g). ModalMax devices have extreme fabrication simplicity. The entire audio device is fabricated by lamination. The simplicity of the design lends itself to lower cost. The piezoelectric audio device can be used without its acoustic chambers and thereby resulting in a very low thickness of 0.023 in. (0.58 mm). The piezoelectric audio device can be completely encapsulated, which makes it very attractive for use in wet environments. Encapsulation does not significantly alter the audio response. Its small size (see Figure 1) is applicable to many consumer electronic products, such as pagers, portable radios, headphones, laptop computers, computer monitors, toys, and electronic games. The audio device can also be used in automobile or aircraft sound systems.

  15. Feature selection for multimodal emotion recognition in the arousal-valence space.

    PubMed

    Torres, Cristian A; Orozco, Álvaro A; Álvarez, Mauricio A

    2013-01-01

    Emotion recognition is a challenging research problem with a significant scientific interest. Most of the emotion assessment studies have focused on the analysis of facial expressions. Recently, it has been shown that the simultaneous use of several biosignals taken from the patient may improve the classification accuracy. An open problem in this area is to identify which biosignals are more relevant for emotion recognition. In this paper, we perform Recursive Feature Elimination (RFE) to select a subset of features that allows emotion classification. Experiments are carried out over a multimodal database with arousal and valence annotations, and a diverse range of features extracted from physiological, neurophysiological, and video signals. Results show that several features can be eliminated while still preserving classification accuracy in setups of 2 and 3 classes. Using a small subset of the features, it is possible to reach 70% accuracy for arousal and 60% accuracy for valence in some experiments. Experimentally, it is shown that the Galvanic Skin Response (GSR) is relevant for arousal classification, while the electroencephalogram (EEG) is relevant for valence.

  16. Universal features of four-dimensional superconformal field theory on conic space

    NASA Astrophysics Data System (ADS)

    Zhou, Yang

    2015-08-01

    Following the set up in arXiv:1408.3393, we study 4 d superconformal field theories on conic spaces. We show that the universal part of supersymmetric Rényi entropy S q across a spherical entangling surface in the limit q → 0 is proportional to a linear combination of central charges, 3 c - 2 a. This is equivalent to a similar statement about the free energy of SCFTs on conic space or hyperbolic space in the corresponding limit. We first derive the asymptotic formula by the free field computation in the presence of a U (1) R-symmetry background and then provide an independent derivation by studying theories on with a particular scaling , which thus confirms the validity of the formula for general interacting SCFTs. Finally we revisit the supersymmetric Rényi entropy of generel SCFTs and find a simple formula for it in terms of central charges a and c.

  17. Extraction of ions and electrons from audio frequency plasma source

    NASA Astrophysics Data System (ADS)

    Haleem, N. A.; Abdelrahman, M. M.; Ragheb, M. S.

    2016-09-01

    Herein, the extraction of high ion / electron current from an audio frequency (AF) nitrogen gas discharge (10 - 100 kHz) is studied and investigated. This system is featured by its small size (L= 20 cm and inner diameter = 3.4 cm) and its capacitive discharge electrodes inside the tube and its high discharge pressure ˜ 0.3 Torr, without the need of high vacuum system or magnetic fields. The extraction system of ion/electron current from the plasma is a very simple electrode that allows self-beam focusing by adjusting its position from the source exit. The working discharge conditions were applied at a frequency from 10 to 100 kHz, power from 50 - 500 W and the gap distance between the plasma meniscus surface and the extractor electrode extending from 3 to 13 mm. The extracted ion/ electron current is found mainly dependent on the discharge power, the extraction gap width and the frequency of the audio supply. SIMION 3D program version 7.0 package is used to generate a simulation of ion trajectories as a reference to compare and to optimize the experimental extraction beam from the present audio frequency plasma source using identical operational conditions. The focal point as well the beam diameter at the collector area is deduced. The simulations showed a respectable agreement with the experimental results all together provide the optimizing basis of the extraction electrode construction and its parameters for beam production.

  18. Temporal Structure and Complexity Affect Audio-Visual Correspondence Detection

    PubMed Central

    Denison, Rachel N.; Driver, Jon; Ruff, Christian C.

    2013-01-01

    Synchrony between events in different senses has long been considered the critical temporal cue for multisensory integration. Here, using rapid streams of auditory and visual events, we demonstrate how humans can use temporal structure (rather than mere temporal coincidence) to detect multisensory relatedness. We find psychophysically that participants can detect matching auditory and visual streams via shared temporal structure for crossmodal lags of up to 200 ms. Performance on this task reproduced features of past findings based on explicit timing judgments but did not show any special advantage for perfectly synchronous streams. Importantly, the complexity of temporal patterns influences sensitivity to correspondence. Stochastic, irregular streams – with richer temporal pattern information – led to higher audio-visual matching sensitivity than predictable, rhythmic streams. Our results reveal that temporal structure and its complexity are key determinants for human detection of audio-visual correspondence. The distinctive emphasis of our new paradigms on temporal patterning could be useful for studying special populations with suspected abnormalities in audio-visual temporal perception and multisensory integration. PMID:23346067

  19. The Effects of Audio-Visual Recorded and Audio Recorded Listening Tasks on the Accuracy of Iranian EFL Learners' Oral Production

    ERIC Educational Resources Information Center

    Drood, Pooya; Asl, Hanieh Davatgari

    2016-01-01

    The ways in which task in classrooms has developed and proceeded have receive great attention in the field of language teaching and learning in the sense that they draw attention of learners to the competing features such as accuracy, fluency, and complexity. English audiovisual and audio recorded materials have been widely used by teachers and…

  20. Neuromagnetic Evidence for a Featural Distinction of English Consonants: Sensor- and Source-Space Data

    ERIC Educational Resources Information Center

    Scharinger, Mathias; Merickel, Jennifer; Riley, Joshua; Idsardi, William J.

    2011-01-01

    Speech sounds can be classified on the basis of their underlying articulators or on the basis of the acoustic characteristics resulting from particular articulatory positions. Research in speech perception suggests that distinctive features are based on both articulatory and acoustic information. In recent years, neuroelectric and neuromagnetic…

  1. Regional shape-based feature space for segmenting biomedical images using neural networks

    NASA Astrophysics Data System (ADS)

    Sundaramoorthy, Gopal; Hoford, John D.; Hoffman, Eric A.

    1993-07-01

    In biomedical images, structure of interest, particularly the soft tissue structures, such as the heart, airways, bronchial and arterial trees often have grey-scale and textural characteristics similar to other structures in the image, making it difficult to segment them using only gray- scale and texture information. However, these objects can be visually recognized by their unique shapes and sizes. In this paper we discuss, what we believe to be, a novel, simple scheme for extracting features based on regional shapes. To test the effectiveness of these features for image segmentation (classification), we use an artificial neural network and a statistical cluster analysis technique. The proposed shape-based feature extraction algorithm computes regional shape vectors (RSVs) for all pixels that meet a certain threshold criteria. The distance from each such pixel to a boundary is computed in 8 directions (or in 26 directions for a 3-D image). Together, these 8 (or 26) values represent the pixel's (or voxel's) RSV. All RSVs from an image are used to train a multi-layered perceptron neural network which uses these features to 'learn' a suitable classification strategy. To clearly distinguish the desired object from other objects within an image, several examples from inside and outside the desired object are used for training. Several examples are presented to illustrate the strengths and weaknesses of our algorithm. Both synthetic and actual biomedical images are considered. Future extensions to this algorithm are also discussed.

  2. Neuromagnetic Evidence for a Featural Distinction of English Consonants: Sensor- and Source-Space Data

    ERIC Educational Resources Information Center

    Scharinger, Mathias; Merickel, Jennifer; Riley, Joshua; Idsardi, William J.

    2011-01-01

    Speech sounds can be classified on the basis of their underlying articulators or on the basis of the acoustic characteristics resulting from particular articulatory positions. Research in speech perception suggests that distinctive features are based on both articulatory and acoustic information. In recent years, neuroelectric and neuromagnetic…

  3. Geometry and dimensionality reduction of feature spaces in primary visual cortex

    NASA Astrophysics Data System (ADS)

    Barbieri, Davide

    2015-09-01

    Some geometric properties of the wavelet analysis performed by visual neurons are discussed and compared with experimental data. In particular, several relationships between the cortical morphologies and the parametric dependencies of extracted features are formalized and considered from a harmonic analysis point of view.

  4. Study on the construction of multi-dimensional Remote Sensing feature space for hydrological drought

    NASA Astrophysics Data System (ADS)

    Xiang, Daxiang; Tan, Debao; Cui, Yuanlai; Wen, Xiongfei; Shen, Shaohong; Li, Zhe

    2014-03-01

    Hydrological drought refers to an abnormal water shortage caused by precipitation and surface water shortages or a groundwater imbalance. Hydrological drought is reflected in a drop of surface water, decrease of vegetation productivity, increase of temperature difference between day and night and so on. Remote sensing permits the observation of surface water, vegetation, temperature and other information from a macro perspective. This paper analyzes the correlation relationship and differentiation of both remote sensing and surface measured indicators, after the selection and extraction a series of representative remote sensing characteristic parameters according to the spectral characterization of surface features in remote sensing imagery, such as vegetation index, surface temperature and surface water from HJ-1A/B CCD/IRS data. Finally, multi-dimensional remote sensing features such as hydrological drought are built on a intelligent collaborative model. Further, for the Dong-ting lake area, two drought events are analyzed for verification of multi-dimensional features using remote sensing data with different phases and field observation data. The experiments results proved that multi-dimensional features are a good method for hydrological drought.

  5. Audio recording and reproduction in CARROUSO: Getting closer to perfection?

    NASA Astrophysics Data System (ADS)

    Teutsch, Heinz; Spors, Sascha; Buchner, Herbert; Rabenstein, Rudolf; Kellermann, Walter

    2002-05-01

    State-of-the-art systems for spatial audio reproduction utilize two to six discrete playback channels. A problem inherent to these systems is the relatively small area where the listener is able to experience a true 3-D sound sensation. This so-called ``sweet spot'' can be significantly enlarged by using loudspeaker arrays in combination with wave field synthesis (WFS) technology, initially developed at Delft University. By following this approach, actual sonic spaces can be reproduced in their entirety and not only discrete multichannel representations thereof. While loudspeaker arrays can be used to reproduce sound fields, microphone arrays can be used for sound field capture and analysis. Having high-quality audio reproduction in mind, microphone array designs are presented that need to fulfill stricter requirements than what has been traditionally considered for microphone array applications. Information on acoustic source position is essential for WFS-based rendering techniques. As will be shown, joint audio-video object tracking proves to be efficient for this task. Moreover, full-duplex applications based on WFS technology, like high-quality teleconferencing or remote music teaching, call for sophisticated multichannel acoustic echo cancellation algorithms. The European project ``CARROUSO'' aims at developing, integrating, and building a real-time system that embraces all previously described technologies in an MPEG-4 context.

  6. Developmental features of the neonatal brain: MR imaging. Part II. Ventricular size and extracerebral space.

    PubMed

    McArdle, C B; Richardson, C J; Nicholas, D A; Mirfakhraee, M; Hayden, C K; Amparo, E G

    1987-01-01

    Magnetic resonance (MR) imaging with a 0.6-T magnet was performed on 51 neonates, aged 29-42 weeks postconception. In 45 neonates, the ventricular/brain ratio (V/B) at the level of the frontal horns and midbody of the lateral ventricles ranged from 0.26 to 0.34. In six other infants a V/B of 0.36 or greater was associated with either cerebral atrophy or obstructive hydrocephalus. The width of the extracerebral space measured along specified points varied little in the neonatal period and ranged from 0 to 4 mm in 48 infants. Extracerebral space widths of 5-6 mm were seen in three other infants with severe asphyxia. Prominence of the subarachnoid space overlying the posterior parietal lobes is normal in neonates and should not be confused with cerebral atrophy. The authors conclude that V/B ratios of 0.26-0.34 and extracerebral space widths of 0-4 mm represent the normal range, and that neonates whose measurements exceed these values should be followed up.

  7. Perceived Danger in Urban Public Space. The Impacts of Physical Features and Personal Factors

    ERIC Educational Resources Information Center

    Blobaum, Anke; Hunecke, Marcel

    2005-01-01

    What are the most relevant factors influencing perceived danger in urban public space? To answer this question, a field experiment of students(N = 122) was carried out on a German university campus within which perceived danger was analyzed under systematic variation of lighting, prospect, and opportunities of escape. Two standardized…

  8. A Study of Audio Tape: Part II

    ERIC Educational Resources Information Center

    Reen, Noel K.

    1975-01-01

    To evaluate reel audio tape, tests were performed to identify: signal-to-noise ratio, total harmonic distortion, dynamic response, frequency response, biased and virgin tape noise, dropout susceptibility and oxide coating uniformity. (SCC)

  9. Audio-visual affective expression recognition

    NASA Astrophysics Data System (ADS)

    Huang, Thomas S.; Zeng, Zhihong

    2007-11-01

    Automatic affective expression recognition has attracted more and more attention of researchers from different disciplines, which will significantly contribute to a new paradigm for human computer interaction (affect-sensitive interfaces, socially intelligent environments) and advance the research in the affect-related fields including psychology, psychiatry, and education. Multimodal information integration is a process that enables human to assess affective states robustly and flexibly. In order to understand the richness and subtleness of human emotion behavior, the computer should be able to integrate information from multiple sensors. We introduce in this paper our efforts toward machine understanding of audio-visual affective behavior, based on both deliberate and spontaneous displays. Some promising methods are presented to integrate information from both audio and visual modalities. Our experiments show the advantage of audio-visual fusion in affective expression recognition over audio-only or visual-only approaches.

  10. The Audio Description as a Physics Teaching Tool

    ERIC Educational Resources Information Center

    Cozendey, Sabrina; Costa, Maria da Piedade

    2016-01-01

    This study analyses the use of audio description in teaching physics concepts, aiming to determine the variables that influence the understanding of the concept. One education resource was audio described. For make the audio description the screen was freezing. The video with and without audio description should be presented to students, so that…

  11. The Audio Description as a Physics Teaching Tool

    ERIC Educational Resources Information Center

    Cozendey, Sabrina; Costa, Maria da Piedade

    2016-01-01

    This study analyses the use of audio description in teaching physics concepts, aiming to determine the variables that influence the understanding of the concept. One education resource was audio described. For make the audio description the screen was freezing. The video with and without audio description should be presented to students, so that…

  12. Far-space neglect in conjunction but not feature search following transcranial magnetic stimulation over right posterior parietal cortex.

    PubMed

    Mahayana, Indra T; Liu, Chia-Lun; Chang, Chi Fu; Hung, Daisy L; Tzeng, Ovid J L; Juan, Chi-Hung; Muggleton, Neil G

    2014-02-01

    Near- and far-space coding in the human brain is a dynamic process. Areas in dorsal, as well as ventral visual association cortex, including right posterior parietal cortex (rPPC), right frontal eye field (rFEF), and right ventral occipital cortex (rVO), have been shown to be important in visuospatial processing, but the involvement of these areas when the information is in near or far space remains unclear. There is a need for investigations of these representations to help explain the pathophysiology of hemispatial neglect, and the role of near and far space is crucial to this. We used a conjunction visual search task using an elliptical array to investigate the effects of transcranial magnetic stimulation delivered over rFEF, rPPC, and rVO on the processing of targets in near and far space and at a range of horizontal eccentricities. As in previous studies, we found that rVO was involved in far-space search, and rFEF was involved regardless of the distance to the array. It was found that rPPC was involved in search only in far space, with a neglect-like effect when the target was located in the most eccentric locations. No effects were seen for any site for a feature search task. As the search arrays had higher predictability with respect to target location than is often the case, these data may form a basis for clarifying both the role of PPC in visual search and its contribution to neglect, as well as the importance of near and far space in these.

  13. Analysis of E. coli promoter recognition problem in dinucleotide feature space.

    PubMed

    Rani, T Sobha; Bhavani, S Durga; Bapi, Raju S

    2007-03-01

    Patterns in the promoter sequences within a species are known to be conserved but there exist many exceptions to this rule which makes the promoter recognition a complex problem. Although many complex feature extraction schemes coupled with several classifiers have been proposed for promoter recognition in the current literature, the problem is still open. A dinucleotide global feature extraction method is proposed for the recognition of sigma-70 promoters in Escherichia coli in this article. The positive data set consists of sigma-70 promoters with known transcription starting points which are part of regulonDB and promec databases. Four different kinds of negative data sets are considered, two of them biological sets (Gordon et al., 2003) and the other two synthetic data sets. Our results reveal that a single-layer perceptron using dinucleotide features is able to achieve an accuracy of 80% against a background of biological non-promoters and 96% for random data sets. A scheme for locating the promoter regions in a given genome sequence is proposed. A deeper analysis of the data set shows that there is a bifurcation of the data set into two distinct classes, a majority class and a minority class. Our results point out that majority class constituting the majority promoter and the majority non-promoter signal is linearly separable. Also the minority class is linearly separable. We further show that the feature extraction and classification methods proposed in the paper are generic enough to be applied to the more complex problem of eucaryotic promoter recognition. We present Drosophila promoter recognition as a case study. http://202.41.85.117/htmfiles/faculty/tsr/tsr.html.

  14. Filling the void-enriching the feature space of successful stopping.

    PubMed

    Huster, René J; Schneider, Signe; Lavallee, Christina F; Enriquez-Geppert, Stefanie; Herrmann, Christoph S

    2017-03-01

    The ability to inhibit behavior is crucial for adaptation in a fast changing environment and is commonly studied with the stop signal task. Current EEG research mainly focuses on the N200 and P300 ERPs and corresponding activity in the theta and delta frequency range, thereby leaving us with a limited understanding of the mechanisms of response inhibition. Here, 15 functional networks were estimated from time-frequency transformed EEG recorded during processing of a visual stop signal task. Cortical sources underlying these functional networks were reconstructed, and a total of 45 features, each representing spectrally and temporally coherent activity, were extracted to train a classifier to differentiate between go and stop trials. A classification accuracy of 85.55% for go and 83.85% for stop trials was achieved. Features capturing fronto-central delta- and theta activity, parieto-occipital alpha, fronto-central as well as right frontal beta activity were highly discriminating between trial-types. However, only a single network, comprising a feature defined by oscillatory activity below 12 Hz, was associated with a generator in the opercular region of the right inferior frontal cortex and showed the expected associations with behavioral inhibition performance. This study pioneers by providing a detailed ranking of neural features regarding their information content for stop and go differentiation at the single-trial level, and may further be the first to identify a scalp EEG marker of the inhibitory control network. This analysis allows for the characterization of the temporal dynamics of response inhibition by matching electrophysiological phenomena to cortical generators and behavioral inhibition performance. Hum Brain Mapp 38:1333-1346, 2017. © 2016 Wiley Periodicals, Inc.

  15. The Effect Of 3D Audio And Other Audio Techniques On Virtual Reality Experience.

    PubMed

    Brinkman, Willem-Paul; Hoekstra, Allart R D; van Egmond, René

    2015-01-01

    Three studies were conducted to examine the effect of audio on people's experience in a virtual world. The first study showed that people could distinguish between mono, stereo, Dolby surround and 3D audio of a wasp. The second study found significant effects for audio techniques on people's self-reported anxiety, presence, and spatial perception. The third study found that adding sound to a visual virtual world had a significant effect on people's experience (including heart rate), while it found no difference in experience between stereo and 3D audio.

  16. A study of some features of ac and dc electric power systems for a space station

    NASA Technical Reports Server (NTRS)

    Hanania, J. I.

    1983-01-01

    This study analyzes certain selected topics in rival dc and high frequency ac electric power systems for a Space Station. The interaction between the Space Station and the plasma environment is analyzed, leading to a limit on the voltage for the solar array and a potential problem with resonance coupling at high frequencies. Certain problems are pointed out in the concept of a rotary transformer, and further development work is indicated in connection with dc circuit switching, special design of a transmission conductor for the ac system, and electric motors. The question of electric shock hazards, particularly at high frequency, is also explored. and a problem with reduced skin resistance and therefore increased hazard with high frequency ac is pointed out. The study concludes with a comparison of the main advantages and disadvantages of the two rival systems, and it is suggested that the choice between the two should be made after further studies and development work are completed.

  17. Dermoscopic diagnosis of melanoma in a 4D space constructed by active contour extracted features.

    PubMed

    Mete, Mutlu; Sirakov, Nikolay Metodiev

    2012-10-01

    Dermoscopy, also known as epiluminescence microscopy, is a major imaging technique used in the assessment of melanoma and other diseases of skin. In this study we propose a computer aided method and tools for fast and automated diagnosis of malignant skin lesions using non-linear classifiers. The method consists of three main stages: (1) skin lesion features extraction from images; (2) features measurement and digitization; and (3) skin lesion binary diagnosis (classification), using the extracted features. A shrinking active contour (S-ACES) extracts color regions boundaries, the number of colors, and lesion's boundary, which is used to calculate the abrupt boundary. Quantification methods for measurements of asymmetry and abrupt endings in skin lesions are elaborated to approach the second stage of the method. The total dermoscopy score (TDS) formula of the ABCD rule is modeled as linear support vector machines (SVM). Further a polynomial SVM classifier is developed. To validate the proposed framework a dataset of 64 lesion images were selected from a collection with a ground truth. The lesions were classified as benign or malignant by the TDS based model and the SVM polynomial classifier. Comparing the results, we showed that the latter model has a better f-measure then the TDS-based model (linear classifier) in the classification of skin lesions into two groups, malignant and benign. Copyright © 2012 Elsevier Ltd. All rights reserved.

  18. Ophthalmic Features of Outpatient Children Diagnosed with Intracranial Space-Occupying Lesions by Ophthalmologists.

    PubMed

    Alswaina, Nayef; Elkhamary, Sahar M; Shammari, Mansour A; Khan, Arif O

    2015-01-01

    Brain tumors in children often involve the visual system, but most retrospective series are by neurologists or oncologists. In this study we highlight the ophthalmic findings of outpatient children with visual complaints and/or strabismus who, based on ophthalmic examination, were suspected to and confirmed to harbor intracranial space-occupying lesions by magnetic resonance imaging (MRI). Retrospective case series of children (less than 18 years) who for visual complaints and/or strabismus underwent cranial MRI at a referral eye hospital (2005-2012), which revealed intracranial space-occupying lesions. Exclusion criteria were known preexisting orbital or ocular trauma, ocular tumor, or neurological disease. For 26 patients (3 months-17 years; mean 7 years; median 9 years; and 14 boys), the most common clinical presentation was decreased vision with disc pallor (10) or swelling (three). Other presentations were strabismus with disc pallor or swelling (four; two of which were left sixth nerve palsies), acquired esotropia with diplopia (three; one bilateral and two left sixth nerve palsies), acquired exotropia (four; two of which were bilateral third nerve palsies, one of which was left partial third nerve palsy, and one of which was associated with headache), nystagmus (one), and disc swelling with headache (one). Most lesions were in the sellar/suprasellar space (10), posterior fossa (six), or optic nerve/chasm (four). The majority of outpatient children diagnosed by ophthalmologists with intracranial space-occupying lesions presented with disc swelling or pallor in the context of decreased vision or strabismus. Two strabismus profiles that did not include disc swelling or pallor were acquired sixth nerve palsy and acquired exotropia (with ptosis (third nerve palsy), nystagmus, or headache).

  19. Multilingual Video and Audio News Alerting

    DTIC Science & Technology

    2004-01-01

    Multilingual Video and Audio News Alerting David D. Palmer, Patrick Bray, Marc Reichman, Katherine Rhodes, Noah White Virage Advanced...enable searching of multilingual video news sources by a monolingual speaker. In addition to full search capabilities, the system also enables real...DATE 2004 2. REPORT TYPE 3. DATES COVERED 00-00-2004 to 00-00-2004 4. TITLE AND SUBTITLE Multilingual Video and Audio News Alerting 5a

  20. The features of radiation dose variations onboard ISS and Mir space station: comparative study.

    PubMed

    Tverskaya, L V; Panasyuk, M I; Reizman, S Ya; Sosnovets, E N; Teltsov, M V; Tsetlin, V V

    2004-01-01

    The dynamics of the ISS-measured radiation dose variations since August 2000 is studied. Use is made of the data obtained with the R-16 instrument, which consists of two ionization chambers behind different shielding thicknesses. The doses recorded during solar energetic particle (SEP) events are compared with the data obtained also by R-16 on Mir space station. The SEP events in the solar maximum of the current cycle make a much smaller contribution to the radiation dose compared with the October 1989 event recorded on Mir space station. In the latter event, the proton intensity was peaking during a strong magnetic storm. The storm-time effect of solar proton geomagnetic cutoff decreases on dose variations is estimated. The dose variations on Mir space stations due to formation of a new radiation belt of high-energy protons and electrons during a sudden commencement of March 24, 1991 storm are also studied. It was for the first time throughout the ISS and Mir dose measurement period that the counting rates recorded by both R-16 channels on ISS in 2001-2002 were nearly the same during some time intervals. This effect may arise from the decreases of relativistic electron fluxes in the outer radiation belt. c2004 COSPAR. Published by Elsevier Ltd. All rights reserved.

  1. Design and implementation of an audio indicator

    NASA Astrophysics Data System (ADS)

    Zheng, Shiyong; Li, Zhao; Li, Biqing

    2017-04-01

    This page proposed an audio indicator which designed by using C9014, LED by operational amplifier level indicator, the decimal count/distributor of CD4017. The experimental can control audibly neon and holiday lights through the signal. Input audio signal after C9014 composed of operational amplifier for power amplifier, the adjust potentiometer extraction amplification signal input voltage CD4017 distributors make its drive to count, then connect the LED display running situation of the circuit. This simple audio indicator just use only U1 and can produce two colors LED with the audio signal tandem come pursuit of the running effect, from LED display the running of the situation takes can understand the general audio signal. The variation in the audio and the frequency of the signal and the corresponding level size. In this light can achieve jump to change, slowly, atlas, lighting four forms, used in home, hotel, discos, theater, advertising and other fields, and a wide range of USES, rU1h life in a modern society.

  2. Self-synchronization for spread spectrum audio watermarks after time scale modification

    NASA Astrophysics Data System (ADS)

    Nadeau, Andrew; Sharma, Gaurav

    2014-02-01

    De-synchronizing operations such as insertion, deletion, and warping pose significant challenges for watermarking. Because these operations are not typical for classical communications, watermarking techniques such as spread spectrum can perform poorly. Conversely, specialized synchronization solutions can be challenging to analyze/ optimize. This paper addresses desynchronization for blind spread spectrum watermarks, detected without reference to any unmodified signal, using the robustness properties of short blocks. Synchronization relies on dynamic time warping to search over block alignments to find a sequence with maximum correlation to the watermark. This differs from synchronization schemes that must first locate invariant features of the original signal, or estimate and reverse desynchronization before detection. Without these extra synchronization steps, analysis for the proposed scheme builds on classical SS concepts and allows characterizes the relationship between the size of search space (number of detection alignment tests) and intrinsic robustness (continuous search space region covered by each individual detection test). The critical metrics that determine the search space, robustness, and performance are: time-frequency resolution of the watermarking transform, and blocklength resolution of the alignment. Simultaneous robustness to (a) MP3 compression, (b) insertion/deletion, and (c) time-scale modification is also demonstrated for a practical audio watermarking scheme developed in the proposed framework.

  3. The detection and observation of meteoroid and space debris impact features on the Solar Max satellite

    NASA Technical Reports Server (NTRS)

    Warren, J. L.; Allton, J. H.; Dardano, C. B.; Holder, J. A.; Marlow, R. R.; Zook, H. A.; Clanton, U. S.; Schultz, R. A.

    1989-01-01

    The results of optical observations on the entire population of impact pits and holes discovered on nearly all of the returned parts of the Solar Maximum Mission satellite are described. Some statistically significant new results on impact dynamics are obtained as well as new number vs. size distributions for craters and holes ranging from 40 to 900 microns. The local spacecraft geometric shielding corrections are calculated and applied to the data. Some selected compositional results are presented which indicate that the Space Shuttle can be a source of impacting debris.

  4. STS-44 onboard (in-space) crew portrait features 'Trash Man' Hennen

    NASA Technical Reports Server (NTRS)

    1991-01-01

    STS-44 crewmembers, wearing mission polo shirts, pose for a second onboard (in-space) portrait on the middeck of Atlantis, Orbiter Vehicle (OV) 104. This portrait was intended to focus on Payload Specialist Thomas J. Hennen, holding extended duration orbiter (EDO) trash compactor handles, who earned the nickname, 'Trash Man' during the flight. It was Hennen who 'starred' in a brief onboard video production about the disposal of trash and waste. Others in the picture are (front row) Commander Frederick D. Gregory (left) and Mission Specialist (MS) James S. Voss and (back row left to right) MS Mario Runco, Jr, MS F. Story Musgrave, and Pilot Terence T. Henricks.

  5. Competing descriptions of diffusion profiles with two features: Surface space-charge layer versus fast grain-boundary diffusion

    NASA Astrophysics Data System (ADS)

    Schraknepper, H.; De Souza, R. A.

    2016-02-01

    Two different physical processes, (i) fast grain-boundary diffusion (FGBD) of oxygen and (ii) hindered oxygen diffusion in a surface space-charge layer, yield oxygen isotope diffusion profiles in a similar form. Two features are observed, with the short, sharp profile close to the surface being followed by a longer, shallower profile. In this study, we develop a procedure for deciding which of the two descriptions applies to experimentally measured profiles. Specifically, we solve Fick's second law, using finite-element simulations, to obtain oxygen isotope diffusion profiles for the two cases. Each set of profiles is then analysed in terms of the competing description. In this manner, we derive falsifiable conditions that allow physical processes to be assigned unambiguously to the two features of such isotope profiles. Applying these conditions to experimental profiles for SrTiO3 single crystals published in the literature, we find that FGBD is an invalid model for describing the diffusion processes.

  6. Navigation for the Blind through Audio-Based Virtual Environments.

    PubMed

    Sánchez, Jaime; Sáenz, Mauricio; Pascual-Leone, Alvaro; Merabet, Lotfi

    2010-01-01

    We present the design, development and an initial study changes and adaptations related to navigation that take place in the brain, by incorporating an Audio-Based Environments Simulator (AbES) within a neuroimaging environment. This virtual environment enables a blind user to navigate through a virtual representation of a real space in order to train his/her orientation and mobility skills. Our initial results suggest that this kind of virtual environment could be highly efficient as a testing, training and rehabilitation platform for learning and navigation.

  7. On the scaling features of magnetic field fluctuations at non-MHD scales in turbulent space plasmas

    NASA Astrophysics Data System (ADS)

    Consolini, G.; Giannattasio, F.; Yordanova, E.; Vörös, Z.; Marcucci, M. F.; Echim, M.; Chang, T.

    2016-11-01

    In several different contexts space plasmas display intermittent turbulence at magneto-hydro-dynamic (MHD) scales, which manifests in anomalous scaling features of the structure functions of the magnetic field increments. Moving to smaller scales, i.e. below the ion-cyclotron and/or ion inertial length, these scaling features are still observed, even though its is not clear if these scaling features are still anomalous or not. Here, we investigate the nature of scaling properties of magnetic field increments at non-MHD scales for a period of fast solar wind to investigate the occurrence or not of multifractal features and collapsing of probability distribution functions (PDFs) using the novel Rank-Ordered Multifractal Analysis (ROMA) method, which is more sensitive than the traditional structure function approach. We find a strong evidence for the occurrence of a near mono-scaling behavior, which suggests that the observed turbulent regime at non-MHD scales mainly displays a mono-fractal nature of magnetic field increments. The results are discussed in terms of a non-compact fractal structure of the dissipation field.

  8. A Visual Analytics Approach Using the Exploration of Multidimensional Feature Spaces for Content-Based Medical Image Retrieval.

    PubMed

    Kumar, Ashnil; Nette, Falk; Klein, Karsten; Fulham, Michael; Kim, Jinman

    2015-09-01

    Content-based image retrieval (CBIR) is a search technique based on the similarity of visual features and has demonstrated potential benefits for medical diagnosis, education, and research. However, clinical adoption of CBIR is partially hindered by the difference between the computed image similarity and the user's search intent, the semantic gap, with the end result that relevant images with outlier features may not be retrieved. Furthermore, most CBIR algorithms do not provide intuitive explanations as to why the retrieved images were considered similar to the query (e.g., which subset of features were similar), hence, it is difficult for users to verify if relevant images, with a small subset of outlier features, were missed. Users, therefore, resort to examining irrelevant images and there are limited opportunities to discover these "missed" images. In this paper, we propose a new approach to medical CBIR by enabling a guided visual exploration of the search space through a tool, called visual analytics for medical image retrieval (VAMIR). The visual analytics approach facilitates interactive exploration of the entire dataset using the query image as a point-of-reference. We conducted a user study and several case studies to demonstrate the capabilities of VAMIR in the retrieval of computed tomography images and multimodality positron emission tomography and computed tomography images.

  9. Features of Self-organization in Space Plasma: Generation and Evolution of Extremely Thin Current Sheets

    NASA Astrophysics Data System (ADS)

    Kropotkin, A. P.; Domrin, V. I.

    2005-12-01

    Thin current sheets (TCS) have been earlier shown to be specific structures forming in magnetized plasmas under a variety of conditions. We argue that fast evolution which may follow the slow, quasi-static process of TCS initial formation, can be started by a fast loss of equilibrium in the system produced by localized (e.g. tearing) instability, and MHD disturbance propagating along TCS from instability location. That fast evolution involves formation of extremely thin embedded structures, with scales ranging down to the ion Larmor radius, and it must be analysed based on plasma kinetics. The analysis has been carried out by means of kinetic simulation using the hybrid technique. The process appears to have quite different features, i.e. to follow various dynamical paths, depending on initial conditions. We demonstrate in particular the key role of the magnetic field normal component. The dominating features may be either those of a slow switch-off shock or those of a stationary forced current sheet (FCS), with its extremely anisotropic ion distributions. In both cases, however, at later stages, the process appears to be spontaneously self-sustained, as a finite magnitude MHD disturbance of a rarefaction wave type propagates back over the background plasma outside the CS. Transformation of electromagnetic energy into the energy of plasma flows occurs at the TCS in both cases, providing an effect of magnetic field "annihilation" which is a necessary constituent of fast magnetic reconnection.

  10. Improvement of information fusion-based audio steganalysis

    NASA Astrophysics Data System (ADS)

    Kraetzer, Christian; Dittmann, Jana

    2010-01-01

    In the paper we extend an existing information fusion based audio steganalysis approach by three different kinds of evaluations: The first evaluation addresses the so far neglected evaluations on sensor level fusion. Our results show that this fusion removes content dependability while being capable of achieving similar classification rates (especially for the considered global features) if compared to single classifiers on the three exemplarily tested audio data hiding algorithms. The second evaluation enhances the observations on fusion from considering only segmental features to combinations of segmental and global features, with the result of a reduction of the required computational complexity for testing by about two magnitudes while maintaining the same degree of accuracy. The third evaluation tries to build a basis for estimating the plausibility of the introduced steganalysis approach by measuring the sensibility of the models used in supervised classification of steganographic material against typical signal modification operations like de-noising or 128kBit/s MP3 encoding. Our results show that for some of the tested classifiers the probability of false alarms rises dramatically after such modifications.

  11. Our Understanding of Space Weather features responsible for geostationary satellite anamolies (P39)

    NASA Astrophysics Data System (ADS)

    Rajaram, G.; et al.

    2006-11-01

    girija60@yahoo.com The topic of space weather effects on operational anomalies on spacecraft is one of considerable research investigation, with both pure and applied aspects. This is because of the very high costs involved in fabricating and operating spacecraft, and in insuring them against the harmful effects of space weather. This is more true for geostationary satellites than of low-orbiting spacecraft, as the former operate in the high-risk environment of the Earth’s outer radiation belts, with its large vagaries in spatial and time variations of high- energy electron and proton distributions (BAKER 2003). Without doubt, plasma and magnetic field emissions from active regions on the Sun are the root cause for spacecraft anomalies. Our study for 2005 shows that over 95% of anomalies can be related to some definite activity on the Sun, ranging from high-speed solar wind streams with their 27-day recurrence patterns/coronal holes/coronal mass ejections preceded by X or M type of flares/and magnetic cloud events. The most energetic solar activity events are generally accompanied by a large rise in solar MeV proton densities at geo-stationary orbit (WILKINSON 1994), and they account for definite anomalies classified as SEU (Single Event Upsets which most often are reversible through resetting commands). Any particles in the low energy ranges (eV and keV, and these could be of magnetospheric or ionospheric origin), are believed to cause external charging effects in exposed parts of the spacecraft such as solar power arrays and power cables. These mainly result in power losses which are debilitating over a period of time. The most dangerous and often irrecoverable damage is due to electronics in the 1-5 MeV range which cause deep dielectric discharge of arc type in semi-conductors comprising spacecraft instruments. Following major solar activity, the populations of these rise to more than (5x103) particles/cm2.ster.sec, with large spatial and time variations (LOVE

  12. [Monitoring of farmland drought based on LST-LAI spectral feature space].

    PubMed

    Sui, Xin-Xin; Qin, Qi-Ming; Dong, Heng; Wang, Jin-Liang; Meng, Qing-Ye; Liu, Ming-Chao

    2013-01-01

    Farmland drought has the characteristics of wide range and seriously affecting on agricultural production, so real-time dynamic monitored has been a challenging problem. By using MODIS land products, and constructing the spectral space of LST and LAI, the temperature LAI drought index (TLDI) was put forward and validated using ground-measured 0-10 cm averaged soil moisture of Ningxia farmland. The results show that the coefficient of determination (R2) of both them varies from 0.43 to 0.86. Compared to TVDI, the TLDI has higher accuracy for farmland moisture monitoring, and solves the saturation of NDVI during the late development phases of the crop. Furthermore, directly using MODIS land products LST and LAI and avoiding the complicated process of using the original MODIS data provide a new technical process to the regular operation of farmland drought monitoring.

  13. Spatial domain entertainment audio decompression/compression

    NASA Astrophysics Data System (ADS)

    Chan, Y. K.; Tam, Ka Him K.

    2014-02-01

    The ARM7 NEON processor with 128bit SIMD hardware accelerator requires a peak performance of 13.99 Mega Cycles per Second for MP3 stereo entertainment quality decoding. For similar compression bit rate, OGG and AAC is preferred over MP3. The Patent Cooperation Treaty Application dated 28/August/2012 describes an audio decompression scheme producing a sequence of interleaving "min to Max" and "Max to min" rising and falling segments. The number of interior audio samples bound by "min to Max" or "Max to min" can be {0|1|…|N} audio samples. The magnitudes of samples, including the bounding min and Max, are distributed as normalized constants within the 0 and 1 of the bounding magnitudes. The decompressed audio is then a "sequence of static segments" on a frame by frame basis. Some of these frames needed to be post processed to elevate high frequency. The post processing is compression efficiency neutral and the additional decoding complexity is only a small fraction of the overall decoding complexity without the need of extra hardware. Compression efficiency can be speculated as very high as source audio had been decimated and converted to a set of data with only "segment length and corresponding segment magnitude" attributes. The PCT describes how these two attributes are efficiently coded by the PCT innovative coding scheme. The PCT decoding efficiency is obviously very high and decoding latency is basically zero. Both hardware requirement and run time is at least an order of magnitude better than MP3 variants. The side benefit is ultra low power consumption on mobile device. The acid test on how such a simplistic waveform representation can indeed reproduce authentic decompressed quality is benchmarked versus OGG(aoTuv Beta 6.03) by three pair of stereo audio frames and one broadcast like voice audio frame with each frame consisting 2,028 samples at 44,100KHz sampling frequency.

  14. Could Audio-Described Films Benefit from Audio Introductions? An Audience Response Study

    ERIC Educational Resources Information Center

    Romero-Fresco, Pablo; Fryer, Louise

    2013-01-01

    Introduction: Time constraints limit the quantity and type of information conveyed in audio description (AD) for films, in particular the cinematic aspects. Inspired by introductory notes for theatre AD, this study developed audio introductions (AIs) for "Slumdog Millionaire" and "Man on Wire." Each AI comprised 10 minutes of…

  15. Could Audio-Described Films Benefit from Audio Introductions? An Audience Response Study

    ERIC Educational Resources Information Center

    Romero-Fresco, Pablo; Fryer, Louise

    2013-01-01

    Introduction: Time constraints limit the quantity and type of information conveyed in audio description (AD) for films, in particular the cinematic aspects. Inspired by introductory notes for theatre AD, this study developed audio introductions (AIs) for "Slumdog Millionaire" and "Man on Wire." Each AI comprised 10 minutes of…

  16. Features of space-charge-limited emission in foil-less diodes

    SciTech Connect

    Wu, Ping; Yuan, Keliang; Liu, Guozhi; Sun, Jun

    2014-12-15

    Space-charge-limited (SCL) current can always be obtained from the blade surface of annular cathodes in foil-less diodes which are widely used in O-type relativistic high power microwave generators. However, there is little theoretical analysis regarding it due to the mathematical complexity, and almost all formulas about the SCL current in foil-less diodes are based on numerical simulation results. This paper performs an initial trial in calculation of the SCL current from annular cathodes theoretically under the ultra-relativistic assumption and the condition of infinitely large guiding magnetic field. The numerical calculation based on the theoretical research is coherent with the particle-in-cell (PIC) simulation result to some extent under a diode voltage of 850 kV. Despite that the theoretical research gives a much larger current than the PIC simulation (41.3 kA for the former and 9.7 kA for the latter), which is induced by the ultra-relativistic assumption in the theoretical research, they both show the basic characteristic of emission from annular cathodes in foil-less diodes, i.e., the emission enhancement at the cathode blade edges, especially at the outer edge. This characteristic is confirmed to some extent in our experimental research of cathode plasma photographing under the same diode voltage and a guiding magnetic field of 4 T.

  17. Audio stream classification for multimedia database search

    NASA Astrophysics Data System (ADS)

    Artese, M.; Bianco, S.; Gagliardi, I.; Gasparini, F.

    2013-03-01

    Search and retrieval of huge archives of Multimedia data is a challenging task. A classification step is often used to reduce the number of entries on which to perform the subsequent search. In particular, when new entries of the database are continuously added, a fast classification based on simple threshold evaluation is desirable. In this work we present a CART-based (Classification And Regression Tree [1]) classification framework for audio streams belonging to multimedia databases. The database considered is the Archive of Ethnography and Social History (AESS) [2], which is mainly composed of popular songs and other audio records describing the popular traditions handed down generation by generation, such as traditional fairs, and customs. The peculiarities of this database are that it is continuously updated; the audio recordings are acquired in unconstrained environment; and for the non-expert human user is difficult to create the ground truth labels. In our experiments, half of all the available audio files have been randomly extracted and used as training set. The remaining ones have been used as test set. The classifier has been trained to distinguish among three different classes: speech, music, and song. All the audio files in the dataset have been previously manually labeled into the three classes above defined by domain experts.

  18. iACP-GAEnsC: Evolutionary genetic algorithm based ensemble classification of anticancer peptides by utilizing hybrid feature space.

    PubMed

    Akbar, Shahid; Hayat, Maqsood; Iqbal, Muhammad; Jan, Mian Ahmad

    2017-06-01

    Cancer is a fatal disease, responsible for one-quarter of all deaths in developed countries. Traditional anticancer therapies such as, chemotherapy and radiation, are highly expensive, susceptible to errors and ineffective techniques. These conventional techniques induce severe side-effects on human cells. Due to perilous impact of cancer, the development of an accurate and highly efficient intelligent computational model is desirable for identification of anticancer peptides. In this paper, evolutionary intelligent genetic algorithm-based ensemble model, 'iACP-GAEnsC', is proposed for the identification of anticancer peptides. In this model, the protein sequences are formulated, using three different discrete feature representation methods, i.e., amphiphilic Pseudo amino acid composition, g-Gap dipeptide composition, and Reduce amino acid alphabet composition. The performance of the extracted feature spaces are investigated separately and then merged to exhibit the significance of hybridization. In addition, the predicted results of individual classifiers are combined together, using optimized genetic algorithm and simple majority technique in order to enhance the true classification rate. It is observed that genetic algorithm-based ensemble classification outperforms than individual classifiers as well as simple majority voting base ensemble. The performance of genetic algorithm-based ensemble classification is highly reported on hybrid feature space, with an accuracy of 96.45%. In comparison to the existing techniques, 'iACP-GAEnsC' model has achieved remarkable improvement in terms of various performance metrics. Based on the simulation results, it is observed that 'iACP-GAEnsC' model might be a leading tool in the field of drug design and proteomics for researchers. Copyright © 2017 Elsevier B.V. All rights reserved.

  19. Estimating Species Distributions Across Space Through Time and with Features of the Environment

    SciTech Connect

    Kelling, S.; Fink, D.; Hochachka, W.; Rosenberg, K.; Cook, R.; Damoulas, C.; Silva, C.; Michener, W.

    2013-01-01

    Complete guidance for mastering the tools and techniques of the digital revolution With the digital revolution opening up tremendous opportunities in many fields, there is a growing need for skilled professionals who can develop data-intensive systems and extract information and knowledge from them. This book frames for the first time a new systematic approach for tackling the challenges of data-intensive computing, providing decision makers and technical experts alike with practical tools for dealing with our exploding data collections. Emphasizing data-intensive thinking and interdisciplinary collaboration, The Data Bonanza: Improving Knowledge Discovery in Science, Engineering, and Business examines the essential components of knowledge discovery, surveys many of the current research efforts worldwide, and points to new areas for innovation. Complete with a wealth of examples and DISPEL-based methods demonstrating how to gain more from data in real-world systems, the book: Outlines the concepts and rationale for implementing data-intensive computing in organizations Covers from the ground up problem-solving strategies for data analysis in a data-rich world Introduces techniques for data-intensive engineering using the Data-Intensive Systems Process Engineering Language DISPEL Features in-depth case studies in customer relations, environmental hazards, seismology, and more Showcases successful applications in areas ranging from astronomy and the humanities to transport engineering Includes sample program snippets throughout the text as well as additional materials on a companion website The Data Bonanza is a must-have guide for information strategists, data analysts, and engineers in business, research, and government, and for anyone wishing to be on the cutting edge of data mining, machine learning, databases, distributed systems, or large-scale computing.

  20. [The features of adaptation and disadaptation of the human cardiovascular system in the space flight conditions].

    PubMed

    Kotovskaia, A R; Fomina, G A

    2010-01-01

    The work was aimed at analysis and generalization of the hemodynamic data collected over 20 years from 26 cosmonauts flown 8 to 438 days aboard orbital stations Salyut 7 and Mir. The paper presents the results of ultrasonic investigations of the heart, arterial and venous peripheral vessels in different parts of human body, and measurements of leg veins capacity with the use of occlusive plethysmograpy. It was shown that in the resting condition such prime hemodynamic parameters as the pumping function of the heart and blood supply of the brain, and integral parameters, i.e. arterial pressure and heat rate, were best "protected" as they demonstrated stability throughout long exposure in microgravity. In the absence of gravitational stimulation, arterial resistance went down in essentially all vascular regions below the heart level; to put it differently, the anti-gravity distribution of the vascular tone was annulled gradually as unneeded in microgravity. Compared with the data about arteries, venous hemodynamics was found to be particularly sensitive considering the early advent and significance of changes. Venous return slowed down, resistance of the lower body vessels declined and capacity of the leg venous net increased. Functional testing with the lower body negative pressure revealed degradation of the gravity-dependent reactions that became more conspicuous as flight duration extended further. Cardiovascular deconditioning showed itself clearly on return to Earth's gravity by decreased g-tolerance during re-entry and orthostatic instability post flight. These investigations provided objective evidence for multifactorial genesis of orthostatic instability during space flight including blood redistribution, altered tone regulation of leg's venous and arterial vessels and hypovolemia.

  1. Feature to space conversion during target selection in the dorsolateral and ventrolateral prefrontal cortex of monkeys.

    PubMed

    Inoue, Masato; Mikami, Akichika

    2010-03-01

    To investigate the neuronal mechanism of the process of selection of a target from an array of stimuli, we analysed neuronal activity of the lateral prefrontal cortex during the response period of a serial probe reproduction task. During the response period of this task, monkeys were trained to select a memorized target object from an array of three objects and make a saccadic eye movement toward it. Of 611 neurons, 74 neurons showed visual response and 56 neurons showed presaccadic activity during the response period. Among visual neurons, 27 showed array- and target-selectivity. All of these array- and target-selective visual responses were recorded from the ventrolateral prefrontal cortex (VLPFC). Among 56 neurons with presaccadic activity, nine showed target-selective activity, 17 showed target- and direction-selective activity, and 23 showed direction-selective activity. The target-selective, and the target- and direction-selective activities were recorded from the VLPFC, and the direction-selective activities were recorded from VLPFC and dorsolateral prefrontal cortex (DLPFC). The starting time of the activity was earlier for the target-selective, and target- and direction-selective activities in VLPFC, intermediate for the direction-selective activities in VLPFC, and later for the direction-selective activities in DLPFC. These results suggest that VLPFC plays a role in the process of selection of a target object from an array of stimuli, VLPFC and DLPFC play a role in determining the location of the target in space, and DLPFC plays a role in selecting a direction and making a decision to generate a saccadic eye movement.

  2. Transitioning from Analog to Digital Audio Recording in Childhood Speech Sound Disorders

    ERIC Educational Resources Information Center

    Shriberg, Lawrence D.; Mcsweeny, Jane L.; Anderson, Bruce E.; Campbell, Thomas F.; Chial, Michael R.; Green, Jordan R.; Hauner, Katherina K.; Moore, Christopher A.; Rusiewicz, Heather L.; Wilson, David L.

    2005-01-01

    Few empirical findings or technical guidelines are available on the current transition from analog to digital audio recording in childhood speech sound disorders. Of particular concern in the present context was whether a transition from analog- to digital-based transcription and coding of prosody and voice features might require re-standardizing…

  3. An Audio-Visual Tutorial Laboratory Program for Introductory Geology. Final Report.

    ERIC Educational Resources Information Center

    Sweet, Walter C.; Bates, Robert L.

    Described is an audio-visual tutorial laboratory program designed to provide a uniform, regularly reinforced, programmatic introduction to a limited set of geologic concepts and features; to provide a sequence of problem-solving exercises on which the student can work as an individual and in which he is required repeatedly to use all elements of…

  4. Advanced Audio Interface for Phonetic Speech Recognition in a High Noise Environment

    DTIC Science & Technology

    2000-01-01

    Standard Object Systems, Inc. (SOS) has used its existing technology in phonetic speech recognition, audio signal processing, and multilingual ...interfaces to speech recognition software. It uses auditory features in speech recognition training, and provides applications to multilingual spoken language...translation. As a future Phase 3 commercial product, this system will perform real time multilingual speech recognition in noisy vehicles, offices and

  5. MRI with synchronized audio to evaluate velopharyngeal insufficiency.

    PubMed

    Maturo, Stephen; Silver, Amanda; Nimkin, Katherine; Sagar, Pallavi; Ashland, Jean; van der Kouwe, Andre J W; Hartnick, Christopher

    2012-11-01

    To demonstrate the feasibility of simultaneous-acquired magnetic resonance imaging (MRI) and high-quality synchronized audio recording for evaluating velopharyngeal closure. Institutional Review Board-approved case series. Tertiary care hospital. Three healthy adult volunteers with a normal speech pattern. MRI with simultaneous recorded audio files evaluating velopharyngeal closure. Precise imaging and audio coordination of specific phonatory tasks. Synchronization of MRI and audio in all three adults. Our novel imaging and audio protocol provides simultaneous acquired MRI with synchronized high quality audio for evaluating velopharyngeal closure. This technique may provide the opportunity to improve diagnosis and surgical planning in patients with velopharyngeal insufficiency.

  6. Audio Haptic Videogaming for Developing Wayfinding Skills in Learners Who are Blind.

    PubMed

    Sánchez, Jaime; de Borba Campos, Marcia; Espinoza, Matías; Merabet, Lotfi B

    2014-01-01

    Interactive digital technologies are currently being developed as a novel tool for education and skill development. Audiopolis is an audio and haptic based videogame designed for developing orientation and mobility (O&M) skills in people who are blind. We have evaluated the cognitive impact of videogame play on O&M skills by assessing performance on a series of behavioral tasks carried out in both indoor and outdoor virtual spaces. Our results demonstrate that the use of Audiopolis had a positive impact on the development and use of O&M skills in school-aged learners who are blind. The impact of audio and haptic information on learning is also discussed.

  7. Nonlinear dynamic macromodeling techniques for audio systems

    NASA Astrophysics Data System (ADS)

    Ogrodzki, Jan; Bieńkowski, Piotr

    2015-09-01

    This paper develops a modelling method and a models identification technique for the nonlinear dynamic audio systems. Identification is performed by means of a behavioral approach based on a polynomial approximation. This approach makes use of Discrete Fourier Transform and Harmonic Balance Method. A model of an audio system is first created and identified and then it is simulated in real time using an algorithm of low computational complexity. The algorithm consists in real time emulation of the system response rather than in simulation of the system itself. The proposed software is written in Python language using object oriented programming techniques. The code is optimized for a multithreads environment.

  8. DWT-Based High Capacity Audio Watermarking

    NASA Astrophysics Data System (ADS)

    Fallahpour, Mehdi; Megías, David

    This letter suggests a novel high capacity robust audio watermarking algorithm by using the high frequency band of the wavelet decomposition, for which the human auditory system (HAS) is not very sensitive to alteration. The main idea is to divide the high frequency band into frames and then, for embedding, the wavelet samples are changed based on the average of the relevant frame. The experimental results show that the method has very high capacity (about 5.5kbps), without significant perceptual distortion (ODG in [-1, 0] and SNR about 33dB) and provides robustness against common audio signal processing such as added noise, filtering, echo and MPEG compression (MP3).

  9. High performance MPEG-audio decoder IC

    NASA Technical Reports Server (NTRS)

    Thorn, M.; Benbassat, G.; Cyr, K.; Li, S.; Gill, M.; Kam, D.; Walker, K.; Look, P.; Eldridge, C.; Ng, P.

    1993-01-01

    The emerging digital audio and video compression technology brings both an opportunity and a new challenge to IC design. The pervasive application of compression technology to consumer electronics will require high volume, low cost IC's and fast time to market of the prototypes and production units. At the same time, the algorithms used in the compression technology result in complex VLSI IC's. The conflicting challenges of algorithm complexity, low cost, and fast time to market have an impact on device architecture and design methodology. The work presented in this paper is about the design of a dedicated, high precision, Motion Picture Expert Group (MPEG) audio decoder.

  10. High performance MPEG-audio decoder IC

    NASA Technical Reports Server (NTRS)

    Thorn, M.; Benbassat, G.; Cyr, K.; Li, S.; Gill, M.; Kam, D.; Walker, K.; Look, P.; Eldridge, C.; Ng, P.

    1993-01-01

    The emerging digital audio and video compression technology brings both an opportunity and a new challenge to IC design. The pervasive application of compression technology to consumer electronics will require high volume, low cost IC's and fast time to market of the prototypes and production units. At the same time, the algorithms used in the compression technology result in complex VLSI IC's. The conflicting challenges of algorithm complexity, low cost, and fast time to market have an impact on device architecture and design methodology. The work presented in this paper is about the design of a dedicated, high precision, Motion Picture Expert Group (MPEG) audio decoder.

  11. Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans (L)

    PubMed Central

    Bresch, Erik; Nielsen, Jon; Nayak, Krishna; Narayanan, Shrikanth

    2007-01-01

    This letter describes a data acquisition setup for recording, and processing, running speech from a person in a magnetic resonance imaging (MRI) scanner. The main focus is on ensuring synchronicity between image and audio acquisition, and in obtaining good signal to noise ratio to facilitate further speech analysis and modeling. A field-programmable gate array based hardware design for synchronizing the scanner image acquisition to other external data such as audio is described. The audio setup itself features two fiber optical microphones and a noise-canceling filter. Two noise cancellation methods are described including a novel approach using a pulse sequence specific model of the gradient noise of the MRI scanner. The setup is useful for scientific speech production studies. Sample results of speech and singing data acquired and processed using the proposed method are given. PMID:17069275

  12. Hierarchical structure for audio-video based semantic classification of sports video sequences

    NASA Astrophysics Data System (ADS)

    Kolekar, M. H.; Sengupta, S.

    2005-07-01

    A hierarchical structure for sports event classification based on audio and video content analysis is proposed in this paper. Compared to the event classifications in other games, those of cricket are very challenging and yet unexplored. We have successfully solved cricket video classification problem using a six level hierarchical structure. The first level performs event detection based on audio energy and Zero Crossing Rate (ZCR) of short-time audio signal. In the subsequent levels, we classify the events based on video features using a Hidden Markov Model implemented through Dynamic Programming (HMM-DP) using color or motion as a likelihood function. For some of the game-specific decisions, a rule-based classification is also performed. Our proposed hierarchical structure can easily be applied to any other sports. Our results are very promising and we have moved a step forward towards addressing semantic classification problems in general.

  13. Extrastriate Visual Areas Integrate Form Features over Space and Time to Construct Representations of Stationary and Rigidly Rotating Objects.

    PubMed

    McCarthy, J Daniel; Kohler, Peter J; Tse, Peter U; Caplovitz, Gideon Paul

    2015-11-01

    When an object moves behind a bush, for example, its visible fragments are revealed at different times and locations across the visual field. Nonetheless, a whole moving object is perceived. Unlike traditional modal and amodal completion mechanisms known to support spatial form integration when all parts of a stimulus are simultaneously visible, relatively little is known about the neural substrates of the spatiotemporal form integration (STFI) processes involved in generating coherent object representations from a succession visible fragments. We used fMRI to identify brain regions involved in two mechanisms supporting the representation of stationary and rigidly rotating objects whose form features are shown in succession: STFI and position updating. STFI allows past and present form cues to be integrated over space and time into a coherent object even when the object is not visible in any given frame. STFI can occur whether or not the object is moving. Position updating allows us to perceive a moving object, whether rigidly rotating or translating, even when its form features are revealed at different times and locations in space. Our results suggest that STFI is mediated by visual regions beyond V1 and V2. Moreover, although widespread cortical activation has been observed for other motion percepts derived solely from form-based analyses [Tse, P. U. Neural correlates of transformational apparent motion. Neuroimage, 31, 766-773, 2006; Krekelberg, B., Vatakis, A., & Kourtzi, Z. Implied motion from form in the human visual cortex. Journal of Neurophysiology, 94, 4373-4386, 2005], increased responses for the position updating that lead to rigidly rotating object representations were only observed in visual areas KO and possibly hMT+, indicating that this is a distinct and highly specialized type of processing.

  14. Extrastriate visual areas integrate form features over space and time to construct representations of stationary and rigidly rotating objects

    PubMed Central

    McCarthy, J. Daniel; Kohler, Peter J.; Tse, Peter U.; Caplovitz, Gideon Paul

    2016-01-01

    When an object moves behind a bush, for example, its visible fragments are revealed at different times and locations across the visual field. Nonetheless, a whole moving object is perceived. Unlike traditional modal and amodal completion mechanisms known to support spatial form integration when all parts of a stimulus are simultaneously visible, relatively little is known about the neural substrates of the spatiotemporal form integration processes involved in generating coherent object representations from a succession visible fragments. We use fMRI to identify brain regions involved in two mechanisms supporting the representation of stationary and rigidly rotating objects whose form features are shown in succession: Spatiotemporal Form Integration (STFI) and Position Updating. STFI allows past and present form cues to be integrated over space and time into a coherent object even when the object is not visible in any given frame. STFI can occur whether or not the object is moving. Position updating allows us to perceive a moving object, whether rigidly rotating or translating, even when its form features are revealed at different times and locations in space. Our results suggest that STFI is mediated by visual regions beyond V1 and V2. Moreover, while widespread cortical activation has been observed for other motion percepts derived solely from form-based analyses (Krekelberg, Vatakis, & Kourtzi, 2005; Tse, 2006), increased responses for the position updating that leads to rigidly rotating object representations were only observed in visual areas KO and possibly hMT+, indicating that this is a distinct and highly specialized type of processing. PMID:26226075

  15. Music information retrieval in compressed audio files: a survey

    NASA Astrophysics Data System (ADS)

    Zampoglou, Markos; Malamos, Athanasios G.

    2014-07-01

    In this paper, we present an organized survey of the existing literature on music information retrieval systems in which descriptor features are extracted directly from the compressed audio files, without prior decompression to pulse-code modulation format. Avoiding the decompression step and utilizing the readily available compressed-domain information can significantly lighten the computational cost of a music information retrieval system, allowing application to large-scale music databases. We identify a number of systems relying on compressed-domain information and form a systematic classification of the features they extract, the retrieval tasks they tackle and the degree in which they achieve an actual increase in the overall speed-as well as any resulting loss in accuracy. Finally, we discuss recent developments in the field, and the potential research directions they open toward ultra-fast, scalable systems.

  16. Three-dimensional audio versus head-down traffic alert and collision avoidance system displays.

    PubMed

    Begault, D R; Pittman, M T

    1996-01-01

    The advantage of a head-up auditory display for situational awareness was evaluated in an experiment designed to measure and compare the acquisition time for capturing visual targets under two conditions: standard head-down Traffic Alert and Collision Avoidance System display and three-dimensional (3-D) audio Traffic Alert and Collision Avoidance System presentation. (The technology used for 3-D audio presentation allows a stereo headphone user to potentially localize a sound at any externalized position in 3-D auditory space). Ten commercial airline crews were tested under full-mission simulation conditions at the NASA-Ames Crew-Vehicle Systems Research Facility Advanced Concepts Flight Simulator. Scenario software generated targets corresponding to aircraft that activated a 3-D aural advisory (the head-up auditory condition) or a standard, visual-audio TCAS advisory (map display with monaural audio alert). Results showed a significant difference in target acquisition time between the two conditions, favoring the 3-D audio Traffic Alert and Collision Avoidance System condition by 500 ms.

  17. Safe, Affordable, Convenient: Environmental Features of Malls and Other Public Spaces Used by Older Adults for Walking

    PubMed Central

    King, Diane K.; Allen, Peg; Jones, Dina L.; Marquez, David X.; Brown, David R.; Rosenberg, Dori; Janicek, Sarah; Allen, Laila; Belza, Basia

    2016-01-01

    Background Midlife and older adults use shopping malls for walking, but little research has examined mall characteristics that contribute to their walkability. Methods We used modified versions of the Centers for Disease Control and Prevention (CDC)-Healthy Aging Research Network (HAN) Environmental Audit and the System for Observing Play and Recreation in Communities (SOPARC) tool to systematically observe 443 walkers in 10 shopping malls. We also observed 87 walkers in 6 community-based nonmall/nongym venues where older adults routinely walked for physical activity. Results All venues had public transit stops and accessible parking. All malls and 67% of nonmalls had wayfinding aids, and most venues (81%) had an established circuitous walking route and clean, well-maintained public restrooms (94%). All venues had level floor surfaces, and one-half had benches along the walking route. Venues varied in hours of access, programming, tripping hazards, traffic control near entrances, and lighting. Conclusions Despite diversity in location, size, and purpose, the mall and nonmall venues audited shared numerous environmental features known to promote walking in older adults and few barriers to walking. Future research should consider programmatic features and outreach strategies to expand the use of malls and other suitable public spaces for walking. PMID:26181907

  18. Safe, Affordable, Convenient: Environmental Features of Malls and Other Public Spaces Used by Older Adults for Walking.

    PubMed

    King, Diane K; Allen, Peg; Jones, Dina L; Marquez, David X; Brown, David R; Rosenberg, Dori; Janicek, Sarah; Allen, Laila; Belza, Basia

    2016-03-01

    Midlife and older adults use shopping malls for walking, but little research has examined mall characteristics that contribute to their walkability. We used modified versions of the Centers for Disease Control and Prevention (CDC)-Healthy Aging Research Network (HAN) Environmental Audit and the System for Observing Play and Recreation in Communities (SOPARC) tool to systematically observe 443 walkers in 10 shopping malls. We also observed 87 walkers in 6 community-based nonmall/nongym venues where older adults routinely walked for physical activity. All venues had public transit stops and accessible parking. All malls and 67% of nonmalls had wayfinding aids, and most venues (81%) had an established circuitous walking route and clean, well-maintained public restrooms (94%). All venues had level floor surfaces, and one-half had benches along the walking route. Venues varied in hours of access, programming, tripping hazards, traffic control near entrances, and lighting. Despite diversity in location, size, and purpose, the mall and nonmall venues audited shared numerous environmental features known to promote walking in older adults and few barriers to walking. Future research should consider programmatic features and outreach strategies to expand the use of malls and other suitable public spaces for walking.

  19. Cross-modal retrieval of scripted speech audio

    NASA Astrophysics Data System (ADS)

    Owen, Charles B.; Makedon, Fillia

    1997-12-01

    This paper describes an approach to the problem of searching speech-based digital audio using cross-modal information retrieval. Audio containing speech (speech-based audio) is difficult to search. Open vocabulary speech recognition is advancing rapidly, but cannot yield high accuracy in either search or transcription modalities. However, text can be searched quickly and efficiently with high accuracy. Script- light digital audio is audio that has an available transcription. This is a surprisingly large class of content including legal testimony, broadcasting, dramatic productions and political meetings and speeches. An automatic mechanism for deriving the synchronization between the transcription and the audio allows for very accurate retrieval of segments of that audio. The mechanism described in this paper is based on building a transcription graph from the text and computing biphone probabilities for the audio. A modified beam search algorithm is presented to compute the alignment.

  20. Modeling of the ground-to-SSFMB link networking features using SPW

    NASA Technical Reports Server (NTRS)

    Watson, John C.

    1993-01-01

    This report describes the modeling and simulation of the networking features of the ground-to-Space Station Freedom manned base (SSFMB) link using COMDISCO signal processing work-system (SPW). The networking features modeled include the implementation of Consultative Committee for Space Data Systems (CCSDS) protocols in the multiplexing of digitized audio and core data into virtual channel data units (VCDU's) in the control center complex and the demultiplexing of VCDU's in the onboard baseband signal processor. The emphasis of this work has been placed on techniques for modeling the CCSDS networking features using SPW. The objectives for developing the SPW models are to test the suitability of SPW for modeling networking features and to develop SPW simulation models of the control center complex and space station baseband signal processor for use in end-to-end testing of the ground-to-SSFMB S-band single access forward (SSAF) link.

  1. Building Digital Audio Preservation Infrastructure and Workflows

    ERIC Educational Resources Information Center

    Young, Anjanette; Olivieri, Blynne; Eckler, Karl; Gerontakos, Theodore

    2010-01-01

    In 2009 the University of Washington (UW) Libraries special collections received funding for the digital preservation of its audio indigenous language holdings. The university libraries, where the authors work in various capacities, had begun digitizing image and text collections in 1997. Because of this, at the onset of the project, workflows (a…

  2. Improving Audio Quality in Distance Learning Applications.

    ERIC Educational Resources Information Center

    Richardson, Craig H.

    This paper discusses common causes of problems encountered with audio systems in distance learning networks and offers practical suggestions for correcting the problems. Problems and discussions are divided into nine categories: (1) acoustics, including reverberant classrooms leading to distorted or garbled voices, as well as one-dimensional audio…

  3. Transparency benchmarking on audio watermarks and steganography

    NASA Astrophysics Data System (ADS)

    Kraetzer, Christian; Dittmann, Jana; Lang, Andreas

    2006-02-01

    The evaluation of transparency plays an important role in the context of watermarking and steganography algorithms. This paper introduces a general definition of the term transparency in the context of steganography, digital watermarking and attack based evaluation of digital watermarking algorithms. For this purpose the term transparency is first considered individually for each of the three application fields (steganography, digital watermarking and watermarking algorithm evaluation). From the three results a general definition for the overall context is derived in a second step. The relevance and applicability of the definition given is evaluated in practise using existing audio watermarking and steganography algorithms (which work in time, frequency and wavelet domain) as well as an attack based evaluation suite for audio watermarking benchmarking - StirMark for Audio (SMBA). For this purpose selected attacks from the SMBA suite are modified by adding transparency enhancing measures using a psychoacoustic model. The transparency and robustness of the evaluated audio watermarking algorithms by using the original and modifid attacks are compared. The results of this paper show hat transparency benchmarking will lead to new information regarding the algorithms under observation and their usage. This information can result in concrete recommendations for modification, like the ones resulting from the tests performed here.

  4. Structuring Broadcast Audio for Information Access

    NASA Astrophysics Data System (ADS)

    Gauvain, Jean-Luc; Lamel, Lori

    2003-12-01

    One rapidly expanding application area for state-of-the-art speech recognition technology is the automatic processing of broadcast audiovisual data for information access. Since much of the linguistic information is found in the audio channel, speech recognition is a key enabling technology which, when combined with information retrieval techniques, can be used for searching large audiovisual document collections. Audio indexing must take into account the specificities of audio data such as needing to deal with the continuous data stream and an imperfect word transcription. Other important considerations are dealing with language specificities and facilitating language portability. At Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI), broadcast news transcription systems have been developed for seven languages: English, French, German, Mandarin, Portuguese, Spanish, and Arabic. The transcription systems have been integrated into prototype demonstrators for several application areas such as audio data mining, structuring audiovisual archives, selective dissemination of information, and topic tracking for media monitoring. As examples, this paper addresses the spoken document retrieval and topic tracking tasks.

  5. 50 CFR 27.72 - Audio equipment.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... 50 Wildlife and Fisheries 9 2013-10-01 2013-10-01 false Audio equipment. 27.72 Section 27.72 Wildlife and Fisheries UNITED STATES FISH AND WILDLIFE SERVICE, DEPARTMENT OF THE INTERIOR (CONTINUED) THE NATIONAL WILDLIFE REFUGE SYSTEM PROHIBITED ACTS Disturbing Violations: Filming, Photography, and Light...

  6. 50 CFR 27.72 - Audio equipment.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... 50 Wildlife and Fisheries 9 2014-10-01 2014-10-01 false Audio equipment. 27.72 Section 27.72 Wildlife and Fisheries UNITED STATES FISH AND WILDLIFE SERVICE, DEPARTMENT OF THE INTERIOR (CONTINUED) THE NATIONAL WILDLIFE REFUGE SYSTEM PROHIBITED ACTS Disturbing Violations: Filming, Photography, and Light...

  7. Some Final Reflections on Dual Audio Television.

    ERIC Educational Resources Information Center

    Borton, Terry; And Others

    The Philadelphia City Schools engaged in a four-year program to develop and test dual audio television, a way to help children learn more from the massive amount of time they spend watching commercial television. The format consisted of an instructional radio broadcast that accompanied popular television shows and attempted to clarify and amplify…

  8. Solar Energy Audio-Visual Materials.

    ERIC Educational Resources Information Center

    Department of Housing and Urban Development, Washington, DC. Office of Policy Development and Research.

    This directory presents an annotated bibliography of non-print information resources dealing with solar energy. The document is divided by type of audio-visual medium, including: (1) Films, (2) Slides and Filmstrips, and (3) Videotapes. A fourth section provides addresses and telephone numbers of audiovisual aids sources, and lists the page…

  9. Audio-Visual Speech Perception Is Special

    ERIC Educational Resources Information Center

    Tuomainen, J.; Andersen, T.S.; Tiippana, K.; Sams, M.

    2005-01-01

    In face-to-face conversation speech is perceived by ear and eye. We studied the prerequisites of audio-visual speech perception by using perceptually ambiguous sine wave replicas of natural speech as auditory stimuli. When the subjects were not aware that the auditory stimuli were speech, they showed only negligible integration of auditory and…

  10. Study of audio speakers containing ferrofluid.

    PubMed

    Rosensweig, R E; Hirota, Y; Tsuda, S; Raj, K

    2008-05-21

    This work validates a method for increasing the radial restoring force on the voice coil in audio speakers containing ferrofluid. In addition, a study is made of factors influencing splash loss of the ferrofluid due to shock. Ferrohydrodynamic analysis is employed throughout to model behavior, and predictions are compared to experimental data.

  11. An ESL Audio-Script Writing Workshop

    ERIC Educational Resources Information Center

    Miller, Carla

    2012-01-01

    The roles of dialogue, collaborative writing, and authentic communication have been explored as effective strategies in second language writing classrooms. In this article, the stages of an innovative, multi-skill writing method, which embeds students' personal voices into the writing process, are explored. A 10-step ESL Audio Script Writing Model…

  12. Providing Students with Formative Audio Feedback

    ERIC Educational Resources Information Center

    Brearley, Francis Q.; Cullen, W. Rod

    2012-01-01

    The provision of timely and constructive feedback is increasingly challenging for busy academics. Ensuring effective student engagement with feedback is equally difficult. Increasingly, studies have explored provision of audio recorded feedback to enhance effectiveness and engagement with feedback. Few, if any, of these focus on purely formative…

  13. Apollo 11 Mission Audio - Day 1

    NASA Image and Video Library

    1969-07-16

    Audio from mission control during the launch of Apollo 11, which was the United States' first lunar landing mission. While astronauts Armstrong and Aldrin descended in the Lunar Module "Eagle" to explore the Sea of Tranquility region of the moon, astronaut Collins remained with the Command and Service Modules "Columbia" in lunar orbit.

  14. Spanish for Agricultural Purposes: The Audio Program.

    ERIC Educational Resources Information Center

    Mainous, Bruce H.; And Others

    The manual is meant to accompany and supplement the basic manual and to serve as support to the audio component of "Spanish for Agricultural Purposes," a one-semester course for North American agriculture specialists preparing to work in Latin America, consists of exercises to supplement readings presented in the course's basic manual and to…

  15. Audio/Visual Ratios in Commercial Filmstrips.

    ERIC Educational Resources Information Center

    Gulliford, Nancy L.

    Developed by the Westinghouse Electric Corporation, Video Audio Compressed (VIDAC) is a compressed time, variable rate, still picture television system. This technology made it possible for a centralized library of audiovisual materials to be transmitted over a television channel in very short periods of time. In order to establish specifications…

  16. Building Digital Audio Preservation Infrastructure and Workflows

    ERIC Educational Resources Information Center

    Young, Anjanette; Olivieri, Blynne; Eckler, Karl; Gerontakos, Theodore

    2010-01-01

    In 2009 the University of Washington (UW) Libraries special collections received funding for the digital preservation of its audio indigenous language holdings. The university libraries, where the authors work in various capacities, had begun digitizing image and text collections in 1997. Because of this, at the onset of the project, workflows (a…

  17. 47 CFR 73.403 - Digital audio broadcasting service requirements.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... programming stream at no direct charge to listeners. In addition, a broadcast radio station must simulcast its analog audio programming on one of its digital audio programming streams. The DAB audio programming... analog programming service currently provided to listeners. (b) Emergency information. The emergency...

  18. A Study of Alternative Delivery Systems for Audio Instruction.

    ERIC Educational Resources Information Center

    Conner, Pat A.

    The Newark Audio Instruction Project determined and evaluated the cost benefits and efficiency factors in using FM main channel and subchannel, the school intercom system, and audio cassettes for delivery of instructional programming for grades K-9, and developed a master plan for integrating radio/audio instruction into the curriculum. A…

  19. To Make a Long Story Short: Abridged Audio at 10.

    ERIC Educational Resources Information Center

    Annichiarico, Mark

    1996-01-01

    Examines the history of abridged audio publishing 10 years after the formation of the Audio Publishers Association. Topics include abridged versus unabridged versions for bookstores and libraries; vendors and publishers; future possibilities for CDs and DVD (Digital Versatile Disc); and audio leasing for libraries. (LRW)

  20. 36 CFR § 1002.12 - Audio disturbances.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 36 Parks, Forests, and Public Property 3 2013-07-01 2012-07-01 true Audio disturbances. § 1002.12... RECREATION § 1002.12 Audio disturbances. (a) The following are prohibited: (1) Operating motorized equipment or machinery such as an electric generating plant, motor vehicle, motorized toy, or an audio device...

  1. 47 CFR 87.483 - Audio visual warning systems.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... 47 Telecommunication 5 2014-10-01 2014-10-01 false Audio visual warning systems. 87.483 Section 87... AVIATION SERVICES Stations in the Radiodetermination Service § 87.483 Audio visual warning systems. An audio visual warning system (AVWS) is a radar-based obstacle avoidance system. AVWS activates...

  2. 47 CFR 10.520 - Common audio attention signal.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... 47 Telecommunication 1 2014-10-01 2014-10-01 false Common audio attention signal. 10.520 Section... Equipment Requirements § 10.520 Common audio attention signal. A Participating CMS Provider and equipment manufacturers may only market devices for public use under part 10 that include an audio attention signal that...

  3. 43 CFR 8365.2-2 - Audio devices.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... 43 Public Lands: Interior 2 2014-10-01 2014-10-01 false Audio devices. 8365.2-2 Section 8365.2-2..., DEPARTMENT OF THE INTERIOR RECREATION PROGRAMS VISITOR SERVICES Rules of Conduct § 8365.2-2 Audio devices. On... audio device such as a radio, television, musical instrument, or other noise producing device or...

  4. 47 CFR 10.520 - Common audio attention signal.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... 47 Telecommunication 1 2011-10-01 2011-10-01 false Common audio attention signal. 10.520 Section... Equipment Requirements § 10.520 Common audio attention signal. A Participating CMS Provider and equipment manufacturers may only market devices for public use under part 10 that include an audio attention signal that...

  5. 43 CFR 8365.2-2 - Audio devices.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... 43 Public Lands: Interior 2 2011-10-01 2011-10-01 false Audio devices. 8365.2-2 Section 8365.2-2..., DEPARTMENT OF THE INTERIOR RECREATION PROGRAMS VISITOR SERVICES Rules of Conduct § 8365.2-2 Audio devices. On... audio device such as a radio, television, musical instrument, or other noise producing device or...

  6. 43 CFR 8365.2-2 - Audio devices.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... 43 Public Lands: Interior 2 2012-10-01 2012-10-01 false Audio devices. 8365.2-2 Section 8365.2-2..., DEPARTMENT OF THE INTERIOR RECREATION PROGRAMS VISITOR SERVICES Rules of Conduct § 8365.2-2 Audio devices. On... audio device such as a radio, television, musical instrument, or other noise producing device or...

  7. 43 CFR 8365.2-2 - Audio devices.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... 43 Public Lands: Interior 2 2013-10-01 2013-10-01 false Audio devices. 8365.2-2 Section 8365.2-2..., DEPARTMENT OF THE INTERIOR RECREATION PROGRAMS VISITOR SERVICES Rules of Conduct § 8365.2-2 Audio devices. On... audio device such as a radio, television, musical instrument, or other noise producing device or...

  8. 47 CFR 10.520 - Common audio attention signal.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... 47 Telecommunication 1 2013-10-01 2013-10-01 false Common audio attention signal. 10.520 Section... Equipment Requirements § 10.520 Common audio attention signal. A Participating CMS Provider and equipment manufacturers may only market devices for public use under part 10 that include an audio attention signal that...

  9. Guidelines for Audio-Visual Services in Academic Libraries.

    ERIC Educational Resources Information Center

    Association of Coll. and Research Libraries, Chicago, IL.

    The purpose of these guidelines, prepared by the Audio-Visual Committee of the Association of College and Research Libraries, is to supply basic assistance to those academic libraries that will assume all or a major portion of an audio-visual program. They attempt to assist librarians to recognize and develop their audio-visual responsibilities…

  10. 47 CFR 73.403 - Digital audio broadcasting service requirements.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... 47 Telecommunication 4 2014-10-01 2014-10-01 false Digital audio broadcasting service requirements... SERVICES RADIO BROADCAST SERVICES Digital Audio Broadcasting § 73.403 Digital audio broadcasting service requirements. (a) Broadcast radio stations using IBOC must transmit at least one over-the-air digital...

  11. 47 CFR 73.403 - Digital audio broadcasting service requirements.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... 47 Telecommunication 4 2013-10-01 2013-10-01 false Digital audio broadcasting service requirements... SERVICES RADIO BROADCAST SERVICES Digital Audio Broadcasting § 73.403 Digital audio broadcasting service requirements. (a) Broadcast radio stations using IBOC must transmit at least one over-the-air digital...

  12. 47 CFR 73.403 - Digital audio broadcasting service requirements.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... 47 Telecommunication 4 2012-10-01 2012-10-01 false Digital audio broadcasting service requirements... SERVICES RADIO BROADCAST SERVICES Digital Audio Broadcasting § 73.403 Digital audio broadcasting service requirements. (a) Broadcast radio stations using IBOC must transmit at least one over-the-air digital...

  13. 47 CFR 73.403 - Digital audio broadcasting service requirements.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... 47 Telecommunication 4 2011-10-01 2011-10-01 false Digital audio broadcasting service requirements... SERVICES RADIO BROADCAST SERVICES Digital Audio Broadcasting § 73.403 Digital audio broadcasting service requirements. (a) Broadcast radio stations using IBOC must transmit at least one over-the-air digital...

  14. Head Tracking of Auditory, Visual, and Audio-Visual Targets

    PubMed Central

    Leung, Johahn; Wei, Vincent; Burgess, Martin; Carlile, Simon

    2016-01-01

    The ability to actively follow a moving auditory target with our heads remains unexplored even though it is a common behavioral response. Previous studies of auditory motion perception have focused on the condition where the subjects are passive. The current study examined head tracking behavior to a moving auditory target along a horizontal 100° arc in the frontal hemisphere, with velocities ranging from 20 to 110°/s. By integrating high fidelity virtual auditory space with a high-speed visual presentation we compared tracking responses of auditory targets against visual-only and audio-visual “bisensory” stimuli. Three metrics were measured—onset, RMS, and gain error. The results showed that tracking accuracy (RMS error) varied linearly with target velocity, with a significantly higher rate in audition. Also, when the target moved faster than 80°/s, onset and RMS error were significantly worst in audition the other modalities while responses in the visual and bisensory conditions were statistically identical for all metrics measured. Lastly, audio-visual facilitation was not observed when tracking bisensory targets. PMID:26778952

  15. Three-dimensional audio using loudspeakers

    NASA Astrophysics Data System (ADS)

    Gardner, William G.

    1997-12-01

    3-D audio systems, which can surround a listener with sounds at arbitrary locations, are an important part of immersive interfaces. A new approach is presented for implementing 3-D audio using a pair of conventional loudspeakers. The new idea is to use the tracked position of the listener's head to optimize the acoustical presentation, and thus produce a much more realistic illusion over a larger listening area than existing loudspeaker 3-D audio systems. By using a remote head tracker, for instance based on computer vision, an immersive audio environment can be created without donning headphones or other equipment. The general approach to a 3-D audio system is to reconstruct the acoustic pressures at the listener's ears that would result from the natural listening situation to be simulated. To accomplish this using loudspeakers requires that first, the ear signals corresponding to the target scene are synthesized by appropriately encoding directional cues, a process known as 'binaural synthesis,' and second, these signals are delivered to the listener by inverting the transmission paths that exist from the speakers to the listener, a process known as 'crosstalk cancellation.' Existing crosstalk cancellation systems only function at a fixed listening location; when the listener moves away from the equalization zone, the 3-D illusion is lost. Steering the equalization zone to the tracked listener preserves the 3-D illusion over a large listening volume, thus simulating a reconstructed soundfield, and also provides dynamic localization cues by maintaining stationary external sound sources during head motion. This dissertation will discuss the theory, implementation, and testing of a head-tracked loudspeaker 3-D audio system. Crosstalk cancellers that can be steered to the location of a tracked listener will be described. The objective performance of these systems has been evaluated using simulations and acoustical measurements made at the ears of human subjects. Many

  16. Demonstration of pattern transfer into sub-100 nm polysilicon line/space features patterned with extreme ultraviolet lithography

    SciTech Connect

    Cardinale, G. F.; Henderson, C. C.; Goldsmith, J. E. M.; Mangat, P. J. S.; Cobb, J.; Hector, S. D.

    1999-11-01

    In two separate experiments, we have successfully demonstrated the transfer of dense- and loose-pitch line/space (L/S) photoresist features, patterned with extreme ultraviolet (EUV) lithography, into an underlying hard mask material. In both experiments, a deep-UV photoresist ({approx}90 nm thick) was spin cast in bilayer format onto a hard mask (50-90 nm thick) and was subsequently exposed to EUV radiation using a 10x reduction EUV exposure system. The EUV reticle was fabricated at Motorola (Tempe, AZ) using a subtractive process with Ta-based absorbers on Mo/Si multilayer mask blanks. In the first set of experiments, following the EUV exposures, the L/S patterns were transferred first into a SiO{sub 2} hard mask (60 nm thick) using a reactive ion etch (RIE), and then into polysilicon (350 nm thick) using a triode-coupled plasma RIE etcher at the University of California, Berkeley, microfabrication facilities. The latter etch process, which produced steep (>85 degree sign ) sidewalls, employed a HBr/Cl chemistry with a large (>10:1) etch selectivity of polysilicon to silicon dioxide. In the second set of experiments, hard mask films of SiON (50 nm thick) and SiO{sub 2} (87 nm thick) were used. A RIE was performed at Motorola using a halogen gas chemistry that resulted in a hard mask-to-photoresist etch selectivity >3:1 and sidewall profile angles {>=}85 degree sign . Line edge roughness (LER) and linewidth critical dimension (CD) measurements were performed using Sandia's GORA(c) CD digital image analysis software. Low LER values (6-9 nm, 3{sigma}, one side) and good CD linearity (better than 10%) were demonstrated for the final pattern-transferred dense polysilicon L/S features from 80 to 175 nm. In addition, pattern transfer (into polysilicon) of loose-pitch (1:2) L/S features with CDs{>=}60 nm was demonstrated. (c) 1999 American Vacuum Society.

  17. Predicting the perception of performed dynamics in music audio with ensemble learning.

    PubMed

    Elowsson, Anders; Friberg, Anders

    2017-03-01

    By varying the dynamics in a musical performance, the musician can convey structure and different expressions. Spectral properties of most musical instruments change in a complex way with the performed dynamics, but dedicated audio features for modeling the parameter are lacking. In this study, feature extraction methods were developed to capture relevant attributes related to spectral characteristics and spectral fluctuations, the latter through a sectional spectral flux. Previously, ground truths ratings of performed dynamics had been collected by asking listeners to rate how soft/loud the musicians played in a set of audio files. The ratings, averaged over subjects, were used to train three different machine learning models, using the audio features developed for the study as input. The highest result was produced from an ensemble of multilayer perceptrons with an R(2) of 0.84. This result seems to be close to the upper bound, given the estimated uncertainty of the ground truth data. The result is well above that of individual human listeners of the previous listening experiment, and on par with the performance achieved from the average rating of six listeners. Features were analyzed with a factorial design, which highlighted the importance of source separation in the feature extraction.

  18. Audio-vestibular signs and symptoms in Chiari malformation type i. Case series and literature review.

    PubMed

    Guerra Jiménez, Gloria; Mazón Gutiérrez, Ángel; Marco de Lucas, Enrique; Valle San Román, Natalia; Martín Laez, Rubén; Morales Angulo, Carmelo

    2015-01-01

    Chiari malformation is an alteration of the base of the skull with herniation through the foramen magnum of the brain stem and cerebellum. Although the most common presentation is occipital headache, the association of audio-vestibular symptoms is not rare. The aim of our study was to describe audio-vestibular signs and symptoms in Chiari malformation type i (CM-I). We performed a retrospective observational study of patients referred to our unit during the last 5 years. We also carried out a literature review of audio-vestibular signs and symptoms in this disease. There were 9 patients (2 males and 7 females), with an average age of 42.8 years. Five patients presented a Ménière-like syndrome; 2 cases, a recurrent vertigo with peripheral features; one patient showed a sudden hearing loss; and one case suffered a sensorineural hearing loss with early childhood onset. The most common audio-vestibular symptom indicated in the literature in patients with CM-I is unsteadiness (49%), followed by dizziness (18%), nystagmus (15%) and hearing loss (15%). Nystagmus is frequently horizontal (74%) or down-beating (18%). Other audio-vestibular signs and symptoms are tinnitus (11%), aural fullness (10%) and hyperacusis (1%). Occipital headache that increases with Valsalva manoeuvres and hand paresthesias are very suggestive symptoms. The appearance of audio-vestibular manifestations in CM-I makes it common to refer these patients to neurotologists. Unsteadiness, vertiginous syndromes and sensorineural hearing loss are frequent. Nystagmus, especially horizontal and down-beating, is not rare. It is important for neurotologists to familiarise themselves with CM-I symptoms to be able to consider it in differential diagnosis. Copyright © 2014 Elsevier España, S.L.U. y Sociedad Española de Otorrinolaringología y Patología Cérvico-Facial. All rights reserved.

  19. Comparing Audio and Video Data for Rating Communication

    PubMed Central

    Williams, Kristine; Herman, Ruth; Bontempo, Daniel

    2013-01-01

    Video recording has become increasingly popular in nursing research, adding rich nonverbal, contextual, and behavioral information. However, benefits of video over audio data have not been well established. We compared communication ratings of audio versus video data using the Emotional Tone Rating Scale. Twenty raters watched video clips of nursing care and rated staff communication on 12 descriptors that reflect dimensions of person-centered and controlling communication. Another group rated audio-only versions of the same clips. Interrater consistency was high within each group with ICC (2,1) for audio = .91, and video = .94. Interrater consistency for both groups combined was also high with ICC (2,1) for audio and video = .95. Communication ratings using audio and video data were highly correlated. The value of video being superior to audio recorded data should be evaluated in designing studies evaluating nursing care. PMID:23579475

  20. Comparing audio and video data for rating communication.

    PubMed

    Williams, Kristine; Herman, Ruth; Bontempo, Daniel

    2013-09-01

    Video recording has become increasingly popular in nursing research, adding rich nonverbal, contextual, and behavioral information. However, benefits of video over audio data have not been well established. We compared communication ratings of audio versus video data using the Emotional Tone Rating Scale. Twenty raters watched video clips of nursing care and rated staff communication on 12 descriptors that reflect dimensions of person-centered and controlling communication. Another group rated audio-only versions of the same clips. Interrater consistency was high within each group with Interclass Correlation Coefficient (ICC) (2,1) for audio .91, and video = .94. Interrater consistency for both groups combined was also high with ICC (2,1) for audio and video = .95. Communication ratings using audio and video data were highly correlated. The value of video being superior to audio-recorded data should be evaluated in designing studies evaluating nursing care.

  1. Dirt Cheap 3D Spatial Audio

    DTIC Science & Technology

    2005-10-01

    Dirt Cheap 3D Spatial Audio Eric Klein1 Greg S. Schmidt12 Erik B. Tomlin1 Dennis G. Brown1 1Virtual Reality Laboratory, Naval Research Laboratory...ADDRESS(ES) Naval Research Laboratory,Virtual Reality Laboratory,4555 Overlook Ave. SW,Washington,DC,20375 8. PERFORMING ORGANIZATION REPORT NUMBER 9...hardware with our 4-wall immersive virtual reality room at the Virtual Reality Laboratory, part of the Naval Research Laboratory in Washington D.C. We

  2. Auditory Depth Control Using Reverberation Cue in Virtual Audio Environment

    NASA Astrophysics Data System (ADS)

    Moon, Han-Gil; Choi, Yulee

    To provide the distance information of a sound source in virtual audio space, we must have some information about effective distance cues and there must be some way to handle them properly. It is well known that the conventional cues comprise loudness, spectral information, reverberation and binaural information. Some research works have shown that most of these cues can give listeners only limited distance information. Among these cues, reverberation can give listeners effective distance information, but the implementation using this cue is not a simple problem because there are no well-defined parameters and methods. This paper discusses methods to control the perceived auditory depth with the reverberation cue. A two-stepped linear envelope method and an artificial reverberator, which can control the early reflection slope of an impulse response but does not alter the reverberation time, are proposed as solutions. To validate these methods, subjective assessment was performed.

  3. Audio processing technology for law enforcement

    NASA Astrophysics Data System (ADS)

    Walter, Sharon M.; Cofano, Maria; Ratley, Roy J.

    1999-01-01

    The Air Force Research Laboratory Multi-Sensor Exploitation Branch (AFRL/IFEC) has been a Department of Defense leader in research and development (R&D) in speech and audio processing for over 25 years. Their primary thrust in these R&D areas has focused on developing technology to improve the collection, handling, identification, and intelligibility of military communication signals. The National Law Enforcement and Corrections Technology Center for the Northeast (NLECTC-NE) is collocated with the AFRL Rome Research Sited at the Griffiss Technology park in upstate New York. The NLECTC-NE supports sixteen (16) states in the northeast sector of the United States, and is funded and supported by the National Institute of Justice (NIJ). Since the inception of the NLECTC-NE in 1995, the AFRL Rome Research Site has expanded the military applications of their expertise to address law enforcement and corrections requirements. AFRL/IFEC's speech and audio processing technology is unique and particularly appropriate for application to law enforcement requirements. It addresses the similar military need for time-critical decisions and actions, operation within noisy environments, and use by uncooperative speakers in tactical, real-time applications. Audio and speech processing technology for both application domains must also often deal with short utterance communications (less than five seconds of speech) and transmission-to-transmission channel variability.

  4. Capacity-optimized mp2 audio watermarking

    NASA Astrophysics Data System (ADS)

    Steinebach, Martin; Dittmann, Jana

    2003-06-01

    Today a number of audio watermarking algorithms have been proposed, some of them at a quality making them suitable for commercial applications. The focus of most of these algorithms is copyright protection. Therefore, transparency and robustness are the most discussed and optimised parameters. But other applications for audio watermarking can also be identified stressing other parameters like complexity or payload. In our paper, we introduce a new mp2 audio watermarking algorithm optimised for high payload. Our algorithm uses the scale factors of an mp2 file for watermark embedding. They are grouped and masked based on a pseudo-random pattern generated from a secret key. In each group, we embed one bit. Depending on the bit to embed, we change the scale factors by adding 1 where necessary until it includes either more even or uneven scale factors. An uneven group has a 1 embedded, an even group a 0. The same rule is later applied to detect the watermark. The group size can be increased or decreased for transparency/payload trade-off. We embed 160 bits or more in an mp2 file per second without reducing perceived quality. As an application example, we introduce a prototypic Karaoke system displaying song lyrics embedded as a watermark.

  5. Audio-visual speech experience with age influences perceived audio-visual asynchrony in speech.

    PubMed

    Alm, Magnus; Behne, Dawn

    2013-10-01

    Previous research indicates that perception of audio-visual (AV) synchrony changes in adulthood. Possible explanations for these age differences include a decline in hearing acuity, a decline in cognitive processing speed, and increased experience with AV binding. The current study aims to isolate the effect of AV experience by comparing synchrony judgments from 20 young adults (20 to 30 yrs) and 20 normal-hearing middle-aged adults (50 to 60 yrs), an age range for which a decline of cognitive processing speed is expected to be minimal. When presented with AV stop consonant syllables with asynchronies ranging from 440 ms audio-lead to 440 ms visual-lead, middle-aged adults showed significantly less tolerance for audio-lead than young adults. Middle-aged adults also showed a greater shift in their point of subjective simultaneity than young adults. Natural audio-lead asynchronies are arguably more predictable than natural visual-lead asynchronies, and this predictability may render audio-lead thresholds more prone to experience-related fine-tuning.

  6. Feature-space assessment of electrical impedance tomography coregistered with computed tomography in detecting multiple contrast targets

    SciTech Connect

    Krishnan, Kalpagam; Liu, Jeff; Kohli, Kirpal

    2014-06-15

    Purpose: Fusion of electrical impedance tomography (EIT) with computed tomography (CT) can be useful as a clinical tool for providing additional physiological information about tissues, but requires suitable fusion algorithms and validation procedures. This work explores the feasibility of fusing EIT and CT images using an algorithm for coregistration. The imaging performance is validated through feature space assessment on phantom contrast targets. Methods: EIT data were acquired by scanning a phantom using a circuit, configured for injecting current through 16 electrodes, placed around the phantom. A conductivity image of the phantom was obtained from the data using electrical impedance and diffuse optical tomography reconstruction software (EIDORS). A CT image of the phantom was also acquired. The EIT and CT images were fused using a region of interest (ROI) coregistration fusion algorithm. Phantom imaging experiments were carried out on objects of different contrasts, sizes, and positions. The conductive medium of the phantoms was made of a tissue-mimicking bolus material that is routinely used in clinical radiation therapy settings. To validate the imaging performance in detecting different contrasts, the ROI of the phantom was filled with distilled water and normal saline. Spatially separated cylindrical objects of different sizes were used for validating the imaging performance in multiple target detection. Analyses of the CT, EIT and the EIT/CT phantom images were carried out based on the variations of contrast, correlation, energy, and homogeneity, using a gray level co-occurrence matrix (GLCM). A reference image of the phantom was simulated using EIDORS, and the performances of the CT and EIT imaging systems were evaluated and compared against the performance of the EIT/CT system using various feature metrics, detectability, and structural similarity index measures. Results: In detecting distilled and normal saline water in bolus medium, EIT as a stand

  7. Feature-space assessment of electrical impedance tomography coregistered with computed tomography in detecting multiple contrast targets.

    PubMed

    Krishnan, Kalpagam; Liu, Jeff; Kohli, Kirpal

    2014-06-01

    Fusion of electrical impedance tomography (EIT) with computed tomography (CT) can be useful as a clinical tool for providing additional physiological information about tissues, but requires suitable fusion algorithms and validation procedures. This work explores the feasibility of fusing EIT and CT images using an algorithm for coregistration. The imaging performance is validated through feature space assessment on phantom contrast targets. EIT data were acquired by scanning a phantom using a circuit, configured for injecting current through 16 electrodes, placed around the phantom. A conductivity image of the phantom was obtained from the data using electrical impedance and diffuse optical tomography reconstruction software (EIDORS). A CT image of the phantom was also acquired. The EIT and CT images were fused using a region of interest (ROI) coregistration fusion algorithm. Phantom imaging experiments were carried out on objects of different contrasts, sizes, and positions. The conductive medium of the phantoms was made of a tissue-mimicking bolus material that is routinely used in clinical radiation therapy settings. To validate the imaging performance in detecting different contrasts, the ROI of the phantom was filled with distilled water and normal saline. Spatially separated cylindrical objects of different sizes were used for validating the imaging performance in multiple target detection. Analyses of the CT, EIT and the EIT/CT phantom images were carried out based on the variations of contrast, correlation, energy, and homogeneity, using a gray level co-occurrence matrix (GLCM). A reference image of the phantom was simulated using EIDORS, and the performances of the CT and EIT imaging systems were evaluated and compared against the performance of the EIT/CT system using various feature metrics, detectability, and structural similarity index measures. In detecting distilled and normal saline water in bolus medium, EIT as a stand-alone imaging system showed

  8. Audio-visual onset differences are used to determine syllable identity for ambiguous audio-visual stimulus pairs

    PubMed Central

    ten Oever, Sanne; Sack, Alexander T.; Wheat, Katherine L.; Bien, Nina; van Atteveldt, Nienke

    2013-01-01

    Content and temporal cues have been shown to interact during audio-visual (AV) speech identification. Typically, the most reliable unimodal cue is used more strongly to identify specific speech features; however, visual cues are only used if the AV stimuli are presented within a certain temporal window of integration (TWI). This suggests that temporal cues denote whether unimodal stimuli belong together, that is, whether they should be integrated. It is not known whether temporal cues also provide information about the identity of a syllable. Since spoken syllables have naturally varying AV onset asynchronies, we hypothesize that for suboptimal AV cues presented within the TWI, information about the natural AV onset differences can aid in speech identification. To test this, we presented low-intensity auditory syllables concurrently with visual speech signals, and varied the stimulus onset asynchronies (SOA) of the AV pair, while participants were instructed to identify the auditory syllables. We revealed that specific speech features (e.g., voicing) were identified by relying primarily on one modality (e.g., auditory). Additionally, we showed a wide window in which visual information influenced auditory perception, that seemed even wider for congruent stimulus pairs. Finally, we found a specific response pattern across the SOA range for syllables that were not reliably identified by the unimodal cues, which we explained as the result of the use of natural onset differences between AV speech signals. This indicates that temporal cues not only provide information about the temporal integration of AV stimuli, but additionally convey information about the identity of AV pairs. These results provide a detailed behavioral basis for further neuro-imaging and stimulation studies to unravel the neurofunctional mechanisms of the audio-visual-temporal interplay within speech perception. PMID:23805110

  9. A Novel Method Using Abstract Convex Underestimation in Ab-Initio Protein Structure Prediction for Guiding Search in Conformational Feature Space.

    PubMed

    Hao, Xiao-Hu; Zhang, Gui-Jun; Zhou, Xiao-Gen; Yu, Xu-Feng

    2016-01-01

    To address the searching problem of protein conformational space in ab-initio protein structure prediction, a novel method using abstract convex underestimation (ACUE) based on the framework of evolutionary algorithm was proposed. Computing such conformations, essential to associate structural and functional information with gene sequences, is challenging due to the high-dimensionality and rugged energy surface of the protein conformational space. As a consequence, the dimension of protein conformational space should be reduced to a proper level. In this paper, the high-dimensionality original conformational space was converted into feature space whose dimension is considerably reduced by feature extraction technique. And, the underestimate space could be constructed according to abstract convex theory. Thus, the entropy effect caused by searching in the high-dimensionality conformational space could be avoided through such conversion. The tight lower bound estimate information was obtained to guide the searching direction, and the invalid searching area in which the global optimal solution is not located could be eliminated in advance. Moreover, instead of expensively calculating the energy of conformations in the original conformational space, the estimate value is employed to judge if the conformation is worth exploring to reduce the evaluation time, thereby making computational cost lower and the searching process more efficient. Additionally, fragment assembly and the Monte Carlo method are combined to generate a series of metastable conformations by sampling in the conformational space. The proposed method provides a novel technique to solve the searching problem of protein conformational space. Twenty small-to-medium structurally diverse proteins were tested, and the proposed ACUE method was compared with It Fix, HEA, Rosetta and the developed method LEDE without underestimate information. Test results show that the ACUE method can more rapidly and more

  10. Combining Video, Audio and Lexical Indicators of Affect in Spontaneous Conversation via Particle Filtering

    PubMed Central

    Savran, Arman; Cao, Houwei; Shah, Miraj; Nenkova, Ani; Verma, Ragini

    2013-01-01

    We present experiments on fusing facial video, audio and lexical indicators for affect estimation during dyadic conversations. We use temporal statistics of texture descriptors extracted from facial video, a combination of various acoustic features, and lexical features to create regression based affect estimators for each modality. The single modality regressors are then combined using particle filtering, by treating these independent regression outputs as measurements of the affect states in a Bayesian filtering framework, where previous observations provide prediction about the current state by means of learned affect dynamics. Tested on the Audio-visual Emotion Recognition Challenge dataset, our single modality estimators achieve substantially higher scores than the official baseline method for every dimension of affect. Our filtering-based multi-modality fusion achieves correlation performance of 0.344 (baseline: 0.136) and 0.280 (baseline: 0.096) for the fully continuous and word level sub challenges, respectively. PMID:25300451

  11. High-performance combination method of electric network frequency and phase for audio forgery detection in battery-powered devices.

    PubMed

    Savari, Maryam; Abdul Wahab, Ainuddin Wahid; Anuar, Nor Badrul

    2016-09-01

    Audio forgery is any act of tampering, illegal copy and fake quality in the audio in a criminal way. In the last decade, there has been increasing attention to the audio forgery detection due to a significant increase in the number of forge in different type of audio. There are a number of methods for forgery detection, which electric network frequency (ENF) is one of the powerful methods in this area for forgery detection in terms of accuracy. In spite of suitable accuracy of ENF in a majority of plug-in powered devices, the weak accuracy of ENF in audio forgery detection for battery-powered devices, especially in laptop and mobile phone, can be consider as one of the main obstacles of the ENF. To solve the ENF problem in terms of accuracy in battery-powered devices, a combination method of ENF and phase feature is proposed. From experiment conducted, ENF alone give 50% and 60% accuracy for forgery detection in mobile phone and laptop respectively, while the proposed method shows 88% and 92% accuracy respectively, for forgery detection in battery-powered devices. The results lead to higher accuracy for forgery detection with the combination of ENF and phase feature.

  12. A digital audio/video interleaving system. [for Shuttle Orbiter

    NASA Technical Reports Server (NTRS)

    Richards, R. W.

    1978-01-01

    A method of interleaving an audio signal with its associated video signal for simultaneous transmission or recording, and the subsequent separation of the two signals, is described. Comparisons are made between the new audio signal interleaving system and the Skylab Pam audio/video interleaving system, pointing out improvements gained by using the digital audio/video interleaving system. It was found that the digital technique is the simplest, most effective and most reliable method for interleaving audio and/or other types of data into the video signal for the Shuttle Orbiter application. Details of the design of a multiplexer capable of accommodating two basic data channels, each consisting of a single 31.5-kb/s digital bit stream are given. An adaptive slope delta modulation system is introduced to digitize audio signals, producing a high immunity of work intelligibility to channel errors, primarily due to the robust nature of the delta-modulation algorithm.

  13. Audio signal analysis in combination with noncontact bio-motion data to successfully monitor snoring.

    PubMed

    Flanagan, David; Arvaneh, Mahnaz; Zaffaroni, Alberto

    2014-01-01

    This paper proposes a novel algorithm for automatic detection of snoring in sleep by combining non-contact bio-motion data with audio data. The audio data is captured using low end Android Smartphones in a non-clinical environment to mimic a possible user-friendly commercial product for sleep audio monitoring. However snore detection becomes a more challenging problem as the recorded signal has lower quality compared to those recorded in clinical environment. To have an accurate classification of snore/non-snore, we first compare a range of commonly used features extracted from the audio signal to find the best subject-independent features. Thereafter, bio-motion data is used to further improve the classification accuracy by identifying episodes which contain high amounts of body movements. High body movement indicates that the subject is turning, coughing or leaving the bed; during these instances snoring does not occur. The proposed algorithm is evaluated using the data recorded over 25 sessions from 7 healthy subjects who are suspected to be regular snorers. Our experimental results showed that the best subject-independent features for snore/non-snore classification are the energy of frequency band 3150-3650 Hz, zero crossing rate and 1st predictor coefficient of linear predictive coding. The proposed features yielded an average classification accuracy of 84.35%. The introduction of bio-motion data significantly improved the results by an average of 5.87% (p<;0.01). This work is the first study that successfully used bio-motion data to improve the accuracy of snore/non-snore classification.

  14. STS-29 MS Bagian juggles audio cassettes on Discovery's, OV-103's, middeck

    NASA Image and Video Library

    1989-03-18

    STS29-02-033 (3-18 March 1989) --- In what appears to be a juggling act in the microgravity of space, James P. Bagian, a physician, is actually attempting to organize audio cassettes. Other frames taken during the flight document Bagian's medical testing of his fellow crewmembers. This photographic frame was among NASA's third STS-29 photo release. Monday, March 20, 1989. Crewmembers were Astronauts Michael L. Coats, John E. Blaha, James F. Buchli, Robert C. Springer and James P. Bagian.

  15. Quality improvement in the biomedical communications audio visual unit.

    PubMed

    Stein, D

    1994-01-01

    The audio visual unit is a highly visible and most frequently requested service provided by a biomedical communications department. To continuously improve service, delivery, and operation of an audio visual communications function is a challenge. This article presents a field-tested audit tool for describing audio visual unit strengths and limitations. The tool suggests four assessment areas: the service plan, the client communication network, training and managing staff, and standard operating policy and procedures.

  16. Stuttering and speech naturalness: audio and audiovisual judgments.

    PubMed

    Martin, R R; Haroldson, S K

    1992-06-01

    Unsophisticated raters, using 9-point interval scales, judged speech naturalness and stuttering severity of recorded stutterer and nonstutterer speech samples. Raters judged separately the audio-only and audiovisual presentations of each sample. For speech naturalness judgments of stutterer samples, raters invariably judged the audiovisual presentation more unnatural than the audio presentation of the same sample; but for the nonstutterer samples, there was no difference between audio and audiovisual naturalness ratings. Stuttering severity ratings did not differ significantly between audio and audiovisual presentations of the same samples. Rater reliability, interrater agreement, and intrarater agreement for speech naturalness judgments were assessed.

  17. A content-based digital audio watermarking algorithm

    NASA Astrophysics Data System (ADS)

    Zhang, Liping; Zhao, Yi; Xu, Wen Li

    2015-12-01

    Digital audio watermarking embeds inaudible information into digital audio data for the purposes of copyright protection, ownership verification, covert communication, and/or auxiliary data carrying. In this paper, we present a novel watermarking scheme to embed a meaningful gray image into digital audio by quantizing the wavelet coefficients (using integer lifting wavelet transform) of audio samples. Our audio-dependent watermarking procedure directly exploits temporal and frequency perceptual masking of the human auditory system (HAS) to guarantee that the embedded watermark image is inaudible and robust. The watermark is constructed by utilizing still image compression technique, breaking each audio clip into smaller segments, selecting the perceptually significant audio segments to wavelet transform, and quantizing the perceptually significant wavelet coefficients. The proposed watermarking algorithm can extract the watermark image without the help from the original digital audio signals. We also demonstrate the robustness of that watermarking procedure to audio degradations and distortions, e.g., those that result from noise adding, MPEG compression, low pass filtering, resampling, and requantization.

  18. Audio scene segmentation for video with generic content

    NASA Astrophysics Data System (ADS)

    Niu, Feng; Goela, Naveen; Divakaran, Ajay; Abdel-Mottaleb, Mohamed

    2008-01-01

    In this paper, we present a content-adaptive audio texture based method to segment video into audio scenes. The audio scene is modeled as a semantically consistent chunk of audio data. Our algorithm is based on "semantic audio texture analysis." At first, we train GMM models for basic audio classes such as speech, music, etc. Then we define the semantic audio texture based on those classes. We study and present two types of scene changes, those corresponding to an overall audio texture change and those corresponding to a special "transition marker" used by the content creator, such as a short stretch of music in a sitcom or silence in dramatic content. Unlike prior work using genre specific heuristics, such as some methods presented for detecting commercials, we adaptively find out if such special transition markers are being used and if so, which of the base classes are being used as markers without any prior knowledge about the content. Our experimental results show that our proposed audio scene segmentation works well across a wide variety of broadcast content genres.

  19. Features of motivation of the crewmembers in an enclosed space at atmospheric pressure changes during breathing inert gases.

    NASA Astrophysics Data System (ADS)

    Komarevcev, Sergey

    Since the 1960s, our psychologists are working on experimenting with small groups in isolation .It was associated with the beginning of spaceflight and necessity to study of human behaviors in ways different from the natural habitat of man .Those, who study human behavior especially in isolation, know- that the behavior in isolation markedly different from that in the natural situаtions. It associated with the development of new, more adaptive behaviors (1) What are the differences ? First of all , isolation is achieved by the fact ,that the group is in a closed space. How experiments show - the crew members have changed the basic personality traits, such as motivation Statement of the problem and methods. In our experimentation we were interested in changing the features of human motivation (strength, stability and direction of motivation) in terms of a closed group in the modified atmosphere pressure and breathing inert gases. Also, we were interested in particular external and internal motivation of the individual in the circumstances. To conduct experimentation , we used an experimental barocomplex GVK -250 , which placed a group of six mаns. A task was to spend fifteen days in isolation on barokomplex when breathing oxigen - xenon mixture of fifteen days in isolation on the same complex when breathing oxygen- helium mixture and fifteen days of isolation on the same complex when breathing normal air All this time, the subjects were isolated under conditions of atmospheric pressure changes , closer to what you normally deal divers. We assumed that breathing inert mixtures can change the strength and stability , and with it , the direction and stability of motivation. To check our results, we planned on using the battery of psychological techniques : 1. Schwartz technique that measures personal values and behavior in society, DORS procedure ( measurement of fatigue , monotony , satiety and stress ) and riffs that give the test once a week. Our assumption is

  20. Quantization and psychoacoustic model in audio coding in advanced audio coding

    NASA Astrophysics Data System (ADS)

    Brzuchalski, Grzegorz

    2011-10-01

    This paper presents complete optimized architecture of Advanced Audio Coder quantization with Huffman coding. After that psychoacoustic model theory is presented and few algorithms described: standard Two Loop Search, its modifications, Genetic, Just Noticeable Level Difference, Trellis-Based and its modification: Cascaded Trellis-Based Algorithm.

  1. Instructional Audio Guidelines: Four Design Principles to Consider for Every Instructional Audio Design Effort

    ERIC Educational Resources Information Center

    Carter, Curtis W.

    2012-01-01

    This article contends that instructional designers and developers should attend to four particular design principles when creating instructional audio. Support for this view is presented by referencing the limited research that has been done in this area, and by indicating how and why each of the four principles is important to the design process.…

  2. Elicitation of attributes for the evaluation of audio-on-audio interference.

    PubMed

    Francombe, Jon; Mason, Russell; Dewhirst, Martin; Bech, Søren

    2014-11-01

    An experiment to determine the perceptual attributes of the experience of listening to a target audio program in the presence of an audio interferer was performed. The first stage was a free elicitation task in which a total of 572 phrases were produced. In the second stage, a consensus vocabulary procedure was used to reduce these phrases into a comprehensive set of attributes. Groups of experienced and inexperienced listeners determined nine and eight attributes, respectively. These attribute sets were combined by the listeners to produce a final set of 12 attributes: masking, calming, distraction, separation, confusion, annoyance, environment, chaotic, balance and blend, imagery, response to stimuli over time, and short-term response to stimuli. In the third stage, a simplified ranking procedure was used to select only the most useful and relevant attributes. Four attributes were selected: distraction, annoyance, balance and blend, and confusion. Ratings using these attributes were collected in the fourth stage, and a principal component analysis performed. This suggested two dimensions underlying the perception of an audio-on-audio interference situation: The first dimension was labeled "distraction" and accounted for 89% of the variance; the second dimension, accounting for 10% of the variance, was labeled "balance and blend."

  3. Audio-Visual, Visuo-Tactile and Audio-Tactile Correspondences in Preschoolers.

    PubMed

    Nava, Elena; Grassi, Massimo; Turati, Chiara

    2016-01-01

    Interest in crossmodal correspondences has recently seen a renaissance thanks to numerous studies in human adults. Yet, still very little is known about crossmodal correspondences in children, particularly in sensory pairings other than audition and vision. In the current study, we investigated whether 4-5-year-old children match auditory pitch to the spatial motion of visual objects (audio-visual condition). In addition, we investigated whether this correspondence extends to touch, i.e., whether children also match auditory pitch to the spatial motion of touch (audio-tactile condition) and the spatial motion of visual objects to touch (visuo-tactile condition). In two experiments, two different groups of children were asked to indicate which of two stimuli fitted best with a centrally located third stimulus (Experiment 1), or to report whether two presented stimuli fitted together well (Experiment 2). We found sensitivity to the congruency of all of the sensory pairings only in Experiment 2, suggesting that only under specific circumstances can these correspondences be observed. Our results suggest that pitch-height correspondences for audio-visual and audio-tactile combinations may still be weak in preschool children, and speculate that this could be due to immature linguistic and auditory cues that are still developing at age five.

  4. Investigating the impact of audio instruction and audio-visual biofeedback for lung cancer radiation therapy

    NASA Astrophysics Data System (ADS)

    George, Rohini

    Lung cancer accounts for 13% of all cancers in the Unites States and is the leading cause of deaths among both men and women. The five-year survival for lung cancer patients is approximately 15%.(ACS facts & figures) Respiratory motion decreases accuracy of thoracic radiotherapy during imaging and delivery. To account for respiration, generally margins are added during radiation treatment planning, which may cause a substantial dose delivery to normal tissues and increase the normal tissue toxicity. To alleviate the above-mentioned effects of respiratory motion, several motion management techniques are available which can reduce the doses to normal tissues, thereby reducing treatment toxicity and allowing dose escalation to the tumor. This may increase the survival probability of patients who have lung cancer and are receiving radiation therapy. However the accuracy of these motion management techniques are inhibited by respiration irregularity. The rationale of this thesis was to study the improvement in regularity of respiratory motion by breathing coaching for lung cancer patients using audio instructions and audio-visual biofeedback. A total of 331 patient respiratory motion traces, each four minutes in length, were collected from 24 lung cancer patients enrolled in an IRB-approved breathing-training protocol. It was determined that audio-visual biofeedback significantly improved the regularity of respiratory motion compared to free breathing and audio instruction, thus improving the accuracy of respiratory gated radiotherapy. It was also observed that duty cycles below 30% showed insignificant reduction in residual motion while above 50% there was a sharp increase in residual motion. The reproducibility of exhale based gating was higher than that of inhale base gating. Modeling the respiratory cycles it was found that cosine and cosine 4 models had the best correlation with individual respiratory cycles. The overall respiratory motion probability distribution

  5. Stream/bounce event perception reveals a temporal limit of motion correspondence based on surface feature over space and time.

    PubMed

    Kawachi, Yousuke; Kawabe, Takahiro; Gyoba, Jiro

    2011-01-01

    We examined how stream/bounce event perception is affected by motion correspondence based on the surface features of moving objects passing behind an occlusion. In the stream/bounce display two identical objects moving across each other in a two-dimensional display can be perceived as either streaming through or bouncing off each other at coincidence. Here, surface features such as colour (Experiments 1 and 2) or luminance (Experiment 3) were switched between the two objects at coincidence. The moment of coincidence was invisible to observers due to an occluder. Additionally, the presentation of the moving objects was manipulated in duration after the feature switch at coincidence. The results revealed that a postcoincidence duration of approximately 200 ms was required for the visual system to stabilize judgments of stream/bounce events by determining motion correspondence between the objects across the occlusion on the basis of the surface feature. The critical duration was similar across motion speeds of objects and types of surface features. Moreover, controls (Experiments 4a-4c) showed that cognitive bias based on feature (colour/luminance) congruency across the occlusion could not fully account for the effects of surface features on the stream/bounce judgments. We discuss the roles of motion correspondence, visual feature processing, and attentive tracking in the stream/bounce judgments.

  6. Stream/bounce event perception reveals a temporal limit of motion correspondence based on surface feature over space and time

    PubMed Central

    Kawachi, Yousuke; Kawabe, Takahiro; Gyoba, Jiro

    2011-01-01

    We examined how stream/bounce event perception is affected by motion correspondence based on the surface features of moving objects passing behind an occlusion. In the stream/bounce display two identical objects moving across each other in a two-dimensional display can be perceived as either streaming through or bouncing off each other at coincidence. Here, surface features such as colour (Experiments 1 and 2) or luminance (Experiment 3) were switched between the two objects at coincidence. The moment of coincidence was invisible to observers due to an occluder. Additionally, the presentation of the moving objects was manipulated in duration after the feature switch at coincidence. The results revealed that a postcoincidence duration of approximately 200 ms was required for the visual system to stabilize judgments of stream/bounce events by determining motion correspondence between the objects across the occlusion on the basis of the surface feature. The critical duration was similar across motion speeds of objects and types of surface features. Moreover, controls (Experiments 4a–4c) showed that cognitive bias based on feature (colour/luminance) congruency across the occlusion could not fully account for the effects of surface features on the stream/bounce judgments. We discuss the roles of motion correspondence, visual feature processing, and attentive tracking in the stream/bounce judgments. PMID:23145236

  7. Cortical Integration of Audio-Visual Information

    PubMed Central

    Vander Wyk, Brent C.; Ramsay, Gordon J.; Hudac, Caitlin M.; Jones, Warren; Lin, David; Klin, Ami; Lee, Su Mei; Pelphrey, Kevin A.

    2013-01-01

    We investigated the neural basis of audio-visual processing in speech and non-speech stimuli. Physically identical auditory stimuli (speech and sinusoidal tones) and visual stimuli (animated circles and ellipses) were used in this fMRI experiment. Relative to unimodal stimuli, each of the multimodal conjunctions showed increased activation in largely non-overlapping areas. The conjunction of Ellipse and Speech, which most resembles naturalistic audiovisual speech, showed higher activation in the right inferior frontal gyrus, fusiform gyri, left posterior superior temporal sulcus, and lateral occipital cortex. The conjunction of Circle and Tone, an arbitrary audio-visual pairing with no speech association, activated middle temporal gyri and lateral occipital cortex. The conjunction of Circle and Speech showed activation in lateral occipital cortex, and the conjunction of Ellipse and Tone did not show increased activation relative to unimodal stimuli. Further analysis revealed that middle temporal regions, although identified as multimodal only in the Circle-Tone condition, were more strongly active to Ellipse-Speech or Circle-Speech, but regions that were identified as multimodal for Ellipse-Speech were always strongest for Ellipse-Speech. Our results suggest that combinations of auditory and visual stimuli may together be processed by different cortical networks, depending on the extent to which speech or non-speech percepts are evoked. PMID:20709442

  8. Fungal flora on board the Mir-Space Station, identification by morphological features and ribosomal DNA sequences.

    PubMed

    Makimura, K; Hanazawa, R; Takatori, K; Tamura, Y; Fujisaki, R; Nishiyama, Y; Abe, S; Uchida, K; Kawamura, Y; Ezaki, T; Yamaguchi, H

    2001-01-01

    This report is on the morphological and molecular biological identification, using 18S- and ITS1-rDNA sequences, of the "space fungi" isolated on board the Russian Mir-Space Station as the major constituents of the fungal flora. The six fungal strains were isolated from air by using an air sampler or from condensation. Strains were identified as Penicillium chrysogenum, Aspergillus versicolor, or Penicillium sp. by both methods. The species of space fungi were common saprophytic fungi in our living environment, potential pathogens, and allergens. This study concluded that the environment on board the space station Mir allows the growth of potentially pathogenic fungi as true in residential areas on the earth. Therefore, to prevent infection or other health disorders caused by these fungi, easy and reliable methods should be established to survey the fungal flora in a space station.

  9. A Content-Adaptive Analysis and Representation Framework for Audio Event Discovery from "Unscripted" Multimedia

    NASA Astrophysics Data System (ADS)

    Radhakrishnan, Regunathan; Divakaran, Ajay; Xiong, Ziyou; Otsuka, Isao

    2006-12-01

    We propose a content-adaptive analysis and representation framework to discover events using audio features from "unscripted" multimedia such as sports and surveillance for summarization. The proposed analysis framework performs an inlier/outlier-based temporal segmentation of the content. It is motivated by the observation that "interesting" events in unscripted multimedia occur sparsely in a background of usual or "uninteresting" events. We treat the sequence of low/mid-level features extracted from the audio as a time series and identify subsequences that are outliers. The outlier detection is based on eigenvector analysis of the affinity matrix constructed from statistical models estimated from the subsequences of the time series. We define the confidence measure on each of the detected outliers as the probability that it is an outlier. Then, we establish a relationship between the parameters of the proposed framework and the confidence measure. Furthermore, we use the confidence measure to rank the detected outliers in terms of their departures from the background process. Our experimental results with sequences of low- and mid-level audio features extracted from sports video show that "highlight" events can be extracted effectively as outliers from a background process using the proposed framework. We proceed to show the effectiveness of the proposed framework in bringing out suspicious events from surveillance videos without any a priori knowledge. We show that such temporal segmentation into background and outliers, along with the ranking based on the departure from the background, can be used to generate content summaries of any desired length. Finally, we also show that the proposed framework can be used to systematically select "key audio classes" that are indicative of events of interest in the chosen domain.

  10. Availability of feature-oriented scanning probe microscopy for remote-controlled measurements on board a space laboratory or planet exploration Rover.

    PubMed

    Lapshin, Rostislav V

    2009-06-01

    Prospects for a feature-oriented scanning (FOS) approach to investigations of sample surfaces, at the micrometer and nanometer scales, with the use of scanning probe microscopy under space laboratory or planet exploration rover conditions, are examined. The problems discussed include decreasing sensitivity of the onboard scanning probe microscope (SPM) to temperature variations, providing autonomous operation, implementing the capabilities for remote control, self-checking, self-adjustment, and self-calibration. A number of topical problems of SPM measurements in outer space or on board a planet exploration rover may be solved via the application of recently proposed FOS methods.

  11. Person identification for mobile robot using audio-visual modality

    NASA Astrophysics Data System (ADS)

    Kim, Young-Ouk; Chin, Sehoon; Lee, Jihoon; Paik, Joonki

    2005-10-01

    Recently, we experienced significant advancement in intelligent service robots. The remarkable features of an intelligent robot include tracking and identification of person using biometric features. The human-robot interaction is very important because it is one of the final goals of an intelligent service robot. Many researches are concentrating in two fields. One is self navigation of a mobile robot and the other is human-robot interaction in natural environment. In this paper we will present an effective person identification method for HRI (Human Robot Interaction) using two different types of expert systems. However, most of mobile robots run under uncontrolled and complicated environment. It means that face and speech information can't be guaranteed under varying conditions, such as lighting, noisy sound, orientation of a robot. According to a value of illumination and signal to noise ratio around mobile a robot, our proposed fuzzy rule make a reasonable person identification result. Two embedded HMM (Hidden Marhov Model) are used for each visual and audio modality to identify person. The performance of our proposed system and experimental results are compared with single modality identification and simply mixed method of two modality.

  12. 47 CFR 10.520 - Common audio attention signal.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... 10.520 Telecommunication FEDERAL COMMUNICATIONS COMMISSION GENERAL COMMERCIAL MOBILE ALERT SYSTEM... manufacturers may only market devices for public use under part 10 that include an audio attention signal that... interval between each repetition. (b) For devices that have polyphonic capabilities, the audio...

  13. An Audio Stream Redirector for the Ethernet Speaker

    ERIC Educational Resources Information Center

    Mandrekar, Ishan; Prevelakis, Vassilis; Turner, David Michael

    2004-01-01

    The authors have developed the "Ethernet Speaker" (ES), a network-enabled single board computer embedded into a conventional audio speaker. Audio streams are transmitted in the local area network using multicast packets, and the ES can select any one of them and play it back. A key requirement for the ES is that it must be capable of playing any…

  14. Audio Podcasting in a Tablet PC-Enhanced Biochemistry Course

    ERIC Educational Resources Information Center

    Lyles, Heather; Robertson, Brian; Mangino, Michael; Cox, James R.

    2007-01-01

    This report describes the effects of making audio podcasts of all lectures in a large, basic biochemistry course promptly available to students. The audio podcasts complement a previously described approach in which a tablet PC is used to annotate PowerPoint slides with digital ink to produce electronic notes that can be archived. The fundamentals…

  15. Selected Audio-Visual Materials for Consumer Education. [New Version.

    ERIC Educational Resources Information Center

    Johnston, William L.

    Ninety-two films, filmstrips, multi-media kits, slides, and audio cassettes, produced between 1964 and 1974, are listed in this selective annotated bibliography on consumer education. The major portion of the bibliography is devoted to films and filmstrips. The main topics of the audio-visual materials include purchasing, advertising, money…

  16. Use of Video and Audio Texts in EFL Listening Test

    ERIC Educational Resources Information Center

    Basal, Ahmet; Gülözer, Kaine; Demir, Ibrahim

    2015-01-01

    The study aims to discover whether audio or video modality in a listening test is more beneficial to test takers. In this study, the posttest-only control group design was utilized and quantitative data were collected in order to measure participant performances concerning two types of modality (audio or video) in a listening test. The…

  17. Teaching Audio Playwriting: The Pedagogy of Drama Podcasting

    ERIC Educational Resources Information Center

    Eshelman, David J.

    2016-01-01

    This article suggests how teaching artists can develop practical coursework in audio playwriting. To prepare students to work in the reemergent audio drama medium, the author created a seminar course called Radio Theatre Writing, taught at Arkansas Tech University in the fall of 2014. The course had three sections. First, it focused on…

  18. Recent Audio-Visual Materials on the Soviet Union.

    ERIC Educational Resources Information Center

    Clarke, Edith Campbell

    1981-01-01

    Identifies and describes audio-visual materials (films, filmstrips, and audio cassette tapes) about the Soviet Union which have been produced since 1977. For each entry, information is presented on title, time required, date of release, cost (purchase and rental), and an abstract. (DB)

  19. Beyond Podcasting: Creative Approaches to Designing Educational Audio

    ERIC Educational Resources Information Center

    Middleton, Andrew

    2009-01-01

    This paper discusses a university-wide pilot designed to encourage academics to creatively explore learner-centred applications for digital audio. Participation in the pilot was diverse in terms of technical competence, confidence and contextual requirements and there was little prior experience of working with digital audio. Many innovative…

  20. Effect of Audio vs. Video on Aural Discrimination of Vowels

    ERIC Educational Resources Information Center

    McCrocklin, Shannon

    2012-01-01

    Despite the growing use of media in the classroom, the effects of using of audio versus video in pronunciation teaching has been largely ignored. To analyze the impact of the use of audio or video training on aural discrimination of vowels, 61 participants (all students at a large American university) took a pre-test followed by two training…

  1. Tune in the Net with RealAudio.

    ERIC Educational Resources Information Center

    Buchanan, Larry

    1997-01-01

    Describes how to connect to the RealAudio Web site to download a player that provides sound from Web pages to the computer through streaming technology. Explains hardware and software requirements and provides addresses for other RealAudio Web sites are provided, including weather information and current news. (LRW)

  2. Audio Podcasting in a Tablet PC-Enhanced Biochemistry Course

    ERIC Educational Resources Information Center

    Lyles, Heather; Robertson, Brian; Mangino, Michael; Cox, James R.

    2007-01-01

    This report describes the effects of making audio podcasts of all lectures in a large, basic biochemistry course promptly available to students. The audio podcasts complement a previously described approach in which a tablet PC is used to annotate PowerPoint slides with digital ink to produce electronic notes that can be archived. The fundamentals…

  3. A Case Study on Audio Feedback with Geography Undergraduates

    ERIC Educational Resources Information Center

    Rodway-Dyer, Sue; Knight, Jasper; Dunne, Elizabeth

    2011-01-01

    Several small-scale studies have suggested that audio feedback can help students to reflect on their learning and to develop deep learning approaches that are associated with higher attainment in assessments. For this case study, Geography undergraduates were given audio feedback on a written essay assignment, alongside traditional written…

  4. Technical Evaluation Report. 65. Video-Conferencing with Audio Software

    ERIC Educational Resources Information Center

    Baggaley, Jon; Klaas, Jim

    2006-01-01

    An online conference is illustrated using the format of a TV talk show. The conference combined live audio discussion with visual images spontaneously selected by the moderator in the manner of a TV control-room director. A combination of inexpensive online collaborative tools was used for the event, based on the browser-based audio-conferencing…

  5. Using Audio Books to Improve Reading and Academic Performance

    ERIC Educational Resources Information Center

    Montgomery, Joel R.

    2009-01-01

    This article highlights significant research about what below grade-level reading means in middle school classrooms and suggests a tested approach to improve reading comprehension levels significantly by using audio books. The use of these audio books can improve reading and academic performance for both English language learners (ELLs) and for…

  6. Use of Audio Modification in Science Vocabulary Assessment

    ERIC Educational Resources Information Center

    Adiguzel, Tufan

    2011-01-01

    The purposes of this study were to examine the utilization of audio modification in vocabulary assessment in school subject areas, specifically in elementary science, and to present a web-based key vocabulary assessment tool for the elementary school level. Audio-recorded readings were used to replace independent student readings as the task…

  7. Teaching Audio Playwriting: The Pedagogy of Drama Podcasting

    ERIC Educational Resources Information Center

    Eshelman, David J.

    2016-01-01

    This article suggests how teaching artists can develop practical coursework in audio playwriting. To prepare students to work in the reemergent audio drama medium, the author created a seminar course called Radio Theatre Writing, taught at Arkansas Tech University in the fall of 2014. The course had three sections. First, it focused on…

  8. A Case Study on Audio Feedback with Geography Undergraduates

    ERIC Educational Resources Information Center

    Rodway-Dyer, Sue; Knight, Jasper; Dunne, Elizabeth

    2011-01-01

    Several small-scale studies have suggested that audio feedback can help students to reflect on their learning and to develop deep learning approaches that are associated with higher attainment in assessments. For this case study, Geography undergraduates were given audio feedback on a written essay assignment, alongside traditional written…

  9. Making the Most of Audio. Technology in Language Learning Series.

    ERIC Educational Resources Information Center

    Barley, Anthony

    Prepared for practicing language teachers, this book's aim is to help them make the most of audio, a readily accessible resource. The book shows, with the help of numerous practical examples, how a range of language skills can be developed. Most examples are in French. Chapters cover the following information: (1) making the most of audio (e.g.,…

  10. A Multi-User Shared Hardware Audio Response System.

    ERIC Educational Resources Information Center

    Voth, Brian

    A flexible surface disc device serves as the basis for the design of a computer operated multi-user shared hardware audio response system. The device employs a slow moving flexible, air supported surface in contact with a large number of heads. Relatively instantaneous distribution of audio from any portion of the disc to the users in accomplished…

  11. 47 CFR 10.520 - Common audio attention signal.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... 10.520 Telecommunication FEDERAL COMMUNICATIONS COMMISSION GENERAL COMMERCIAL MOBILE ALERT SYSTEM... manufacturers may only market devices for public use under part 10 that include an audio attention signal that... interval between each repetition. (b) For devices that have polyphonic capabilities, the audio attention...

  12. Making the Most of Audio. Technology in Language Learning Series.

    ERIC Educational Resources Information Center

    Barley, Anthony

    Prepared for practicing language teachers, this book's aim is to help them make the most of audio, a readily accessible resource. The book shows, with the help of numerous practical examples, how a range of language skills can be developed. Most examples are in French. Chapters cover the following information: (1) making the most of audio (e.g.,…

  13. TVAR modeling of EEG to detect audio distraction during simulated driving

    NASA Astrophysics Data System (ADS)

    Dahal, Nabaraj; (Nanda Nandagopal, D.; Cocks, Bernadine; Vijayalakshmi, Ramasamy; Dasari, Naga; Gaertner, Paul

    2014-06-01

    Objective. The objective of our current study was to look for the EEG correlates that can reveal the engaged state of the brain while undertaking cognitive tasks. Specifically, we aimed to identify EEG features that could detect audio distraction during simulated driving. Approach. Time varying autoregressive (TVAR) analysis using Kalman smoother was carried out on short time epochs of EEG data collected from participants as they undertook two simulated driving tasks. TVAR coefficients were then used to construct all pole model enabling the identification of EEG features that could differentiate normal driving from audio distracted driving. Main results. Pole analysis of the TVAR model led to the visualization of event related synchronization/desynchronization (ERS/ERD) patterns in the form of pole displacements in pole plots of the temporal EEG channels in the z plane enabling the differentiation of the two driving conditions. ERS in the EEG data has been demonstrated during audio distraction as an associated phenomenon. Significance. Visualizing the ERD/ERS phenomenon in terms of pole displacement is a novel approach. Although ERS/ERD has previously been demonstrated as reliable when applied to motor related tasks, it is believed to be the first time that it has been applied to investigate human cognitive phenomena such as attention and distraction. Results confirmed that distracted/non-distracted driving states can be identified using this approach supporting its applicability to cognition research.

  14. Transitioning from analog to digital audio recording in childhood speech sound disorders

    PubMed Central

    Shriberg, Lawrence D.; McSweeny, Jane L.; Anderson, Bruce E.; Campbell, Thomas F.; Chial, Michael R.; Green, Jordan R.; Hauner, Katherina K.; Moore, Christopher A.; Rusiewicz, Heather L.; Wilson, David L.

    2014-01-01

    Few empirical findings or technical guidelines are available on the current transition from analog to digital audio recording in childhood speech sound disorders. Of particular concern in the present context was whether a transition from analog- to digital-based transcription and coding of prosody and voice features might require re-standardizing a reference database for research in childhood speech sound disorders. Two research transcribers with different levels of experience glossed, transcribed, and prosody-voice coded conversational speech samples from eight children with mild to severe speech disorders of unknown origin. The samples were recorded, stored, and played back using representative analog and digital audio systems. Effect sizes calculated for an array of analog versus digital comparisons ranged from negligible to medium, with a trend for participants’ speech competency scores to be slightly lower for samples obtained and transcribed using the digital system. We discuss the implications of these and other findings for research and clinical practise. PMID:16019779

  15. ESA personal communications and digital audio broadcasting systems based on non-geostationary satellites

    NASA Technical Reports Server (NTRS)

    Logalbo, P.; Benedicto, J.; Viola, R.

    1993-01-01

    Personal Communications and Digital Audio Broadcasting are two new services that the European Space Agency (ESA) is investigating for future European and Global Mobile Satellite systems. ESA is active in promoting these services in their various mission options including non-geostationary and geostationary satellite systems. A Medium Altitude Global Satellite System (MAGSS) for global personal communications at L and S-band, and a Multiregional Highly inclined Elliptical Orbit (M-HEO) system for multiregional digital audio broadcasting at L-band are described. Both systems are being investigated by ESA in the context of future programs, such as Archimedes, which are intended to demonstrate the new services and to develop the technology for future non-geostationary mobile communication and broadcasting satellites.

  16. A direct broadcast satellite-audio experiment

    NASA Technical Reports Server (NTRS)

    Vaisnys, Arvydas; Abbe, Brian; Motamedi, Masoud

    1992-01-01

    System studies have been carried out over the past three years at the Jet Propulsion Laboratory (JPL) on digital audio broadcasting (DAB) via satellite. The thrust of the work to date has been on designing power and bandwidth efficient systems capable of providing reliable service to fixed, mobile, and portable radios. It is very difficult to predict performance in an environment which produces random periods of signal blockage, such as encountered in mobile reception where a vehicle can quickly move from one type of terrain to another. For this reason, some signal blockage mitigation techniques were built into an experimental DAB system and a satellite experiment was conducted to obtain both qualitative and quantitative measures of performance in a range of reception environments. This paper presents results from the experiment and some conclusions on the effectiveness of these blockage mitigation techniques.

  17. Horatio Audio-Describes Shakespeare's "Hamlet": Blind and Low-Vision Theatre-Goers Evaluate an Unconventional Audio Description Strategy

    ERIC Educational Resources Information Center

    Udo, J. P.; Acevedo, B.; Fels, D. I.

    2010-01-01

    Audio description (AD) has been introduced as one solution for providing people who are blind or have low vision with access to live theatre, film and television content. However, there is little research to inform the process, user preferences and presentation style. We present a study of a single live audio-described performance of Hart House…

  18. Horatio Audio-Describes Shakespeare's "Hamlet": Blind and Low-Vision Theatre-Goers Evaluate an Unconventional Audio Description Strategy

    ERIC Educational Resources Information Center

    Udo, J. P.; Acevedo, B.; Fels, D. I.

    2010-01-01

    Audio description (AD) has been introduced as one solution for providing people who are blind or have low vision with access to live theatre, film and television content. However, there is little research to inform the process, user preferences and presentation style. We present a study of a single live audio-described performance of Hart House…

  19. Audio-Visual Speech Cue Combination

    PubMed Central

    Arnold, Derek H.; Tear, Morgan; Schindel, Ryan; Roseboom, Warrick

    2010-01-01

    Background Different sources of sensory information can interact, often shaping what we think we have seen or heard. This can enhance the precision of perceptual decisions relative to those made on the basis of a single source of information. From a computational perspective, there are multiple reasons why this might happen, and each predicts a different degree of enhanced precision. Relatively slight improvements can arise when perceptual decisions are made on the basis of multiple independent sensory estimates, as opposed to just one. These improvements can arise as a consequence of probability summation. Greater improvements can occur if two initially independent estimates are summated to form a single integrated code, especially if the summation is weighted in accordance with the variance associated with each independent estimate. This form of combination is often described as a Bayesian maximum likelihood estimate. Still greater improvements are possible if the two sources of information are encoded via a common physiological process. Principal Findings Here we show that the provision of simultaneous audio and visual speech cues can result in substantial sensitivity improvements, relative to single sensory modality based decisions. The magnitude of the improvements is greater than can be predicted on the basis of either a Bayesian maximum likelihood estimate or a probability summation. Conclusion Our data suggest that primary estimates of speech content are determined by a physiological process that takes input from both visual and auditory processing, resulting in greater sensitivity than would be possible if initially independent audio and visual estimates were formed and then subsequently combined. PMID:20419130

  20. Attention to sound improves auditory reliability in audio-tactile spatial optimal integration.

    PubMed

    Vercillo, Tiziana; Gori, Monica

    2015-01-01

    The role of attention on multisensory processing is still poorly understood. In particular, it is unclear whether directing attention toward a sensory cue dynamically reweights cue reliability during integration of multiple sensory signals. In this study, we investigated the impact of attention in combining audio-tactile signals in an optimal fashion. We used the Maximum Likelihood Estimation (MLE) model to predict audio-tactile spatial localization on the body surface. We developed a new audio-tactile device composed by several small units, each one consisting of a speaker and a tactile vibrator independently controllable by external software. We tested participants in an attentional and a non-attentional condition. In the attentional experiment, participants performed a dual task paradigm: they were required to evaluate the duration of a sound while performing an audio-tactile spatial task. Three unisensory or multisensory stimuli, conflictual or not conflictual sounds and vibrations arranged along the horizontal axis, were presented sequentially. In the primary task participants had to evaluate in a space bisection task the position of the second stimulus (the probe) with respect to the others (the standards). In the secondary task they had to report occasionally changes in duration of the second auditory stimulus. In the non-attentional task participants had only to perform the primary task (space bisection). Our results showed an enhanced auditory precision (and auditory weights) in the auditory attentional condition with respect to the control non-attentional condition. The results of this study support the idea that modality-specific attention modulates multisensory integration.

  1. Enabling Low-Power, Multi-Modal Neural Interfaces Through a Common, Low-Bandwidth Feature Space.

    PubMed

    Irwin, Zachary T; Thompson, David E; Schroeder, Karen E; Tat, Derek M; Hassani, Ali; Bullard, Autumn J; Woo, Shoshana L; Urbanchek, Melanie G; Sachs, Adam J; Cederna, Paul S; Stacey, William C; Patil, Parag G; Chestek, Cynthia A

    2016-05-01

    Brain-Machine Interfaces (BMIs) have shown great potential for generating prosthetic control signals. Translating BMIs into the clinic requires fully implantable, wireless systems; however, current solutions have high power requirements which limit their usability. Lowering this power consumption typically limits the system to a single neural modality, or signal type, and thus to a relatively small clinical market. Here, we address both of these issues by investigating the use of signal power in a single narrow frequency band as a decoding feature for extracting information from electrocorticographic (ECoG), electromyographic (EMG), and intracortical neural data. We have designed and tested the Multi-modal Implantable Neural Interface (MINI), a wireless recording system which extracts and transmits signal power in a single, configurable frequency band. In prerecorded datasets, we used the MINI to explore low frequency signal features and any resulting tradeoff between power savings and decoding performance losses. When processing intracortical data, the MINI achieved a power consumption 89.7% less than a more typical system designed to extract action potential waveforms. When processing ECoG and EMG data, the MINI achieved similar power reductions of 62.7% and 78.8%. At the same time, using the single signal feature extracted by the MINI, we were able to decode all three modalities with less than a 9% drop in accuracy relative to using high-bandwidth, modality-specific signal features. We believe this system architecture can be used to produce a viable, cost-effective, clinical BMI.

  2. Thermal and neutron-physical features of the nuclear reactor for a power pulsation plant for space applications

    NASA Astrophysics Data System (ADS)

    Gordeev, É. G.; Kaminskii, A. S.; Konyukhov, G. V.; Pavshuk, V. A.; Turbina, T. A.

    2012-05-01

    We have explored the possibility of creating small-size reactors with a high power output with the provision of thermal stability and nuclear safety under standard operating conditions and in emergency situations. The neutron-physical features of such a reactor have been considered and variants of its designs preserving the main principles and approaches of nuclear rocket engine technology are presented.

  3. Materials Science Research Hardware for Application on the International Space Station: an Overview of Typical Hardware Requirements and Features

    NASA Technical Reports Server (NTRS)

    Schaefer, D. A.; Cobb, S.; Fiske, M. R.; Srinivas, R.

    2000-01-01

    NASA's Marshall Space Flight Center (MSFC) is the lead center for Materials Science Microgravity Research. The Materials Science Research Facility (MSRF) is a key development effort underway at MSFC. The MSRF will be the primary facility for microgravity materials science research on board the International Space Station (ISS) and will implement the NASA Materials Science Microgravity Research Program. It will operate in the U.S. Laboratory Module and support U. S. Microgravity Materials Science Investigations. This facility is being designed to maintain the momentum of the U.S. role in microgravity materials science and support NASA's Human Exploration and Development of Space (HEDS) Enterprise goals and objectives for Materials Science. The MSRF as currently envisioned will consist of three Materials Science Research Racks (MSRR), which will be deployed to the International Space Station (ISS) in phases, Each rack is being designed to accommodate various Experiment Modules, which comprise processing facilities for peer selected Materials Science experiments. Phased deployment will enable early opportunities for the U.S. and International Partners, and support the timely incorporation of technology updates to the Experiment Modules and sensor devices.

  4. Noninvertible watermarking methods for MPEG-encoded audio

    NASA Astrophysics Data System (ADS)

    Qiao, Lintian; Nahrstedt, Klara

    1999-04-01

    Nowadays the multimedia technology in distributed environments becomes realistic and the multimedia copyright protection issue becomes more and more important. Various digital watermarking techniques have been proposed in recent years as the methods to protect the copyright of multimedia data. Although, conceptually, these techniques can be easily extended for protecting digital audio data, it is challenging to apply these techniques to MPEG Audio streams because we need to design the watermarking schemes working directly in the compressed data domain. In this paper, we present watermarking methods which will embed the watermark directly into the MPEG audio bit streams rather than going through expensive decoding/encoding process in order to apply watermarking schemes in uncompressed data domain. Among the two presented schemes, one embeds the watermark into the Scale Factors of the MPEG audio streams and another one embeds the watermark into the MPEG encoded samples. Our experimental results show that both methods perform well and the distortion could be controlled at the minimal level. While we use MPEG Audio Layer II streams in our experimental tests, the proposed schemes can be applied to MPEG Audio Layer I and III. Furthermore, by enforcing creation of the watermark through a standard encryption function such as DES, the proposed schemes will be successful in resolving rightful ownership of watermarked MPEG audio.

  5. An inconclusive digital audio authenticity examination: a unique case.

    PubMed

    Koenig, Bruce E; Lacey, Douglas S

    2012-01-01

    This case report sets forth an authenticity examination of 35 encrypted, proprietary-format digital audio files containing recorded telephone conversations between two codefendants in a criminal matter. The codefendant who recorded the conversations did so on a recording system he developed; additionally, he was both a forensic audio authenticity examiner, who had published and presented in the field, and was the head of a professional audio society's writing group for authenticity standards. The authors conducted the examination of the recordings following nine laboratory steps of the peer-reviewed and published 11-step digital audio authenticity protocol. Based considerably on the codefendant's direct involvement with the development of the encrypted audio format, his experience in the field of forensic audio authenticity analysis, and the ease with which the audio files could be accessed, converted, edited in the gap areas, and reconstructed in such a way that the processes were undetected, the authors concluded that the recordings could not be scientifically authenticated through accepted forensic practices.

  6. How we give personalised audio feedback after summative OSCEs.

    PubMed

    Harrison, Christopher J; Molyneux, Adrian J; Blackwell, Sara; Wass, Valerie J

    2015-04-01

    Students often receive little feedback after summative objective structured clinical examinations (OSCEs) to enable them to improve their performance. Electronic audio feedback has shown promise in other educational areas. We investigated the feasibility of electronic audio feedback in OSCEs. An electronic OSCE system was designed, comprising (1) an application for iPads allowing examiners to mark in the key consultation skill domains, provide "tick-box" feedback identifying strengths and difficulties, and record voice feedback; (2) a feedback website giving students the opportunity to view/listen in multiple ways to the feedback. Acceptability of the audio feedback was investigated, using focus groups with students and questionnaires with both examiners and students. 87 (95%) students accessed the examiners' audio comments; 83 (90%) found the comments useful and 63 (68%) reported changing the way they perform a skill as a result of the audio feedback. They valued its highly personalised, relevant nature and found it much more useful than written feedback. Eighty-nine per cent of examiners gave audio feedback to all students on their stations. Although many found the method easy, lack of time was a factor. Electronic audio feedback provides timely, personalised feedback to students after a summative OSCE provided enough time is allocated to the process.

  7. Effects of aging on audio-visual speech integration.

    PubMed

    Huyse, Aurélie; Leybaert, Jacqueline; Berthommier, Frédéric

    2014-10-01

    This study investigated the impact of aging on audio-visual speech integration. A syllable identification task was presented in auditory-only, visual-only, and audio-visual congruent and incongruent conditions. Visual cues were either degraded or unmodified. Stimuli were embedded in stationary noise alternating with modulated noise. Fifteen young adults and 15 older adults participated in this study. Results showed that older adults had preserved lipreading abilities when the visual input was clear but not when it was degraded. The impact of aging on audio-visual integration also depended on the quality of the visual cues. In the visual clear condition, the audio-visual gain was similar in both groups and analyses in the framework of the fuzzy-logical model of perception confirmed that older adults did not differ from younger adults in their audio-visual integration abilities. In the visual reduction condition, the audio-visual gain was reduced in the older group, but only when the noise was stationary, suggesting that older participants could compensate for the loss of lipreading abilities by using the auditory information available in the valleys of the noise. The fuzzy-logical model of perception confirmed the significant impact of aging on audio-visual integration by showing an increased weight of audition in the older group.

  8. Sampling Function of Degree 2 for DVD-Audio

    NASA Astrophysics Data System (ADS)

    Toraichi, Kazuo; Nakamura, Koji

    Authors have been studying Fluency Information Theory that generalizes Shannon’s sampling theorem and its applications. Among the practical application of the research, the Fluency DAC that is developed as the Digital-to-analog converter for CD audio could have received objective valuation including receipt Golden Sound Award in 1988. In recent, DVD-Audio that deal with maximum sampling rate of 192 kHz has appeared. Due to the introduction of DVD audio that requires four times the sampling rate of nowadays CD audio, the request for developing a new Fluency DAC for DVD audio was initiated. From such requirements, the research for developing the Fluency DAC for DVD-Audio has been started. The result of the research could revive awards in local contest in Japan audio apparatus at 2000 and 2001. As the initial report on our project in developing the Fluency DAC that is capable of dealing with a maximum sampling rate of 192kHz, in this paper we aimed to derive the sampling function that acts as the impulse response for such a D/A converter.

  9. Characteristics of Abductive Inquiry in Earth and Space Science: An Undergraduate Teacher Prospective Case Study

    NASA Astrophysics Data System (ADS)

    Ramalis, T. R.; Liliasari; Herdiwidjaya, D.

    2016-08-01

    The purpose this case study was to describe characteristic features learning activities in the domain of earth and space science. Context of this study is earth and space learning activities on three groups of student teachers prospective, respectively on the subject of the shape and size of Earth, land and sea breeze, and moon's orbit. The analysis is conducted qualitatively from activity data and analyze students doing project work, student worksheets, group project report documents, note and audio recordings of discussion. Research findings identified the type of abduction: theoretical models abduction, factual abduction, and law abduction during the learning process. Implications for science inquiry learning as well as relevant research were suggested.

  10. Taenia crassiceps injection into the subarachnoid space of rats simulates radiological and morphological features of racemose neurocysticercosis.

    PubMed

    Hamamoto Filho, Pedro Tadao; Fabro, Alexandre Todorovic; Rodrigues, Marianna Vaz; Bazan, Rodrigo; Vulcano, Luiz Carlos; Biondi, Germano Francisco; Zanini, Marco Antônio

    2017-01-01

    Neurocysticercosis is a major public health concern. Although its eradication appears feasible, the disease remains endemic in developing countries and has emerged again in Europe and in the USA. Basic studies on neurocysticercosis are needed to better understand the pathophysiologic mechanisms and, consequently, to improve treatment perspectives. Much has been published on experimental parenchymal neurocysticercosis, but there are no experimental models of racemose neurocysticercosis. Cysts of Taenia crassiceps were injected into the subarachnoid space of 11 rats. After 4 months, magnetic resonance imaging (MRI) was performed to verify the occurrence of ventricular dilatation and the distribution of cysts in the cerebrospinal fluid compartments. The histologic assessment was done focusing on changes in the ependyma, choroid plexus, and brain parenchyma. MRI and histologic assessment confirmed the findings similar to those seen in human racemose neurocysticercosis including enlargement of the basal cisterns, hydrocephalus, and inflammatory infiltration through the ependyma and choroid plexus into cerebrospinal fluid spaces. We developed a simple model of racemose neurocysticercosis by injecting cysts of T. crassiceps into the subarachnoid space of rats. This model can help understand the pathophysiologic mechanisms of the disease.

  11. The spacing of the interstellar 6.2 and 7.7 micron emission features as an indicator of polycyclic aromatic hydrocarbon size

    NASA Technical Reports Server (NTRS)

    Hudgins, D. M.; Allamandola, L. J.

    1999-01-01

    A database of astrophysically relevant, infrared spectral measurements on a wide variety of neutral as well as positively and negatively charged polycyclic aromatic hydrocarbons (PAHs), ranging in size from C10H8 through C48H20, is now available to extend the interstellar PAH model. Beyond simply indicating general characteristics of the carriers, this collection of data now makes it possible to conduct a more thorough interpretation of the details of the interstellar spectra and thereby derive deeper insights into the nature of the emitting material and conditions in the emission zones. This Letter is the first such implementation of this spectral database. The infrared spectra of PAH cations, the main PAH form in the most energetic emission zones, are usually dominated by a few strong features in the 1650-1100 cm-1 (6.1-9.1 microns) region that tend to cluster the vicinity of the interstellar emission bands at 1610 and 1320 cm-1 (6.2 and 7.6 microns), but with spacings typically somewhat less than that observed in the canonical interstellar spectrum. However, the spectra in the database show that this spacing increases steadily with molecular size. Extrapolation of this trend indicates that PAHs in the 50-80 carbon atom size range are entirely consistent with the observed interstellar spacing. Furthermore, the profile of the 1610 cm-1 (6.2 microns) interstellar band indicates that PAHs containing as few as 20 carbon atoms contribute to this feature.

  12. The Spacing of the Interstellar 6.2 Microns and 7.7 Microns Emission Features as an Indicator of PAH Size

    NASA Technical Reports Server (NTRS)

    Hudgins, Douglas M.; Allamandola, L. J.; Mead, Susan (Technical Monitor)

    1998-01-01

    A database of astrophysically relevant, infrared spectral measurements on a wide variety of neutral as well as positively and negatively charged polycyclic aromatic hydrocarbons ranging in size from C10H8 through C48H20 is now available to extend the interstellar PAH model. Beyond simply indicating general characteristics of the carriers, this collection of data now makes it possible to conduct a more thorough interpretation of the details of the interstellar spectra and thereby derive deeper insights into the nature of the emitting material and conditions in the emission zones. This paper is the first such implementation of this spectral database. The infrared spectra of PAH cations, the main PAH form in the most energetic emission zones, are usually dominated by a few strong features in the 1650 - 1100 per centimeter (6.1 - 9.1 microns) region which tend to cluster in the vicinity of the interstellar emission bands at 1610 per centimeter and 1320 per centimeter (6.2 and 7.6 microns) but with spacings typically somewhat less than that observed in the canonical interstellar spectrum. However, the spectra in the database show that this spacing increases steadily with molecular size. Extrapolation of this trend indicates that PAHS in the 50 to 80 carbon atom size range are entirely consistent with the observed interstellar spacing. Furthermore, the profile of the 1610 per centimeter (6.2 microns) interstellar band indicates that PAHS containing as few as 20 carbon atoms contribute to this feature.

  13. A review of lossless audio compression standards and algorithms

    NASA Astrophysics Data System (ADS)

    Muin, Fathiah Abdul; Gunawan, Teddy Surya; Kartiwi, Mira; Elsheikh, Elsheikh M. A.

    2017-09-01

    Over the years, lossless audio compression has gained popularity as researchers and businesses has become more aware of the need for better quality and higher storage demand. This paper will analyse various lossless audio coding algorithm and standards that are used and available in the market focusing on Linear Predictive Coding (LPC) specifically due to its popularity and robustness in audio compression, nevertheless other prediction methods are compared to verify this. Advanced representation of LPC such as LSP decomposition techniques are also discussed within this paper.

  14. Musical examination to bridge audio data and sheet music

    NASA Astrophysics Data System (ADS)

    Pan, Xunyu; Cross, Timothy J.; Xiao, Liangliang; Hei, Xiali

    2015-03-01

    The digitalization of audio is commonly implemented for the purpose of convenient storage and transmission of music and songs in today's digital age. Analyzing digital audio for an insightful look at a specific musical characteristic, however, can be quite challenging for various types of applications. Many existing musical analysis techniques can examine a particular piece of audio data. For example, the frequency of digital sound can be easily read and identified at a specific section in an audio file. Based on this information, we could determine the musical note being played at that instant, but what if you want to see a list of all the notes played in a song? While most existing methods help to provide information about a single piece of the audio data at a time, few of them can analyze the available audio file on a larger scale. The research conducted in this work considers how to further utilize the examination of audio data by storing more information from the original audio file. In practice, we develop a novel musical analysis system Musicians Aid to process musical representation and examination of audio data. Musicians Aid solves the previous problem by storing and analyzing the audio information as it reads it rather than tossing it aside. The system can provide professional musicians with an insightful look at the music they created and advance their understanding of their work. Amateur musicians could also benefit from using it solely for the purpose of obtaining feedback about a song they were attempting to play. By comparing our system's interpretation of traditional sheet music with their own playing, a musician could ensure what they played was correct. More specifically, the system could show them exactly where they went wrong and how to adjust their mistakes. In addition, the application could be extended over the Internet to allow users to play music with one another and then review the audio data they produced. This would be particularly

  15. Towards Structural Analysis of Audio Recordings in the Presence of Musical Variations

    NASA Astrophysics Data System (ADS)

    Müller, Meinard; Kurth, Frank

    2006-12-01

    One major goal of structural analysis of an audio recording is to automatically extract the repetitive structure or, more generally, the musical form of the underlying piece of music. Recent approaches to this problem work well for music, where the repetitions largely agree with respect to instrumentation and tempo, as is typically the case for popular music. For other classes of music such as Western classical music, however, musically similar audio segments may exhibit significant variations in parameters such as dynamics, timbre, execution of note groups, modulation, articulation, and tempo progression. In this paper, we propose a robust and efficient algorithm for audio structure analysis, which allows to identify musically similar segments even in the presence of large variations in these parameters. To account for such variations, our main idea is to incorporate invariance at various levels simultaneously: we design a new type of statistical features to absorb microvariations, introduce an enhanced local distance measure to account for local variations, and describe a new strategy for structure extraction that can cope with the global variations. Our experimental results with classical and popular music show that our algorithm performs successfully even in the presence of significant musical variations.

  16. Development of Audio Watermark Technology to be Extracted Contactlessly by Cell Phone

    NASA Astrophysics Data System (ADS)

    Modegi, Toshio

    We have proposed a novel audio watermarking technology, which embeds a set of bitstream data by changing two-channel stereo locations of lower frequency components in an embedding target audio signal. This method features nearly lossless embedding, robustness against lossy data compression or analogue conversion, and enables contactless asynchronous detection of embedded watermarks through speaker and microphone devices without the original audio signals. Then, we propose several extended monaural embedding methods, which embed data by changing level balances between two sets of lower frequency components, which are divided by either frequency or temporal dimensions. In these level changes, we remove lower frequency components of the left channel signal and add the removed components to the corresponding components of the right channel of given stereo signal in order to support also nearly lossless embedding. These methods enable watermark detection from only the left channel signal by a single monaural microphone, and we have confirmed detection capability by a cell phone. In this paper, we describe abstracts of our proposed watermark embedding and extracting algorithms, experimental results of watermark extraction precision.

  17. Biomedical image representation approach using visualness and spatial information in a concept feature space for interactive region-of-interest-based retrieval

    PubMed Central

    Rahman, Md. Mahmudur; Antani, Sameer K.; Demner-Fushman, Dina; Thoma, George R.

    2015-01-01

    Abstract. This article presents an approach to biomedical image retrieval by mapping image regions to local concepts where images are represented in a weighted entropy-based concept feature space. The term “concept” refers to perceptually distinguishable visual patches that are identified locally in image regions and can be mapped to a glossary of imaging terms. Further, the visual significance (e.g., visualness) of concepts is measured as the Shannon entropy of pixel values in image patches and is used to refine the feature vector. Moreover, the system can assist the user in interactively selecting a region-of-interest (ROI) and searching for similar image ROIs. Further, a spatial verification step is used as a postprocessing step to improve retrieval results based on location information. The hypothesis that such approaches would improve biomedical image retrieval is validated through experiments on two different data sets, which are collected from open access biomedical literature. PMID:26730398

  18. Biomedical image representation approach using visualness and spatial information in a concept feature space for interactive region-of-interest-based retrieval.

    PubMed

    Rahman, Md Mahmudur; Antani, Sameer K; Demner-Fushman, Dina; Thoma, George R

    2015-10-01

    This article presents an approach to biomedical image retrieval by mapping image regions to local concepts where images are represented in a weighted entropy-based concept feature space. The term "concept" refers to perceptually distinguishable visual patches that are identified locally in image regions and can be mapped to a glossary of imaging terms. Further, the visual significance (e.g., visualness) of concepts is measured as the Shannon entropy of pixel values in image patches and is used to refine the feature vector. Moreover, the system can assist the user in interactively selecting a region-of-interest (ROI) and searching for similar image ROIs. Further, a spatial verification step is used as a postprocessing step to improve retrieval results based on location information. The hypothesis that such approaches would improve biomedical image retrieval is validated through experiments on two different data sets, which are collected from open access biomedical literature.

  19. Performance Characterization of Loctite (Registered Trademark) 242 and 271 Liquid Locking Compounds (LLCs) as a Secondary Locking Feature for International Space Station (ISS) Fasteners

    NASA Technical Reports Server (NTRS)

    Dube, Michael J.; Gamwell, Wayne R.

    2011-01-01

    Several International Space Station (ISS) hardware components use Loctite (and other polymer based liquid locking compounds (LLCs)) as a means of meeting the secondary (redundant) locking feature requirement for fasteners. The primary locking method is the fastener preload, with the application of the Loctite compound which when cured is intended to resist preload reduction. The reliability of these compounds has been questioned due to a number of failures during ground testing. The ISS Program Manager requested the NASA Engineering and Safety Center (NESC) to characterize and quantify sensitivities of Loctite being used as a secondary locking feature. The findings and recommendations provided in this investigation apply to the anaerobic LLCs Loctite 242 and 271. No other anaerobic LLCs were evaluated for this investigation. This document contains the findings and recommendations of the NESC investigation

  20. Nonlinear dimensionality reduction of CT histogram based feature space for predicting recurrence-free survival in non-small-cell lung cancer

    NASA Astrophysics Data System (ADS)

    Kawata, Y.; Niki, N.; Ohmatsu, H.; Aokage, K.; Kusumoto, M.; Tsuchida, T.; Eguchi, K.; Kaneko, M.

    2015-03-01

    Advantages of CT scanners with high resolution have allowed the improved detection of lung cancers. In the recent release of positive results from the National Lung Screening Trial (NLST) in the US showing that CT screening does in fact have a positive impact on the reduction of lung cancer related mortality. While this study does show the efficacy of CT based screening, physicians often face the problems of deciding appropriate management strategies for maximizing patient survival and for preserving lung function. Several key manifold-learning approaches efficiently reveal intrinsic low-dimensional structures latent in high-dimensional data spaces. This study was performed to investigate whether the dimensionality reduction can identify embedded structures from the CT histogram feature of non-small-cell lung cancer (NSCLC) space to improve the performance in predicting the likelihood of RFS for patients with NSCLC.

  1. Analysis of musical expression in audio signals

    NASA Astrophysics Data System (ADS)

    Dixon, Simon

    2003-01-01

    In western art music, composers communicate their work to performers via a standard notation which specificies the musical pitches and relative timings of notes. This notation may also include some higher level information such as variations in the dynamics, tempo and timing. Famous performers are characterised by their expressive interpretation, the ability to convey structural and emotive information within the given framework. The majority of work on audio content analysis focusses on retrieving score-level information; this paper reports on the extraction of parameters describing the performance, a task which requires a much higher degree of accuracy. Two systems are presented: BeatRoot, an off-line beat tracking system which finds the times of musical beats and tracks changes in tempo throughout a performance, and the Performance Worm, a system which provides a real-time visualisation of the two most important expressive dimensions, tempo and dynamics. Both of these systems are being used to process data for a large-scale study of musical expression in classical and romantic piano performance, which uses artificial intelligence (machine learning) techniques to discover fundamental patterns or principles governing expressive performance.

  2. Personal audio with a planar bright zone.

    PubMed

    Coleman, Philip; Jackson, Philip J B; Olik, Marek; Pedersen, Jan Abildgaard

    2014-10-01

    Reproduction of multiple sound zones, in which personal audio programs may be consumed without the need for headphones, is an active topic in acoustical signal processing. Many approaches to sound zone reproduction do not consider control of the bright zone phase, which may lead to self-cancellation problems if the loudspeakers surround the zones. Conversely, control of the phase in a least-squares sense comes at a cost of decreased level difference between the zones and frequency range of cancellation. Single-zone approaches have considered plane wave reproduction by focusing the sound energy in to a point in the wavenumber domain. In this article, a planar bright zone is reproduced via planarity control, which constrains the bright zone energy to impinge from a narrow range of angles via projection in to a spatial domain. Simulation results using a circular array surrounding two zones show the method to produce superior contrast to the least-squares approach, and superior planarity to the contrast maximization approach. Practical performance measurements obtained in an acoustically treated room verify the conclusions drawn under free-field conditions.

  3. Minimally radiating sources for personal audio.

    PubMed

    Elliott, Stephen J; Cheer, Jordan; Murfet, Harry; Holland, Keith R

    2010-10-01

    In order to reduce annoyance from the audio output of personal devices, it is necessary to maintain the sound level at the user position while minimizing the levels elsewhere. If the dark zone, within which the sound is to be minimized, extends over the whole far field of the source, the problem reduces to that of minimizing the radiated sound power while maintaining the pressure level at the user position. It is shown analytically that the optimum two-source array then has a hypercardioid directivity and gives about 7 dB reduction in radiated sound power, compared with a monopole producing the same on-axis pressure. The performance of other linear arrays is studied using monopole simulations for the motivating example of a mobile phone. The trade-off is investigated between the performance in reducing radiated noise, and the electrical power required to drive the array for different numbers of elements. It is shown for both simulations and experiments conducted on a small array of loudspeakers under anechoic conditions, that both two and three element arrays provide a reasonable compromise between these competing requirements. The implementation of the two-source array in a coupled enclosure is also shown to reduce the electrical power requirements.

  4. Localization performance with synthesized directional audio

    NASA Astrophysics Data System (ADS)

    Agnew, Jeffrey R.; German, Valencia; Calhoun, Gloria L.; Ericson, Mark A.

    1990-07-01

    This report summarizes three studies designed to measure and compare the ability of subjects to localize sounds in azimuth, via headphones, generated by two prototype auditory localization cue synthesizers. In the first study, performance differences were found between the two synthesizers in certain areas of the azimuth plane. Additionally, the design of a synthesizer (e.g., resolution and interpolating between head-related transfer functions (HRTF's)) can impact the perceived direction of the acoustic signals. Previous research with directional audio suggests that the veridicality of 3-D auditory displays could be optimized if individualized HRTF's are employed to synthesize the virtual sound sources, particularly in elevation. However, data from this experiment suggest that this design requirement can be relaxed, especially if only azimuth information is to be conveyed by the localization synthesizer. In the second study, two response methods for measuring localization performance were evaluated. No performance differences were found when subjects either verbally reported angular estimates or pointed to a circle to indicate the perceived direction of the target stimuli. In the third study, performance was impacted by manipulating the bandwidth of the acoustic signal and head movement.

  5. Audio-visual recording in the surgery: do patients mind?

    PubMed Central

    Campbell, I. K.

    1982-01-01

    The results of questionnaires completed by 145 patients following audio-visual recording of their consultations are analysed. It is concluded that the technique is well accepted and non-intrusive. ImagesFigure 1.Figure 2. PMID:7143315

  6. Audio-visual temporal perception in children with restored hearing.

    PubMed

    Gori, Monica; Chilosi, Anna; Forli, Francesca; Burr, David

    2017-05-01

    It is not clear how audio-visual temporal perception develops in children with restored hearing. In this study we measured temporal discrimination thresholds with an audio-visual temporal bisection task in 9 deaf children with restored audition, and 22 typically hearing children. In typically hearing children, audition was more precise than vision, with no gain in multisensory conditions (as previously reported in Gori et al. (2012b)). However, deaf children with restored audition showed similar thresholds for audio and visual thresholds and some evidence of gain in audio-visual temporal multisensory conditions. Interestingly, we found a strong correlation between auditory weighting of multisensory signals and quality of language: patients who gave more weight to audition had better language skills. Similarly, auditory thresholds for the temporal bisection task were also a good predictor of language skills. This result supports the idea that the temporal auditory processing is associated with language development. Copyright © 2017. Published by Elsevier Ltd.

  7. Audio CAPTCHA for SIP-Based VoIP

    NASA Astrophysics Data System (ADS)

    Soupionis, Yannis; Tountas, George; Gritzalis, Dimitris

    Voice over IP (VoIP) introduces new ways of communication, while utilizing existing data networks to provide inexpensive voice communications worldwide as a promising alternative to the traditional PSTN telephony. SPam over Internet Telephony (SPIT) is one potential source of future annoyance in VoIP. A common way to launch a SPIT attack is the use of an automated procedure (bot), which generates calls and produces audio advertisements. In this paper, our goal is to design appropriate CAPTCHA to fight such bots. We focus on and develop audio CAPTCHA, as the audio format is more suitable for VoIP environments and we implement it in a SIP-based VoIP environment. Furthermore, we suggest and evaluate the specific attributes that audio CAPTCHA should incorporate in order to be effective, and test it against an open source bot implementation.

  8. Video genre classification using multimodal features

    NASA Astrophysics Data System (ADS)

    Jin, Sung Ho; Bae, Tae Meon; Choo, Jin Ho; Ro, Yong Man

    2003-12-01

    We propose a video genre classification method using multimodal features. The proposed method is applied for the preprocessing of automatic video summarization or the retrieval and classification of broadcasting video contents. Through a statistical analysis of low-level and middle-level audio-visual features in video, the proposed method can achieve good performance in classifying several broadcasting genres such as cartoon, drama, music video, news, and sports. In this paper, we adopt MPEG-7 audio-visual descriptors as multimodal features of video contents and evaluate the performance of the classification by feeding the features into a decision tree-based classifier which is trained by CART. The experimental results show that the proposed method can recognize several broadcasting video genres with a high accuracy and the classification performance with multimodal features is superior to the one with unimodal features in the genre classification.

  9. 17 CFR 232.304 - Graphic, image, audio and video material.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... 17 Commodity and Securities Exchanges 2 2012-04-01 2012-04-01 false Graphic, image, audio and... Submissions § 232.304 Graphic, image, audio and video material. (a) If a filer includes graphic, image, audio..., image, audio or video material is presented in the delivered version, or they may be listed in an...

  10. 17 CFR 232.304 - Graphic, image, audio and video material.

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... 17 Commodity and Securities Exchanges 2 2013-04-01 2013-04-01 false Graphic, image, audio and... Submissions § 232.304 Graphic, image, audio and video material. (a) If a filer includes graphic, image, audio..., image, audio or video material is presented in the delivered version, or they may be listed in an...

  11. 17 CFR 232.304 - Graphic, image, audio and video material.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... 17 Commodity and Securities Exchanges 3 2014-04-01 2014-04-01 false Graphic, image, audio and... Submissions § 232.304 Graphic, image, audio and video material. (a) If a filer includes graphic, image, audio..., image, audio or video material is presented in the delivered version, or they may be listed in an...

  12. 17 CFR 232.304 - Graphic, image, audio and video material.

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ... 17 Commodity and Securities Exchanges 2 2011-04-01 2011-04-01 false Graphic, image, audio and... Submissions § 232.304 Graphic, image, audio and video material. (a) If a filer includes graphic, image, audio..., image, audio or video material is presented in the delivered version, or they may be listed in an...

  13. 37 CFR 201.27 - Initial notice of distribution of digital audio recording devices or media.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... distribution of digital audio recording devices or media. 201.27 Section 201.27 Patents, Trademarks, and... Initial notice of distribution of digital audio recording devices or media. (a) General. This section..., any digital audio recording device or digital audio recording medium in the United States....

  14. 37 CFR 201.28 - Statements of Account for digital audio recording devices or media.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... digital audio recording devices or media. 201.28 Section 201.28 Patents, Trademarks, and Copyrights U.S... of Account for digital audio recording devices or media. (a) General. This section prescribes rules... United States any digital audio recording device or digital audio recording medium. (b) Definitions....

  15. 37 CFR 201.27 - Initial notice of distribution of digital audio recording devices or media.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... distribution of digital audio recording devices or media. 201.27 Section 201.27 Patents, Trademarks, and... § 201.27 Initial notice of distribution of digital audio recording devices or media. (a) General. This... and distribute, any digital audio recording device or digital audio recording medium in the...

  16. 37 CFR 201.28 - Statements of Account for digital audio recording devices or media.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... digital audio recording devices or media. 201.28 Section 201.28 Patents, Trademarks, and Copyrights... of Account for digital audio recording devices or media. (a) General. This section prescribes rules... United States any digital audio recording device or digital audio recording medium. (b) Definitions....

  17. 37 CFR 201.28 - Statements of Account for digital audio recording devices or media.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... digital audio recording devices or media. 201.28 Section 201.28 Patents, Trademarks, and Copyrights... of Account for digital audio recording devices or media. (a) General. This section prescribes rules... United States any digital audio recording device or digital audio recording medium. (b) Definitions....

  18. 37 CFR 201.28 - Statements of Account for digital audio recording devices or media.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... digital audio recording devices or media. 201.28 Section 201.28 Patents, Trademarks, and Copyrights... of Account for digital audio recording devices or media. (a) General. This section prescribes rules... United States any digital audio recording device or digital audio recording medium. (b) Definitions....

  19. 37 CFR 201.27 - Initial notice of distribution of digital audio recording devices or media.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... distribution of digital audio recording devices or media. 201.27 Section 201.27 Patents, Trademarks, and... Initial notice of distribution of digital audio recording devices or media. (a) General. This section..., any digital audio recording device or digital audio recording medium in the United States....

  20. 37 CFR 201.27 - Initial notice of distribution of digital audio recording devices or media.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... distribution of digital audio recording devices or media. 201.27 Section 201.27 Patents, Trademarks, and... Initial notice of distribution of digital audio recording devices or media. (a) General. This section..., any digital audio recording device or digital audio recording medium in the United States....

  1. Getting Started with CD Audio in HyperCard.

    ERIC Educational Resources Information Center

    Decker, Donald A.

    1992-01-01

    This article examines the use of the Voyager Compact Disk (CD) AudioStack to provide HyperCard stacks designed to promote language learning with the ability to play on common precisely specified portions of off-the-shelf audio compact disks in a CD-ROM drive. Four German and Russian HyperCard stacks are described and their construction outlined.…

  2. Studies on a Spatialized Audio Interface for Sonar

    DTIC Science & Technology

    2011-10-03

    the proper collection of sound atoms are provided by some other acoustic source. In the case of binaural perception, particularly with multiple...University, provided applied binaural research on human auditory processing of spatialized (or 3D) audio over headphones. A comprehensive summary is...provided applied binaural research on human auditory processing of spatialized (or 3D) audio over headphones. A comprehensive summary is presented of a

  3. Getting Started with CD Audio in HyperCard.

    ERIC Educational Resources Information Center

    Decker, Donald A.

    1992-01-01

    This article examines the use of the Voyager Compact Disk (CD) AudioStack to provide HyperCard stacks designed to promote language learning with the ability to play on common precisely specified portions of off-the-shelf audio compact disks in a CD-ROM drive. Four German and Russian HyperCard stacks are described and their construction outlined.…

  4. Concepts, strategies and potentials using hypo-g and other features of the space environment for commercialization using higher plants

    NASA Technical Reports Server (NTRS)

    Krikorian, A. D.

    1985-01-01

    Opportunities for releasing, capturing, constructing and/or fixing the differential expressions or response potentials of the higher plant genome in the hypo-g environment for commercialization are explored. General strategies include improved plant-growing, crop and forestry production systems which conserve soil, water, labor and energy resources, and nutritional partitioning and mobilization of nutrients and synthates. Tissue and cell culture techniques of commercial potential include the growing and manipulation of cultured plant cells in vitro in a bioreactor to produce biologicals and secondary plants of economic value. The facilitation of plant breeding, the cloning of specific pathogen-free materials, the elimination of growing point or apex viruses, and the increase of plant yield are other O-g applications. The space environment may be advantageous in somatic embryogenesis, the culture of alkaloids, and the development of completely new crop plant germ plasm.

  5. Concepts, strategies and potentials using hypo-g and other features of the space environment for commercialization using higher plants

    NASA Technical Reports Server (NTRS)

    Krikorian, A. D.

    1985-01-01

    Opportunities for releasing, capturing, constructing and/or fixing the differential expressions or response potentials of the higher plant genome in the hypo-g environment for commercialization are explored. General strategies include improved plant-growing, crop and forestry production systems which conserve soil, water, labor and energy resources, and nutritional partitioning and mobilization of nutrients and synthates. Tissue and cell culture techniques of commercial potential include the growing and manipulation of cultured plant cells in vitro in a bioreactor to produce biologicals and secondary plants of economic value. The facilitation of plant breeding, the cloning of specific pathogen-free materials, the elimination of growing point or apex viruses, and the increase of plant yield are other O-g applications. The space environment may be advantageous in somatic embryogenesis, the culture of alkaloids, and the development of completely new crop plant germ plasm.

  6. Experiences with audio feedback in a veterinary curriculum.

    PubMed

    Rhind, Susan M; Pettigrew, Graham W; Spiller, Jo; Pearson, Geoff T

    2013-01-01

    On a national scale in the United Kingdom, student surveys have served to highlight areas within higher education that are not achieving high student satisfaction. Of particular concern to the veterinary and medical disciplines are the persistently poor levels of student satisfaction with academic feedback compared to students in other subjects. In this study we describe experiences with audio feedback trials in a veterinary curriculum. Students received audio feedback on either an in-course laboratory practical report or on an in-course multiple-choice test. Shortly after receiving their feedback, students were surveyed using an electronic questionnaire. In both courses, more students strongly agreed that audio feedback was helpful compared to either text-based (course A) or whole-class (course B) feedback. When asked to reflect on the helpfulness of various types of feedback they had received, audio feedback was rated less helpful than individual discussion with a member of staff (course A and course B), more helpful than peer discussion or automated feedback (course A and course B), and more helpful than written comments or whole-class review sessions (course B). From a faculty perspective, in course A, use of audio feedback was more efficient than handwritten feedback. In course B, the additional time commitment required was approximately 5 hours. Major themes in the qualitative data included the personal and individual nature of the feedback, quantity of feedback, improvement in students' insight into the process of marking, and the capacity of audio feedback to encourage and motivate.

  7. Galileo Galilei (gg) Space Test of the Weak Equivalence Principle to 10-17: the Key Features

    NASA Astrophysics Data System (ADS)

    Nobili, A. M.

    2014-01-01

    Galileo Galilei (GG) is a small satellite to fly in a low altitude, near circular, Sun-synchronous orbit around the Earth with the goal of testing in its field the weak equivalence principle to 10-17. It would improve the best tests with slowly rotating torsion balances by four orders of magnitude. The sensor is a differential accelerometer with two concentric coaxial test cylinders of different composition rotating together with the whole satellite around the symmetry axis, and weakly coupled in the plane perpendicular to it as a 2D mechanical oscillator. One axis rotation at 1 Hz, to be provided once and for all at launch, stabilizes the satellite and up-converts the signal from the low orbital frequency of ≃1.7 · 10-4 Hz to the much higher 1 Hz rotation frequency, needing neither motor nor bearings (passive rotation). For test masses suspended in orbit the major gain is a signal from Earth almost 500 times stronger than for the same masses suspended on ground. In GG both the sensor and the spacecraft are designed to fully exploit the properties of space in order to push the test four orders of magnitude better than slowly rotating torsion balances on ground.

  8. Digital Audio Radio Broadcast Systems Laboratory Testing Nearly Complete

    NASA Technical Reports Server (NTRS)

    2005-01-01

    Radio history continues to be made at the NASA Lewis Research Center with the completion of phase one of the digital audio radio (DAR) testing conducted by the Consumer Electronics Group of the Electronic Industries Association. This satellite, satellite/terrestrial, and terrestrial digital technology will open up new audio broadcasting opportunities both domestically and worldwide. It will significantly improve the current quality of amplitude-modulated/frequency-modulated (AM/FM) radio with a new digitally modulated radio signal and will introduce true compact-disc-quality (CD-quality) sound for the first time. Lewis is hosting the laboratory testing of seven proposed digital audio radio systems and modes. Two of the proposed systems operate in two modes each, making a total of nine systems being tested. The nine systems are divided into the following types of transmission: in-band on-channel (IBOC), in-band adjacent-channel (IBAC), and new bands. The laboratory testing was conducted by the Consumer Electronics Group of the Electronic Industries Association. Subjective assessments of the audio recordings for each of the nine systems was conducted by the Communications Research Center in Ottawa, Canada, under contract to the Electronic Industries Association. The Communications Research Center has the only CCIR-qualified (Consultative Committee for International Radio) audio testing facility in North America. The main goals of the U.S. testing process are to (1) provide technical data to the Federal Communication Commission (FCC) so that it can establish a standard for digital audio receivers and transmitters and (2) provide the receiver and transmitter industries with the proper standards upon which to build their equipment. In addition, the data will be forwarded to the International Telecommunications Union to help in the establishment of international standards for digital audio receivers and transmitters, thus allowing U.S. manufacturers to compete in the

  9. Multifunction audio digitizer for communications systems

    NASA Technical Reports Server (NTRS)

    Monford, L. G., Jr.

    1971-01-01

    Digitizer accomplishes both N bit pulse code modulation /PCM/ and delta modulation, and provides modulation indicating variable signal gain and variable sidetone. Other features include - low package count, variable clock rate to optimize bandwidth, and easily expanded PCM output.

  10. GK Per (Nova Persei 1901): HUBBLE SPACE TELESCOPE IMAGERY AND SPECTROSCOPY OF THE EJECTA, AND FIRST SPECTRUM OF THE JET-LIKE FEATURE

    SciTech Connect

    Shara, Michael M.; Zurek, David; Mizusawa, Trisha; De Marco, Orsola; Williams, Robert; Livio, Mario

    2012-06-15

    We have imaged the ejecta of GK Persei (Nova Persei 1901 A.D.) with the Hubble Space Telescope (HST), whose 0.1 arcsec resolution reveals hundreds of cometary-like structures with long axes aligned toward GK Per. One or both ends of the structures often show a brightness enhancement relative to the structures' middle sections, but there is no simple regularity to their morphologies (in contrast with, for example, the Helix nebula). Some of structures' morphologies suggest the presence of slow-moving or stationary material with which the ejecta is colliding, while others suggest shaping from a wind emanating from GK Per itself. The most detailed expansion map of any classical nova's ejecta was created by comparing HST images taken in successive years. Wide Field and Planetary Camera 2 narrowband images and Space Telescope Imaging Spectrograph spectra demonstrate that the physical conditions in this nova's ejecta vary strongly on spatial scales much smaller than those of the ejecta. Directly measuring accurate densities and compositions, and hence masses of this and other nova shells, will demand data at least as resolved spatially as those presented here. The filling factor of the ejecta is 1% or less, and the nova ejecta mass must be less than 10{sup -4} M{sub Sun }. A modest fraction of the emission nebulosities vary in brightness by up to a factor of two on timescales of one year. Finally, we present the deepest images yet obtained of a jet-like feature outside the main body of GK Per nebulosity, and the first spectrum of that feature. Dominated by strong, narrow emission lines of [N II], [O II], [O III], and [S II], this feature is probably a shock due to ejected material running into stationary interstellar matter, slowly moving ejecta from a previous nova episode, or circumbinary matter present before 1901. An upper limit to the mass of the jet is of order a few times 10{sup -6} M{sub Sun }. If the jet mass is close to this limit then the jet might be an

  11. #NASATweetup @NASA_Langley Audio Slideshow

    NASA Image and Video Library

    NASA Langley Research Center's first tweet-up involved a diverse group of more than 40 that included an astronaut's daughter, a physics student from Wisconsin, one of NASA's newest space camp crew ...

  12. Why 8-Year-Olds Cannot Tell the Difference between Steve Martin and Paul Newman: Factors Contributing to the Slow Development of Sensitivity to the Spacing of Facial Features

    ERIC Educational Resources Information Center

    Mondloch, Catherine J.; Dobson, Kate S.; Parsons, Julie; Maurer, Daphne

    2004-01-01

    Children are nearly as sensitive as adults to some cues to facial identity (e.g., differences in the shape of internal features and the external contour), but children are much less sensitive to small differences in the spacing of facial features. To identify factors that contribute to this pattern, we compared 8-year-olds' sensitivity to spacing…

  13. Talker variability in audio-visual speech perception.

    PubMed

    Heald, Shannon L M; Nusbaum, Howard C

    2014-01-01

    A change in talker is a change in the context for the phonetic interpretation of acoustic patterns of speech. Different talkers have different mappings between acoustic patterns and phonetic categories and listeners need to adapt to these differences. Despite this complexity, listeners are adept at comprehending speech in multiple-talker contexts, albeit at a slight but measurable performance cost (e.g., slower recognition). So far, this talker variability cost has been demonstrated only in audio-only speech. Other research in single-talker contexts have shown, however, that when listeners are able to see a talker's face, speech recognition is improved under adverse listening (e.g., noise or distortion) conditions that can increase uncertainty in the mapping between acoustic patterns and phonetic categories. Does seeing a talker's face reduce the cost of word recognition in multiple-talker contexts? We used a speeded word-monitoring task in which listeners make quick judgments about target word recognition in single- and multiple-talker contexts. Results show faster recognition performance in single-talker conditions compared to multiple-talker conditions for both audio-only and audio-visual speech. However, recognition time in a multiple-talker context was slower in the audio-visual condition compared to audio-only condition. These results suggest that seeing a talker's face during speech perception may slow recognition by increasing the importance of talker identification, signaling to the listener a change in talker has occurred.

  14. Talker variability in audio-visual speech perception

    PubMed Central

    Heald, Shannon L. M.; Nusbaum, Howard C.

    2014-01-01

    A change in talker is a change in the context for the phonetic interpretation of acoustic patterns of speech. Different talkers have different mappings between acoustic patterns and phonetic categories and listeners need to adapt to these differences. Despite this complexity, listeners are adept at comprehending speech in multiple-talker contexts, albeit at a slight but measurable performance cost (e.g., slower recognition). So far, this talker variability cost has been demonstrated only in audio-only speech. Other research in single-talker contexts have shown, however, that when listeners are able to see a talker’s face, speech recognition is improved under adverse listening (e.g., noise or distortion) conditions that can increase uncertainty in the mapping between acoustic patterns and phonetic categories. Does seeing a talker’s face reduce the cost of word recognition in multiple-talker contexts? We used a speeded word-monitoring task in which listeners make quick judgments about target word recognition in single- and multiple-talker contexts. Results show faster recognition performance in single-talker conditions compared to multiple-talker conditions for both audio-only and audio-visual speech. However, recognition time in a multiple-talker context was slower in the audio-visual condition compared to audio-only condition. These results suggest that seeing a talker’s face during speech perception may slow recognition by increasing the importance of talker identification, signaling to the listener a change in talker has occurred. PMID:25076919

  15. Audio-visual integration through the parallel visual pathways.

    PubMed

    Kaposvári, Péter; Csete, Gergő; Bognár, Anna; Csibri, Péter; Tóth, Eszter; Szabó, Nikoletta; Vécsei, László; Sáry, Gyula; Kincses, Zsigmond Tamás

    2015-10-22

    Audio-visual integration has been shown to be present in a wide range of different conditions, some of which are processed through the dorsal, and others through the ventral visual pathway. Whereas neuroimaging studies have revealed integration-related activity in the brain, there has been no imaging study of the possible role of segregated visual streams in audio-visual integration. We set out to determine how the different visual pathways participate in this communication. We investigated how audio-visual integration can be supported through the dorsal and ventral visual pathways during the double flash illusion. Low-contrast and chromatic isoluminant stimuli were used to drive preferably the dorsal and ventral pathways, respectively. In order to identify the anatomical substrates of the audio-visual interaction in the two conditions, the psychophysical results were correlated with the white matter integrity as measured by diffusion tensor imaging.The psychophysiological data revealed a robust double flash illusion in both conditions. A correlation between the psychophysical results and local fractional anisotropy was found in the occipito-parietal white matter in the low-contrast condition, while a similar correlation was found in the infero-temporal white matter in the chromatic isoluminant condition. Our results indicate that both of the parallel visual pathways may play a role in the audio-visual interaction.

  16. Audio Visual Integration with Competing Sources in the Framework of Audio Visual Speech Scene Analysis.

    PubMed

    Ganesh, Attigodu Chandrashekara; Berthommier, Frédéric; Schwartz, Jean-Luc

    2016-01-01

    We introduce "Audio-Visual Speech Scene Analysis" (AVSSA) as an extension of the two-stage Auditory Scene Analysis model towards audiovisual scenes made of mixtures of speakers. AVSSA assumes that a coherence index between the auditory and the visual input is computed prior to audiovisual fusion, enabling to determine whether the sensory inputs should be bound together. Previous experiments on the modulation of the McGurk effect by audiovisual coherent vs. incoherent contexts presented before the McGurk target have provided experimental evidence supporting AVSSA. Indeed, incoherent contexts appear to decrease the McGurk effect, suggesting that they produce lower audiovisual coherence hence less audiovisual fusion. The present experiments extend the AVSSA paradigm by creating contexts made of competing audiovisual sources and measuring their effect on McGurk targets. The competing audiovisual sources have respectively a high and a low audiovisual coherence (that is, large vs. small audiovisual comodulations in time). The first experiment involves contexts made of two auditory sources and one video source associated to either the first or the second audio source. It appears that the McGurk effect is smaller after the context made of the visual source associated to the auditory source with less audiovisual coherence. In the second experiment with the same stimuli, the participants are asked to attend to either one or the other source. The data show that the modulation of fusion depends on the attentional focus. Altogether, these two experiments shed light on audiovisual binding, the AVSSA process and the role of attention.

  17. FIRST ULTRAVIOLET REFLECTANCE SPECTRA OF PLUTO AND CHARON BY THE HUBBLE SPACE TELESCOPE COSMIC ORIGINS SPECTROGRAPH: DETECTION OF ABSORPTION FEATURES AND EVIDENCE FOR TEMPORAL CHANGE

    SciTech Connect

    Stern, S. A.; Spencer, J. R.; Shinn, A.; Cunningham, N. J.; Hain, M. J.

    2012-01-15

    We have observed the mid-UV spectra of both Pluto and its large satellite, Charon, at two rotational epochs using the Hubble Space Telescope (HST) Cosmic Origins Spectrograph (COS) in 2010. These are the first HST/COS measurements of Pluto and Charon. Here we describe the observations and our reduction of them, and present the albedo spectra, average mid-UV albedos, and albedo slopes we derive from these data. These data reveal evidence for a strong absorption feature in the mid-UV spectrum of Pluto; evidence for temporal change in Pluto's spectrum since the 1990s is reported, and indirect evidence for a near-UV spectral absorption on Charon is also reported.

  18. Cosmic Ray Monitoring and Space Dangerous Phenomena, 1. Search of Features In Cosmic Rays What Can Be Used For Forecasting of Major Geomagnetic Storms

    NASA Astrophysics Data System (ADS)

    Dorman, L. I.; Pustil'Nik, L. A.; Sternlieb, A.; Zukerman, I. G.

    According to NOAA Space Weather Scales, geomagnetic storms of scales G5 (3- hour index of geomagnetic activity Kp=9), G4 (Kp=8) and G3 (Kp=7) are dangerous for people technology and health (influence on power systems, on spacecraft oper- ations, on HF radio-communications and others). To prevent these serious damages will be very important to forecast dangerous geomagnetic storms. In many papers it was shown that in principle for this forecasting can be used data on CR intensity and CR anisotropy changing before SC of major geomagnetic storms accompanied by sufficient Forbush-decreases (e.g., Dorman et al., 1995, 1999). In this paper we con- sider over 100 major geomagnetic storms and for each case we analyze hourly data of many NM for 8 days with SC in the 4-st day of 8-days period (that before SC we have at least 3 full days). We determine what part of major geomagnetic storms is accompanied CR intensity and CR anisotropy changing before SC, and what part of major geomagnetic storms does not show any features what can be used for forecast- ing. We estimate also how these parts depend from the index of geomagnetic activ- ity Kp. REFERENCES: Dorman L.I., et al. "Cosmic-ray forecasting features for big Forbush-decreases". Nuclear Physics B, Vol. 49A, pp. 136-144. (1995). L.I.Dorman, et al, "Cosmic ray Forbush-decrease as indicators of space dangerous phenomenon and possible use of cosmic ray data for their prediction", Proc. of 26-th Intern. Cos- mic Ray Conference, Salt Lake City, Vol. 6, p. 476-479, (1999).

  19. Joint application of audio spectral envelope and tonality index in an e-asthma monitoring system.

    PubMed

    Wiśniewski, Marcin; Zieliński, Tomasz P

    2015-05-01

    This paper presents in detail a recently introduced highly efficient method for automatic detection of asthmatic wheezing in breathing sounds. The fluctuation in the audio spectral envelope (ASE) from the MPEG-7 standard and the value of the tonality index (TI) from the MPEG-2 Audio specification are jointly used as discriminative features for wheezy sounds, while the support vector machine (SVM) with a polynomial kernel serves as a classifier. The advantages of the proposed approach are described in the paper (e.g., detecting weak wheezes, very good ROC characteristics, independence from noise color). Since the method is not computationally complex, it is suitable for remote asthma monitoring using mobile devices (personal medical assistants). The main contribution of this paper consists of presenting all the implementation details concerning the proposed approach for the first time, i.e., the pseudocode of the method and adjusting the values of the ASE and TI parameters after which only one (not two) FFT is required for analysis of a next overlapping signal fragment. The efficiency of the method has also been additionally confirmed by the AdaBoost classifier with a built-in mechanism to feature ranking, as well as a previously performed minimal-redundancy-maximal-relevance test.

  20. Audio Watermarking Based on HAS and Neural Networks in DCT Domain

    NASA Astrophysics Data System (ADS)

    Tsai, Hung-Hsu; Cheng, Ji-Shiung; Yu, Pao-Ta

    2003-12-01

    We propose a new intelligent audio watermarking method based on the characteristics of the HAS and the techniques of neural networks in the DCT domain. The method makes the watermark imperceptible by using the audio masking characteristics of the HAS. Moreover, the method exploits a neural network for memorizing the relationships between the original audio signals and the watermarked audio signals. Therefore, the method is capable of extracting watermarks without original audio signals. Finally, the experimental results are also included to illustrate that the method significantly possesses robustness to be immune against common attacks for the copyright protection of digital audio.

  1. Say What? The Role of Audio in Multimedia Video

    NASA Astrophysics Data System (ADS)

    Linder, C. A.; Holmes, R. M.

    2011-12-01

    Audio, including interviews, ambient sounds, and music, is a critical-yet often overlooked-part of an effective multimedia video. In February 2010, Linder joined scientists working on the Global Rivers Observatory Project for two weeks of intensive fieldwork in the Congo River watershed. The team's goal was to learn more about how climate change and deforestation are impacting the river system and coastal ocean. Using stills and video shot with a lightweight digital SLR outfit and audio recorded with a pocket-sized sound recorder, Linder documented the trials and triumphs of working in the heart of Africa. Using excerpts from the six-minute Congo multimedia video, this presentation will illustrate how to record and edit an engaging audio track. Topics include interview technique, collecting ambient sounds, choosing and using music, and editing it all together to educate and entertain the viewer.

  2. Designing optical disk systems into audio/video products

    NASA Astrophysics Data System (ADS)

    Yang, Jiandong

    2008-12-01

    Optical disk systems are still widely used in players in consumer electronics and automotive applications, although more and more audio and video contents are played from other medias such as flash memory and hard disk drive based devices. There are various architectures with the integrations of audio and video (A/V) decoders and optical disk servo components to reduced product BOM cost. Some issues are addressed for designing optical disk playing modules into an audio or video products. Servo implementation including tracking, seeking and rotating control needs to consider the characteristics of low cost mechanisms and non-ideal disks. When optical disk systems are used in portable or moving environments, the approaches from servo control side and electronic can be helpful for anti-shock. Special handlings to defect disks are important to playability.

  3. Music Identification System Using MPEG-7 Audio Signature Descriptors

    PubMed Central

    You, Shingchern D.; Chen, Wei-Hwa; Chen, Woei-Kae

    2013-01-01

    This paper describes a multiresolution system based on MPEG-7 audio signature descriptors for music identification. Such an identification system may be used to detect illegally copied music circulated over the Internet. In the proposed system, low-resolution descriptors are used to search likely candidates, and then full-resolution descriptors are used to identify the unknown (query) audio. With this arrangement, the proposed system achieves both high speed and high accuracy. To deal with the problem that a piece of query audio may not be inside the system's database, we suggest two different methods to find the decision threshold. Simulation results show that the proposed method II can achieve an accuracy of 99.4% for query inputs both inside and outside the database. Overall, it is highly possible to use the proposed system for copyright control. PMID:23533359

  4. Multi-channel spatialization system for audio signals

    NASA Technical Reports Server (NTRS)

    Begault, Durand R. (Inventor)

    1995-01-01

    Synthetic head related transfer functions (HRTF's) for imposing reprogramable spatial cues to a plurality of audio input signals included, for example, in multiple narrow-band audio communications signals received simultaneously are generated and stored in interchangeable programmable read only memories (PROM's) which store both head related transfer function impulse response data and source positional information for a plurality of desired virtual source locations. The analog inputs of the audio signals are filtered and converted to digital signals from which synthetic head related transfer functions are generated in the form of linear phase finite impulse response filters. The outputs of the impulse response filters are subsequently reconverted to analog signals, filtered, mixed and fed to a pair of headphones.

  5. Multi-channel spatialization systems for audio signals

    NASA Technical Reports Server (NTRS)

    Begault, Durand R. (Inventor)

    1993-01-01

    Synthetic head related transfer functions (HRTF's) for imposing reprogrammable spatial cues to a plurality of audio input signals included, for example, in multiple narrow-band audio communications signals received simultaneously are generated and stored in interchangeable programmable read only memories (PROM's) which store both head related transfer function impulse response data and source positional information for a plurality of desired virtual source locations. The analog inputs of the audio signals are filtered and converted to digital signals from which synthetic head related transfer functions are generated in the form of linear phase finite impulse response filters. The outputs of the impulse response filters are subsequently reconverted to analog signals, filtered, mixed, and fed to a pair of headphones.

  6. Highlight summarization in golf videos using audio signals

    NASA Astrophysics Data System (ADS)

    Kim, Hyoung-Gook; Kim, Jin Young

    2008-01-01

    In this paper, we present an automatic summarization of highlights in golf videos based on audio information alone without video information. The proposed highlight summarization system is carried out based on semantic audio segmentation and detection on action units from audio signals. Studio speech, field speech, music, and applause are segmented by means of sound classification. Swing is detected by the methods of impulse onset detection. Sounds like swing and applause form a complete action unit, while studio speech and music parts are used to anchor the program structure. With the advantage of highly precise detection of applause, highlights are extracted effectively. Our experimental results obtain high classification precision on 18 golf games. It proves that the proposed system is very effective and computationally efficient to apply the technology to embedded consumer electronic devices.

  7. The Application of Acoustic Measurements and Audio Recordings for Diagnosis of In-Flight Hardware Anomalies

    NASA Technical Reports Server (NTRS)

    Welsh, David; Denham, Samuel; Allen, Christopher

    2011-01-01

    In many cases, an initial symptom of hardware malfunction is unusual or unexpected acoustic noise. Many industries such as automotive, heating and air conditioning, and petro-chemical processing use noise and vibration data along with rotating machinery analysis techniques to identify noise sources and correct hardware defects. The NASA/Johnson Space Center Acoustics Office monitors the acoustic environment of the International Space Station (ISS) through periodic sound level measurement surveys. Trending of the sound level measurement survey results can identify in-flight hardware anomalies. The crew of the ISS also serves as a "detection tool" in identifying unusual hardware noises; in these cases the spectral analysis of audio recordings made on orbit can be used to identify hardware defects that are related to rotating components such as fans, pumps, and compressors. In this paper, three examples of the use of sound level measurements and audio recordings for the diagnosis of in-flight hardware anomalies are discussed: identification of blocked inter-module ventilation (IMV) ducts, diagnosis of abnormal ISS Crew Quarters rack exhaust fan noise, and the identification and replacement of a defective flywheel assembly in the Treadmill with Vibration Isolation (TVIS) hardware. In each of these examples, crew time was saved by identifying the off nominal component or condition that existed and in directing in-flight maintenance activities to address and correct each of these problems.

  8. KENNEDY SPACE CENTER, FLA. - The Rocket Garden at the KSC Visitor Complex features eight authentic rockets from the past, including a Mercury-Atlas rocket. The garden also features a climb-in Mercury, Gemino and Apollo capsule replicas, seating pods and informative graphic elements.

    NASA Image and Video Library

    2003-07-22

    KENNEDY SPACE CENTER, FLA. - The Rocket Garden at the KSC Visitor Complex features eight authentic rockets from the past, including a Mercury-Atlas rocket. The garden also features a climb-in Mercury, Gemino and Apollo capsule replicas, seating pods and informative graphic elements.

  9. Online feature selection with streaming features.

    PubMed

    Wu, Xindong; Yu, Kui; Ding, Wei; Wang, Hao; Zhu, Xingquan

    2013-05-01

    We propose a new online feature selection framework for applications with streaming features where the knowledge of the full feature space is unknown in advance. We define streaming features as features that flow in one by one over time whereas the number of training examples remains fixed. This is in contrast with traditional online learning methods that only deal with sequentially added observations, with little attention being paid to streaming features. The critical challenges for Online Streaming Feature Selection (OSFS) include 1) the continuous growth of feature volumes over time, 2) a large feature space, possibly of unknown or infinite size, and 3) the unavailability of the entire feature set before learning starts. In the paper, we present a novel Online Streaming Feature Selection method to select strongly relevant and nonredundant features on the fly. An efficient Fast-OSFS algorithm is proposed to improve feature selection performance. The proposed algorithms are evaluated extensively on high-dimensional datasets and also with a real-world case study on impact crater detection. Experimental results demonstrate that the algorithms achieve better compactness and higher prediction accuracy than existing streaming feature selection algorithms.

  10. Three dimensional audio versus head down TCAS displays

    NASA Technical Reports Server (NTRS)

    Begault, Durand R.; Pittman, Marc T.

    1994-01-01

    The advantage of a head up auditory display was evaluated in an experiment designed to measure and compare the acquisition time for capturing visual targets under two conditions: Standard head down traffic collision avoidance system (TCAS) display, and three-dimensional (3-D) audio TCAS presentation. Ten commercial airline crews were tested under full mission simulation conditions at the NASA Ames Crew-Vehicle Systems Research Facility Advanced Concepts Flight Simulator. Scenario software generated targets corresponding to aircraft which activated a 3-D aural advisory or a TCAS advisory. Results showed a significant difference in target acquisition time between the two conditions, favoring the 3-D audio TCAS condition by 500 ms.

  11. Cover signal specific steganalysis: the impact of training on the example of two selected audio steganalysis approaches

    NASA Astrophysics Data System (ADS)

    Kraetzer, Christian; Dittmann, Jana

    2008-02-01

    The main goals of this paper are to show the impact of the basic assumptions for the cover channel characteristics as well as the impact of different training/testing set generation strategies on the statistical detectability of exemplary chosen audio hiding approaches known from steganography and watermarking. Here we have selected exemplary five steganography algorithms and four watermarking algorithms. The channel characteristics for two different chosen audio cover channels (an application specific exemplary scenario of VoIP steganography and universal audio steganography) are formalised and their impact on decisions in the steganalysis process, especially on the strategies applied for training/ testing set generation, are shown. Following the assumptions on the cover channel characteristics either cover dependent or cover independent training and testing can be performed, using either correlated or non-correlated training and test sets. In comparison to previous work, additional frequency domain features are introduced for steganalysis and the performance (in terms of classification accuracy) of Bayesian classifiers and multinomial logistic regression models is compared with the results of SVM classification. We show that the newly implemented frequency domain features increase the classification accuracy achieved in SVM classification. Furthermore it is shown on the example of VoIP steganalysis that channel character specific evaluation performs better than tests without focus on a specific channel (i.e. universal steganalysis). A comparison of test results for cover dependent and independent training and testing shows that the latter performs better for all nine algorithms evaluated here and the used SVM based classifier.

  12. Fiber-channel audio video standard for military and commercial aircraft product lines

    NASA Astrophysics Data System (ADS)

    Keller, Jack E.

    2002-08-01

    Fibre channel is an emerging high-speed digital network technology that combines to make inroads into the avionics arena. The suitability of fibre channel for such applications is largely due to its flexibility in these key areas: Network topologies can be configured in point-to-point, arbitrated loop or switched fabric connections. The physical layer supports either copper or fiber optic implementations with a Bit Error Rate of less than 10-12. Multiple Classes of Service are available. Multiple Upper Level Protocols are supported. Multiple high speed data rates offer open ended growth paths providing speed negotiation within a single network. Current speeds supported by commercially available hardware are 1 and 2 Gbps providing effective data rates of 100 and 200 MBps respectively. Such networks lend themselves well to the transport of digital video and audio data. This paper summarizes an ANSI standard currently in the final approval cycle of the InterNational Committee for Information Technology Standardization (INCITS). This standard defines a flexible mechanism whereby digital video, audio and ancillary data are systematically packaged for transport over a fibre channel network. The basic mechanism, called a container, houses audio and video content functionally grouped as elements of the container called objects. Featured in this paper is a specific container mapping called Simple Parametric Digital Video (SPDV) developed particularly to address digital video in avionics systems. SPDV provides pixel-based video with associated ancillary data typically sourced by various sensors to be processed and/or distributed in the cockpit for presentation via high-resolution displays. Also highlighted in this paper is a streamlined Upper Level Protocol (ULP) called Frame Header Control Procedure (FHCP) targeted for avionics systems where the functionality of a more complex ULP is not required.

  13. Multi-modal gesture recognition using integrated model of motion, audio and video

    NASA Astrophysics Data System (ADS)

    Goutsu, Yusuke; Kobayashi, Takaki; Obara, Junya; Kusajima, Ikuo; Takeichi, Kazunari; Takano, Wataru; Nakamura, Yoshihiko

    2015-07-01

    Gesture recognition is used in many practical applications such as human-robot interaction, medical rehabilitation and sign language. With increasing motion sensor development, multiple data sources have become available, which leads to the rise of multi-modal gesture recognition. Since our previous approach to gesture recognition depends on a unimodal system, it is difficult to classify similar motion patterns. In order to solve this problem, a novel approach which integrates motion, audio and video models is proposed by using dataset captured by Kinect. The proposed system can recognize observed gestures by using three models. Recognition results of three models are integrated by using the proposed framework and the output becomes the final result. The motion and audio models are learned by using Hidden Markov Model. Random Forest which is the video classifier is used to learn the video model. In the experiments to test the performances of the proposed system, the motion and audio models most suitable for gesture recognition are chosen by varying feature vectors and learning methods. Additionally, the unimodal and multi-modal models are compared with respect to recognition accuracy. All the experiments are conducted on dataset provided by the competition organizer of MMGRC, which is a workshop for Multi-Modal Gesture Recognition Challenge. The comparison results show that the multi-modal model composed of three models scores the highest recognition rate. This improvement of recognition accuracy means that the complementary relationship among three models improves the accuracy of gesture recognition. The proposed system provides the application technology to understand human actions of daily life more precisely.

  14. Speech Music Discrimination Using Class-Specific Features

    DTIC Science & Technology

    2004-08-01

    Speech Music Discrimination Using Class-Specific Features Thomas Beierholm...between speech and music . Feature extraction is class-specific and can therefore be tailored to each class meaning that segment size, model orders...interest. Some of the applications of audio signal classification are speech/ music classification [1], acoustical environmental classification [2][3

  15. The Air Operations Simulation Centre Audio System.

    DTIC Science & Technology

    1998-04-01

    The Enable Retrigger feature enables the user to specify whether the selected sound is to be treated as a one-shot edge- triggered sound which plays...to completion once triggered , or a level-sensitive sound which restarts and plays only while its control variable is non-zero. See Appendix B...mock-up are well advanced. Aircraft noises are modelled and include engine turbine whine, afterburner roar, air- conditioning and wind noises

  16. Audio Signal Processing Using Time-Frequency Approaches: Coding, Classification, Fingerprinting, and Watermarking

    NASA Astrophysics Data System (ADS)

    Umapathy, K.; Ghoraani, B.; Krishnan, S.

    2010-12-01

    Audio signals are information rich nonstationary signals that play an important role in our day-to-day communication, perception of environment, and entertainment. Due to its non-stationary nature, time- or frequency-only approaches are inadequate in analyzing these signals. A joint time-frequency (TF) approach would be a better choice to efficiently process these signals. In this digital era, compression, intelligent indexing for content-based retrieval, classification, and protection of digital audio content are few of the areas that encapsulate a majority of the audio signal processing applications. In this paper, we present a comprehensive array of TF methodologies that successfully address applications in all of the above mentioned areas. A TF-based audio coding scheme with novel psychoacoustics model, music classification, audio classification of environmental sounds, audio fingerprinting, and audio watermarking will be presented to demonstrate the advantages of using time-frequency approaches in analyzing and extracting information from audio signals.

  17. The Role of Audio Media in the Lives of Children.

    ERIC Educational Resources Information Center

    Christenson, Peter G.; Lindlof, Thomas R.

    Mass communication researchers have largely ignored the role of audio media and popular music in the lives of children, yet the available evidence shows that children do listen. Extant studies yield a consistent developmental portrait of childrens' listening frequency, but there is a notable lack of programatic research over the past decade, one…

  18. Improved Techniques for Automatic Chord Recognition from Music Audio Signals

    ERIC Educational Resources Information Center

    Cho, Taemin

    2014-01-01

    This thesis is concerned with the development of techniques that facilitate the effective implementation of capable automatic chord transcription from music audio signals. Since chord transcriptions can capture many important aspects of music, they are useful for a wide variety of music applications and also useful for people who learn and perform…

  19. The Use of Audio in Computer-Based Instruction.

    ERIC Educational Resources Information Center

    Koroghlanian, Carol M.; Sullivan, Howard J.

    This study investigated the effects of audio and text density on the achievement, time-in-program, and attitudes of 134 undergraduates. Data concerning the subjects' preexisting computer skills and experience, as well as demographic information, were also collected. The instruction in visual design principles was delivered by computer and included…

  20. A Decade of Experimentation with the Audio-Tutorial System.

    ERIC Educational Resources Information Center

    Mott, Charles J.

    The audio-tutorial systems in earth science was developed at St. Petersburg Junior College in response to an unacceptable but not atypical student failure rate at the community college level. This description of the program includes: (1) history of program development, (2) use of student tutors, (3) experimentation with a student paced, mastery…

  1. Audio-Visual Aid in Teaching "Fatty Liver"

    ERIC Educational Resources Information Center

    Dash, Sambit; Kamath, Ullas; Rao, Guruprasad; Prakash, Jay; Mishra, Snigdha

    2016-01-01

    Use of audio visual tools to aid in medical education is ever on a rise. Our study intends to find the efficacy of a video prepared on "fatty liver," a topic that is often a challenge for pre-clinical teachers, in enhancing cognitive processing and ultimately learning. We prepared a video presentation of 11:36 min, incorporating various…

  2. Dual Audio TV Instruction: A Mass Broadcast Simulation.

    ERIC Educational Resources Information Center

    Borton, Terry; And Others

    Intended as a method of increasing children's learning from commercial television by providing supplementary educational commentary on a simultaneous radio broadcast, this study was designed to explore the potential of dual audio instruction as a mass educational system by determining the medium's practicality, the rate of voluntary usage, the…

  3. Design and Development of a Computerized Audio-Video Laboratory.

    ERIC Educational Resources Information Center

    Behnke, Ralph R.; Derry, James O.

    1984-01-01

    Describes the components, functions, and applications of a computer-based audio/video learning laboratory at Texas Christian University that permits on-line communication between student learners and instructional staff. The laboratory gives instructors feedback from student learners during and after instruction and enables students to evaluate…

  4. The Audio-Visual Marketing Handbook for Independent Schools.

    ERIC Educational Resources Information Center

    Griffith, Tom

    This how-to booklet offers specific advice on producing video or slide/tape programs for marketing independent schools. Five chapters present guidelines for various stages in the process: (1) Audio-Visual Marketing in Context (aesthetics and economics of audiovisual marketing); (2) A Question of Identity (identifying the audience and deciding on…

  5. Sounds in CD-ROM--Integrating Audio in Multimedia Products.

    ERIC Educational Resources Information Center

    Rosebush, Judson

    1992-01-01

    Describes how audio technology is being integrated into CD-ROMs to create multimedia products. Computer hardware and software are discussed, including the use of HyperCard to combine still pictures, moving video pictures, and sound; and specific new multimedia products produced by the Voyager Company are described. (LRW)

  6. Digital Audio Broadcasting in the Short Wave Bands

    NASA Technical Reports Server (NTRS)

    Vaisnys, Arvydas

    1998-01-01

    For many decades the Short Wae broadcasting service has used high power, double-sideband AM signals to reach audiences far and wide. While audio quality was usually not very high, inexpensive receivers could be used to tune into broadcasts fro distant countries.

  7. Use of NdBFe magnets in audio deck sliders

    NASA Astrophysics Data System (ADS)

    Simonen, S. A. E.; Basak, A.

    1992-07-01

    Rotary motors and strings are usually employed for the movement of sliders on audio decks in studios to control the volume of sound in various channels. These sliding mechanisms can be made more reliable and compact by using dc stepping linear motors built with high energy product NdBFe magnets.

  8. Audio-visual enhancement of speech in noise.

    PubMed

    Girin, L; Schwartz, J L; Feng, G

    2001-06-01

    A key problem for telecommunication or human-machine communication systems concerns speech enhancement in noise. In this domain, a certain number of techniques exist, all of them based on an acoustic-only approach--that is, the processing of the audio corrupted signal using audio information (from the corrupted signal only or additive audio information). In this paper, an audio-visual approach to the problem is considered, since it has been demonstrated in several studies that viewing the speaker's face improves message intelligibility, especially in noisy environments. A speech enhancement prototype system that takes advantage of visual inputs is developed. A filtering process approach is proposed that uses enhancement filters estimated with the help of lip shape information. The estimation process is based on linear regression or simple neural networks using a training corpus. A set of experiments assessed by Gaussian classification and perceptual tests demonstrates that it is indeed possible to enhance simple stimuli (vowel-plosive-vowel sequences) embedded in white Gaussian noise.

  9. Reading's SLiCK with New Audio Texts and Strategies.

    ERIC Educational Resources Information Center

    Boyle, Elizabeth A.; Washburn, Shari Gallin; Rosenberg, Michael S.; Connelly, Vincent J.; Brinckerhoff, Loring C.; Banerjee, Manju

    2002-01-01

    This article discusses challenges for secondary students with disabilities and alternative instructional methods that teachers of students with poor reading skills can use to convey content information effectively and efficiently. The use of audio textbooks on CD-ROMs is emphasized and the SLiCK strategy is explained as a support for the CD-ROM.…

  10. Infant Perception of Audio-Visual Speech Synchrony

    ERIC Educational Resources Information Center

    Lewkowicz, David J.

    2010-01-01

    Three experiments investigated perception of audio-visual (A-V) speech synchrony in 4- to 10-month-old infants. Experiments 1 and 2 used a convergent-operations approach by habituating infants to an audiovisually synchronous syllable (Experiment 1) and then testing for detection of increasing degrees of A-V asynchrony (366, 500, and 666 ms) or by…

  11. The 7 Habits of Highly Effective Families Audio System. [Audiotapes].

    ERIC Educational Resources Information Center

    Covey, Stephen R.

    Intended to help families build rewarding relationships, this set of audio tapes communicates the importance of family, shows the role of leadership in creating "beautiful family culture," and provides a frame of reference in which to solve problems. The first of the four tapes explores the concept of principles as the foundation of…

  12. The Timbre Toolbox: extracting audio descriptors from musical signals.

    PubMed

    Peeters, Geoffroy; Giordano, Bruno L; Susini, Patrick; Misdariis, Nicolas; McAdams, Stephen

    2011-11-01

    The analysis of musical signals to extract audio descriptors that can potentially characterize their timbre has been disparate and often too focused on a particular small set of sounds. The Timbre Toolbox provides a comprehensive set of descriptors that can be useful in perceptual research, as well as in music information retrieval and machine-learning approaches to content-based retrieval in large sound databases. Sound events are first analyzed in terms of various input representations (short-term Fourier transform, harmonic sinusoidal components, an auditory model based on the equivalent rectangular bandwidth concept, the energy envelope). A large number of audio descriptors are then derived from each of these representations to capture temporal, spectral, spectrotemporal, and energetic properties of the sound events. Some descriptors are global, providing a single value for the whole sound event, whereas others are time-varying. Robust descriptive statistics are used to characterize the time-varying descriptors. To examine the information redundancy across audio descriptors, correlational analysis followed by hierarchical clustering is performed. This analysis suggests ten classes of relatively independent audio descriptors, showing that the Timbre Toolbox is a multidimensional instrument for the measurement of the acoustical structure of complex sound signals.

  13. Audio-Visual Aid in Teaching "Fatty Liver"

    ERIC Educational Resources Information Center

    Dash, Sambit; Kamath, Ullas; Rao, Guruprasad; Prakash, Jay; Mishra, Snigdha

    2016-01-01

    Use of audio visual tools to aid in medical education is ever on a rise. Our study intends to find the efficacy of a video prepared on "fatty liver," a topic that is often a challenge for pre-clinical teachers, in enhancing cognitive processing and ultimately learning. We prepared a video presentation of 11:36 min, incorporating various…

  14. Audio-visual perception system for a humanoid robotic head.

    PubMed

    Viciana-Abad, Raquel; Marfil, Rebeca; Perez-Lorenzo, Jose M; Bandera, Juan P; Romero-Garces, Adrian; Reche-Lopez, Pedro

    2014-05-28

    One of the main issues within the field of social robotics is to endow robots with the ability to direct attention to people with whom they are interacting. Different approaches follow bio-inspired mechanisms, merging audio and visual cues to localize a person using multiple sensors. However, most of these fusion mechanisms have been used in fixed systems, such as those used in video-conference rooms, and thus, they may incur difficulties when constrained to the sensors with which a robot can be equipped. Besides, within the scope of interactive autonomous robots, there is a lack in terms of evaluating the benefits of audio-visual attention mechanisms, compared to only audio or visual approaches, in real scenarios. Most of the tests conducted have been within controlled environments, at short distances and/or with off-line performance measurements. With the goal of demonstrating the benefit of fusing sensory information with a Bayes inference for interactive robotics, this paper presents a system for localizing a person by processing visual and audio data. Moreover, the performance of this system is evaluated and compared via considering the technical limitations of unimodal systems. The experiments show the promise of the proposed approach for the proactive detection and tracking of speakers in a human-robot interactive framework.

  15. Sounds Good: Using Digital Audio for Evaluation Feedback

    ERIC Educational Resources Information Center

    Rotheram, Bob

    2009-01-01

    Feedback on student work is problematic for faculty and students in British higher education. Evaluation feedback takes faculty much time to produce and students are often dissatisfied with its quantity, timing, and clarity. The Sounds Good project has been experimenting with the use of digital audio for feedback, aiming to save faculty time and…

  16. The Audio-Visual Marketing Handbook for Independent Schools.

    ERIC Educational Resources Information Center

    Griffith, Tom

    This how-to booklet offers specific advice on producing video or slide/tape programs for marketing independent schools. Five chapters present guidelines for various stages in the process: (1) Audio-Visual Marketing in Context (aesthetics and economics of audiovisual marketing); (2) A Question of Identity (identifying the audience and deciding on…

  17. Developing a Framework for Effective Audio Feedback: A Case Study

    ERIC Educational Resources Information Center

    Hennessy, Claire; Forrester, Gillian

    2014-01-01

    The increase in the use of technology-enhanced learning in higher education has included a growing interest in new approaches to enhance the quality of feedback given to students. Audio feedback is one method that has become more popular, yet evaluating its role in feedback delivery is still an emerging area for research. This paper is based on a…

  18. Audio-Described Educational Materials: Ugandan Teachers' Experiences

    ERIC Educational Resources Information Center

    Wormnaes, Siri; Sellaeg, Nina

    2013-01-01

    This article describes and discusses a qualitative, descriptive, and exploratory study of how 12 visually impaired teachers in Uganda experienced audio-described educational video material for teachers and student teachers. The study is based upon interviews with these teachers and observations while they were using the material either…

  19. Audio-Visual Perception System for a Humanoid Robotic Head

    PubMed Central

    Viciana-Abad, Raquel; Marfil, Rebeca; Perez-Lorenzo, Jose M.; Bandera, Juan P.; Romero-Garces, Adrian; Reche-Lopez, Pedro

    2014-01-01

    One of the main issues within the field of social robotics is to endow robots with the ability to direct attention to people with whom they are interacting. Different approaches follow bio-inspired mechanisms, merging audio and visual cues to localize a person using multiple sensors. However, most of these fusion mechanisms have been used in fixed systems, such as those used in video-conference rooms, and thus, they may incur difficulties when constrained to the sensors with which a robot can be equipped. Besides, within the scope of interactive autonomous robots, there is a lack in terms of evaluating the benefits of audio-visual attention mechanisms, compared to only audio or visual approaches, in real scenarios. Most of the tests conducted have been within controlled environments, at short distances and/or with off-line performance measurements. With the goal of demonstrating the benefit of fusing sensory information with a Bayes inference for interactive robotics, this paper presents a system for localizing a person by processing visual and audio data. Moreover, the performance of this system is evaluated and compared via considering the technical limitations of unimodal systems. The experiments show the promise of the proposed approach for the proactive detection and tracking of speakers in a human-robot interactive framework. PMID:24878593

  20. Audio and Video Reflections to Promote Social Justice

    ERIC Educational Resources Information Center

    Boske, Christa

    2011-01-01

    Purpose: The purpose of this paper is to examine how 15 graduate students enrolled in a US school leadership preparation program understand issues of social justice and equity through a reflective process utilizing audio and/or video software. Design/methodology/approach: The study is based on the tradition of grounded theory. The researcher…