These are representative sample records from Science.gov related to your search topic.
For comprehensive and current results, perform a real-time search at Science.gov.
1

THE STUDY AND IMPLEMENTATION OF TEXT-TO-SPEECH SYSTEM FOR  

E-print Network

of translating the text user defined or export from database into audio. Fig.1 Text-to-speech system's functionsTHE STUDY AND IMPLEMENTATION OF TEXT-TO-SPEECH SYSTEM FOR AGRICULTURAL INFORMATION Huoguo Zheng ,*2 of information resource in rural, we R & D the text-to-speech system. The system includes two parts, software

Paris-Sud XI, Université de

2

Text-to-speech scripting interface for appropriate vocalisation of e-texts  

Microsoft Academic Search

Electronic texts carry important meta-information (such as tags in HTML) that most of the current Text-to-Speech (TtS) systems ignore during the production of the speech. We propose an approach to exploit this meta-information in order to achieve a detailed auditory representation of an e-text. The e-Text to Speech and Audio (e-TSA) Composer has been designed and developed as an XML

Gerasimos Xydas; Georgios Kouroupetroglou

2001-01-01

3

Audio-Visual Teaching Machines.  

ERIC Educational Resources Information Center

An audiovisual teaching machine (AVTM) presents programed audio and visual material simultaneously to a student and accepts his response. If his response is correct, the machine proceeds with the lesson; if it is incorrect, the machine so indicates and permits another choice (linear) or automatically presents supplementary material (branching).…

Dorsett, Loyd G.

4

TEXT-TO-SPEECH CONVERSION WITH STAGED NEURAL NETWORKS  

Microsoft Academic Search

This paper presents a series of staged artificial neural networks (ANNs) for phoneme recognition for text -to-speech applications. Applying ANNs for phoneme mapping for text -to-speech conversion creates a fast distributed recognition engine. This engine not only supports the mapping of missing words in the database, but it can also reduce contradictions related to different pronunciations for the same word.

FABIO ARCINIEGAS; MARK J. EMBRECHTS

5

A markup language for text-to-speech synthesis.   

E-print Network

Text-to-speech synthesizers must process text, and therefore require some knowledge of text structure. While many TTS systems allow for user control by means of ad hoc ‘escape sequences’, there remains to date no ...

Sproat, Richard; Taylor, Paul; Tanenblatt, Michael; Isard, Amy

1997-01-01

6

Neural networks for text-to-speech phoneme recognition  

Microsoft Academic Search

Presents two different artificial neural network (ANN) approaches for phoneme recognition for text-to-speech applications: staged backpropagation neural networks and self-organizing maps. Several current commercial approaches rely on an exhaustive dictionary approach for text-to-phoneme conversion. Applying neural networks to phoneme mapping for text-to-speech conversion creates a fast distributed recognition engine. This engine not only supports the mapping of missing words in

Mark J. Embrechts; Fabio Arciniegas

2000-01-01

7

Augmented Auditory Representation of e-Texts for Text-to-Speech Systems  

Microsoft Academic Search

Emerging electronic text formats include hierarchical structure and visualization related information that current Text-to-Speech (TtS) systems i gnore. In this paper we present a novel approach for composing detailed auditory representation of e-texts using sp eech and audio. Furthermore, we provide a scripting language (CAD scripts) for defining specific customizations on the operation of a TtS. CAD scripts can as

Gerasimos Xydas; Georgios Kouroupetroglou

2001-01-01

8

Choosing and Using Text-to-Speech Software  

ERIC Educational Resources Information Center

This article describes a computer-based technology for generating speech called text-to-speech (TTS). This software is ready for widespread use by libraries, other organizations, and individual users. It offers the affordable ability to turn just about any electronic text that is not image-based into an artificially spoken communication. The…

Peters, Tom; Bell, Lori

2007-01-01

9

Multilingual Text Analysis for Text-to-Speech Synthesis  

Microsoft Academic Search

We present a model of text analysis for text-to-speech (TTS) syn- thesis based on weighted finite-state transducers, which serves as the text-analysis module of the multilingual Bell Labs TTS system. The transducers are constructed using a lexical toolkit that allows declarative descriptions of lexicons, morphological rules, numeral- expansion rules, and phonological rules, inter alia. To date, the model has been

Richard Sproat

1996-01-01

10

A MARKUP LANGUAGE FOR TEXT-TO-SPEECH SYNTHESIS  

Microsoft Academic Search

Text-to-speech synthesizers must process text, and there- fore require some knowledge of text structure. While many TTS systems allow for user control by means of ad hoc 'escape sequences', there remains to date no ad- equate and generally agreed upon system-independent standard for marking up text for the purposes of synthe- sis. The present paper is a collaborative effort between

Richard Sproat; Paul Taylor; Michael Tanenblatt; Amy Isard

1997-01-01

11

Approaches for adaptive database reduction for Text-To-Speech synthesis Aleksandra Krul 1,2  

E-print Network

Approaches for adaptive database reduction for Text-To-Speech synthesis Aleksandra Krul 1,2 , G This paper raises the issue of speech database reduction adapted to a specific domain for Text-To-Speech (TTS measures. Index Terms: text-to-speech synthesis, corpus reduction, database adaptation, KL divergence 1

Yvon, François

12

Multilingual Number Transcription for Text-to-Speech Conversion R. San-Segundo1  

E-print Network

Multilingual Number Transcription for Text-to-Speech Conversion R. San-Segundo1 , J.M. Montero1 , M of a text to speech fully-trainable conversion system and its application to number transcription. The main of on expert rules. This paper proposes a general architecture based on statistical ma- chine translation

Edinburgh, University of

13

"Look What I Did!": Student Conferences with Text-to-Speech Software  

ERIC Educational Resources Information Center

The authors describe a strategy that empowers students to edit and revise their own writing. Students input their writing in to text-to-speech software that rereads the text aloud. While listening, students make necessary revisions and edits.

Young, Chase; Stover, Katie

2014-01-01

14

Concatenative Text-to-Speech Synthesis Based on Prototype Waveform Interpolation (A Time Frequency Approach)   

E-print Network

This paper presents some preliminary methods to apply the Time- Frequency Interpolation technique - TFI [3] to concatenative text-to-speech synthesis. The TFI technique described here is a pitch-synchronous time-frequency ...

Morais, Edmilson; Taylor, Paul; Violaro, Fabio

15

Using Text-to-Speech Reading Support for an Adult with Mild Aphasia and Cognitive Impairment  

ERIC Educational Resources Information Center

This single case study served to examine text-to-speech (TTS) effects on reading rate and comprehension in an individual with mild aphasia and cognitive impairment. Findings showed faster reading, given TTS presented at a normal speaking rate, but no significant comprehension changes. TTS may support reading in people with aphasia when time…

Harvey, Judy; Hux, Karen; Snell, Jeffry

2013-01-01

16

The Effects of Word Prediction and Text-to-Speech on the Writing Process of Translating  

ERIC Educational Resources Information Center

The purpose of this study was to determine the effects of the combination of word prediction and text-to-speech software on the writing process of translating. Participants for this study included 10 elementary and middle school students who had a diagnosis of disorder of written expression. A modified multiple case series was used to collect data…

Cunningham, Robert

2013-01-01

17

Audio steganalysis based on factor analysis and support vector machine  

NASA Astrophysics Data System (ADS)

Factor analysis (FA) is a multivariate statistical method for studying the correlation among a group of variables, and is able to minimize data redundancy and reveal hidden patterns. We used FA to reduce the dimension of feature vector and support vector machine to classify the audio data sets in the proposed audio steganalysis system. Then the system is utilized to detect the stego-audio signals embedded by wavelet domain LSB, quantization index method (QIM) and addition method (AM). The dimension of feature vector is decreased from 36 to 7 (LSB), 6 (QIM), 6 (AM) respectively. The detection rates are all greater than 91% and the detecting performance is maintained.

Yuan, Jinsha; Qi, Yincheng

2008-10-01

18

Faking it: Synthetic text-to-speech synthesis for u nder-resourced languages - Experimental design  

Microsoft Academic Search

Speech synthesis or text-to-speech (TTS) systems are currently available for a number of the world's major languages, but for thousands of the world's 'minor' languages no such technology is available. While awaiting the development of such technology, we would like to try the stop-gap solution of using an existing TTS system for a major language (the base language) to 'fake'

Harold Somers

19

A Hybrid Text-to-Speech System That Combines Concatenative and Statistical Synthesis Units  

Microsoft Academic Search

Concatenative synthesis and statistical synthesis are the two main approaches to text-to-speech (TTS) synthesis. Con- catenative TTS (CTTS) stores natural speech features segments, selected from a recorded speech database. Consequently, CTTS systems enable speech synthesis with natural quality. However, as the footprintofthe stored dataisreduced,desired segmentsare not alwaysavailableinthestoreddata,andaudiblediscontinuitiesmay result. On the other hand, statistical TTS (STTS) systems, in spite of having

Stas Tiomkin; David Malah; Slava Shechtman; Zvi Kons

2011-01-01

20

A rule-based text-to-speech system for Portuguese  

Microsoft Academic Search

The latest progress in the development of a text-to-speech system for Portuguese is described. The system comprises four major modules: text normalization, linguistic and phonetic processing, generation of the synthesizer parameters and synthesis. The present rule-based version, based on the Klatt80 formant synthesizer, has achieved promising results, namely in what concerns the performance of stress assignment, phonetic transcription, and prosodic

L. C. Oliviera; M. C. Viana; I. M. Trancoso

1992-01-01

21

F0 Analysis and Modeling for Cantonese Text-to-Speech  

Microsoft Academic Search

This paper presents a study on the control of fundamental frequency (F0) in Cantonese text-to-speech (TTS) systems. The surface F0 contour of an utterance is considered as the combination of tone-related local components and phrase-level long-term variation. A novel method of F0 normalization has been developed to effectively separate them. Statistical analysis is performed for the phrase curves and the

Yujia Li; Tan Lee; Yao Qian

2004-01-01

22

Orthographic Learning and the Role of Text-to-Speech Software in Dutch Disabled Readers.  

PubMed

In this study, we examined whether orthographic learning can be demonstrated in disabled readers learning to read in a transparent orthography (Dutch). In addition, we tested the effect of the use of text-to-speech software, a new form of direct instruction, on orthographic learning. Both research goals were investigated by replicating Share's self-teaching paradigm. A total of 65 disabled Dutch readers were asked to read eight stories containing embedded homophonic pseudoword targets (e.g., Blot/Blod), with or without the support of text-to-speech software. The amount of orthographic learning was assessed 3 or 7 days later by three measures of orthographic learning. First, the results supported the presence of orthographic learning during independent silent reading by demonstrating that target spellings were correctly identified more often, named more quickly, and spelled more accurately than their homophone foils. Our results support the hypothesis that all readers, even poor readers of transparent orthographies, are capable of developing word-specific knowledge. Second, a negative effect of text-to-speech software on orthographic learning was demonstrated in this study. This negative effect was interpreted as the consequence of passively listening to the auditory presentation of the text. We clarify how these results can be interpreted within current theoretical accounts of orthographic learning and briefly discuss implications for remedial interventions. PMID:23686998

Staels, Eva; Van den Broeck, Wim

2015-01-01

23

Advancements in text-to-speech technology and implications for AAC applications  

NASA Astrophysics Data System (ADS)

Intelligibility was the initial focus in text-to-speech (TTS) research, since it is clearly a necessary condition for the application of the technology. Sufficiently high intelligibility (approximating human speech) has been achieved in the last decade by the better formant-based and concatenative TTS systems. This led to commercially available TTS systems for highly motivated users, particularly the blind and vocally impaired. Some unnatural qualities of TTS were exploited by these users, such as very fast speaking rates and altered pitch ranges for flagging relevant information. Recently, the focus in TTS research has turned to improving naturalness, so that synthetic speech sounds more human and less robotic. Unit selection approaches to concatenative synthesis have dramatically improved TTS quality, although at the cost of larger and more complex systems. This advancement in naturalness has made TTS technology more acceptable to the general public. The vocally impaired appreciate a more natural voice with which to represent themselves when communicating with others. Unit selection TTS does not achieve such high speaking rates as the earlier TTS systems, however, which is a disadvantage to some AAC device users. An important new research emphasis is to improve and increase the range of emotional expressiveness of TTS.

Syrdal, Ann K.

2003-10-01

24

Segmental intelligibility of four currently used text-to-speech synthesis methods.  

PubMed

The study investigated the segmental intelligibility of four currently available text-to-speech (TTS) products under 0-dB and 5-dB signal-to-noise ratios. The products were IBM ViaVoice version 5.1, which uses formant coding, Festival version 1.4.2, a diphone-based LPC TTS product, AT&T Next-Gen, a half-phone-based TTS product that uses harmonic-plus-noise method for synthesis, and FlexVoice2, a hybrid TTS product that combines concatenative and formant coding techniques. Overall, concatenative techniques were more intelligible than formant or hybrid techniques, with formant coding slightly better at modeling vowels and concatenative techniques marginally better at synthesizing consonants. No TTS product was better at resisting noise interference than others, although all were more intelligible at 5 dB than at 0-dB SNR. The better TTS products in this study were, on the average, 22% less intelligible and had about 3 times more phoneme errors than human voice under comparable listening conditions. The hybrid TTS technology of FlexVoice had the lowest intelligibility and highest error rates. There were discernible patterns of errors for stops, fricatives, and nasals. Unrestricted TTS output--e-mail messages, news reports, and so on--under high noise conditions prevalent in automobiles, airports, etc. will likely challenge the listeners. PMID:12703720

Venkatagiri, Horabail S

2003-04-01

25

MIXED TYPE AUDIO CLASSIFICATION WITH SUPPORT VECTOR MACHINE Department of Computer Science  

E-print Network

MIXED TYPE AUDIO CLASSIFICATION WITH SUPPORT VECTOR MACHINE Lei Chen Department of Computer Science of Waterloo tozsu@uwaterloo.ca ABSTRACT Content-based classification of audio data is an important problem for various applications such as overall analysis of audio-visual streams, boundary detection of video story

Ã?zsu, M. Tamer

26

Content-Based Audio Classification and Retrieval by Support Vector Machines  

Microsoft Academic Search

Support vector machines (SVMs) have been recently proposed as a new learning algorithm for pattern recognition. In this paper, the SVMs with a binary tree recognition strategy are used to tackle the audio classification problem. We illustrate the potential of SVMs on a common audio database, which consists of 409 sounds of 16 classes. We compare the SVMs based classification

Guodong Guo; Stan Z. Li

2000-01-01

27

Large-vocabulary audio-visual speech recognition by machines and humans  

Microsoft Academic Search

We compare automatic recognition with human perception of audio-visual speech, in the large-vocabulary, continuous speech recognition (LVCSR) domain. Specifically, we study the bene- fit of the visual modality for both machines and humans, when combined with audio degraded by speech-babble noise at var- ious signal-to-noise ratios (SNRs). We first consider an auto- matic speechreading system with a pixel based visual

Gerasimos Potamianos; Chalapathy Neti; Giridharan Iyengar; Eric Helmuth

2001-01-01

28

Audio  

Microsoft Academic Search

\\u000a If you’re one of those who treat audio in Flash as an afterthought, think again. In many respects, audio is a major medium\\u000a for communicating your message. In this chapter, we dig into audio in Flash: where it comes from, what formats are supported,\\u000a and how to use it in Flash. Regardless whether you’re new to Flash or an old

Tom Green; David Stiller

29

Effects of Text-to-Speech Software on the Reading Rate and Comprehension Skills of High School Students with Specific Learning Disabilities  

ERIC Educational Resources Information Center

The purpose of this study was to examine the effects of a text-to-speech software program known as "Read Please" on the reading rate and reading comprehension accuracy of two high school students with specific learning disabilities (SLD) in reading. A single-subject A-B-A-B "withdrawal" research design (Alberto & Troutman, 2009) was used to…

Moorman, Amanda; Boon, Richard T.; Keller-Bell, Yolanda; Stagliano, Christina; Jeffs, Tara

2010-01-01

30

Listening to Revise: What a Study about Text-to-Speech Software Taught Us about Students' Expectations for Technology Use in the Writing Center  

ERIC Educational Resources Information Center

This is a story of a failed study. In 2007, the authors set out to demonstrate that Kurzweil 3000, an adaptive text-to-speech software program, would help any student revise with its read-aloud function and numerous writing tools. During the course of the study, the authors confronted their misconceptions about students' technology use and…

Conard-Salvo, Tammy; Spartz, John M.

2012-01-01

31

The Effects of Word Prediction and Text-to-Speech Technologies on the Narrative Writing Skills of Hispanic Students with Specific Learning Disabilities  

ERIC Educational Resources Information Center

A multiple-baseline design across subjects was used to investigate the effects of word prediction and text-to-speech alone and in combination on four narrative composition-writing skills (writing fluency, syntax, spelling accuracy, and overall organization) of six fifth-grade Hispanic boys with specific learning disabilities (SLD). Participants…

Silio, Monica C.; Barbetta, Patricia M.

2010-01-01

32

Time and spectral analysis methods with machine learning for the authentication of digital audio recordings.  

PubMed

This paper addresses the problem of tampering detection and discusses new methods that can be used for authenticity analysis of digital audio recordings. Nowadays, the only method referred to digital audio files commonly approved by forensic experts is the ENF criterion. It consists in fluctuation analysis of the mains frequency induced in electronic circuits of recording devices. Therefore, its effectiveness is strictly dependent on the presence of mains signal in the recording, which is a rare occurrence. This article presents the existing methods of time and spectral analysis along with their modifications as proposed by the author involving spectral analysis of residual signal enhanced by machine learning algorithms. The effectiveness of tampering detection methods described in this paper is tested on a predefined music database. The results are compared graphically using ROC-like curves. Furthermore, time-frequency plots are presented and enhanced by reassignment method in purpose of visual inspection of modified recordings. Using this solution, enables analysis of minimal changes of background sounds, which may indicate tampering. PMID:23481673

Korycki, Rafal

2013-07-10

33

DIXI - portuguese text-to-speech system  

Microsoft Academic Search

This paper describes the software architecture of the Portuguesetext-to-speech system DIXI1. The system has three major modules.The first one contains the text normalizer and searcheseachword in the lexicon. The second one is a multi-level rulebased module for lexical stress assignment, orthographic to phonetictranscription, metrically based prosodic patterning and forgenerating the evolution of the synthesizer parameters. The finalmodule is the Klatt

Luís C. Oliveira; Céu Viana; Isabel M. Trancoso

1991-01-01

34

Audio 2008: Audio Fixation  

ERIC Educational Resources Information Center

Take a look around the bus or subway and see just how many people are bumping along to an iPod or an MP3 player. What they are listening to is their secret, but the many signature earbuds in sight should give one a real sense of just how pervasive digital audio has become. This article describes how that popularity is mirrored in library audio

Kaye, Alan L.

2008-01-01

35

Fuzzy Logic-Based Audio Pattern Recognition  

NASA Astrophysics Data System (ADS)

Audio and audio-pattern recognition is becoming one of the most important technologies to automatically control embedded systems. Fuzzy logic may be the most important enabling methodology due to its ability to rapidly and economically model such application. An audio and audio-pattern recognition engine based on fuzzy logic has been developed for use in very low-cost and deeply embedded systems to automate human-to-machine and machine-to-machine interaction. This engine consists of simple digital signal-processing algorithms for feature extraction and normalization, and a set of pattern-recognition rules manually tuned or automatically tuned by a self-learning process.

Malcangi, M.

2008-11-01

36

Audio Mining  

NSDL National Science Digital Library

Occasionally referred to as audio indexing, audio mining is a computerized task involving the processing of an audio file, extracting the dialog and creating a textual transcript, and searching the transcript for certain words or phrases. Considering the amount of audio content on the Internet and other sources, it is clear that audio mining is a growing technology.To get an idea of what audio mining is and how it can be used, people can read this article from the Cutter Consortium (1). It lists six broad areas that can benefit from using the technology and briefly discusses each one. A more detailed introduction is offered on the Leavitt Communications Web site (2). This article delves into how audio mining works by giving a basic technical understanding of the process. A new method of searching an audio file, dubbed the "phonetic search engine," is compared to traditional methods in this white paper (3). A publication from the Compaq Cambridge Research Laboratory (4) discusses ways of collecting and analyzing information from an audio file. It also mentions SpeechBot, a Web-based tool for multimedia retrieval. Several papers can be downloaded from the home page of a research project studying the National Gallery of the Spoken Word (5). The repository is comprised of massive historical audio content, and the team at the University of Colorado is investigating phrase recognition to index the data. Have you ever had a tune stuck in your head, but not known the name of the artist or song title? The Musical Audio-Mining project (6) is working on ways to search for information about a song simply by humming part of it. Audio mining can also be used in the War on Terrorism, as is described in this article of Federal Computer Week (7). Massive amounts of recorded phone conversations are intercepted by the government each day, and audio mining would be an efficient way to sort through irrelevant material and catch suspicious activity. The World Wide Web Consortium released this draft of the Voice Extensible Markup Language (8), which could have applications for the audio mining community.

Leske, Cavin.

2002-01-01

37

Modem/Audio IntegrationModem/Audio Integration Concurrent Audio AndConcurrent Audio And  

E-print Network

#12;Modem/Audio IntegrationModem/Audio Integration #12;Concurrent Audio AndConcurrent Audio And Modem AccelerationModem Acceleration Dr. Rob MaherDr. Rob Maher Engineering ManagerEngineering ManagerIntroduction and Scope uu Impact of Audio/Modem AccelerationImpact of Audio/Modem Acceleration uu Features and Cost

Maher, Robert C.

38

BIBLIOGRAPHY Text-to-speech in Vocabulary Acquisition and Student  

E-print Network

native Arabic reading in reading paragraphs, sentences, and isolated words. Journal of Psycholinguistic Psycholinguistics, 24, 289-315. Handley, Z. & Hamel, M. (2005). Establishing a Methodology for Benchmarking Speech language. Applied Psycholinguistics, 19, 99-113. Pelletreau, Timothy R. (2006). Computer

39

RECENT ADVANCES IN MULTILINGUAL TEXT-TO-SPEECH SYNTHESIS  

Microsoft Academic Search

this paper we will discuss recent advances in multilingualtext-to-speech (TTS) synthesis research atAT&T Bell Laboratories. The TTS system developedat AT&T Bell Laboratories generates syntheticspeech by concatenating segments of natural speech.The architecture of the system is designed as a modularpipeline where each module handles one particularstep in the process of converting text into speech. Besidesconceptual and computational advantages, themodular structure has

Bernd M; Juergen Schroeter; Jan van Santen; Richard Sproat; Joseph Olive

1996-01-01

40

Voice quality interpolation for emotional text-to-speech synthesis  

Microsoft Academic Search

Synthesizing desired emotions using concatenative algorithms relies on collection of large databases. This paper focuses on the development and assessment of a simple algorithm to interpolate the intended vocal effort in existing databases in order to create new databases with intermediate levels of vocal effort. Three diphone databases in German with soft, modal, and loud voice qualities are processed with

Oytun Türk; Marc Schröder; Baris Bozkurt; Levent M. Arslan

2005-01-01

41

Detecting double compression of audio signal  

NASA Astrophysics Data System (ADS)

MP3 is the most popular audio format nowadays in our daily life, for example music downloaded from the Internet and file saved in the digital recorder are often in MP3 format. However, low bitrate MP3s are often transcoded to high bitrate since high bitrate ones are of high commercial value. Also audio recording in digital recorder can be doctored easily by pervasive audio editing software. This paper presents two methods for the detection of double MP3 compression. The methods are essential for finding out fake-quality MP3 and audio forensics. The proposed methods use support vector machine classifiers with feature vectors formed by the distributions of the first digits of the quantized MDCT (modified discrete cosine transform) coefficients. Extensive experiments demonstrate the effectiveness of the proposed methods. To the best of our knowledge, this piece of work is the first one to detect double compression of audio signal.

Yang, Rui; Shi, Yun Q.; Huang, Jiwu

2010-01-01

42

Using Audio  

Microsoft Academic Search

\\u000a This chapter covers the following topics:\\u000a \\u000a \\u000a \\u000a \\u000a  \\u000a \\u000a How to load sound files\\u000a \\u000a \\u000a \\u000a  \\u000a \\u000a How to control audio behavior\\u000a \\u000a \\u000a \\u000a  \\u000a \\u000a How to read and display audio ID3 information\\u000a \\u000a \\u000a \\u000a  \\u000a \\u000a How to display the sound spectrum\\u000a \\u000a \\u000a \\u000a  \\u000a \\u000a How to control sound volume and panning\\u000a \\u000a \\u000a \\u000a The value of sound is subtle and undervalued. Often, it makes the difference between a good site and

Sean McSharry

43

Audio Segmentation and Classification  

E-print Network

Audio Segmentation and Classification Abdillahi Hussein Omar Kgs. Lyngby 2005 #12;Preface The work describes the work done on the development of an audio segmentation and classification system. Many existing works on audio classification deal with the problem of classifying known homogeneous audio segments

44

AUDIO INDEXING Gal RICHARD  

E-print Network

AUDIO AUDIO INDEXING Gaël RICHARD Ecole Nationale Supérieure des Télécommunications (ENST) Speech and Image Processing Department 37-39, rue Dareau, 75014 Paris, France #12;Audio Indexing Gaël RICHARD Ecole audio data available nowadays and the spread of its use as a data source in many applications

Richard, Gaël

45

Overview of Audio Forensics  

Microsoft Academic Search

\\u000a Audio forensics applies the tools and techniques of audio engineering and digital signal processing to study audio data as\\u000a part of a legal proceeding or an official investigation of some kind. This chapter summarizes the principal audio forensic\\u000a tasks, including authentication, enhancement, and interpretation. The chapter explains the relevant procedural and historical\\u000a background, presents several examples of audio forensic applications,

Robert C. Maher

2010-01-01

46

Digital audio editing  

NASA Astrophysics Data System (ADS)

Contrasting editing methods particularly in broadcasting are reviewed and the requirements of digital audio editing are explored. A hierarchy of editing strategies with varying degrees of sophistication is proposed. The simplest is to cut the tape and use error concealment and electronic crossfading to smooth the splice. In a more advanced option, the concept of separate cut-point and edit-point is introduced, using an auxiliary data track to control a jump over the splice. The top level is a disc-based strategy which gives the user a flexible, nondestructive editing technique with advanced rehearsal facilities not possible with conventional methods. An experimental disc-based editor is developed. The design philosophy and implementation of this editor and a simulation of its performance are described. The man-machine interface, data formatting, systems level design and the software engineering of the project are reported.

McNally, G. W.; Gaskell, P. S.; Stirling, A. J.

1985-08-01

47

Audio Minimization: Applying 3D Audio Techniques to Multi-Stream Audio Interfaces  

E-print Network

Audio Minimization: Applying 3D Audio Techniques to Multi-Stream Audio Interfaces Yolanda Vazquez, stephen}@dcs.gla.ac.uk ABSTRACT Audio-driven eyes-free interactions in which simultaneous audio streams are employed can overload the user. We pro- pose the use of an audio minimization technique in a 3D audio

Williamson, John

48

Audio-visual affective expression recognition  

NASA Astrophysics Data System (ADS)

Automatic affective expression recognition has attracted more and more attention of researchers from different disciplines, which will significantly contribute to a new paradigm for human computer interaction (affect-sensitive interfaces, socially intelligent environments) and advance the research in the affect-related fields including psychology, psychiatry, and education. Multimodal information integration is a process that enables human to assess affective states robustly and flexibly. In order to understand the richness and subtleness of human emotion behavior, the computer should be able to integrate information from multiple sensors. We introduce in this paper our efforts toward machine understanding of audio-visual affective behavior, based on both deliberate and spontaneous displays. Some promising methods are presented to integrate information from both audio and visual modalities. Our experiments show the advantage of audio-visual fusion in affective expression recognition over audio-only or visual-only approaches.

Huang, Thomas S.; Zeng, Zhihong

2007-11-01

49

Developments Trends in Audio  

E-print Network

how to authenticate multimedia data have led to research in forgery detection, but studies in audio are generally still limited com- pared to those in image and video. Audio forgery techniques could be used on recent developments in detecting the forgery of digital audio files is still lacking. Some of the surveys

Kuo, C.-C. "Jay"

50

AudioNet  

NSDL National Science Digital Library

For Internauts with RealAudio 1.0 capability (a 14.4 modem) try AudioNet, the "Broadcast Network of the Internet." AudioNet offers live broadcasts of over ten different talk radio stations, including WOR--New York, WTEM--Washington D.C., and XTRA--San Diego. It also offers several music radio stations, a selection of audio books, and numerous live (and recent) sporting events such as NIT and NCAA Men's and Women's basketball games and college baseball games. http://www.audionet.com/ Free RealAudio 1.0 and 2.0 players can be downloaded from the above sites. RealAudio 2.0 players will play RealAudio 1.0 sites, but 1.0 players will not play 2.0 sites. For more information on this and other plug-ins, visit the Scout Toolkit: webtools/plugins.html

51

Topic in Depth - Audio Mining  

NSDL National Science Digital Library

Occasionally referred to as audio indexing, audio mining is a computerized task involving the processing of an audio file, extracting the dialog and creating a textual transcript, and searching the transcript for certain words or phrases. Considering the amount of audio content on the Internet and other sources, it is clear that audio mining is a growing technology of growing importance.

2010-09-15

52

Audio?visual aids  

Microsoft Academic Search

AUDIO BOOKS. Audio Book Company, St. Joseph, Michigan. 1956. 16 rpm disc recordings. About 60 minutes per record.GREAT TALES AND POEMS OF EDGAR ALLEN POE. 4 records. Sale: $4.95.THE TRIAL OF SOCRATES. 3 records. Sale: $3.95.THE BEST OF MARK TWAIN. 4 records. Sale: $4.95.THE AUTOBIOGRAPHY OF BENJAMIN FRANKLIN. 8 records. Sale: $8.95.ALICE IN WONDERLAND. 3 records. Sale: &3.95.THE AUDIO BOOKS

Earle E. Curtis; A. C. LaFollette; Helen Harton; Georgia Bowman; Wally Schneider

1958-01-01

53

Audio forensic examination  

Microsoft Academic Search

The field of audio forensics involves many topics familiar to the general audio digital signal processing (DSP) community, such as speech recognition, talker identification, and signal quality enhancement. There is potentially much to be gained by applying modern DSP theory to problems of interest to the forensics community, and this article is written to give the DSP audience some insight

Robert C. Maher

2009-01-01

54

Audio Engineers: Sound Weavers  

NSDL National Science Digital Library

Students are introduced to audio engineers, discovering the type of environment in which they work and exactly what they do on a day-to-day basis. Students come to realize that audio engineers help produce their favorite music and movies.

Integrated Teaching and Learning Program,

55

Designing audio aura  

Microsoft Academic Search

In this paper, we describe the process behind the design of Audio Aura. The goal of Audio Aura is to provide serendipitous information, via background auditory cues, that is tied to people's physical actions in the workplace. We used scenarios to explore issues in serendipitous information such as privacy and work practice. Our sound design was guided by a number

Elizabeth D. Mynatt; Maribeth Back; Roy Want; Michael Baer; Jason B. Ellis

1998-01-01

56

Mobile Audio Designs Monkey: A Tool for Audio Augmented Reality  

E-print Network

Mobile Audio Designs Monkey: A Tool for Audio Augmented Reality Bruce N. Walker and Kevin Stamper of audio Augmented Reality (AR) design, there is a significant problem. In order to develop audio for an AR the designer should be focusing on the creativity, design, and the logic of the Aug- mented Reality (AR), he

57

Advances in audio source seperation and multisource audio content retrieval  

E-print Network

Advances in audio source seperation and multisource audio content retrieval Emmanuel Vincent INRIA, Centre de Rennes - Bretagne Atlantique Campus de Beaulieu, F-35042 Rennes Cedex, France ABSTRACT Audio Audio Source Separation Toolkit (FASST) and discuss its advantages compared to earlier approaches

Paris-Sud XI, Université de

58

Perceptual coding of digital audio  

Microsoft Academic Search

During the last decade, CD-quality digital audio has essentially replaced analog audio. Emerging digital audio applications for network, wireless, and multimedia computing systems face a series of constraints such as reduced channel bandwidth, limited storage capacity, and low cost. These new applications have created a demand for high-quality digital audio delivery at low bit rates. In response to this need,

Ted Painter; Andreas Spanias

2000-01-01

59

Audio Engineering Society Convention Paper  

E-print Network

Audio Engineering Society Convention Paper Presented at the 112th Convention 2002 May 10­13 Munich. Additional papers may be obtained by sending request and remittance to Audio Engineering Society, 60 East 42 of the Audio Engineering Society. Intelligent Audio Source Separation using Independent Component Analysis

Mitianoudis, Nikolaos

60

Audio Engineering Society Convention Paper  

E-print Network

Audio Engineering Society Convention Paper Presented at the 120th Convention 2006 May 20­23 Paris. Additional papers may be obtained by sending request and remittance to Audio Engineering Society, 60 East 42 of the Audio Engineering Society. Parametric Representation of Multichannel Audio Based on Principal Component

Paris-Sud XI, Université de

61

DAILY / OVERNIGHT REQUEST FORM PROJECTORS: AUDIO  

E-print Network

DAILY / OVERNIGHT REQUEST FORM PROJECTORS: AUDIO: HUNTER COLLEGE Today's Date: ___ Overhead Projector ___ Public Address Unit ___ Ensign PA Audio-Visual Services ___ Slide Projector ___ Audio Cassette

Qiu, Weigang

62

LONG TERM REQUEST FORM PROJECTORS: AUDIO  

E-print Network

LONG · TERM REQUEST FORM PROJECTORS: AUDIO: HUNTER COLLEGE Today's Date: ___ Overhead Projector ___ Public Address Unit ___ Ensign PA Audio-Visual Services ___ Slide Projector ___ Audio Cassette Player

Qiu, Weigang

63

Forensic audio watermark detection  

NASA Astrophysics Data System (ADS)

Digital audio watermarking detection is often computational complex and requires at least as much audio information as required to embed a complete watermark. In some applications, especially real-time monitoring, this is an important drawback. The reason for this is the usage of sync sequences at the beginning of the watermark, allowing a decision about the presence only if at least the sync has been found and retrieved. We propose an alternative method for detecting the presence of a watermark. Based on the knowledge of the secret key used for embedding, we create a mark for all potential marking stages and then use a sliding window to test a given audio file on the presence of statistical characteristics caused by embedding. In this way we can detect a watermark in less than 1 second of audio.

Steinebach, Martin; Zmudzinski, Sascha; Petrautzki, Dirk

2012-03-01

64

Audio signal processor  

NASA Technical Reports Server (NTRS)

System provides automatic volume control for an audio amplifier or a voice communication system without introducing noise surges during pauses in the input, and without losing the initial signal when the input resumes.

Hymer, R. L.

1970-01-01

65

Acoustic chase : designing an interactive audio environment to stimulate human body movement  

E-print Network

An immersive audio environment was created that explores how humans react to commands imposed by a machine generating its acoustic stimuli on the basis of tracked body movement. In this environment, different states of ...

Schiessl, Simon Karl Josef, 1972-

2004-01-01

66

Using syntax in large-scale audio document translation  

Microsoft Academic Search

Recently, the use of syntax has very effectively improved machine translation (MT) quality in many text translation tasks. However, using syntax in speech translation poses additional challenges because of disfluencies and other spoken language phenomena, and of errors introduced by automatic speech recognition (ASR). In this paper, we investigate the effect of using syntax in a large-scale audio document translation

Jing Zheng; Necip Fazil Ayan; Wen Wang; David Burkett

2009-01-01

67

Machine Vision  

NSDL National Science Digital Library

An overview of a generic image-based machine vision system is provided on this Web site (1). The tutorial describes the main components of such a system, how its accuracy is measured, and what scientific and industrial applications benefit from machine vision. A more technical perspective of machine vision technology is given in an online publication of the Automated Imaging Association (2). Monthly feature articles discuss breaking issues related to machine vision, and several technical papers can be downloaded, which are sorted into categories such as three dimensional imaging and nanotechnology. Researchers from the MIT's Artificial Intelligence Laboratory (3) are investigating how to enable a computer to interpret visual and audio signals from its human user. By using machine perception systems to track the user's gaze, for example, the computer could ascertain the focus of the user's attention, thereby facilitating interaction between the human and the computer. The project's homepage includes numerous research papers, as well as video demonstrations of some of its systems. Machine vision is also finding its way into vehicles. A March 2003 news article (4) highlights a field test in Michigan of a collision avoidance system that uses, among other things, machine vision to warn drivers that they are approaching a slower or stopped object too quickly. The 3D Computer Vision Group at Carnegie Mellon University (5) is involved in several projects, including three dimensional object recognition and humanoid robot vision. Many of the group's recent publications are available for download. NASA's Mars Exploration Rovers, the second of which was launched in July 2003, have vision systems that will let them safely navigate rough terrain. These systems are described in this conference paper (6), including specifics of the stereo vision algorithm and insights into future missions. A new implementation of machine vision comes from a former researcher from Cambridge University. His shape recognition system, which is detailed in this news article (7), is reportedly much more related to human visual processes than existing techniques. For additional developments related to this evolving technology, Machine Vision News (8) has information about worldwide research and new applications of machine vision systems.

Leske, Cavin.

68

Efficient audio signal processing for embedded systems  

NASA Astrophysics Data System (ADS)

As mobile platforms continue to pack on more computational power, electronics manufacturers start to differentiate their products by enhancing the audio features. However, consumers also demand smaller devices that could operate for longer time, hence imposing design constraints. In this research, we investigate two design strategies that would allow us to efficiently process audio signals on embedded systems such as mobile phones and portable electronics. In the first strategy, we exploit properties of the human auditory system to process audio signals. We designed a sound enhancement algorithm to make piezoelectric loudspeakers sound ”richer" and "fuller." Piezoelectric speakers have a small form factor but exhibit poor response in the low-frequency region. In the algorithm, we combine psychoacoustic bass extension and dynamic range compression to improve the perceived bass coming out from the tiny speakers. We also developed an audio energy reduction algorithm for loudspeaker power management. The perceptually transparent algorithm extends the battery life of mobile devices and prevents thermal damage in speakers. This method is similar to audio compression algorithms, which encode audio signals in such a ways that the compression artifacts are not easily perceivable. Instead of reducing the storage space, however, we suppress the audio contents that are below the hearing threshold, therefore reducing the signal energy. In the second strategy, we use low-power analog circuits to process the signal before digitizing it. We designed an analog front-end for sound detection and implemented it on a field programmable analog array (FPAA). The system is an example of an analog-to-information converter. The sound classifier front-end can be used in a wide range of applications because programmable floating-gate transistors are employed to store classifier weights. Moreover, we incorporated a feature selection algorithm to simplify the analog front-end. A machine learning algorithm AdaBoost is used to select the most relevant features for a particular sound detection application. In this classifier architecture, we combine simple "base" analog classifiers to form a strong one. We also designed the circuits to implement the AdaBoost-based analog classifier.

Chiu, Leung Kin

69

Audio Engineering Society Convention Paper  

E-print Network

applications Augmented Reality Audio (ARA) is used in many mobile applications like geolocalized games, non Society. Augmented Reality Audio Editing Jacques Lemordant1 , Yohan Lasorsa1 1 INRIA Rhône-Alpes, France {jacques.lemordant, yohan.lasorsa}@inria.fr ABSTRACT The concept of augmented reality audio (ARA

Joseph Fourier Grenoble-I, Université

70

Audio Switch SpeakersAvotec  

E-print Network

Control Monitor Keyboard & Mouse Audio Switch Control SpeakersAvotec USB HUB USB HUB USB CAT5FORP USB Ext Stim PC Stim Mac Laptop Audio RCA RCA USB CAT5 FORPUSB Ext Video K&M Video Video Speakers Avotec Projector Video Switch VGA RCA RCA 3T Audio and Visual Stimulus Setup DVD Drive K&MSiemens RCA

Walker, Matthew P.

71

Audio Engineering Society Convention Paper  

E-print Network

Audio Engineering Society Convention Paper Presented at the 125th Convention 2008 October 2­5 San be obtained by sending request and remittance to Audio Engineering Society, 60 East 42nd Street, New York, New portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering

Jackson, Philip JB

72

Audio Engineering Society Convention Paper  

E-print Network

Audio Engineering Society Convention Paper Presented at the 117th Convention 2004 October 28­31 San for the contents. Additional papers may be obtained by sending request and remittance to Audio Engineering Society of the Audio Engineering Society. AES Technical Committee on Signal Processing Educational CD Project Robert C

Maher, Robert C.

73

Audio Engineering Society Convention Paper  

E-print Network

Audio Engineering Society Convention Paper Presented at the 120th Convention 2006 May 20­23 Paris. Additional papers may be obtained by sending request and remittance to Audio Engineering Society, 60 East 42 of the Audio Engineering Society. Acoustic Rendering for Color Information Ludovico Ausiello1 , Emanuele

Ferri, Massimo

74

Audio Engineering Society Convention Paper  

E-print Network

Audio Engineering Society Convention Paper Presented at the 122nd Convention 2007 May 5­8 Vienna be obtained by sending request and remittance to Audio Engineering Society, 60 East 42nd Street, New York, New portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering

Plumbley, Mark

75

Audio Engineering Society Convention Paper  

E-print Network

Audio Engineering Society Convention Paper Presented at the 127th Convention 2009 October 9­12 New be obtained by sending request and remittance to Audio Engineering Society, 60 East 42nd Street, New York, New portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering

Joseph Fourier Grenoble-I, Université

76

Audio Engineering Society Convention Paper  

E-print Network

Audio Engineering Society Convention Paper Presented at the 128th Convention 2010 May 22­25 London be obtained by sending request and remittance to Audio Engineering Society, 60 East 42nd Street, New York, New portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering

Paris-Sud XI, Université de

77

Audio Engineering Society Convention Paper  

E-print Network

Audio Engineering Society Convention Paper Presented at the 114th Convention 2003 March 22 for the contents. Additional papers may be obtained by sending request and remittance to Audio Engineering Society of the Audio Engineering Society. Real Time Object Based Coding Paul M. Brossier1 , Mark B. Sandler1 and Mark D

Plumbley, Mark

78

Audio Engineering Society Convention Paper  

E-print Network

Audio Engineering Society Convention Paper Presented at the 134th Convention 2013 May 4­7 Rome. Additional papers may be obtained by sending request and remittance to Audio Engineering Society, 60 East 42 of the Audio Engineering Society. On the Informed Source Separation Approach for Interactive Remixing in Stereo

Paris-Sud XI, Université de

79

Real World Audio  

NASA Technical Reports Server (NTRS)

Crystal River Engineering was originally featured in Spinoff 1992 with the Convolvotron, a high speed digital audio processing system that delivers three-dimensional sound over headphones. The Convolvotron was developed for Ames' research on virtual acoustic displays. Crystal River is a now a subsidiary of Aureal Semiconductor, Inc. and they together develop and market the technology, which is a 3-D (three dimensional) audio technology known commercially today as Aureal 3D (A-3D). The technology has been incorporated into video games, surround sound systems, and sound cards.

1998-01-01

80

3D Audio System  

NASA Technical Reports Server (NTRS)

Ames Research Center research into virtual reality led to the development of the Convolvotron, a high speed digital audio processing system that delivers three-dimensional sound over headphones. It consists of a two-card set designed for use with a personal computer. The Convolvotron's primary application is presentation of 3D audio signals over headphones. Four independent sound sources are filtered with large time-varying filters that compensate for motion. The perceived location of the sound remains constant. Possible applications are in air traffic control towers or airplane cockpits, hearing and perception research and virtual reality development.

1992-01-01

81

Audio Feedback -- Better Feedback?  

ERIC Educational Resources Information Center

National Student Survey (NSS) results show that many students are dissatisfied with the amount and quality of feedback they get for their work. This study reports on two case studies in which we tried to address these issues by introducing audio feedback to one undergraduate (UG) and one postgraduate (PG) class, respectively. In case study one…

Voelkel, Susanne; Mello, Luciane V.

2014-01-01

82

A ROBUST AUDIO FINGERPRINT'S BASED IDENTIFICATION METHOD An audio fingerprint is a small digest of an audio file  

E-print Network

A ROBUST AUDIO FINGERPRINT'S BASED IDENTIFICATION METHOD ABSTRACT An audio fingerprint is a small digest of an audio file computed from its main perceptual properties. Like human fingerprints, audio fingerprints allow to identify an audio file among a set of candidates but does not allow to re­ trieve any

Brun, Chez Luc

83

Audio issues in MIR evaluation vOverview of audio formats  

E-print Network

Audio issues in MIR evaluation vOverview of audio formats vPreferred presentation of audio files in an MIR testbed vA set of simple recommendations Audio Formats I 1. Apple à AIFF (Audio Interchange File Sample rates � Channels · 2 most common uncompressed formats · Digital Audio Workstations support both

Reiss, Josh

84

Audio distribution and Monitoring Circuit  

NASA Technical Reports Server (NTRS)

Versatile circuit accepts and distributes TV audio signals. Three-meter audio distribution and monitoring circuit provides flexibility in monitoring, mixing, and distributing audio inputs and outputs at various signal and impedance levels. Program material is simultaneously monitored on three channels, or single-channel version built to monitor transmitted or received signal levels, drive speakers, interface to building communications, and drive long-line circuits.

Kirkland, J. M.

1983-01-01

85

Digital audio measurements for broadcasting  

Microsoft Academic Search

Transmitting and storing audio signals in the digital domain is well-established in the broadcast industry. Analog audio has given way to the AES3 and Sony\\/Phillips digital interface format (SPDIF). AES3 datastreams are also embedded in SDI television signals. The first part of the paper is describing the digital audio signal itself to get a better understanding of the related problems.

KRESIMIR CRNKOVIC; MLADEN MALETIC; HRVOJE DOMITROVIC

2003-01-01

86

Audio Mastering as Musical Practice.  

E-print Network

??This thesis examines audio mastering as musical communication. Tasks including loudness management, harmonic balance, denoising, phase alignment, monitoring, effects application, and administrative responsibilities are of… (more)

Shelvock, Matt

2012-01-01

87

Audio Engineering Society Convention Paper 5484  

E-print Network

___________________________________ Audio Engineering Society Convention Paper 5484 Presented and remittance to Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA; also see without direct permission from the Journal of the Audio Engineering Society

Paris-Sud XI, Université de

88

Designing Spatial Audio Interfaces to Support Multiple Audio Streams  

E-print Network

, multiple audio streams, divided-attention task. 1. INTRODUCTION AND BACKGROUND Auditory interfaces are used.gaime-project.org ABSTRACT Auditory interfaces offer a solution to the problem of effective eyes-free mobile interactions an experiment using a divided-attention task where a continuous podcast and an audio menu compete for attention

Williamson, John

89

Corpus design based on the Kullback-Leibler divergence for Text-To-Speech synthesis application  

E-print Network

-Leibler divergence to compute the score of each candidate sentence. The proposed cri- terion gives the possibility to the quality of the recorded textual corpus. Therefore, the corpus construction is a crucial step in building is defined by the units to be covered. However, each sentence in the textual corpus is a set of units, so

Yvon, François

90

Enhancing reading comprehension with text-to-speech (DECtalk) computer system  

Microsoft Academic Search

The study aimed at enhancing text comprehension, assessed through inferencing questions in expository prose passages, in adolescent readers. A total of 67 grades 6, 7, and 8 readers, dichotomized into above and below average subgroups in reading, first read unaided the 12 computerized passages of about 200 words each. They were then provided with explanations of difficult words and sentence

Che Kan Leong

1992-01-01

91

WORD AND SYLLABLE MODELS FOR GERMAN TEXT-TO-SPEECH SYNTHESIS  

E-print Network

- ery natural language has productive word formation processes, and the community of speakers and innovative processes as regular words are. Thus, in unlimited vocabulary scenarios we are not facing a memory lexical status by now, ex- amples of spontaneous novel word coinage by compounding are easy to come by

Möbius, Bernd

92

A Comparative Study of ID3 and Backpropagation for English Text-to-Speech Mapping  

Microsoft Academic Search

The performance of the error backpropagation(BP) and ID3 learning algorithms wascompared on the task of mapping Englishtext to phonemes and stresses. Under the distributedoutput code developed by Sejnowskiand Rosenberg, it is shown that BP consistentlyout-performs ID3 on this task by severalpercentage points. Three hypotheses explainingthis difference were explored: (a)ID3 is overfitting the training data, (b) BPis able to share hidden

Thomas G. Dietterich; Hermann Hild; Ghulum Bakiri

1990-01-01

93

Large-scale content-based audio retrieval from text queries  

Microsoft Academic Search

In content-based audio retrieval, the goal is to find sound recordings (audio documents) based on their acoustic fea- tures. This content-based approach differs from retrieval approaches that index media files using metadata such as file names and user tags. In this paper, we propose a machine learning approach for retrieving sounds that is novel in that it (1) uses free-form

Gal Chechik; Eugene Ie; Martin Rehn; Samy Bengio; Dick Lyon

2008-01-01

94

A Speech Driven Face Animation System Based on Machine Learning  

Microsoft Academic Search

Lip synchronization is the key issue in speech driven face animation system. In this paper, some clustering and machine learning methods are combined together to estimate face animation parameters from audio sequences and then apply the learning results to MPEG-4 based speech driven face animation system. Based on a large recorded audio-visual database, an unsupervised cluster algorithm is proposed to

CHEN Yi-Qiang; GAO Wen; WANG Zhao-Qi; JIANG Da-Long

2003-01-01

95

A UNIFIED APPROACH TO REAL TIME AUDIO-TO-SCORE AND AUDIO-TO-AUDIO ALIGNMENT USING SEQUENTIAL MONTECARLO INFERENCE TECHNIQUES  

E-print Network

A UNIFIED APPROACH TO REAL TIME AUDIO-TO-SCORE AND AUDIO-TO-AUDIO ALIGNMENT USING SEQUENTIAL discrete. The major contribution of this paper is addressing both problems of audio-to-score and audio-to-audio signal to music events. In this paper, we focus our attention on the alignment of a streaming audio

Paris-Sud XI, Université de

96

Audio Engineering Society Convention Paper  

E-print Network

-bit coded representation of the audio stream with a sample rate of 2.8 MHz, and Sigma Delta modula of the Audio Engineering Society. Description of Limit Cycles in Feedback Sigma Delta Modulators Derk Reefman1 for analysis of limit cycle behavior in feedforward sigma delta modulators (SDMs). However, the dynamics

Reiss, Josh

97

Audio-Visual Speech Recognition  

E-print Network

) Graphical Methods: Manifold Estimation 2) Local Graph Discriminant Features 2) Audio Noise 1) Beam Noise 1) Graphical Methods: Manifold Estimation 2) Local Graph Discriminant Features 2) Audio Noise 1 to transmit data from camera to digital video tape Facial Features: ­ Hair ­ Skin ­ Clothing ­ Obstructions

Hasegawa-Johnson, Mark

98

Phonetic Searching Of Digital Audio  

NSDL National Science Digital Library

A new method of searching an audio file, dubbed the "phonetic search engine," is compared to traditional methods in this white paper. The 10-page pdf document has images and graphs to illustrate the process and results. Topics covered include audio searching techniques, implementation of new search methods, as well as current and future applications of the technology.

Cardillo, Peter S.; Clements, Mark; Miller, Michael

2007-12-10

99

The Lowdown on Audio Downloads  

ERIC Educational Resources Information Center

First offered to public libraries in 2004, downloadable audiobooks have grown by leaps and bounds. According to the Audio Publishers Association, their sales today account for 21% of the spoken-word audio market. It hasn't been easy, however. WMA. DRM. MP3. AAC. File extensions small on letters but very big on consequences for librarians,…

Farrell, Beth

2010-01-01

100

Audio Engineering Society Convention Paper  

E-print Network

and recorded audio mixing where recording a single sound source with more than one signal path is required, while simultaneously using a microphone to record the amplifier. -Using a parallel digital sound effect A method for reducing comb-filtering effects due to delay time differences between audio signals in a sound

Reiss, Josh

101

Engaging Students with Audio Feedback  

ERIC Educational Resources Information Center

Students express widespread dissatisfaction with academic feedback. Teaching staff perceive a frequent lack of student engagement with written feedback, much of which goes uncollected or unread. Published evidence shows that audio feedback is highly acceptable to students but is underused. This paper explores methods to produce and deliver audio

Cann, Alan

2014-01-01

102

ASDF: AUDIO SCENE DESCRIPTION FORMAT  

Microsoft Academic Search

The Audio Scene Description Format (ASDF) is an col- laboratively evolving format for the storage and inter- change of static, dynamic and interactive spatial audio content. This position paper briefly describes the current status and raises a list of open questions which shall be addressed in the panel discussion.

Matthias Geier; Sascha Spors

103

Audio Watermarking with Error Correction  

E-print Network

In recent times, communication through the internet has tremendously facilitated the distribution of multimedia data. Although this is indubitably a boon, one of its repercussions is that it has also given impetus to the notorious issue of online music piracy. Unethical attempts can also be made to deliberately alter such copyrighted data and thus, misuse it. Copyright violation by means of unauthorized distribution, as well as unauthorized tampering of copyrighted audio data is an important technological and research issue. Audio watermarking has been proposed as a solution to tackle this issue. The main purpose of audio watermarking is to protect against possible threats to the audio data and in case of copyright violation or unauthorized tampering, authenticity of such data can be disputed by virtue of audio watermarking.

Chadha, Aman; Goel, Rishabh; Dave, Hiren; Roja, M Mani

2011-01-01

104

Video and audio over the Internet  

E-print Network

Video and audio over the Internet Thierry Turletti INRIA­Rodeo, France #12; 2 . Requirements . Multicasting . The MBone . Protocols for packet audio & video . Audio/video conferencing tools Outline #12; 3 . Multicast service . MBone connectivity . Real Time Protocol . Session Control Protocol . Audio/video

Turletti, Thierry

105

PARAMETRIC AUDIO CODING Bernd Edler, Heiko Purnhagen  

E-print Network

1 PARAMETRIC AUDIO CODING Bernd Edler, Heiko Purnhagen University of Hannover Laboratorium f low bit rate audio coding applications in mobile commu- nications or on the internet, parametric audio on the other hand. Both of these techniques usually represent the audio signal waveform in a way

106

A Tutorial on MPEG\\/Audio Compression  

Microsoft Academic Search

ABSTRACT This tutorial covers the theory behind MPEG\\/audio compression This algorithm was developed by the Motion Picture Experts Group (MPEG), as an International Organization for Standardization (ISO) standard for the high fidelity compression of digital audio The MPEG\\/audio compression standard is one part of a multiple part standard that addresses the compression of video - 2), the compression of audio

Davis Pan

1995-01-01

107

Digital Audio Compression By Davis Yen Pan  

E-print Network

Digital Audio Compression By Davis Yen Pan Abstract Compared to most digital data types, with the exception of digital video, the data rates associ- ated with uncompressed digital audio are substan- tial. Digital audio compression enables more effi- cient storage and transmission of audio data. The many forms

Lazar, Aurel A.

108

Audio Source Separation using Sparse Representations  

E-print Network

Audio Source Separation using Sparse Representations Andrew Nesbit1 , Maria G. Jafari1 , Emmanuel ABSTRACT We address the problem of audio source separation, namely, the recovery of audio signals from related to the windowing methods used in the MPEG audio coding framework. In considering the anechoic

Plumbley, Mark

109

Audio Source Separation usingAudio Source Separation usingAudio Source Separation usingAudio Source Separation using Sparse RepresentationsSparse RepresentationsSparse RepresentationsSparse Representations  

E-print Network

Audio Source Separation usingAudio Source Separation usingAudio Source Separation usingAudio Source the problem of audio source separation, namely, the recovery of audio signals from recordings of mixtures in the MPEG audio coding framework. In considering the anechoic (delays but no echoes) and determined (equal

Paris-Sud XI, Université de

110

Metrological digital audio reconstruction  

DOEpatents

Audio information stored in the undulations of grooves in a medium such as a phonograph record may be reconstructed, with little or no contact, by measuring the groove shape using precision metrology methods coupled with digital image processing and numerical analysis. The effects of damage, wear, and contamination may be compensated, in many cases, through image processing and analysis methods. The speed and data handling capacity of available computing hardware make this approach practical. Two examples used a general purpose optical metrology system to study a 50 year old 78 r.p.m. phonograph record and a commercial confocal scanning probe to study a 1920's celluloid Edison cylinder. Comparisons are presented with stylus playback of the samples and with a digitally re-mastered version of an original magnetic recording. There is also a more extensive implementation of this approach, with dedicated hardware and software.

Fadeyev; Vitaliy (Berkeley, CA), Haber; Carl (Berkeley, CA)

2004-02-19

111

Robot Command Interface Using an Audio-Visual Speech Recognition System  

NASA Astrophysics Data System (ADS)

In recent years audio-visual speech recognition has emerged as an active field of research thanks to advances in pattern recognition, signal processing and machine vision. Its ultimate goal is to allow human-computer communication using voice, taking into account the visual information contained in the audio-visual speech signal. This document presents a command's automatic recognition system using audio-visual information. The system is expected to control the laparoscopic robot da Vinci. The audio signal is treated using the Mel Frequency Cepstral Coefficients parametrization method. Besides, features based on the points that define the mouth's outer contour according to the MPEG-4 standard are used in order to extract the visual speech information.

Ceballos, Alexánder; Gómez, Juan; Prieto, Flavio; Redarce, Tanneguy

112

Audio Streaming over FM band between USRP1 and USRP2 Project Report  

E-print Network

Audio Streaming over FM band between USRP1 and USRP2 Phase II Project Report Kushal Shah Department is streamed over FM band from one host machine to another over a wireless channel using GNU Radio and USRPs, demodulation, etc., of the signal hardware independent. Thus, the radio system then only requires low cost

Yu, Chansu

113

Audio, Video and Audio-Visual Signatures for Short Video Clip Detection: Experiments on Trecvid2003  

E-print Network

Audio, Video and Audio-Visual Signatures for Short Video Clip Detection: Experiments on Trecvid2003.Senechal,Denis.Pellerin}@lis.inpg.fr, Laurent.Besacier@imag.fr ABSTRACT In this paper, we present the association of audio and video signatures for short video clip detection. First, we present an audio signature based on the spectral flatness measure

Paris-Sud XI, Université de

114

A Morphological Analysis of Audio Objects and their Control Methods for 3D Audio  

E-print Network

A Morphological Analysis of Audio Objects and their Control Methods for 3D Audio Justin Mathew technological improvements in audio reproduction systems increased the possibilities to spatialize sources in a listening envi- ronment. The spatialization of reproduced audio is highly depen- dent on the recording

Paris-Sud XI, Université de

115

Audio Conductor The Audio Conductor project aims to give users control  

E-print Network

Audio Conductor The Audio Conductor project aims to give users control of prerecorded music in real and gyroscope data back to a central unit. The central unit consists of an FPGA, which manages the audio and song selection can be recognized. The tempo of the output audio is then modified to match the user

Hero, Alfred O.

116

AUDIO ORACLE: A NEW ALGORITHM FOR FAST LEARNING OF AUDIO STRUCTURES  

E-print Network

AUDIO ORACLE: A NEW ALGORITHM FOR FAST LEARNING OF AUDIO STRUCTURES Shlomo Dubnov G´erard Assayag that we call audio factors. The new structure allows fast retrieval and recombination of sub of audio retrieval, question of partial similarity can be used to ask questions whether certain sound clip

Paris-Sud XI, Université de

117

50 CFR 27.72 - Audio equipment.  

Code of Federal Regulations, 2013 CFR

...OF THE INTERIOR (CONTINUED) THE NATIONAL WILDLIFE REFUGE SYSTEM PROHIBITED ACTS Disturbing Violations: Filming, Photography, and Light and Sound Equipment § 27.72 Audio equipment. The operation or use of audio devices including...

2013-10-01

118

Audio classification from time-frequency texture  

E-print Network

Time-frequency representations of audio signals often resemble texture images. This paper derives a simple audio classification algorithm based on treating sound spectrograms as texture images. The algorithm is inspired ...

Slotine, Jean-Jacques E.

119

Digital Audio Application to Short Wave Broadcasting  

NASA Technical Reports Server (NTRS)

Digital audio is becoming prevalent not only in consumer electornics, but also in different broadcasting media. Terrestrial analog audio broadcasting in the AM and FM bands will be eventually be replaced by digital systems.

Chen, Edward Y.

1997-01-01

120

HMM-Based Audio Keyword Generation  

Microsoft Academic Search

\\u000a With the exponential growth in the production creation of multimedia data, there is an increasing need for video semantic\\u000a analysis. Audio, as a significant part of video, provides important cues to human perception when humans are browsing and\\u000a understanding video contents. To detect semantic content by useful audio information, we introduce audio keywords which are\\u000a sets of specific audio sounds

Min Xu; Ling-yu Duan; Jianfei Cai; Liang-tien Chia; Changsheng Xu; Qi Tian

2004-01-01

121

Audio-visual integration in multimodal communication  

Microsoft Academic Search

We review recent research that examines audio-visual integration in multimodal communication. The topics include bimodality in human speech, human and automated lip reading, facial animation, lip synchronization, joint audio-video coding, and bimodal speaker verification. We also study the enabling technologies for these research topics, including automatic facial-feature tracking and audio-to-visual mapping. Recent progress in audio-visual research shows that joint processing

Tsuhan Chen; Ram R. Rao

1998-01-01

122

Mean Machines.  

ERIC Educational Resources Information Center

Suggests scales as alternative representations of numerical concepts and operations that can be used as arithmetic-mean machines, adding machines, multiplication machines, and geometric-mean machines. (ASK)

Flores, Alfinio

1998-01-01

123

50 CFR 27.72 - Audio equipment.  

Code of Federal Regulations, 2012 CFR

...9 2012-10-01 2012-10-01 false Audio equipment. 27.72 Section 27.72 Wildlife...Violations: Light and Sound Equipment § 27.72 Audio equipment. The operation or use of audio devices including radios, recording and...

2012-10-01

124

Acoustic Lexemes for Organizing Internet Audio  

E-print Network

Acoustic Lexemes for Organizing Internet Audio Michael A. Casey In this article, a method is proposed for automatic fine-scale audio description that draws inspiration from ontological sound automation of audio description at the level of sound objects for indexing and retrieving sound segments

Casey, Michael

125

Audio Morphing for Percussive Hybrid Sound Generation  

E-print Network

Audio Morphing for Percussive Hybrid Sound Generation Andrea Primavera1, Francesco Piazza1 should be addressed to Andrea Primavera (a.primavera@univpm.it) ABSTRACT The aim of audio morphing to obtain more realistic and perceptually relevant sounds. In this paper we present an automatic audio

Reiss, Josh

126

50 CFR 27.72 - Audio equipment.  

...9 2014-10-01 2014-10-01 false Audio equipment. 27.72 Section 27.72 Wildlife...and Light and Sound Equipment § 27.72 Audio equipment. The operation or use of audio devices including radios, recording and...

2014-10-01

127

A robust audio classification and segmentation method  

Microsoft Academic Search

In this paper, we present a robust algorithm for audio classification that is capable of segmenting and classifying an audio stream into speech, music, environment sound and silence. Audio classification is processed in two steps, which makes it suitable for different applications. The first step of the classification is speech and non-speech discrimination. In this step, a novel algorithm based

Lie Lu; Hao Jiang; HongJiang Zhang

2001-01-01

128

AUDIO-VISUAL AND MAINTENANCE SERVICES  

E-print Network

AUDIO-VISUAL AND MAINTENANCE SERVICES University Policy No: AD2510 Classification: Administration for the Use of Audio Visual Services PURPOSE 1.00 The purpose of this policy is to set out provisions for Audio Visual: services, equipment rental and maintenance on campus. DEFINITIONS For the purposes

Victoria, University of

129

36 CFR 2.12 - Audio disturbances.  

Code of Federal Regulations, 2012 CFR

...2012-07-01 2012-07-01 false Audio disturbances. 2.12 Section 2.12...PUBLIC USE AND RECREATION § 2.12 Audio disturbances. (a) The following are...motor vehicle, motorized toy, or an audio device, such as a radio,...

2012-07-01

130

50 CFR 27.72 - Audio equipment.  

Code of Federal Regulations, 2011 CFR

...8 2011-10-01 2011-10-01 false Audio equipment. 27.72 Section 27.72 Wildlife...Violations: Light and Sound Equipment § 27.72 Audio equipment. The operation or use of audio devices including radios, recording and...

2011-10-01

131

Robust audio watermarking in the time domain  

Microsoft Academic Search

The audio watermarking method proposed in this paper offers copyright protection to an audio signal by time domain processing. The strength of audio signal modifications is limited by the necessity to produce an output signal that is perceptually similar to the original one. The watermarking method presented here does not require the use of the original signal for watermark detection.

Paraskevi Bassia; Ioannis Pitas; Nikos Nikolaidis

2001-01-01

132

Robust Audio Hashing for Content Identification  

Microsoft Academic Search

Nowadays most audio content identification systems are based on watermarking technology. In this paper we present a different technology, referred to as robust audio hashing. By extracting robust features and translating them into a bit string, we get an object called a robust hash. Content can then be identified by comparing hash values of a received audio clip with the

Jaap Haitsma; Ton Kalker; Job Oostveen

2001-01-01

133

Joint watermarking of audio-visual data  

Microsoft Academic Search

Both audio and video watermarking enable copyright protection with owner or customer authentication and the detection of media manipulations. The available watermarking technology concentrates on single media like audio or video. But the typical multimedia stream consists of both video and audio data. Our goal is to provide a solution with robust and fragile aspects to guarantee authentication and integrity

Jana Dittmann; Martin Steinebach

2001-01-01

134

FLAC: Free Lossless Audio Codec  

NSDL National Science Digital Library

The Free Lossless Audio Codec (FLAC) is a method of compressing audio data. Whereas the widely known MP3 format sacrifices sound quality for a smaller compressed size, FLAC does not lose information in the encoding process. FLAC is an open source software project, and its homepage contains downloadable encoding utilities for most common operating systems. An excellent overview of the FLAC format is given, which explains the underlying architecture and features of the codec. This section also provides links to background information and research papers upon which the development of FLAC was based. Thorough documentation is included online.

135

Cluster: Metals. Course: Machine Shop. Research Project.  

ERIC Educational Resources Information Center

The set of 13 units is designed for use with an instructor in actual machine shop practice and is also keyed to audio visual and textual materials. Each unit contains a series of task packages which: specify prerequisites within the series (minimum is Unit 1); provide a narrative rationale for learning; list both general and specific objectives in…

Sanford - Lee County Schools, NC.

136

Quantitative characterisation of audio data by ordinal symbolic dynamics  

NASA Astrophysics Data System (ADS)

Ordinal symbolic dynamics has developed into a valuable method to describe complex systems. Recently, using the concept of transcripts, the coupling behaviour of systems was assessed, combining the properties of the symmetric group with information theoretic ideas. In this contribution, methods from the field of ordinal symbolic dynamics are applied to the characterisation of audio data. Coupling complexity between frequency bands of solo violin music, as a fingerprint of the instrument, is used for classification purposes within a support vector machine scheme. Our results suggest that coupling complexity is able to capture essential characteristics, sufficient to distinguish among different violins.

Aschenbrenner, T.; Monetti, R.; Amigó, J. M.; Bunk, W.

2013-06-01

137

Audio/ Videoconferencing Packages: High Cost  

ERIC Educational Resources Information Center

This report compares two integrated course delivery packages: "Centra 6" and "WebEx". Both applications feature asynchronous and synchronous audio communications for online education and training. They are relatively costly products, and provide useful comparisons with the two less expensive products to be evaluated in the following report #53.…

Murillo, Sonia; Rizzuto, Mary; Sawyers, Urel

2005-01-01

138

Audio/ Videoconferencing Packages: Low Cost  

ERIC Educational Resources Information Center

A comparison was conducted of "Voxwire MeetingRoom" and "iVocalize" v4.1.0.3, both Web-conferencing products using voice-over-Internet protocol (VoIP) to provide unlimited, inexpensive, international audio communication, and high-quality Web-conferencing fostering collaborative learning. The study used the evaluation criteria used in earlier…

Treblay, Remy; Fyvie, Barb; Koritko, Brenda

2005-01-01

139

Scan for Author Audio Interview  

E-print Network

and communications technology (ICT), such as a smartphone that communicates with a computer at the clinic, the CHWVIEWPOINT Scan for Author Audio Interview Primary Health Care in Low-Income Countries Building on Recent Achievements Jeffrey D. Sachs, PhD S MALL INVESTMENTS IN IMPROVED HEALTH OF THE POOR have

140

Acoustic Segmentation for Audio Browsers  

Microsoft Academic Search

Online digital audio is a rapidly growing resource, whichcan be accessed in rich new ways not previously possible.For example, it is possible to listen to just thoseportions of a long discussion whichinvolve a given subsetof people, or to instantly skip ahead to the next speaker.Providing this capability to users, however, requires generationof necessary indices, as well as an interface whichutilizes

Don Kimber; Lynn Wilcox

1996-01-01

141

Audio Engineering Society Convention Paper  

E-print Network

of Sound Recording, University of Surrey 2 Centre for Vision Speech and Signal Processing, University of the Audio Engineering Society. Objective Assessment of Spatial Localisation Attributes of Surround-Sound the listening area of various sound reproduction systems: mono, two-channel stereo (TCS), 3/2 stereo (i.e., 5

Jackson, Philip JB

142

Audio Engineering Society Convention Paper  

E-print Network

and seasonal changes in natural sound environments [1, 2]. Although digital storage of such recordings is now recording, the more common situation in recordings of natural sound environments is #12;Maher Extreme Time The increasing availability of audio recording devices capable of capturing hours, days, or weeks of continuous

Maher, Robert C.

143

Filtered Audio Demo Max Kamenetsky  

E-print Network

Filtered Audio Demo Max Kamenetsky In this demo you'll listen to a 10 second segment of music response, impulse and step responses, and snapshots of the input and output signals. First order lowpass; characteristic, would sound much more mu#15;ed.) The impulse response shows that this #12;lter smooths out

144

Ubiquitous audio: capturing spontaneous collaboration  

Microsoft Academic Search

Although talkhtg is an integral part of collaborative activ- ity, there has been little computer support for acquiring and accessing the contents of conversations. Our approach has focused on ubiquitous audio, or the unobtrusive capture of voice interactions in everyday work environments. Because the words themselves are not available for organizing the captured interactions, structure is derived from acoustical information

Debby Hindus; Chris Schmandt

1992-01-01

145

Authenticity examination of compressed audio recordings using detection of multiple compression and encoders' identification.  

PubMed

Since the appearance of digital audio recordings, audio authentication has been becoming increasingly difficult. The currently available technologies and free editing software allow a forger to cut or paste any single word without audible artifacts. Nowadays, the only method referring to digital audio files commonly approved by forensic experts is the ENF criterion. It consists in fluctuation analysis of the mains frequency induced in electronic circuits of recording devices. Therefore, its effectiveness is strictly dependent on the presence of mains signal in the recording, which is a rare occurrence. Recently, much attention has been paid to authenticity analysis of compressed multimedia files and several solutions were proposed for detection of double compression in both digital video and digital audio. This paper addresses the problem of tampering detection in compressed audio files and discusses new methods that can be used for authenticity analysis of digital recordings. Presented approaches consist in evaluation of statistical features extracted from the MDCT coefficients as well as other parameters that may be obtained from compressed audio files. Calculated feature vectors are used for training selected machine learning algorithms. The detection of multiple compression covers up tampering activities as well as identification of traces of montage in digital audio recordings. To enhance the methods' robustness an encoder identification algorithm was developed and applied based on analysis of inherent parameters of compression. The effectiveness of tampering detection algorithms is tested on a predefined large music database consisting of nearly one million of compressed audio files. The influence of compression algorithms' parameters on the classification performance is discussed, based on the results of the current study. PMID:24637036

Korycki, Rafal

2014-05-01

146

Automatic Audio Segmentation using a Measure of Audio Novelty  

Microsoft Academic Search

The paper describes methods for automatically locating points of significant change in music or audio, by analyzing local self-similarity. This method can find individual note boundaries or even natural segment boundaries such as verse\\/chorus or speech\\/music transitions, even in the absence of cues such as silence. This approach uses the signal to model itself, and thus does not rely on

Jonathan Foote; A. Distance Matrix Embedding

2000-01-01

147

A new audio device driver abstraction Dickon Reed  

E-print Network

A new audio device driver abstraction Dickon Reed Computer Laboratory, University of Cambridge­ ence ``Networks and Operating Systems Support for distributed audio and video'', 1998 Traditional audio device drivers are unable to pro­ vide low latency audio services and often do not pro­ vide audio mixing

Hand, Steven

148

Audio Branding meets Ambush Marketing  

Microsoft Academic Search

Ambush Marketing ist die Vorgehensweise von Unternehmen, dem Publikum eines Sport-Events durch Marketingmaßnahmen eine autorisierte Verbindung zu diesem Event zu signalisieren, obwohl die betreffenden Unternehmen keine Vermarktungsrechte an dieser Veranstaltung besitzen. Ambushed Audio Branding kennzeichnet dabei alle auditiven Ambush-Marketing-Maßnahmen im Rahmen von Sport-Events. Die auditiven Botschaften können dabei elektroakustisch, verbal oder instrumental-akustisch in Szene gesetzt werden. An der ESB Business

Gerd Nufer; Rainer Hirt

2011-01-01

149

A Secure Audio Teleconference System  

Microsoft Academic Search

Users of large communication networks often require a multi-party teleconferencing facility. The most common technique for\\u000a providing secure audio teleconferencing requires the speech of each participant to be returned to clear form in a bridge circuit\\u000a where it is combined with the speech of the other participants. The combined signal is then re-encrypted for distribution\\u000a to the conferees. This introduces

D. G. Steer; L. Strawczynski; Whitfield Diffie; Michael J. Wiener

1988-01-01

150

A Smart Audio User Interface to Access Large Digital Audio Archives on Handheld Devices  

NASA Astrophysics Data System (ADS)

Increasing availability of handheld players and other multimedia mobile devices with larger storage capacity has led to a rapid increases in the amount of audio information on a single device. This calls for new ways to efficiently access audio and multimedia information stored on such deeply embedded devices. Handsfree audio interfaces and semantic information retrieval are two primary requirements for the next generation of handheld multimedia devices. Voice and audio input can be used to build an audio-user interface, as well as to query large audio archives stored on these devices. Soft computing methods, such as fuzzy logic, were used to model a smart logic audio interface that reduces system complexity and increases reliability when accessing audio stored information.

Malcangi, Mario

2008-11-01

151

Aeronautical audio broadcasting via satellite  

NASA Technical Reports Server (NTRS)

A system design for aeronautical audio broadcasting, with C-band uplink and L-band downlink, via Inmarsat space segments is presented. Near-transparent-quality compression of 5-kHz bandwidth audio at 20.5 kbit/s is achieved based on a hybrid technique employing linear predictive modeling and transform-domain residual quantization. Concatenated Reed-Solomon/convolutional codes with quadrature phase shift keying are selected for bandwidth and power efficiency. RF bandwidth at 25 kHz per channel, and a decoded bit error rate at 10(exp -6) with E(sub b)/N(sub o) at 3.75 dB are obtained. An interleaver, scrambler, modem synchronization, and frame format were designed, and frequency-division multiple access was selected over code-division multiple access. A link budget computation based on a worst-case scenario indicates sufficient system power margins. Transponder occupancy analysis for 72 audio channels demonstrates ample remaining capacity to accommodate emerging aeronautical services.

Tzeng, Forrest F.

1993-01-01

152

Collusion-Resistant Audio Fingerprinting System in the Modulated Complex Lapped Transform Domain  

PubMed Central

Collusion-resistant fingerprinting paradigm seems to be a practical solution to the piracy problem as it allows media owners to detect any unauthorized copy and trace it back to the dishonest users. Despite the billionaire losses in the music industry, most of the collusion-resistant fingerprinting systems are devoted to digital images and very few to audio signals. In this paper, state-of-the-art collusion-resistant fingerprinting ideas are extended to audio signals and the corresponding parameters and operation conditions are proposed. Moreover, in order to carry out fingerprint detection using just a fraction of the pirate audio clip, block-based embedding and its corresponding detector is proposed. Extensive simulations show the robustness of the proposed system against average collusion attack. Moreover, by using an efficient Fast Fourier Transform core and standard computer machines it is shown that the proposed system is suitable for real-world scenarios. PMID:23762455

Garcia-Hernandez, Jose Juan; Feregrino-Uribe, Claudia; Cumplido, Rene

2013-01-01

153

Collusion-resistant audio fingerprinting system in the modulated complex lapped transform domain.  

PubMed

Collusion-resistant fingerprinting paradigm seems to be a practical solution to the piracy problem as it allows media owners to detect any unauthorized copy and trace it back to the dishonest users. Despite the billionaire losses in the music industry, most of the collusion-resistant fingerprinting systems are devoted to digital images and very few to audio signals. In this paper, state-of-the-art collusion-resistant fingerprinting ideas are extended to audio signals and the corresponding parameters and operation conditions are proposed. Moreover, in order to carry out fingerprint detection using just a fraction of the pirate audio clip, block-based embedding and its corresponding detector is proposed. Extensive simulations show the robustness of the proposed system against average collusion attack. Moreover, by using an efficient Fast Fourier Transform core and standard computer machines it is shown that the proposed system is suitable for real-world scenarios. PMID:23762455

Garcia-Hernandez, Jose Juan; Feregrino-Uribe, Claudia; Cumplido, Rene

2013-01-01

154

Audio-visual speech enhancement with AVCDCN (audio-visual codebook dependent cepstral normalization)  

Microsoft Academic Search

In this paper, we introduce a non-linear enhancement tech- nique called Audio-Visual Codebook Dependent Cepstral Normalization (AVCDCN) and we consider its use with both audio-only and audio-visual speech recognition. AVCDCN is inspired from CDCN (1) (2), an audio-only en- hancement technique that approximates the non-linear ef- fect of noise on speech with a piece-wise constant func- tion. Our experiments show

Sabine Deligne; Gerasimos Potamianos; Chalapathy Neti

2002-01-01

155

Digital audio authentication by robust feature embedding  

NASA Astrophysics Data System (ADS)

We introduce an approach for verifying the integrity of digital audio recording by means of content-based integrity watermarking. Here an audio fingerprint is extracted from the Fourier domain and embedded as a digital watermark in the same domain. The design of the feature extraction allows a fine temporal resolution of the verification of the integrity. Experimental results show a good distinction between authentic and tampered audio content.

Zmudzinski, Sascha; Munir, Badar; Steinebach, Martin

2012-03-01

156

Jet Audio 5.14 Basic  

NSDL National Science Digital Library

For persons looking for a handy and powerful way to play numerous types of audio or video files, Jet Audio 5.14 Basic will be a welcome addition to their computer. With this latest version, users can broadcast over the internet, utilize the built-in equalizer, and control the speed of recordings, along with sixteen other features. Additionally, users can manipulate the appearance (or "skin") of Jet Audio, or create their own skin as well. Jet Audio 5.14 Basic is compatible with all systems running Windows 98 and higher.

157

Three-Dimensional Audio Client Library  

NASA Technical Reports Server (NTRS)

The Three-Dimensional Audio Client Library (3DAudio library) is a group of software routines written to facilitate development of both stand-alone (audio only) and immersive virtual-reality application programs that utilize three-dimensional audio displays. The library is intended to enable the development of three-dimensional audio client application programs by use of a code base common to multiple audio server computers. The 3DAudio library calls vendor-specific audio client libraries and currently supports the AuSIM Gold-Server and Lake Huron audio servers. 3DAudio library routines contain common functions for (1) initiation and termination of a client/audio server session, (2) configuration-file input, (3) positioning functions, (4) coordinate transformations, (5) audio transport functions, (6) rendering functions, (7) debugging functions, and (8) event-list-sequencing functions. The 3DAudio software is written in the C++ programming language and currently operates under the Linux, IRIX, and Windows operating systems.

Rizzi, Stephen A.

2005-01-01

158

47 CFR 73.403 - Digital audio broadcasting service requirements.  

Code of Federal Regulations, 2013 CFR

...2013-10-01 2013-10-01 false Digital audio broadcasting service requirements. 73.403...SERVICES RADIO BROADCAST SERVICES Digital Audio Broadcasting § 73.403 Digital audio broadcasting service requirements....

2013-10-01

159

47 CFR 73.403 - Digital audio broadcasting service requirements.  

Code of Federal Regulations, 2010 CFR

...2010-10-01 2010-10-01 false Digital audio broadcasting service requirements. 73.403...SERVICES RADIO BROADCAST SERVICES Digital Audio Broadcasting § 73.403 Digital audio broadcasting service requirements....

2010-10-01

160

Multichannel Audio Coding for Multimedia Services in Intelligent Environments  

E-print Network

5 Multichannel Audio Coding for Multimedia Services in Intelligent Environments Athanasios Summary. Audio is an integral component of multimedia services in intelligent environments. Use of multiple channels in audio capturing and rendering offers the advantage of recreating arbitrary acoustic

Tsakalides, Panagiotis

161

47 CFR 73.403 - Digital audio broadcasting service requirements.  

...2014-10-01 2014-10-01 false Digital audio broadcasting service requirements. 73.403...SERVICES RADIO BROADCAST SERVICES Digital Audio Broadcasting § 73.403 Digital audio broadcasting service requirements....

2014-10-01

162

47 CFR 73.403 - Digital audio broadcasting service requirements.  

Code of Federal Regulations, 2011 CFR

...2011-10-01 2011-10-01 false Digital audio broadcasting service requirements. 73.403...SERVICES RADIO BROADCAST SERVICES Digital Audio Broadcasting § 73.403 Digital audio broadcasting service requirements....

2011-10-01

163

47 CFR 73.403 - Digital audio broadcasting service requirements.  

Code of Federal Regulations, 2012 CFR

...2012-10-01 2012-10-01 false Digital audio broadcasting service requirements. 73.403...SERVICES RADIO BROADCAST SERVICES Digital Audio Broadcasting § 73.403 Digital audio broadcasting service requirements....

2012-10-01

164

Machine Shop Grinding Machines.  

ERIC Educational Resources Information Center

This curriculum manual is one in a series of machine shop curriculum manuals intended for use in full-time secondary and postsecondary classes, as well as part-time adult classes. The curriculum can also be adapted to open-entry, open-exit programs. Its purpose is to equip students with basic knowledge and skills that will enable them to enter the…

Dunn, James

165

A GENERIC AUDIO CLASSIFICATION AND SEGMENTATION APPROACH FOR MULTIMEDIA INDEXING AND RETRIEVAL  

E-print Network

A GENERIC AUDIO CLASSIFICATION AND SEGMENTATION APPROACH FOR MULTIMEDIA INDEXING AND RETRIEVAL the attention on the area of generic and automatic audio classification and segmentation for audio audio classification and global segmentation framework based on automatic audio analysis providing

Gabbouj, Moncef

166

Digital Audio: A Sound Design Element.  

ERIC Educational Resources Information Center

Discussion of incorporating audio into videodiscs for multimedia educational applications highlights a project developed for the Navy that used digital audio in an interactive video delivery system (IVDS) for training sonar operators. Storage constraints with videodiscs are explained, design requirements for the IVDS are described, and production…

Barron, Ann; Varnadoe, Susan

1992-01-01

167

Audio Retrieval Using Perceptually Based Structures  

Microsoft Academic Search

Despite growing interest in multimedia data management, audio retrieval has received little attention. In part, this can be attributed to existing unstructured audio representations that do not easily lend themselves to content based retrieval and especially browsing. This paper aims to address this oversight. It begins by reviewing existing techniques and the specific problems posed by unstructured representations. Some characteristics

Kathy Melih; Ruben Gonzalez

1998-01-01

168

Enhancing Manual Scan Registration Using Audio Cues  

NASA Astrophysics Data System (ADS)

Indoor mapping and modelling requires that acquired data be processed by editing, fusing, formatting the data, amongst other operations. Currently the manual interaction the user has with the point cloud (data) while processing it is visual. Visual interaction does have limitations, however. One way of dealing with these limitations is to augment audio in point cloud processing. Audio augmentation entails associating points of interest in the point cloud with audio objects. In coarse scan registration, reverberation, intensity and frequency audio cues were exploited to help the user estimate depth and occupancy of space of points of interest. Depth estimations were made reliably well when intensity and frequency were both used as depth cues. Coarse changes of depth could be estimated in this manner. The depth between surfaces can therefore be estimated with the aid of the audio objects. Sound reflections of an audio object provided reliable information of the object surroundings in some instances. For a point/area of interest in the point cloud, these reflections can be used to determine the unseen events around that point/area of interest. Other processing techniques could benefit from this while other information is estimated using other audio cues like binaural cues and Head Related Transfer Functions. These other cues could be used in position estimations of audio objects to aid in problems such as indoor navigation problems.

Ntsoko, T.; Sithole, G.

2014-04-01

169

Digital Audio Sampling for Film and Video.  

ERIC Educational Resources Information Center

Digital audio sampling is explained, and some of its implications in digital sound applications are discussed. Digital sound equipment is rapidly replacing analog recording devices as the state-of-the-art in audio technology. The philosophy of digital recording involves doing away with the continuously variable analog waveforms and turning the…

Stanton, Michael J.

170

Improving Audio Quality in Distance Learning Applications.  

ERIC Educational Resources Information Center

This paper discusses common causes of problems encountered with audio systems in distance learning networks and offers practical suggestions for correcting the problems. Problems and discussions are divided into nine categories: (1) acoustics, including reverberant classrooms leading to distorted or garbled voices, as well as one-dimensional audio

Richardson, Craig H.

171

Features for audio and music classification  

Microsoft Academic Search

Four audio feature sets are evaluated in their ability to classify five general audio classes and seven pop- ular music genres. The feature sets include low-level signal properties, mel-frequency spectral coefficients, and two new sets based on perceptual models of hear- ing. The temporal behavior of the features is ana- lyzed and parameterized and these parameters are in- cluded as

Martin F. Mckinney; Jeroen Breebaart

2003-01-01

172

Let's Hear It for Audio Mining  

NSDL National Science Digital Library

A detailed introduction is offered on the Leavitt Communications Web site. This article delves into how audio mining works by giving a basic technical understanding of the process. Approaches to audio mining are discussed, as well as how the technology works, performance, languages, and the challenges faced by designers.

Leavitt, Neal

2007-12-11

173

Inner structure computation for audio signal analysis  

Microsoft Academic Search

We present in this paper an audio signal classification method based on Zipf and inverse Zipf laws. These laws are powerful analysis tools allowing the extraction of information not available by the way of standard methods. The adaptation of Zipf and Inverse Zipf laws to audio signals requires a coding of these signals into literary texts, considered as sequences of

E. Dellandrea; P. Makris; N. Vincent

2003-01-01

174

Robust Audio Watermarking in the Time Domain  

Microsoft Academic Search

The audio watermarking method presented below offers copyright protection to an audio signal by modifying its temporal characteristics. The amount of modification embedded is limited by the necessity that the output signal must not be perceptually different from the original one. The watermarking method presented here does not require the original signal for watermark detection. The watermark key is simply

P. Bassia; I. Pitas

1999-01-01

175

Unsupervised clustering of ambulatory audio and video  

Microsoft Academic Search

A truly personal and reactive computer system should have access to the same information as its user, including the ambient sights and sounds. To this end, we have developed a system for extracting events and scenes from natural audio\\/visual input. We find our system can (without any prior labeling of data) cluster the audio\\/visual data into events, such as passing

Brian Clarkson; Alex Pentland

1999-01-01

176

Digital Audio Radio Field Tests  

NASA Technical Reports Server (NTRS)

Radio history continues to be made at the NASA Lewis Research Center with the beginning of phase two of Digital Audio Radio testing conducted by the Consumer Electronic Manufacturers Association (a sector of the Electronic Industries Association and the National Radio Systems Committee) and cosponsored by the Electronic Industries Association and the National Association of Broadcasters. The bulk of the field testing of the four systems should be complete by the end of October 1996, with results available soon thereafter. Lewis hosted phase one of the testing process, which included laboratory testing of seven proposed digital audio radio systems and modes (see the following table). Two of the proposed systems operate in two modes, thus making a total of nine systems for testing. These nine systems are divided into the following types of transmission: in-band on channel (IBOC), in-band adjacent channel (IBAC), and new bands - the L-band (1452 to 1492 MHz) and the S-band (2310 to 2360 MHz).

Hollansworth, James E.

1997-01-01

177

Supported eText: Effects of Text-to-Speech on Access and Achievement for High School Students with Disabilities  

ERIC Educational Resources Information Center

Students with disabilities often lack the skills required to access the general education curriculum and achieve success in school and postschool environments. Evidence suggests that using assistive technologies such as digital texts and translational supports enhances outcomes for these students (Anderson-Inman & Horney, 2007). The purpose of the…

Izzo, Margo Vreeburg; Yurick, Amanda; McArrell, Bianca

2009-01-01

178

AUDIO-VIDEO EVENT RECOGNITION SYSTEM FOR PUBLIC TRANSPORT SECURITY  

E-print Network

AUDIO-VIDEO EVENT RECOGNITION SYSTEM FOR PUBLIC TRANSPORT SECURITY Van-Thinh Vu Quoc-Cuong Pham Keywords: audio-video surveillance, audio-video event, behavior analysis, event recognition. Abstract This paper presents an audio-video surveillance system for the automatic surveillance in public transport

Paris-Sud XI, Université de

179

A ROBUST AUDIO FINGERPRINT EXTRACTION ALGORITHM Jerome Lebosse  

E-print Network

A ROBUST AUDIO FINGERPRINT EXTRACTION ALGORITHM Jâ??erâ??ome Lebossâ??e France Tâ??elâ??ecom R&D 32 rue des coutures 14000 Caen,France jeanclaude.pailles@orange­ft.com ABSTRACT An Audio fingerprint is a small digest of an audio file computed from its main perceptual properties. Like human fingerprints, Audio fingerprints

Brun, Chez Luc

180

AME CURRICULUM GUIDE FALL 2014 AUDIO AND MUSIC ENGINEERING  

E-print Network

choose to earn a Master of Science degree in Electrical Engineering with as little as one additional year manufacturing, audio software development, music and audio recording and production, core audio and signal processing technologies and component manufacturing, musical instruments and audio equipment manufacturing

181

Audio Segment Retrieval Using a Synthesized HMM Atulya Velivelli  

E-print Network

Audio Segment Retrieval Using a Synthesized HMM Atulya Velivelli Beckman Institute for Advanced@ifp.uiuc.edu ABSTRACT In this paper, we propose a general approach to audio segment re­ trieval using synthesized HMMs. The approach allows a user to query audio data of any length by one or more example audio seg­ ments and find

Zhai, ChengXiang

182

AUDIO COMPRESSION EXPLOITING REPETITION (ACER): CHALLENGES AND SOLUTIONS  

E-print Network

AUDIO COMPRESSION EXPLOITING REPETITION (ACER): CHALLENGES AND SOLUTIONS Stuart Cunningham and Vic of the development of an innovative audio compression system, designed to exploit repetition sequences in audio, and particularly, music. The paper briefly introduces and describes how musical content and structure within audio

Davies, John N.

183

THE AUDIO DEGRADATION TOOLBOX AND ITS APPLICATION TO ROBUSTNESS EVALUATION  

E-print Network

THE AUDIO DEGRADATION TOOLBOX AND ITS APPLICATION TO ROBUSTNESS EVALUATION Matthias Mauch Sebastian.ewert}@eecs.qmul.ac.uk ABSTRACT We introduce the Audio Degradation Toolbox (ADT) for the controlled degradation of audio signals, and propose its usage as a means of evaluating and comparing the ro- bustness of audio processing algorithms

Mauch, Matthias

184

Working with audio: integrating personal tape recorders and desktop computers  

Microsoft Academic Search

Audio data is rarely used on desktop computers today, although audio is otherwise widely used for communication tasks. This paper describes early work aimed at creating computer tools that support the ways users may want to work with audio data. User needs for the system were determined by intervieweing people already working with audio data, using existing devices such as

Leo Degen; Richard Mander; Gitta Salomon

1992-01-01

185

Audio Segment Retrieval Using a Synthesized HMM Atulya Velivelli  

E-print Network

Audio Segment Retrieval Using a Synthesized HMM Atulya Velivelli Beckman Institute for Advanced@ifp.uiuc.edu ABSTRACT In this paper, we propose a general approach to audio segment re- trieval using synthesized HMMs. The approach allows a user to query audio data of any length by one or more example audio seg- ments and find

Zhai, ChengXiang

186

Audio Imputation Using the Non-negative Hidden Markov Model  

E-print Network

Audio Imputation Using the Non-negative Hidden Markov Model Jinyu Han1, , Gautham J. Mysore2. Abstract. Missing data in corrupted audio recordings poses a challeng- ing problem for audio signal-frequency domain of audio signals. The proposed approach, based on the Non- negative Hidden Markov Model, enables

Pardo, Bryan

187

Reverse Audio Engineering: Model-Based Inversion of Dynamic Range  

E-print Network

1 Reverse Audio Engineering: Model-Based Inversion of Dynamic Range Compression Stanislaw Gorlow, Graduate Student Member, IEEE and Joshua D. Reiss, Member, IEEE Abstract--Reverse audio engineering so far, reverse audio engineering. I. INTRODUCTION SOUND or audio engineering is an established discipline

188

High-Fidelity Piezoelectric Audio Device  

NASA Technical Reports Server (NTRS)

ModalMax is a very innovative means of harnessing the vibration of a piezoelectric actuator to produce an energy efficient low-profile device with high-bandwidth high-fidelity audio response. The piezoelectric audio device outperforms many commercially available speakers made using speaker cones. The piezoelectric device weighs substantially less (4 g) than the speaker cones which use magnets (10 g). ModalMax devices have extreme fabrication simplicity. The entire audio device is fabricated by lamination. The simplicity of the design lends itself to lower cost. The piezoelectric audio device can be used without its acoustic chambers and thereby resulting in a very low thickness of 0.023 in. (0.58 mm). The piezoelectric audio device can be completely encapsulated, which makes it very attractive for use in wet environments. Encapsulation does not significantly alter the audio response. Its small size (see Figure 1) is applicable to many consumer electronic products, such as pagers, portable radios, headphones, laptop computers, computer monitors, toys, and electronic games. The audio device can also be used in automobile or aircraft sound systems.

Woodward, Stanley E.; Fox, Robert L.; Bryant, Robert G.

2003-01-01

189

Interference from audio distracters during speechreading  

NASA Astrophysics Data System (ADS)

Although many audio-visual speech experiments have focused on situations where the presence of an incongruent visual speech signal influences the perceived utterance heard by an observer, there are also documented examples of a related effect in which the presence of an incongruent audio speech signal influences the perceived utterance seen by an observer. This study examined the effects that different distracting audio signals had on performance in a color and number keyword speechreading task. When the distracting sound was noise, time-reversed speech, or continuous speech, it had no effect on speechreading. However, when the distracting audio signal consisted of speech that started at the same time as the visual stimulus, speechreading performance was substantially degraded. This degradation did not depend on the semantic similarity between the target and masker speech, but it was substantially reduced when the onset of the audio speech was shifted relative to that of the visual stimulus. Overall, these results suggest that visual speech perception is impaired by the presence of a simultaneous mismatched audio speech signal, but that other types of audio distracters have little effect on speechreading performance.

Brungart, Douglas S.; Simpson, Brian D.

2005-12-01

190

The HDTV digital audio matrix  

NASA Astrophysics Data System (ADS)

Multichannel sound systems are being studied as part of the Eureka 95 and Radio-communication Bureau TG10-1 investigations into high definition television. One emerging sound system has five channels; three at the front and two at the back. This raises some compatibility issues. The listener might have only, say, two loudspeakers or the material to be broadcast may have fewer than five channels. The problem is how best to produce a set of signals to be broadcast, which is suitable for all listeners, from those that are available. To investigate this area, a device has been designed and built which has six input channels and six output channels. Each output signal is a linear combination of the input signals. The inputs and outputs are in AES/EBU digital audio format using BBC-designed AESIC chips. The matrix operation, to produce the six outputs from the six inputs, is performed by a Motorola DSP56001. The user interface and 'housekeeping' is managed by a T222 transputer. The operator of the matrix uses a VDU to enter sets of coefficients and a rotary switch to select which set to use. A set of analog controls is also available and is used to control operations other than the simple compatibility matrixing. The matrix has been very useful for simple tasks: mixing a stereo signal into mono, creating a stereo signal from a mono signal, applying a fixed gain or attenuation to a signal, exchanging the A and B channels of an AES/EBU bitstream, and so on. These are readily achieved using simple sets of coefficients. Additions to the user interface software have led to several more sophisticated applications which still consist of a matrix operation. Different multichannel panning laws have been evaluated. The analog controls adjust the panning; the audio signals are processed digitally using a matrix operation. A digital SoundField microphone decoder has also been implemented. audio matrix is such that it can be applied to a wide variety of signal processing tasks. -The combination of a dedicated DSP chip programmed in assembly language for speed of operation and a general purpose processor for user interface tasks programmed in a high level language has been found to be extremely useful.

Mason, A. J.

191

Kid Machine  

NSDL National Science Digital Library

This activity is on page 3 (continued on page 2) of the pdf, part of the Simple Machines Discovery Box. In this fun activity, learners "create" a complex machine by simulating the parts in action. Learners move their bodies and make sounds as if they are individual parts of a moving machine. Then learners discover what happens when part of a machine is broken and problem solve ways to fix it.

2012-06-26

192

Simple Machines  

NSDL National Science Digital Library

This is an online activity about simple machines. Learners will try their hand at putting these amazing devices to work. They will use several simple machines to help "build" a tree house. This is an excellent activity to demonstrate how science - in particular, simple machines - are at work in our everyday lives.

Cosi

2000-01-01

193

Electrostatic Machines  

NSDL National Science Digital Library

This website from Antonio Carlos M. De Queiroz, an associate professor at the Federal University of Rio de Janeiro, illustrates a number of different electrostatic machines. The site includes details and images of machines built by the professor as well as many other historical machines of this type. Some information is also available in Portugese.

De Queiroz, Antonio C.

2011-07-13

194

Simple Machines  

NSDL National Science Digital Library

This is a lesson about simple machines and how they relate to robots. Learners will gain an understanding of simple machines and how they may be used in our everyday lives. Students will also have an opportunity to design a Rube Goldberg Machine of their own. This is lesson 10 of 16 in the MarsBots learning module.

195

History Channel: Audio and Video  

NSDL National Science Digital Library

ItâÂÂs perhaps a bit of a stretch of the imagination to think of a place that would include both a clip of Spiro Agnew speaking out on what he perceived to be the biases of television news coverage and some archival footage of Depression-era gangsters, but itâÂÂs all right here on the History ChannelâÂÂs Audio and Video online archive. The speech archive is quite nice, and may prove to be both edifying and entertaining. Visitors can browse the speech archive by topics (such as War & Diplomacy) or alphabetically. Some of the clips offered here include comments by the scientist Wernher von Braun after hearing that the U.S.S.R had landed a spacecraft on the moon. The video clip section is also quite well-developed, as it contains clips of the trial of Adolf Eichmann and the breaking of the sound barrier.

196

Heiko Purnhagen An Overview of MPEG4 Audio Version 2 AN OVERVIEW OF MPEG4 AUDIO VERSION 2  

E-print Network

fine granularity, i.e. embedded coding with AES 17 th International Conference on High Quality Audio ranging from 2 to 64 kbit/s and above. While the first version of MPEG­4 Audio was finalised in 1998, work­Delay Audio Coding, Small Step Scalability, Parametric Audio Coding, and Environmental Spatialisation

197

IEEE TRANS. ON AUDIO, SPEECH AND LANGUAGE PROC., 2010 1 On the Information Geometry of Audio Streams  

E-print Network

IEEE TRANS. ON AUDIO, SPEECH AND LANGUAGE PROC., 2010 1 On the Information Geometry of Audio Abstract--This paper proposes methods for information pro- cessing of audio streams using methods information as information entities, suitable for similarity and symbolic computing on audio signals

Paris-Sud XI, Université de

198

Card-sized portable audio player using high quality audio coding technology TwinVQ  

Microsoft Academic Search

This paper presents a new implementation of the solid state audio player equipped with TwinVQ decoder. TwinVQ is an advanced high quality coding technology and its basic algorithm is utilized in MPEG-4\\/Audio standardized by the ISO (International Standardization Organization). Using TwinVQ coding, 44.1 kHz sampled audio data can be compressed at a very low bitrate of 40 kbit\\/s\\/ch with better

Tetsuya Takahashi; Takashi Morita; Toshiaki Shimoda; Takao Kaneko; Takehiro Moriya

2000-01-01

199

Interaction with Machine Improvisation  

NASA Astrophysics Data System (ADS)

We describe two multi-agent architectures for an improvisation oriented musician-machine interaction systems that learn in real time from human performers. The improvisation kernel is based on sequence modeling and statistical learning. We present two frameworks of interaction with this kernel. In the first, the stylistic interaction is guided by a human operator in front of an interactive computer environment. In the second framework, the stylistic interaction is delegated to machine intelligence and therefore, knowledge propagation and decision are taken care of by the computer alone. The first framework involves a hybrid architecture using two popular composition/performance environments, Max and OpenMusic, that are put to work and communicate together, each one handling the process at a different time/memory scale. The second framework shares the same representational schemes with the first but uses an Active Learning architecture based on collaborative, competitive and memory-based learning to handle stylistic interactions. Both systems are capable of processing real-time audio/video as well as MIDI. After discussing the general cognitive background of improvisation practices, the statistical modelling tools and the concurrent agent architecture are presented. Then, an Active Learning scheme is described and considered in terms of using different improvisation regimes for improvisation planning. Finally, we provide more details about the different system implementations and describe several performances with the system.

Assayag, Gerard; Bloch, George; Cont, Arshia; Dubnov, Shlomo

200

Web Audio/Video Streaming Tool  

NASA Technical Reports Server (NTRS)

In order to promote NASA-wide educational outreach program to educate and inform the public of space exploration, NASA, at Kennedy Space Center, is seeking efficient ways to add more contents to the web by streaming audio/video files. This project proposes a high level overview of a framework for the creation, management, and scheduling of audio/video assets over the web. To support short-term goals, the prototype of a web-based tool is designed and demonstrated to automate the process of streaming audio/video files. The tool provides web-enabled users interfaces to manage video assets, create publishable schedules of video assets for streaming, and schedule the streaming events. These operations are performed on user-defined and system-derived metadata of audio/video assets stored in a relational database while the assets reside on separate repository. The prototype tool is designed using ColdFusion 5.0.

Guruvadoo, Eranna K.

2003-01-01

201

36 CFR 2.12 - Audio disturbances.  

... (a) The following are prohibited: (1) Operating motorized equipment or machinery such as an electric generating plant, motor vehicle, motorized toy, or an audio device, such as a radio, television set, tape deck or...

2014-07-01

202

36 CFR 1002.12 - Audio disturbances.  

... (a) The following are prohibited: (1) Operating motorized equipment or machinery such as an electric generating plant, motor vehicle, motorized toy, or an audio device, such as a radio, television set, tape deck or...

2014-07-01

203

Supreme Court audio file search engine  

E-print Network

Search engines have evolved from simple text indexing to indexing other forms of media, such as audio and video. I have designed and implemented a web-based system that permits people to search the transcripts of selected ...

Wang, Edward M., 1976-

2004-01-01

204

Audio Gallery: Scientists and Social Responsibility  

NSDL National Science Digital Library

This online audio gallery is from the Museum's Seminars on Science, a series of distance-learning courses designed to help educators meet the new national science standards. Scientists and Social Responsibility, part of the Frontiers in Physical Science seminar, is available in broadband and modem formats and with a printable PDF transcript. The audio discusses some of the social-responsibility issues that scientists are grappling with today.

205

Methodologies and tools for audio digital archives  

Microsoft Academic Search

In response to the proposal of digitizing the entire back-run of several European audio archives, many research projects have\\u000a been carried out in order to discover the technical issues involved in making prestigious audio documents digitally available,\\u000a which are related to the A\\/D transfer process and supervised metadata extraction. This article gives an innovative approach\\u000a to metadata extraction from such

Nicola Orio; Lauro Snidaro; Sergio Canazza; Gian Luca Foresti

2009-01-01

206

Spread-spectrum watermarking of audio signals  

Microsoft Academic Search

Watermarking has become a technology of choice for a broad range of multimedia copyright protection applications. Watermarks have also been used to embed format-independent metadata in audio\\/video signals in a way that is robust to common editing. In this paper, we present several novel mechanisms for effective encoding and detection of direct-sequence spread-spectrum watermarks in audio signals. The developed techniques

Darko Kirovski; Henrique S. Malvar

2003-01-01

207

AudioGene: Predicting Hearing Loss Genotypes from Phenotypes to Guide Genetic Screening  

PubMed Central

Autosomal Dominant Nonsyndromic Hearing Loss (ADNSHL) is a common and often progressive sensory deficit. ADNSHL displays a high degree of genetic heterogeneity, and varying rates of progression. Accurate, comprehensive and cost-effective genetic testing facilitates genetic counseling and provides valuable prognostic information to affected individuals. In this paper, we describe the algorithm underlying AudioGene, a software system employing machine-learning techniques that utilizes phenotypic information derived from audiograms to predict the genetic cause of hearing loss in persons segregating ADNSHL. Our data show that AudioGene has an accuracy of 68% in predicting the causative gene within its top three predictions, as compared to 44% for a Majority classifier. We also show that AudioGene remains effective for audiograms with high levels of clinical measurement noise. We identify audiometric outliers for each genetic locus and hypothesize that outliers may reflect modifying genetic effects. As personalized genomic medicine becomes more common, AudioGene will be increasingly useful as a phenotypic filter to assess pathogenicity of variants identified by massively parallel sequencing. PMID:23280582

Taylor, Kyle R.; DeLuca, Adam P.; Shearer, A. Eliot; Hildebrand, Michael S.; Black-Ziegelbein, E. Ann; Anand, V. Nikhil; Sloan, Christina M.; Eppsteiner, Robert W.; Scheetz, Todd E.; Huygen, Patrick L. M.; Smith, Richard J. H.; Braun, Terry A.; Casavant, Thomas L.

2013-01-01

208

: Helmholtz machine estimation .  

E-print Network

: Helmholtz machine density estimation . . : . . . (supervised learning) , (active learning) (query learning) [1, 3]. . (unsupervised learning), . , [5]. . Helmholtz machine , . Helmholtz machine : Helmholtz machine [2] . Helmholtz machine (generative network) (recognition network) . , , . Helmholtz machine (self

209

Card-sized portable audio player using high quality audio coding technology TwinVQ  

Microsoft Academic Search

This paper presents an audio player using TwinVQ, an advanced high quality coding technology, which can compress 44.1 kHz sampled audio data at a very low bit rate of 40 kbit\\/s\\/ch. The player is business-card-sized of 10 mm thickness, and has practical performance with low power consumption

T. Takahashi; T. Morita; T. Shimoda; T. Kaneko; Takehiro Moriya

2000-01-01

210

REISS ET AL. COMPRESSION FOR SUPER AUDIO CD Audio Engineering Society  

E-print Network

coding efficiency in DSD signals. The goal of this work is to explore new compression techniques which minimised and quantified. Lower bounds are established on the compression ratio of these methodsREISS ET AL. COMPRESSION FOR SUPER AUDIO CD _________________________________ Audio Engineering

Reiss, Josh

211

Online Submission ID: 0028 AudioCloning: Extracting Material Fingerprints from Example Audio Recording  

E-print Network

, computer3 animation, and feature films. Sound effects are usually produced4 from either audio recordings the intrinsinc audio quality of the original recording of18 the sounding material, while naturally varying.e. the differences between the real-world21 recording and modal-synthesized sounds, and transfer it to various22

North Carolina at Chapel Hill, University of

212

Kicking Machine  

NSDL National Science Digital Library

In this design challenge activity, learners build a machine that kicks a Ping-Pong ball into a cup lying on its side 12 inches away. Learners use a pendulum, a rubber band, or a combination of the two to do this and test the accuracy of their machines by shooting a Ping-Pong ball into a cup. Educators can use this activity to demonstrate simple machines and kinetic/potential energy. This activity can be followed up with "Extreme Kicking Machine" which adds an even greater challenge into the mix.

2012-08-20

213

Digital Multicasting of Multiple Audio Streams  

NASA Technical Reports Server (NTRS)

The Mission Control Center Voice Over Internet Protocol (MCC VOIP) system (see figure) comprises hardware and software that effect simultaneous, nearly real-time transmission of as many as 14 different audio streams to authorized listeners via the MCC intranet and/or the Internet. The original version of the MCC VOIP system was conceived to enable flight-support personnel located in offices outside a spacecraft mission control center to monitor audio loops within the mission control center. Different versions of the MCC VOIP system could be used for a variety of public and commercial purposes - for example, to enable members of the general public to monitor one or more NASA audio streams through their home computers, to enable air-traffic supervisors to monitor communication between airline pilots and air-traffic controllers in training, and to monitor conferences among brokers in a stock exchange. At the transmitting end, the audio-distribution process begins with feeding the audio signals to analog-to-digital converters. The resulting digital streams are sent through the MCC intranet, using a user datagram protocol (UDP), to a server that converts them to encrypted data packets. The encrypted data packets are then routed to the personal computers of authorized users by use of multicasting techniques. The total data-processing load on the portion of the system upstream of and including the encryption server is the total load imposed by all of the audio streams being encoded, regardless of the number of the listeners or the number of streams being monitored concurrently by the listeners. The personal computer of a user authorized to listen is equipped with special- purpose MCC audio-player software. When the user launches the program, the user is prompted to provide identification and a password. In one of two access- control provisions, the program is hard-coded to validate the user s identity and password against a list maintained on a domain-controller computer at the MCC. In the other access-control provision, the program verifies that the user is authorized to have access to the audio streams. Once both access-control checks are completed, the audio software presents a graphical display that includes audiostream-selection buttons and volume-control sliders. The user can select all or any subset of the available audio streams and can adjust the volume of each stream independently of that of the other streams. The audio-player program spawns a "read" process for the selected stream(s). The spawned process sends, to the router(s), a "multicast-join" request for the selected streams. The router(s) responds to the request by sending the encrypted multicast packets to the spawned process. The spawned process receives the encrypted multicast packets and sends a decryption packet to audio-driver software. As the volume or muting features are changed by the user, interrupts are sent to the spawned process to change the corresponding attributes sent to the audio-driver software. The total latency of this system - that is, the total time from the origination of the audio signals to generation of sound at a listener s computer - lies between four and six seconds.

Macha, Mitchell; Bullock, John

2007-01-01

214

Spatial domain entertainment audio decompression/compression  

NASA Astrophysics Data System (ADS)

The ARM7 NEON processor with 128bit SIMD hardware accelerator requires a peak performance of 13.99 Mega Cycles per Second for MP3 stereo entertainment quality decoding. For similar compression bit rate, OGG and AAC is preferred over MP3. The Patent Cooperation Treaty Application dated 28/August/2012 describes an audio decompression scheme producing a sequence of interleaving "min to Max" and "Max to min" rising and falling segments. The number of interior audio samples bound by "min to Max" or "Max to min" can be {0|1|…|N} audio samples. The magnitudes of samples, including the bounding min and Max, are distributed as normalized constants within the 0 and 1 of the bounding magnitudes. The decompressed audio is then a "sequence of static segments" on a frame by frame basis. Some of these frames needed to be post processed to elevate high frequency. The post processing is compression efficiency neutral and the additional decoding complexity is only a small fraction of the overall decoding complexity without the need of extra hardware. Compression efficiency can be speculated as very high as source audio had been decimated and converted to a set of data with only "segment length and corresponding segment magnitude" attributes. The PCT describes how these two attributes are efficiently coded by the PCT innovative coding scheme. The PCT decoding efficiency is obviously very high and decoding latency is basically zero. Both hardware requirement and run time is at least an order of magnitude better than MP3 variants. The side benefit is ultra low power consumption on mobile device. The acid test on how such a simplistic waveform representation can indeed reproduce authentic decompressed quality is benchmarked versus OGG(aoTuv Beta 6.03) by three pair of stereo audio frames and one broadcast like voice audio frame with each frame consisting 2,028 samples at 44,100KHz sampling frequency.

Chan, Y. K.; Tam, Ka Him K.

2014-02-01

215

Multimodal audio guide for museums and exhibitions  

NASA Astrophysics Data System (ADS)

In our paper we introduce a new Audio Guide concept for exploring buildings, realms and exhibitions. Actual proposed solutions work in most cases with pre-defined devices, which users have to buy or borrow. These systems often go along with complex technical installations and require a great degree of user training for device handling. Furthermore, the activation of audio commentary related to the exhibition objects is typically based on additional components like infrared, radio frequency or GPS technology. Beside the necessity of installation of specific devices for user location, these approaches often only support automatic activation with no or limited user interaction. Therefore, elaboration of alternative concepts appears worthwhile. Motivated by these aspects, we introduce a new concept based on usage of the visitor's own mobile smart phone. The advantages in our approach are twofold: firstly the Audio Guide can be used in various places without any purchase and extensive installation of additional components in or around the exhibition object. Secondly, the visitors can experience the exhibition on individual tours only by uploading the Audio Guide at a single point of entry, the Audio Guide Service Counter, and keeping it on her or his personal device. Furthermore, since the user usually is quite familiar with the interface of her or his phone and can thus interact with the application device easily. Our technical concept makes use of two general ideas for location detection and activation. Firstly, we suggest an enhanced interactive number based activation by exploiting the visual capabilities of modern smart phones and secondly we outline an active digital audio watermarking approach, where information about objects are transmitted via an analog audio channel.

Gebbensleben, Sandra; Dittmann, Jana; Vielhauer, Claus

2006-02-01

216

Simple Machines  

NSDL National Science Digital Library

Through a five-lesson series with five activities, students are introduced to six simple machines—inclined plane, wedge, screw, lever, pulley, wheel-and-axle—as well as compound machines, which are combinations of two or more simple machines. Once students understand about work (work = force x distance), they become familiar with the machines' mechanical advantages, and see how they make work easier. Through an introduction to compound machines, students begin to think critically about machine inventions and their pervasive roles in our lives. After learning about Rube Goldberg contraptions—absurd inventions that complete simple tasks in complicated ways—they evaluate the importance and usefulness of the many machines around them. Through the hands-on activities, students draw designs for contraptions that could move a circus elephant into a rail car, create a construction site ramp design by measuring different inclined planes and calculating the ideal vs. actual mechanical advantage of each, compare the theoretical and actual mechanical advantages of different pulley systems conceived to save a whale, build and test grape catapults made with popsicle sticks and rubber bands, and follow the steps of the engineering design process to design and build Rube Goldberg machines.

Integrated Teaching and Learning Program,

217

Electric machine  

SciTech Connect

An interior permanent magnet electric machine is disclosed. The interior permanent magnet electric machine comprises a rotor comprising a plurality of radially placed magnets each having a proximal end and a distal end, wherein each magnet comprises a plurality of magnetic segments and at least one magnetic segment towards the distal end comprises a high resistivity magnetic material.

El-Refaie, Ayman Mohamed Fawzi (Niskayuna, NY); Reddy, Patel Bhageerath (Madison, WI)

2012-07-17

218

Simple Machines  

NSDL National Science Digital Library

How do you get a glove and a ball up to your tree house? One answer is to use a pulley. A pulley is a simple machine. In this original KET interactive, children learn about the basic workings of three simple machines.

2010-11-03

219

Nonplanar machines  

SciTech Connect

This talk examines methods available to minimize, but never entirely eliminate, degradation of machine performance caused by terrain following. Breaking of planar machine symmetry for engineering convenience and/or monetary savings must be balanced against small performance degradation, and can only be decided on a case-by-case basis. 5 refs.

Ritson, D. (Stanford Linear Accelerator Center, Menlo Park, CA (USA))

1989-05-01

220

The Effect of Audio and Visual Aids on Task Performance in Distributed Collaborative Virtual Environments  

NASA Astrophysics Data System (ADS)

Collaborative virtual environments (CVE) has recently gained the attention of many researchers due to its numerous potential application domains. Cooperative virtual environments, where users simultaneously manipulate objects, is one of the subfields of CVEs. In this paper we present a framework that enables two users to cooperatively manipulate objects in virtual environment, while setting on two separate machines connected through local network. In addition the article presents the use of sensory feedback (audio and visual) and investigates their effects on the cooperation and user's performance. Six volunteers subject had to cooperatively perform a peg-in-hole task. Results revealed that visual and auditory aid increase users' performance. However majority of the users preferred visual feedback to audio. We hope this framework will greatly help in the development of CAD systems that allow the designers to collaboratively design while being distant. Similarly other application domains may be cooperative assembly, surgical training and rehabilitation systems.

Ullah, Sehat; Richard, Paul; Otman, Samir; Mallem, Malik

2009-03-01

221

Simple Machines  

NSDL National Science Digital Library

In this physical science mini-unit designed for grade 8 students, six groups of students each specialize in one of the six simple machines. As outlined here, the unit covers three to four class periods and deals with forces and motion, work and machines, and mechanical advantage. The teacher introduces students to these topics using the included PowerPoint presentation. Student groups do library or Internet research on their particular simple machine and then experiment with it in class. In culminating oral presentations, each group explains the function(s) of their machine, its mechanical advantage, and some of its everyday uses. The unit plan contains a worksheet for each simple machine group, a notes sheet, teacher pointers, and an assessment rubric.

Spaulding, Anna

2004-01-01

222

Audio stream classification for multimedia database search  

NASA Astrophysics Data System (ADS)

Search and retrieval of huge archives of Multimedia data is a challenging task. A classification step is often used to reduce the number of entries on which to perform the subsequent search. In particular, when new entries of the database are continuously added, a fast classification based on simple threshold evaluation is desirable. In this work we present a CART-based (Classification And Regression Tree [1]) classification framework for audio streams belonging to multimedia databases. The database considered is the Archive of Ethnography and Social History (AESS) [2], which is mainly composed of popular songs and other audio records describing the popular traditions handed down generation by generation, such as traditional fairs, and customs. The peculiarities of this database are that it is continuously updated; the audio recordings are acquired in unconstrained environment; and for the non-expert human user is difficult to create the ground truth labels. In our experiments, half of all the available audio files have been randomly extracted and used as training set. The remaining ones have been used as test set. The classifier has been trained to distinguish among three different classes: speech, music, and song. All the audio files in the dataset have been previously manually labeled into the three classes above defined by domain experts.

Artese, M.; Bianco, S.; Gagliardi, I.; Gasparini, F.

2013-03-01

223

Podscanning : audio microcontent and synchronous communication for mobile devices  

E-print Network

Over the past decade, computationally powerful audio communication devices have become commonplace. Mobile devices have high storage capacity for digital audio, and smartphones or networked PDAs can be used to stream ...

Wheeler, Patrick Sean

2010-01-01

224

Scribbling Machines  

NSDL National Science Digital Library

In this activity, learners explore electronics and motion by making a Scribbling Machine, a motorized contraption that moves in unusual ways and leaves a mark to trace its path. Itâs made from simple materials and is based on the idea of motion created by an offset motor. Try using harvested motors and switches from discarded toys and electronics to make your Scribbling Machine - this not only keeps costs down, but is a playful and inventive way to explore how everyday objects work. To take the activity further, you can also incorporate PicoCrickets to make your Scribbling Machine more intelligent and to explore computers.

Exploratorium

2013-01-30

225

Content-Based Classification, Search, and Retrieval of Audio  

Microsoft Academic Search

Many audio and multimedia applications would benefit from the ability to classify and search for audio based on its characteristics. The audio analysis, search, and classification engine described here reduces sounds to perceptual and acoustical features. This lets users search or retrieve sounds by any one feature or a combination of them, by specifying previously learned classes based on these

Erling Wold; Thom Blum; Douglas Keislar; James Wheaton

1996-01-01

226

Applying neural network on the content-based audio classification  

Microsoft Academic Search

Many audio and multimedia applications would benefit if they could interpret the content of audio rather than relying on descriptions or keywords. These applications include multimedia databases and file systems, digital libraries, automatic segmentation or indexing of video (e.g., news or sports storage), and surveillance. This paper describes a novel content-based audio classification approach based on neural network and genetic

Xi Shao; Changsheng Xun; Mohan S Kankanhalli

2003-01-01

227

Toward Optimization of Multimodal User Interfaces for Tactical Audio Applications  

Microsoft Academic Search

Tactical audio uses audio feedback to facilitate the precise and accurate positioning of an object with respect to some other object. Existing solutions in pointing and trajectory based Human Computer Interface (HCI) tasks have primarily explored visual feedback, sometimes in rather limited conditions. In this paper we have examined different sonification paradigms for tactical audio to improve the accuracy of

Zeljko Obrenovic; Dusan Starcevic; Emil Jovanov

2002-01-01

228

Automatically segmenting and clustering minimal-impact personal audio archives  

E-print Network

Automatically segmenting and clustering minimal-impact personal audio archives Daniel P.W. Ellis, and experimenting with methods to index and access the resulting data. Audio archives have several distinctive no attention from the research community. At the same time, continuous audio archives are minimally intrusive

Ellis, Dan

229

To Make a Long Story Short: Abridged Audio at 10.  

ERIC Educational Resources Information Center

Examines the history of abridged audio publishing 10 years after the formation of the Audio Publishers Association. Topics include abridged versus unabridged versions for bookstores and libraries; vendors and publishers; future possibilities for CDs and DVD (Digital Versatile Disc); and audio leasing for libraries. (LRW)

Annichiarico, Mark

1996-01-01

230

Query by humming: musical information retrieval in an audio database  

Microsoft Academic Search

The emergence of audio and video data types in databases will require new information retrieval methods adapted to the specific characteristics and needs of these data types. An ef- fective and natural way of querying a musical audio database is by humming the tune of a song. In this paper, a system for querying an audio database by humming is

Asif Ghias; Jonathan Logan; David Chamberlin; Brian C. Smith

1995-01-01

231

MUSICAL AUDIO STREAM SEPARATION BY NON-NEGATIVE MATRIX FACTORIZATION  

E-print Network

MUSICAL AUDIO STREAM SEPARATION BY NON-NEGATIVE MATRIX FACTORIZATION Beiming Wang Queen Mary audio into streams of individual sound sources, such as instruments or voice. In this paper, we show) algorithm. The system was tested on both artificially mixed audio and real musical recording. This work

Plumbley, Mark

232

THE PRESENTATION OF MULTIPLE EARCONS IN A SPATIALISED AUDIO SPACE  

E-print Network

THE PRESENTATION OF MULTIPLE EARCONS IN A SPATIALISED AUDIO SPACE David K McGookin Department://www.dcs.gla.ac.uk/~mcgookdk ABSTRACT In this paper work to improve upon the design of structured audio messages called Earcons for use in a concurrent spatialised audio environment is described. Issues involving the limitations of current Earcon

McGookin, David

233

Spatial Audio System for Surround Video Martin Morrell, 2  

E-print Network

Spatial Audio System for Surround Video 1 Martin Morrell, 2 Chris Baume, 3 Joshua D. Reiss 1 In this paper we present the design processes of a spatial audio system for Surround Video. Surround Video reproduction, immersing them in the content. The design of a spatial audio system was necessary to give 360

Reiss, Josh

234

The audio notebook: paper and pen interaction with structured speech  

Microsoft Academic Search

This paper addresses the problem that a listener experiences when attempting to capture information presented during a lecture, meeting, or interview. Listeners must divide their attention between the talker and their notetaking activity. We propose a new device-the Audio Notebook-for taking notes and interacting with a speech recording. The Audio Notebook is a combination of a digital audio recorder and

Lisa Stifelman; Barry Arons; Chris Schmandt

2001-01-01

235

Social Audio Features for Advanced Music Retrieval Michael Kuhn  

E-print Network

Social Audio Features for Advanced Music Retrieval Interfaces Michael Kuhn Computer Engineering audio- features (objective approach) or from user driven informa- tion sources, such as collaborative filtering or social tags (subjective approach). Studies show that the latter tech- niques outperform audio

236

Analysis of Audio Packet Loss in the Internet  

Microsoft Academic Search

We consider the problem of distributing audio data over networks such as the Internet that do not provide support for real-time applications. Experiments with such networks indicate that audio quality is mediocre in large part because of excessive audio packet losses. In this paper, we show using measurements over the Internet as well as analytic modeling that the number of

Jean-chrysostome Bolot; Hugues Crépin; Andrés Vega-garcía

1995-01-01

237

Optimal Audio Transmission over Error-Prone Wireless Links  

E-print Network

1 Optimal Audio Transmission over Error-Prone Wireless Links Ala' Khalifeh, Student Member IEEE for transmitting high quality audio sequences over error-prone wireless links. Our framework introduces apparatus and technique to optimally protect a stored audio sequence transmitted over a wireless link while considering

Yousefi'zadeh, Homayoun

238

Distributed audio feature extraction for music Stuart Bray  

E-print Network

Distributed audio feature extraction for music Stuart Bray Computer Science Department University@cs.uvic.ca ABSTRACT One of the important challenges facing music information retrieval (MIR) of audio signals is scaling analysis algo- rithms to large collections. Typically, analysis of audio signals utilizes

Tzanetakis, George

239

INFORMED AUDIO SOURCE SEPARATION: A COMPARATIVE STUDY Antoine Liutkus1  

E-print Network

INFORMED AUDIO SOURCE SEPARATION: A COMPARATIVE STUDY Antoine Liutkus1 Stanislaw Gorlow2 Nicolas separation algorithms is to recover the con- stituent sources, or audio objects, from their mixture. How. Informed Source Separation (ISS) is a solution to make separation robust when the audio objects are known

Paris-Sud XI, Université de

240

An Audio Watermarking Method Based On Molecular Matching Pursuit  

E-print Network

An Audio Watermarking Method Based On Molecular Matching Pursuit Mathieu Parvaix1 , Sridhar (Sri of the signal introduced by the MMP to get a precise representation of audio signals, and then by using TF representation techniques as viable watermarking schemes. Index Terms -- audio coding, time

Paris-Sud XI, Université de

241

Packet Audio Playout Delay Adjustment: Performance Bounds and Algorithms \\Lambda  

E-print Network

Packet Audio Playout Delay Adjustment: Performance Bounds and Algorithms \\Lambda Sue B. Moon, Jim 01003 fsbmoon,kurose,towsleyg@cs.umass.edu Abstract In packet audio applications, packets are buffered, given a trace of packet audio receptions at a receiver, we present efficient algorithms for computing

Massachusetts at Amherst, University of

242

47 CFR 10.520 - Common audio attention signal.  

...2014-10-01 2014-10-01 false Common audio attention signal. 10.520 Section...Equipment Requirements § 10.520 Common audio attention signal. A Participating...public use under part 10 that include an audio attention signal that meets the...

2014-10-01

243

43 CFR 8365.2-2 - Audio devices.  

...2014-10-01 2014-10-01 false Audio devices. 8365.2-2 Section 8365...SERVICES Rules of Conduct § 8365.2-2 Audio devices. On developed recreation sites...person shall: (a) Operate or use any audio device such as a radio,...

2014-10-01

244

43 CFR 8365.2-2 - Audio devices.  

Code of Federal Regulations, 2013 CFR

...2013-10-01 2013-10-01 false Audio devices. 8365.2-2 Section 8365...SERVICES Rules of Conduct § 8365.2-2 Audio devices. On developed recreation sites...person shall: (a) Operate or use any audio device such as a radio,...

2013-10-01

245

43 CFR 8365.2-2 - Audio devices.  

Code of Federal Regulations, 2012 CFR

...2012-10-01 2012-10-01 false Audio devices. 8365.2-2 Section 8365...SERVICES Rules of Conduct § 8365.2-2 Audio devices. On developed recreation sites...person shall: (a) Operate or use any audio device such as a radio,...

2012-10-01

246

43 CFR 8365.2-2 - Audio devices.  

Code of Federal Regulations, 2011 CFR

...2011-10-01 2011-10-01 false Audio devices. 8365.2-2 Section 8365...SERVICES Rules of Conduct § 8365.2-2 Audio devices. On developed recreation sites...person shall: (a) Operate or use any audio device such as a radio,...

2011-10-01

247

Audio Decision Support for Supervisory Control of Unmanned Vehicles  

E-print Network

1 Audio Decision Support for Supervisory Control of Unmanned Vehicles Literature Review C.E. NEHME sources (dissertations, conference proceedings) relevant to the use of the audio channel in a multimodal that is used often in the literature is that of "virtual audio cues". This term refers to the phenomenon

Cummings, Mary "Missy"

248

Audio Visual Classroom Consoles(SMALL) Extron Multilink panel  

E-print Network

Audio Visual Classroom Consoles(SMALL) Extron Multilink panel 1. Type in keypad code (please to have an adaptor from your computer for the VGA cable. 2. Audio: usually attached with the VGA cable into "Audio In" next to your VGA input. 3. Network: To access the internet, plug in the network cable to your

Peak, Derek

249

Audio feature extraction and analysis for scene classification  

Microsoft Academic Search

Analysis and classification of the scene content of a video sequence are very important for content-based indexing and retrieval of multimedia databases. We report our research on using the associated audio information for video scene classification. We describe several audio features that have been found effective in distinguishing audio characteristics of different scene classes. Based on these features, a neural

Zhu Liu; Jincheng Huang; Yao Wang; Tsuhan Chen

1997-01-01

250

Audio Engineering Society Convention e-Brief 151  

E-print Network

Audio Engineering Society Convention e-Brief 151 Presented at the 136th Convention 2014 April 26. Reproduction of this paper, or any portion thereof, is not permitted without direct permission from the Audio Engineering Society. APE: Audio Perceptual Evaluation toolbox for MATLAB Brecht De Man, Joshua D. Reiss Centre

Reiss, Josh

251

Speech Recognition on MPEG\\/Audio Encoded Files  

Microsoft Academic Search

A technique to peform speech recognition directly from audio files encoded using the MPEG\\/Audio coding standard is described. The technique works in the com- pressed domain and does not require the MPEG\\/Audio file to be decompressed. Only the encoded subband sam- ples are extracted and processed for training and recog- nition. The underlying speech recognition engine used is based on

Lawrence Yapp; Gregory L. Zick

1997-01-01

252

High performance MPEG-audio decoder IC  

NASA Technical Reports Server (NTRS)

The emerging digital audio and video compression technology brings both an opportunity and a new challenge to IC design. The pervasive application of compression technology to consumer electronics will require high volume, low cost IC's and fast time to market of the prototypes and production units. At the same time, the algorithms used in the compression technology result in complex VLSI IC's. The conflicting challenges of algorithm complexity, low cost, and fast time to market have an impact on device architecture and design methodology. The work presented in this paper is about the design of a dedicated, high precision, Motion Picture Expert Group (MPEG) audio decoder.

Thorn, M.; Benbassat, G.; Cyr, K.; Li, S.; Gill, M.; Kam, D.; Walker, K.; Look, P.; Eldridge, C.; Ng, P.

1993-01-01

253

May4,2009May4,2009 Audio Streaming over FM bandAudio Streaming over FM band  

E-print Network

Audio streaming between USRP1 and USRP2 over FM band GNU RadioGNU Radio ­­ Used for modulation and demodulationMay4,2009May4,2009 Audio Streaming over FM bandAudio Streaming over FM band between USRP1 and USRP2FAQwww.gnuradio.org/trac/wiki/USRP2GenFAQ #12;May4,2009May4,2009 77 Audio streaming between USRP1 and USRP2 over FM band

Yu, Chansu

254

Math Machines  

NSDL National Science Digital Library

The mission of the Math Machines organization is to "improve the quality of mathematical education, enhance the transfer of mathematical thinking into other classes, and increase students' ability to apply rigorous mathematics outside the classroom." Their website supports a National Science Foundation ATE grant-supported project designed to improve teaching in the areas of Mathematics, Science, and Technology at the high school and college levels. This improved learning results from using math, science, and technology principles to build and control various machines such as pointers and robots or "math machines", which are simple devices that provide an immediate, physical, dynamic expression to abstract mathematical equations. The website provides information links on Educational Theory, Classroom Activities, Project Workshops, Calculators & Programs, and Machine Construction Instructions for Building: Closed Circuits, Servo Motors, Controllers, Robot Boards and more. There is also contact information, an FAQ section, as well as upcoming events.

2010-05-18

255

AUDIO RECORDING CONTRACT Campus policy and federal law require that students be permitted to use an audio  

E-print Network

AUDIO RECORDING CONTRACT Campus policy and federal law require that students be permitted to use an audio recorder in an academic setting when recording of material is necessary to provide reasonable to use audio recordings or transcriptions only for study and classroom-related use. I agree

Gallo, Linda C.

256

An ESL Audio-Script Writing Workshop  

ERIC Educational Resources Information Center

The roles of dialogue, collaborative writing, and authentic communication have been explored as effective strategies in second language writing classrooms. In this article, the stages of an innovative, multi-skill writing method, which embeds students' personal voices into the writing process, are explored. A 10-step ESL Audio Script Writing Model…

Miller, Carla

2012-01-01

257

Bose Learning Center - Audio Demonstrator Technology  

NSDL National Science Digital Library

Here's information about a new audio technology created at the Bose Corporation, a large manufacturer of sound equipment. The new technology lets the building managers of large places like arenas, auditoriums and outdoor stadiums, preview how the sound system will sound before it's installed.

Bose Corporation

258

Creating meaning in audio and music signals  

E-print Network

adaption under- standing cognition Allan Turing Theory of computing 1940'es Norbert Wiener Cybernetics 1948, the social context, and the computational resources in time and space #12;08/10/201312 Cognitive Systems, DTUCreating meaning in audio and music signals Jan Larsen, Associate Professor PhD Cognitive Systems

259

BachelorofMusic DigitalAudioArts  

E-print Network

rudiments skills test for Music 2160 (Theory I). Students who achieve `good standing' on this examBachelorofMusic DigitalAudioArts N a m e requirements and academic regulations. Contact an Academic Advisor in the Faculty of Fine Arts for advising

Seldin, Jonathan P.

260

AUDIO SYSTEM FOR TECHNICAL READINGS A Dissertation  

E-print Network

. V. Raman May 1994 #12; c fl T. V. Raman 1994 ALL RIGHTS RESERVED #12; AUDIO SYSTEM FOR TECHNICAL in Math and Computer Science at the Indian Institute of Tech­ nology, Bombay. For his final­year project­ phrase the recordings to prepare his own notes before recycling the cassette tapes. This took time

Keinan, Alon

261

Digital audio broadcasting: an interactive services architecture  

Microsoft Academic Search

Digital media technologies offer enhanced multimedia signal broadcasting and description of the signal on content. Digital audio broadcasting (DAB) is a media standard with extended multimedia capabilities, offering novel services to users world-wide. Each digital broadcasting standard though, cannot be separately viewed from the development of Internet radio broadcasting. We introduce a multi-agent based system architecture, specially designed to provide

Nikos Manouselis; Pythagoras Karampiperis

2001-01-01

262

Audio-Tutorial Instruction; An Expanded Approach.  

ERIC Educational Resources Information Center

The University of Missouri-Columbia School of Medicine is developing an audio-tutorial system with several unique features. A Didactor, a device which provides most of the capabilities of computer-assisted instruction but at a fraction of the cost, is the center of the system. The Didactor is combined with tape recordings and slides to present a…

Herrick, Merlyn C.

263

Audio/Visual Ratios in Commercial Filmstrips.  

ERIC Educational Resources Information Center

Developed by the Westinghouse Electric Corporation, Video Audio Compressed (VIDAC) is a compressed time, variable rate, still picture television system. This technology made it possible for a centralized library of audiovisual materials to be transmitted over a television channel in very short periods of time. In order to establish specifications…

Gulliford, Nancy L.

264

Video / Audio Instructions: Sharp Video Data Projector;  

E-print Network

Room 259 Video / Audio Instructions: Sharp Video Data Projector; DVDIVCR; Wireless Microphone:/ /aok.lib.umbc.edu/reference/BI /roompol,php3 These instructions will allow you to operate the video and screen, standing behind the instructor's workstation; the video projector is located on the ceiling

Maryland, Baltimore County, University of

265

AUDIO/VIDEO RECORDING RELEASE VANDERBILT UNIVERSITY  

E-print Network

tape, sound recordings, still photography, CD-ROM or any other media. 4. The right to record, reproduceAUDIO/VIDEO RECORDING RELEASE VANDERBILT UNIVERSITY IMPORTANT: BEFORE AGREEING TO HAVE YOUR CONTENT RECORDED AND SHARED ELECTRONICALLY, PLEASE VERIFY THAT YOU OWN THE COPYRIGHT OF ALL MATERIALS BEING

Simaan, Nabil

266

Audio Engineering Society Convention Paper 8494  

E-print Network

modeling gunshot acoustics simply by convolving a single recording of a gunshot sound with the impulse-3780 USA rob.maher@montana.edu ABSTRACT Audio recordings of gunshots exhibit acoustical properties orientation of the firearm with respect to the recording microphone. Prior empirical studies have demonstrated

Maher, Robert C.

267

Technical advances in digital audio radio broadcasting  

Microsoft Academic Search

The move to digital is a natural progression taking place in all aspects of broadcast media applications from document processing in newspapers to video processing in television distribution. This is no less true for audio broadcasting which has taken a unique development path in the United States. This path has been heavily influenced by a combination of regulatory and migratory

CHRISTOF FALLER; Biing-Hwang Juang; PETER KROON; Hui-Ling Lou; SEAN A. RAMPRASHAD; CARL-ERIK W. SUNDBERG

2002-01-01

268

Speak Dutch; an Audio-Lingual Course.  

ERIC Educational Resources Information Center

"Speak Dutch" is a revised and augmented edition of "Modern Dutch, A First Year College Level Audio-Lingual Course for Dutch." The text is organized into 26 regular lessons and four review lessons plus indexes to grammar points and vocabulary and a list of verb paradigms. The first lesson (40 pages) is devoted to systematic presentation of the…

Lagerwey, Walter

269

Automatic Musical Genre Classification of Audio Signals  

Microsoft Academic Search

Musical genres are categorical descriptions that are used to describe music. They are c ommonly used to structure the increasing amounts of music available in digital form on the Web and are important for music information retrieval. Genre ca tegorization for audio has traditionally been performed manually. A particular musical genre is characterized by statistical properties related to the instrumentation,

George Tzanetakis

2001-01-01

270

Audio Mining: The Next Big Thing?  

NSDL National Science Digital Library

To get an idea of what audio mining is and how it can be used, people can read this article from the Cutter Consortium. It lists six broad areas that can benefit from using the technology and briefly discusses each one: technical support centers and help desks, call centers, broadcast media, conference managers, intelligence gathering, law enforcement, and security operations.

2008-01-30

271

Providing Students with Formative Audio Feedback  

ERIC Educational Resources Information Center

The provision of timely and constructive feedback is increasingly challenging for busy academics. Ensuring effective student engagement with feedback is equally difficult. Increasingly, studies have explored provision of audio recorded feedback to enhance effectiveness and engagement with feedback. Few, if any, of these focus on purely formative…

Brearley, Francis Q.; Cullen, W. Rod

2012-01-01

272

Advances in speech and audio compression  

Microsoft Academic Search

Speech and audio compression has advanced rapidly in recent years spurred on by cost-effective digital technology and diverse commercial applications. Recent activity in speech compression is dominated by research and development of a family of techniques commonly described as code-excited linear prediction (CELP) coding. These algorithms exploit models of speech production and auditory perception and offer a quality versus bit

ALLEN GERSHO

1994-01-01

273

Performance measurement in blind audio source separation  

Microsoft Academic Search

In this paper, we discuss the evaluation of blind audio source separation (BASS) algorithms. Depending on the exact application, different distortions can be allowed between an estimated source and the wanted true source. We consider four dif- ferent sets of such allowed distortions, from time-invariant gains to time-varying filters. In each case, we decompose the estimated source into a true

Emmanuel Vincent; Rémi Gribonval; Cédric Févotte

2006-01-01

274

Automated Analysis of an Audio Control Protocol  

Microsoft Academic Search

. We show how HyTech, a symbolic model checker for linear hybridsystems, can be used to analyze an audio control protocol. This protocol[BPV94] was first verified by Bosscher et al. without computer support. Inthis paper, we demonstrate that algorithmic methods can not only verify theprotocol, but can also automatically synthesize the bound on the maximumclock drift, and suggest design modification

Pei-hsin Ho; Howard Wong-toi

1995-01-01

275

Technical Advances in Digital Audio Radio Broadcasting  

E-print Network

week, radio programs reach 96% of people over 12 years old who on the average listen over 3.2 h daily. These programs are being transmitted from over 11 700 radio stations in the U.S. alone. The 20th century has beenTechnical Advances in Digital Audio Radio Broadcasting CHRISTOF FALLER, BIING-HWANG JUANG, FELLOW

Juang, Biing-Hwang "Fred"

276

Drum cutter mining machine  

Microsoft Academic Search

A drum cutter mining machine includes a machine frame with a winch having a drive wheel to engage a rack or chain which extends along the path of travel by the mining machine to propel the machine along a mine face. The mining machine is made up of discrete units which include a machine body and machine housings joined to

K. Oberste-beulmann; H. Schupphaus

1980-01-01

277

Simulating Turing Machines on Maurer Machines  

E-print Network

Simulating Turing Machines on Maurer Machines J.A. Bergstra1,2 and C.A. Middelburg3 1 Programming MB Eindhoven, the Netherlands keesm@win.tue.nl Abstract. Maurer machines are much closer to real computers than Turing machines. Computer instructions play a prominent part in Mau- rer machines. We show

van der Zwaag, Mark

278

Culture Machine  

NSDL National Science Digital Library

Culture Machine is a new, refereed, electronic journal encompassing cultural studies and cultural theory. The international editorial board of the interactive journal aims to "generat[e] research in culture and theory" by promoting and publishing "the most provocative of new work." The theme of the inaugural issue is Taking Risks with the Future. Content includes articles such as Life After Death of the Text by Johan Fornas, Cultural Studies in the Clouds: Mourning for Detail by Tadeusz Slawek, and The Future States of Politics by Kenneth Surin. Culture Machine is hosted by the University of Teesside, England.

279

Function Machine  

NSDL National Science Digital Library

This Java applet allows learners to explore simple linear functions. Students determine the algebraic form of a linear equation by entering inputs into the machine and by looking for patterns in the outputs. The function rules available are: integers from -10 to 10 are either added to, subtracted from, or multiplied by the input x to yield the output y.

2011-01-01

280

Decoding Machine  

NSDL National Science Digital Library

In this math lesson, learners explore variables and their uses. Learners pretend to be FBI agents and make a TOP SECRET tool that enables them to decode and find the values of hidden messages and words. Learners make their simple "decoding machines" out of paper and tape.

Lessonplans, Utah

2012-10-22

281

Leonardo's Machines  

NSDL National Science Digital Library

This Science NetLinks lesson will highlight some of Leonardo da Vinci's futuristic inventions, introducing the elements of machines. Through the use of student interactives, this lesson provides an interesting context in which students can begin to develop their ideas about the uniqueness of humans in the natural world.

Science Netlinks;

2003-03-05

282

BOOSTED BINARY AUDIO FINGERPRINT BASED ON SPECTRAL SUBBAND MOMENTS Sungwoong Kim and Chang D. Yoo  

E-print Network

BOOSTED BINARY AUDIO FINGERPRINT BASED ON SPECTRAL SUBBAND MOMENTS Sungwoong Kim and Chang D. Yoo@kaist.ac.kr, cdyoo@ee.kaist.ac.kr ABSTRACT An audio fingerprinting system identifies an audio based on a unique feature vector called the audio fingerprint. The performance of an audio fingerprinting system is directly

Yoo, Chang D.

283

AUDIO TEXTURES Lie Lu, Stan Li, Liu Wenyin, Hong-Jiang Zhang  

E-print Network

AUDIO TEXTURES Lie Lu, Stan Li, Liu Wenyin, Hong-Jiang Zhang Microsoft Research China No.49 Zhichun a new audio medium, called audio texture, as a means of synthesizing long audio stream according to a given short example audio clip. The example clip is analyzed, and basic building patterns are extracted

Liu, Wenyin

284

Augmenting real-world objects: a paper-based audio notebook  

Microsoft Academic Search

The Audio Notebook allows a user to capture and access an audio recording of a lecture or meeting in conjunction with notes written on paper. The audio recording is synchronized with the user's handwritten notes and page turns. As a user flips through physical pages of notes, the audio scans to the start of each page. Audio is also accessed

Lisa J. Stifelman

1996-01-01

285

Interactions audio-tactiles et perception de la parole : Comparaisons entre sujets aveugles et voyants  

E-print Network

Interactions audio-tactiles et perception de la parole : Comparaisons entre sujets aveugles et the speaker's face modulates the decoding of speech. Audio-tactile perception was compared to audio performed a phonemic decision task in three conditions: audio-only, congruent audio-tactile, and incongruent

Paris-Sud XI, Université de

286

RTP Payload for Redundant Audio Data  

Microsoft Academic Search

This document describes a payload format for use with the real-time transportprotocol (RTP), version 2, for encoding redundant audio data. Theprimary motivation for the scheme described herein is the development ofaudio conferencing tools for use with lossy packet networks such as theInternet Mbone, although this scheme is not limited to such applications.Perkins et al INTERNET-DRAFT 25 July 19971 IntroductionIf multimedia

Andres Vega-garcia; Colin Perkins; Isidor Kouvelas; Jean-chrysostome Bolot; Orion Hodson; Sacha Fosse-parisis; Vicky Hardman

1997-01-01

287

Building an Audio Visualizer in Flex  

Microsoft Academic Search

Prior to Flash Player 9, Flash developers had to rely on third-party applications to create equalizer-type displays or audio\\u000a visualizers. Now, thanks to improvements to the ActionScript language, you can create these experience enhancers natively.\\u000a Combine that with the relative ease of use of the Flex framework, and you can come up with some pretty amazing visualizations\\u000a with minimal effort.

Hasan Otuome

288

Perceptually controlled doping for audio source separation  

NASA Astrophysics Data System (ADS)

The separation of an underdetermined audio mixture can be performed through sparse component analysis (SCA) that relies however on the strong hypothesis that source signals are sparse in some domain. To overcome this difficulty in the case where the original sources are available before the mixing process, the informed source separation (ISS) embeds in the mixture a watermark, which information can help a further separation. Though powerful, this technique is generally specific to a particular mixing setup and may be compromised by an additional bitrate compression stage. Thus, instead of watermarking, we propose a `doping' method that makes the time-frequency representation of each source more sparse, while preserving its audio quality. This method is based on an iterative decrease of the distance between the distribution of the signal and a target sparse distribution, under a perceptual constraint. We aim to show that the proposed approach is robust to audio coding and that the use of the sparsified signals improves the source separation, in comparison with the original sources. In this work, the analysis is made only in instantaneous mixtures and focused on voice sources.

Mahé, Gaël; Nadalin, Everton Z.; Suyama, Ricardo; Romano, João MT

2014-12-01

289

A digital audio/video interleaving system. [for Shuttle Orbiter  

NASA Technical Reports Server (NTRS)

A method of interleaving an audio signal with its associated video signal for simultaneous transmission or recording, and the subsequent separation of the two signals, is described. Comparisons are made between the new audio signal interleaving system and the Skylab Pam audio/video interleaving system, pointing out improvements gained by using the digital audio/video interleaving system. It was found that the digital technique is the simplest, most effective and most reliable method for interleaving audio and/or other types of data into the video signal for the Shuttle Orbiter application. Details of the design of a multiplexer capable of accommodating two basic data channels, each consisting of a single 31.5-kb/s digital bit stream are given. An adaptive slope delta modulation system is introduced to digitize audio signals, producing a high immunity of work intelligibility to channel errors, primarily due to the robust nature of the delta-modulation algorithm.

Richards, R. W.

1978-01-01

290

Machine Learning  

NASA Astrophysics Data System (ADS)

The purpose of this chapter is to present fundamental ideas and techniques of machine learning suitable for the field of this book, i.e., for automated scientific discovery. The chapter focuses on those symbolic machine learning methods, which produce results that are suitable to be interpreted and understood by humans. This is particularly important in the context of automated scientific discovery as the scientific theories to be produced by machines are usually meant to be interpreted by humans. This chapter contains some of the most influential ideas and concepts in machine learning research to give the reader a basic insight into the field. After the introduction in Sect. 1, general ideas of how learning problems can be framed are given in Sect. 2. The section provides useful perspectives to better understand what learning algorithms actually do. Section 3 presents the Version space model which is an early learning algorithm as well as a conceptual framework, that provides important insight into the general mechanisms behind most learning algorithms. In section 4, a family of learning algorithms, the AQ family for learning classification rules is presented. The AQ family belongs to the early approaches in machine learning. The next, Sect. 5 presents the basic principles of decision tree learners. Decision tree learners belong to the most influential class of inductive learning algorithms today. Finally, a more recent group of learning systems are presented in Sect. 6, which learn relational concepts within the framework of logic programming. This is a particularly interesting group of learning systems since the framework allows also to incorporate background knowledge which may assist in generalisation. Section 7 discusses Association Rules - a technique that comes from the related field of Data mining. Section 8 presents the basic idea of the Naive Bayesian Classifier. While this is a very popular learning technique, the learning result is not well suited for human comprehension as it is essentially a large collection of probability values. In Sect. 9, we present a generic method for improving accuracy of a given learner by generatingmultiple classifiers using variations of the training data. While this works well in most cases, the resulting classifiers have significantly increased complexity and, hence, tend to destroy the human readability of the learning result that a single learner may produce. Section 10 contains a summary, mentions briefly other techniques not discussed in this chapter and presents outlook on the potential of machine learning in the future.

Hoffmann, Achim; Mahidadia, Ashesh

291

WEIGHTING SCHEMES FOR AUDIO-VISUAL FUSION IN SPEECH RECOGNITION  

Microsoft Academic Search

In this work we demonstrate an improvement in the state-of-the- art large vocabulary continuous speech recognition (LVCSR) per- formance, under clean and noisy conditions, by the use of visual information, in addition to the traditional audio one. We take a decision fusion approach for the audio-visual information, where the single-modality (audio- and visual- only) HMM classifiers are combined to recognize

Dimitra Vergyri; Gerasimos Potamianos; Juergen Luettin

292

State-Machine Replication  

E-print Network

State-Machine Replication #12;The Problem Clients Server #12;The Problem Clients Server #12;The (state machine) #12;The Solution 1. Make server deterministic (state machine) State machine #12;The Solution 1. Make server deterministic (state machine) 2. Replicate server State machines #12;The Solution 1

Venkataramani, Arun

293

USING 3D AUDIO GUIDANCE TO LOCATE INDOOR STATIC OBJECTS  

E-print Network

, the music industry, visualization applications and computer games. Three dimensional audio signals can are basically impulse responses of two particular ears sampled from different distances and different angles

Tsigas, Philippas

294

Simple Machines  

NSDL National Science Digital Library

Can you identify the six types of simple machines? 1. What do you know about Inclined Planes? Draw an example on your graphic organizer and state one fact.Inclined Plane 2. What do you know about levers? Draw an example on your graphic organizer and state one fact.Lever. 3. What do you know about pulleys? Draw an example on your graphic organizer and ...

Stewart, Miss

2010-03-24

295

Induction machine  

DOEpatents

A polyphase rotary induction machine for use as a motor or generator utilizing a single rotor assembly having two series connected sets of rotor windings, a first stator winding disposed around the first rotor winding and means for controlling the current induced in one set of the rotor windings compared to the current induced in the other set of the rotor windings. The rotor windings may be wound rotor windings or squirrel cage windings.

Owen, Whitney H. (Ogden, UT)

1980-01-01

296

78 FR 38093 - Seventh Meeting: RTCA Special Committee 226, Audio Systems and Equipment  

Federal Register 2010, 2011, 2012, 2013

...Meeting: RTCA Special Committee 226, Audio Systems and Equipment AGENCY: Federal...Notice of RTCA Special Committee 226, Audio Systems and Equipment...meeting of the RTCA Special Committee 226, Audio Systems and Equipment [[Page...

2013-06-25

297

77 FR 58209 - Fourth Meeting: RTCA Special Committee 226, Audio Systems and Equipment  

Federal Register 2010, 2011, 2012, 2013

...Meeting: RTCA Special Committee 226, Audio Systems and Equipment AGENCY: Federal...notice of RTCA Special Committee 226, Audio Systems and Equipment...meeting of the RTCA Special Committee 226, Audio Systems and Equipment. DATES: The...

2012-09-19

298

78 FR 57673 - Eighth Meeting: RTCA Special Committee 226, Audio Systems and Equipment  

Federal Register 2010, 2011, 2012, 2013

...Meeting: RTCA Special Committee 226, Audio Systems and Equipment AGENCY: Federal...Notice of RTCA Special Committee 226, Audio Systems and Equipment...meeting of the RTCA Special Committee 226, Audio Systems and Equipment. DATES: The...

2013-09-19

299

47 CFR 73.9005 - Compliance requirements for covered demodulator products: Audio.  

Code of Federal Regulations, 2010 CFR

...requirements for covered demodulator products: Audio. 73.9005 Section 73.9005 Telecommunication...requirements for covered demodulator products: Audio. Except as otherwise provided in...demodulator products shall not output the audio portions of unscreened content or of...

2010-10-01

300

76 FR 79755 - First Meeting: RTCA Special Committee 226 Audio Systems and Equipment  

Federal Register 2010, 2011, 2012, 2013

...First Meeting: RTCA Special Committee 226 Audio Systems and Equipment AGENCY: Federal...Notice of RTCA Special Committee 226, Audio Systems and Equipment...meeting of RTCA Special Committee 226, Audio Systems and Equipment, for the first...

2011-12-22

301

Optimal Audio Transmission over Wireless Tandem Channels Ala' Khalifeh Homayoun Yousefi'zadeh  

E-print Network

1 Optimal Audio Transmission over Wireless Tandem Channels Ala' Khalifeh Homayoun Yousefi propose a statistical optimization framework for transmitting audio sequences over wireless links. Our proposed framework protects audio frames against both temporally cor- related random bit errors introduced

Yousefi'zadeh, Homayoun

302

77 FR 37732 - Fourteenth Meeting: RTCA Special Committee 224, Audio Systems and Equipment  

Federal Register 2010, 2011, 2012, 2013

...Meeting: RTCA Special Committee 224, Audio Systems and Equipment AGENCY: Federal...Notice of RTCA Special Committee 224, Audio Systems and Equipment...meeting of RTCA Special Committee 224, Audio Systems and Equipment. DATES: The...

2012-06-22

303

Proceedings of ICAD 05-Eleventh Meeting of the International Conference on Auditory Display, Limerick, Ireland, July 6-9, 2005 MOBILE AUDIO DESIGNS MONKEY: AN AUDIO AUGMENTED  

E-print Network

, Limerick, Ireland, July 6-9, 2005 ICAD05-1 MOBILE AUDIO DESIGNS MONKEY: AN AUDIO AUGMENTED REALITY DESIGNER in mobile, wearable, and ubiquitous computing, as well as augmented reality applications. Currently, users@cc.gatech.edu ABSTRACT Audio Augmented Reality (AR) design is currently a very difficult task. To develop audio for an AR

304

T-SA-00582-2004.R1 A Generic Audio Classification and Segmentation Approach for Multimedia Indexing and Retrieval 1 A Generic Audio Classification and  

E-print Network

T-SA-00582-2004.R1 A Generic Audio Classification and Segmentation Approach for Multimedia Indexing and Retrieval 1 A Generic Audio Classification and Segmentation Approach for Multimedia Indexing and Retrieval of generic and automatic audio classification and segmentation for audio-based multimedia indexing

Gabbouj, Moncef

305

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 16, NO. 8, NOVEMBER 2008 1361 Union of MDCT Bases for Audio Coding  

E-print Network

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 16, NO. 8, NOVEMBER 2008 1361 Union of MDCT Bases for Audio Coding Emmanuel Ravelli, Student Member, IEEE, Gaël Richard, Senior Member decompositions for audio coding. Audio signals are decomposed over a redundant union of modified discrete cosine

Paris 7 - Denis Diderot, Université

306

PREPRINT FOR AN OVERVIEW ON PERCEPTUALLY MOTIVATED AUDIO INDEXING AND CLASSIFICATION, PROC. OF THE IEEE, 2013 1 An overview on Perceptually Motivated Audio  

E-print Network

PREPRINT FOR AN OVERVIEW ON PERCEPTUALLY MOTIVATED AUDIO INDEXING AND CLASSIFICATION, PROC. OF THE IEEE, 2013 1 An overview on Perceptually Motivated Audio Indexing and Classification Ga¨el Richard An audio indexing system aims at describing audio content by identifying, labeling or categorizing

Richard, Gaël

307

Elicitation of attributes for the evaluation of audio-on-audio interference.  

PubMed

An experiment to determine the perceptual attributes of the experience of listening to a target audio program in the presence of an audio interferer was performed. The first stage was a free elicitation task in which a total of 572 phrases were produced. In the second stage, a consensus vocabulary procedure was used to reduce these phrases into a comprehensive set of attributes. Groups of experienced and inexperienced listeners determined nine and eight attributes, respectively. These attribute sets were combined by the listeners to produce a final set of 12 attributes: masking, calming, distraction, separation, confusion, annoyance, environment, chaotic, balance and blend, imagery, response to stimuli over time, and short-term response to stimuli. In the third stage, a simplified ranking procedure was used to select only the most useful and relevant attributes. Four attributes were selected: distraction, annoyance, balance and blend, and confusion. Ratings using these attributes were collected in the fourth stage, and a principal component analysis performed. This suggested two dimensions underlying the perception of an audio-on-audio interference situation: The first dimension was labeled "distraction" and accounted for 89% of the variance; the second dimension, accounting for 10% of the variance, was labeled "balance and blend." PMID:25373964

Francombe, Jon; Mason, Russell; Dewhirst, Martin; Bech, Søren

2014-11-01

308

Instructional Audio Guidelines: Four Design Principles to Consider for Every Instructional Audio Design Effort  

ERIC Educational Resources Information Center

This article contends that instructional designers and developers should attend to four particular design principles when creating instructional audio. Support for this view is presented by referencing the limited research that has been done in this area, and by indicating how and why each of the four principles is important to the design process.…

Carter, Curtis W.

2012-01-01

309

Investigating the impact of audio instruction and audio-visual biofeedback for lung cancer radiation therapy  

NASA Astrophysics Data System (ADS)

Lung cancer accounts for 13% of all cancers in the Unites States and is the leading cause of deaths among both men and women. The five-year survival for lung cancer patients is approximately 15%.(ACS facts & figures) Respiratory motion decreases accuracy of thoracic radiotherapy during imaging and delivery. To account for respiration, generally margins are added during radiation treatment planning, which may cause a substantial dose delivery to normal tissues and increase the normal tissue toxicity. To alleviate the above-mentioned effects of respiratory motion, several motion management techniques are available which can reduce the doses to normal tissues, thereby reducing treatment toxicity and allowing dose escalation to the tumor. This may increase the survival probability of patients who have lung cancer and are receiving radiation therapy. However the accuracy of these motion management techniques are inhibited by respiration irregularity. The rationale of this thesis was to study the improvement in regularity of respiratory motion by breathing coaching for lung cancer patients using audio instructions and audio-visual biofeedback. A total of 331 patient respiratory motion traces, each four minutes in length, were collected from 24 lung cancer patients enrolled in an IRB-approved breathing-training protocol. It was determined that audio-visual biofeedback significantly improved the regularity of respiratory motion compared to free breathing and audio instruction, thus improving the accuracy of respiratory gated radiotherapy. It was also observed that duty cycles below 30% showed insignificant reduction in residual motion while above 50% there was a sharp increase in residual motion. The reproducibility of exhale based gating was higher than that of inhale base gating. Modeling the respiratory cycles it was found that cosine and cosine 4 models had the best correlation with individual respiratory cycles. The overall respiratory motion probability distribution function could be approximated to a normal distribution function. A statistical analysis was also performed to investigate if a patient's physical, tumor or general characteristics played a role in identifying whether he/she responded positively to the coaching type---signified by a reduction in the variability of respiratory motion. The analysis demonstrated that, although there were some characteristics like disease type and dose per fraction that were significant with respect to time-independent analysis, there were no significant time trends observed for the inter-session or intra-session analysis. Based on patient feedback with the existing audio-visual biofeedback system used for the study and research performed on other feedback systems, an improved audio-visual biofeedback system was designed. It is hoped the widespread clinical implementation of audio-visual biofeedback for radiotherapy will improve the accuracy of lung cancer radiotherapy.

George, Rohini

310

Machine Design  

NSDL National Science Digital Library

This website, the homepage of Machine Design.com, contains resources on a variety of information for engineers and technicians related to devices, components, design applications, products, and systems in the manufacturing technology sector. The site also features a CAD library, eBooks, audiovisual aids, webinars, whitepapers and a reference center. Some of the resources require a free login. The page offers an RSS feed to keep users up to date on new resources. A free login may be required to access some of these items.

311

On Steganography in Lost Audio Packets  

E-print Network

The paper presents a new hidden data insertion procedure based on estimated probability of the remaining time of the call for steganographic method called LACK (Lost Audio PaCKets steganography). LACK provides hidden communication for real-time services like Voice over IP. The analytical results presented in this paper concern the influence of LACK's hidden data insertion procedures on the method's impact on quality of voice transmission and its resistance to steganalysis. The proposed hidden data insertion procedure is also compared to previous steganogram insertion approach based on estimated remaining average call duration.

Mazurczyk, Wojciech; Szczypiorski, Krzysztof

2011-01-01

312

ABC News: Video and Audio Newsclips  

NSDL National Science Digital Library

ABC News has added a section of video and audio newsclips to its news service at the GO Network, InfoSeek Corporation's Internet portal. Users can see and listen to national headline news, such as a clip from Warren Beatty's speech at an awards dinner Wednesday night (sounding rather presidential). They can also search for additional video files using Videosearch, by Virage. Beatty as a search term turned up a clip about the Clinton family's summer vacation on Martha's Vineyard that included a mention of Beatty's presidential aspirations and opinions on the Democratic Party, but no additional pictures of Beatty.

313

An MDCT Hardware Accelerator for MP3 Audio Xingdong Dai  

E-print Network

-III (MP3) standard is widely employed in music industry because of its efficient audio compression. A key enabler in MP3 coding is the prefect reconstruction (PR) cosine modulated filter bank based on the concept audio players are battery operated [2]­[4]. The N point MDCT of a sequence {x(i)} is defined as X(k) = N

Wagh, Meghanad

314

Video Segmentation with the Assistance of Audio Content Analysis  

Microsoft Academic Search

Video structure extraction is essential to automatic content-based organization, retrieval and browsing of video. However, while many robust shot segmentation algorithms have been developed, it is still difficult to extract scene structures or group shots into scenes. In this paper, we present a novel audio assisted video segmentation scheme, in which audio and color information is integrated in video scene

Hao Jiang; Tong Lin; Hong-Jiang Zhang

2000-01-01

315

The Practical Audio-Visual Handbook for Teachers.  

ERIC Educational Resources Information Center

The use of audio/visual media as an aid to instruction is a common practice in today's classroom. Most teachers, however, have little or no formal training in this field and rarely a knowledgeable coordinator to help them. "The Practical Audio-Visual Handbook for Teachers" discusses the types and mechanics of many of these media forms and proposes…

Scuorzo, Herbert E.

316

Audio Teleconferencing and Distance Education: Towards a Conceptual Synthesis.  

ERIC Educational Resources Information Center

To provide a basis for the conceptual development of audio teleconferencing, key audio teleconferencing and distance education variables extracted from the literature are related into a representative model whose major interdependent components include interaction, technical design, learner characteristics, attitudes, and evaluation, learning…

Rothe, J. Peter

1985-01-01

317

Augmented Reality Audio for Mobile and Wearable Appliances  

Microsoft Academic Search

The concept of augmented reality audio characterizes techniques where a real sound environment is extended with virtual auditory environments and communications scenarios. A framework is introduced for mobile augmented reality audio (MARA) based on a specific headset configuration where binaural microphone elements are integrated into stereo ear- phones. When microphone signals are routed directly to the earphones, a user is

Aki Ha Rma; Julia Jakka; Miikka Tikander; Matti Karjalainen; Aes Fellow; Tapio Lokki; Jarmo Hiipakka; Gae Tan Lorho

2004-01-01

318

An Audio Stream Redirector for the Ethernet Speaker  

ERIC Educational Resources Information Center

The authors have developed the "Ethernet Speaker" (ES), a network-enabled single board computer embedded into a conventional audio speaker. Audio streams are transmitted in the local area network using multicast packets, and the ES can select any one of them and play it back. A key requirement for the ES is that it must be capable of playing any…

Mandrekar, Ishan; Prevelakis, Vassilis; Turner, David Michael

2004-01-01

319

Audio enhanced 3D interfaces for visually impaired users  

Microsoft Academic Search

Three dimensional computer applications such as CAD packages are often difficult to use because of inadequate depth feedback to the user. It has, however, been shown that audio feedback can help improve a user's sense of depth perception. This paper describes an experiment which evaluates the use of three different audio environments in a 3D task undertakeh by visually impaired

Stephen W. Mereu; Rick Kazman

1996-01-01

320

Redundancy Control in Real-Time Internet Audio Conferencing  

Microsoft Academic Search

The use of redundant audio encoding has been advocated for lossy networks like the Internet(1, 2) as a way of reducing the impact of loss in audio- conferences. We present a model of loss and determine howtheamountofredundancy shouldbe variedwiththe loss rate. In addition, we make loss measurements and make a preliminary investigation of the position of re- dundantencodings relative to

Isidor Kouvelas; Orion Hodson; Vicky Hardman; Jon Crowcroft

1997-01-01

321

A Case Study on Audio Feedback with Geography Undergraduates  

ERIC Educational Resources Information Center

Several small-scale studies have suggested that audio feedback can help students to reflect on their learning and to develop deep learning approaches that are associated with higher attainment in assessments. For this case study, Geography undergraduates were given audio feedback on a written essay assignment, alongside traditional written…

Rodway-Dyer, Sue; Knight, Jasper; Dunne, Elizabeth

2011-01-01

322

User-driven recognition of audio events in news videos  

Microsoft Academic Search

We propose a method for user-driven recognition of events in audio streams, aiming to assist journalists towards easily annotate unedited audiovisual content. Nonlocal information provided by the user, as for example that the sound of applause exists within the video, is used for adapting the audio event classifiers so as to detect the exact position of these events in the

Theodoros Giannakopoulos; Sergios Petridis; Stavros Perantonis

2010-01-01

323

AUDIO FINGERPRINT IDENTIFICATION BY APPROXIMATE STRING Jerome Lebosse  

E-print Network

AUDIO FINGERPRINT IDENTIFICATION BY APPROXIMATE STRING MATCHING Jerome Lebosse France Telecom R boulevard du Marchal Juin 14050 Caen, FRANCE luc.brun@greyc.ensicaen.fr ABSTRACT An audio fingerprint first presents a fingerprint extraction algorithm. The identification task is performed by a new

Paris-Sud XI, Université de

324

Representation of sound fields for audio recording and reproduction  

E-print Network

Representation of sound fields for audio recording and reproduction F. M. Fazia , M. Noisternigb of sound fields for audio recording and reproduction F. M. Fazia , M. Noisternigb and O. Warusfelb and loudspeaker arrays are often used for the recording and reproduction of a given sound field. A number

Paris-Sud XI, Université de

325

The Audio-Visual Equipment Directory. Seventeenth Edition.  

ERIC Educational Resources Information Center

The following types of audiovisual equipment are catalogued: 8 mm. and 16 mm. motion picture projectors, filmstrip and sound filmstrip projectors, slide projectors, random access projection equipment, opaque, overhead, and micro-projectors, record players, special purpose projection equipment, audio tape recorders and players, audio tape…

Herickes, Sally, Ed.

326

MUSTAJUURI - AN APPLICATION AND TOOLKIT FOR INTERACTIVE AUDIO PROCESSING  

Microsoft Academic Search

Mustajuuri is a freeware application and toolkit for audio sig- nal processing. It is designed for quick prototyping, testing and combination of audio or MIDI processing modules. Its main fo- cus is on efficient and low-latency real-time operation. Mustaju- uri offers an extremely flexible plugin architecture. By creating new plugins programmmers can extend Mustajuuri to meet new needs. The C++

Tommi Ilmonen

2001-01-01

327

Content-Based Retrieval of Music and Audio  

Microsoft Academic Search

Though many systems exist for content-based retrieval of images, little work has been done on the audio portionof the multimedia stream. This paper presents a system to retrieve audio documents by acoustic similarity. Thesimilarity measure is based on statistics derived from a supervised vector quantizer, rather than matching simple pitchor spectral characteristics. The system is thus able to learn distinguishing

Jonathan T. Foote

1997-01-01

328

Visualizing music and audio using self-similarity  

Microsoft Academic Search

This paper presents a novel approach to visualizing the time structure of music and audio. The acoustic similarity between any two instants of an audio recording is displayed in a 2D representation, allowing identification of structural and rhythmic characteristics. Examples are presented for classical and popular music. Applications include content-based analysis and segmentation, as well as tempo and structure extraction.

Jonathan Foote

1999-01-01

329

Queen Mary's `Media & Arts Technology Studios' Audio System Design  

E-print Network

next to the listening room in a space that used to be a large open plan computing lab area). The control room is provides facilities to record and monitor audio from the other rooms. All the cablingQueen Mary's `Media & Arts Technology Studios' Audio System Design Martin J. Morrell1 , Christopher

Reiss, Josh

330

Comparison of Audio Signal Codings for Zipf Analysis  

Microsoft Academic Search

We present in this paper a comparison of audio signal codings that we have developed in order to study xiphoidal sounds. These sounds are produced by the lower oesophageal sphincter whose dysfonctionnement can be responsible for the gastro-eosophageal reflux phenomenon. Our goal is to extract pertinent information from audio signals in order to characterize the pathology of the pa- tients

E. DELLANDREA; P. MAKRIS; N. VINCENT; M. BOIRON

331

2DEQ: An Intuitive Audio Equalizer Andrew T. Sabin  

E-print Network

2DEQ: An Intuitive Audio Equalizer Andrew T. Sabin Northwestern University a of a simple, intuitive audio equalizer with the idea that our approach could also be applied to other types of music production tools. First, users generate a large set of equalization curves representative

Bustamante, Fabián E.

332

Effective Use of Audio Media in Multimedia Presentations.  

ERIC Educational Resources Information Center

This paper emphasizes research-based reasons for adding audio to multimedia presentations. The first section summarizes suggestions from a review of research on the effectiveness of audio media when accompanied by other forms of media; types of research studies (e.g., evaluation, intra-medium, and aptitude treatment interaction studies) are also…

Kerr, Brenda

333

RTP Profile for Audio and Video Conferences with Minimal Control  

Microsoft Academic Search

This note describes a profile for the use of the real-time transport protocol (RTP) andthe associated control protocol, RTCP, within audio and video multiparticipant conferenceswith minimal control. It provides interpretations of generic fields within the RTP specificationsuitable for audio and video conferences. In particular, this document defines a set of defaultmappings from payload type numbers to encodings.The document also describes

Henning Schulzrinne

1995-01-01

334

Effect of Audio vs. Video on Aural Discrimination of Vowels  

ERIC Educational Resources Information Center

Despite the growing use of media in the classroom, the effects of using of audio versus video in pronunciation teaching has been largely ignored. To analyze the impact of the use of audio or video training on aural discrimination of vowels, 61 participants (all students at a large American university) took a pre-test followed by two training…

McCrocklin, Shannon

2012-01-01

335

Control Mechanisms for Packet Audio in the Internet  

Microsoft Academic Search

The Internet provides a single class best effort service. From an application's point of view, this service amounts in practice to providing channels with time-varying characteristics such as delay and loss distributions. One way to support real time applications such as interactive audio given this service is to use control mechanisms that adapt the audio coding and decoding processes based

Jean-chrysostome Bolot; Andrés Vega-garcía

1996-01-01

336

Single source noise reduction of received HF audio: experimental study  

NASA Astrophysics Data System (ADS)

This paper visits the application of single-source noise reduction on received audio over a HF channel. The noise reduction algorithm is typically used in vocoder noise processing at the transmitter before encoding. This study presents the results of the algorithm effects by objectively measuring audio quality through the use of industry standard PESQ analysis.

Campbell, Eric C.; Alva, Carlos O.

2014-05-01

337

RTP Profile for Audio and Video Conferences with Minimal Control  

Microsoft Academic Search

This memo describes a profile for the use of the real-time transport protocol (RTP), version 2, and the associated control protocol, RTCP, within audio and video multiparticipant conferences with minimal control. It provides interpretations of generic fields within the RTP specification suitable for audio and video conferences. In particular, this document defines a set of default mappings from payload type

H. Schulzrinne

1996-01-01

338

Beyond Podcasting: Creative Approaches to Designing Educational Audio  

ERIC Educational Resources Information Center

This paper discusses a university-wide pilot designed to encourage academics to creatively explore learner-centred applications for digital audio. Participation in the pilot was diverse in terms of technical competence, confidence and contextual requirements and there was little prior experience of working with digital audio. Many innovative…

Middleton, Andrew

2009-01-01

339

Classification of general audio data for content-based retrieval  

Microsoft Academic Search

In this paper, we address the problem of classification of continuous general audio data (GAD) for content-based retrieval, and describe a scheme that is able to classify audio segments into seven categories consisting of silence, single speaker speech, music, environmental noise, multiple speakers' speech, simultaneous speech and music, and speech and noise. We studied a total of 143 classification features

Dongge Li; Ishwar K. Sethi; Nevenka Dimitrova; Thomas Mcgee

2001-01-01

340

Using Audio Books to Improve Reading and Academic Performance  

ERIC Educational Resources Information Center

This article highlights significant research about what below grade-level reading means in middle school classrooms and suggests a tested approach to improve reading comprehension levels significantly by using audio books. The use of these audio books can improve reading and academic performance for both English language learners (ELLs) and for…

Montgomery, Joel R.

2009-01-01

341

Tunneling machine  

SciTech Connect

A diametrically compact tunneling machine for boring tunnels is disclosed. The machine includes a tubular support frame having a hollow piston mounted therein which is movable from a retracted position in the support frame to an extended position. A drive shaft is rotatably mounted in the hollow piston and carries a cutter head at one end. The hollow piston is restrained against rotational movement relative to the support frame and the drive shaft is constrained against longitudinal movement relative to the hollow piston. A plurality of radially extendible feet project from the support frame to the tunnel wall to grip the tunnel wall during a tunneling operation wherein the hollow piston is driven forwardly so that the cutter head works on the tunnel face. When the hollow piston is fully extended, a plurality of extendible support feet, which are fixed to the rearward and forward ends of the hollow piston, are extended, the radially extendible feet are retracted and the support frame is shifted forwardly by the piston so that a further tunneling operation may be initiated.

Snyder, L.L.

1980-02-19

342

Course info Machine Learning  

E-print Network

Course info Machine Learning Real life problems Lecture 1: Machine Learning Problem Qinfeng (Javen) Shi 28 July 2014 Intro. to Stats. Machine Learning COMP SCI 4401/7401 Qinfeng (Javen) Shi Lecture 1: Machine Learning Problem #12;Course info Machine Learning Real life problems Table of Contents I 1 Course

Shi, Qinfeng "Javen"

343

Socially guided machine learning  

E-print Network

Social interaction will be key to enabling robots and machines in general to learn new tasks from ordinary people (not experts in robotics or machine learning). Everyday people who need to teach their machines new things ...

Thomaz, Andrea Lockerd

2006-01-01

344

Single Chip Audio Processor For Second Generation 6 Chip MUSE (HDTV) Decoder  

Microsoft Academic Search

A singlechip audio processor has been developed to decode audio signals multiplexed into the MUSE signal. With only a single peripheral memory device required as opposed to several needed previously, thii IC offers additional userconvenient features, including Schannel digital audio outputs and 4 to 2 Channel conversion. The appearance of this IC should make possible further downsizing of the audio

Masanori Kurita; Kouji Yoshitomi; Yukio Otobe; Kiyoshi Kohiyama; F. Asami; N. Tahara; K. Sugihara

1992-01-01

345

AUDIO-VISUAL CONTENT-BASED MULTIMEDIA INDEXING AND RETRIEVAL THE MUVIS FRAMEWORK  

E-print Network

AUDIO-VISUAL CONTENT-BASED MULTIMEDIA INDEXING AND RETRIEVAL ­ THE MUVIS FRAMEWORK Moncef Gabbouj and Serkan Kiranyaz Institute of Signal Processing Tampere University of Technology Tampere, Finland moncef multimedia types such as audio, video, audio/video interlaced in several formats. It allows real-time audio

Gabbouj, Moncef

346

Automatic audio and manual transcripts alignment, time-code transfer and selection of exact transcripts  

E-print Network

Automatic audio and manual transcripts alignment, time-code transfer and selection of exact focuses on automatic processing of sibling resources of audio and written documents, such as available in audio archives or for parliament debates: written texts are close but not exact audio transcripts

Boula de Mareüil, Philippe

347

Lossless Audio Compression: A Case Study CISE Technical Report 08-415  

E-print Network

Lossless Audio Compression: A Case Study CISE Technical Report 08-415 Department of Computer Audio compression is used everywhere ­ largely due to the rise of the internet, computers, and embedded to compress large audio signals into small, high-quality, convenient formats. While lossy audio formats, like

Mishra, Prabhat

348

Audio Engineering Society Convention PaperPresented at the 111th Convention  

E-print Network

Audio Engineering Society Convention PaperPresented at the 111th Convention 2001 September 21 for the contents. Additional papers may be obtained by sending request and remittance to Audio Engineering Society of the Audio Engineering Society. Bayesian Harmonic Analysis for Audio Testing and Measurement Thomas J. Loredo

Loredo, Thomas J.

349

Andersen Efficient Audio Power Amplification -Challenges International Conference, Copenhagen, Denmark, 2005 September 24 1  

E-print Network

Andersen Efficient Audio Power Amplification - Challenges AES 27th International Conference, Copenhagen, Denmark, 2005 September 2­4 1 EFFICIENT AUDIO POWER AMPLIFICATION - CHALLENGES MICHAEL A. E a decade efficient audio power amplification has evolved and today switch-mode audio power amplification

350

Audio-visual Convolutive Blind Source Separation Qingju Liu, Wenwu Wang, Philip Jackson  

E-print Network

Audio-visual Convolutive Blind Source Separation Qingju Liu, Wenwu Wang, Philip Jackson Centre.Liu,W.Wang,P.Jackson}@surrey.ac.uk Abstract--We present a novel method for speech separation from their audio mixtures using the audio to characterise statistically the audio- visual coherence with features obtained from the training set

Jackson, Philip JB

351

AUDIO ENCODING BASED ON THE EMPIRICAL MODE DECOMPOSITION Kais Khaldi1  

E-print Network

AUDIO ENCODING BASED ON THE EMPIRICAL MODE DECOMPOSITION Kais Khaldi1 , Abdel-Ouahab Boudraa23. The pro- posed approach is applied to several audio signals compres- sion. Results are compared to wavelet and audio quality. 1. INTRODUCTION Audio coding at low bit rate and high fidelity is an impor- tant task

Paris-Sud XI, Université de

352

Denoising of Audio Data by Nonlinear Diffusion Martin Welk, Achim Bergmeister, and Joachim Weickert  

E-print Network

Denoising of Audio Data by Nonlinear Diffusion Martin Welk, Achim Bergmeister, and Joachim Weickert of audio sig- nals. An important difference between image and audio signals is which parts of the signal, they encode essential information in audio data. To adapt diffusion to this setting, we apply

353

Audio Structuring and Personalized Retrieval Using Ontologies Latifur Khan and Dennis McLeod  

E-print Network

Audio Structuring and Personalized Retrieval Using Ontologies Latifur Khan and Dennis Mc. Specifically, the focus here is on multimedia audio databases: we are developing a Personal AudioCast system at the USC Integrated Media Systems Center which creates and delivers to an end-user personalized audio

Texas at Dallas, University of

354

Effects of aging on audio-visual speech integration.  

PubMed

This study investigated the impact of aging on audio-visual speech integration. A syllable identification task was presented in auditory-only, visual-only, and audio-visual congruent and incongruent conditions. Visual cues were either degraded or unmodified. Stimuli were embedded in stationary noise alternating with modulated noise. Fifteen young adults and 15 older adults participated in this study. Results showed that older adults had preserved lipreading abilities when the visual input was clear but not when it was degraded. The impact of aging on audio-visual integration also depended on the quality of the visual cues. In the visual clear condition, the audio-visual gain was similar in both groups and analyses in the framework of the fuzzy-logical model of perception confirmed that older adults did not differ from younger adults in their audio-visual integration abilities. In the visual reduction condition, the audio-visual gain was reduced in the older group, but only when the noise was stationary, suggesting that older participants could compensate for the loss of lipreading abilities by using the auditory information available in the valleys of the noise. The fuzzy-logical model of perception confirmed the significant impact of aging on audio-visual integration by showing an increased weight of audition in the older group. PMID:25324091

Huyse, Aurélie; Leybaert, Jacqueline; Berthommier, Frédéric

2014-10-01

355

Space Shuttle Orbiter audio subsystem. [to communication and tracking system  

NASA Technical Reports Server (NTRS)

The selection of the audio multiplex control configuration for the Space Shuttle Orbiter audio subsystem is discussed and special attention is given to the evaluation criteria of cost, weight and complexity. The specifications and design of the subsystem are described and detail is given to configurations of the audio terminal and audio central control unit (ATU, ACCU). The audio input from the ACCU, at a signal level of -12.2 to 14.8 dBV, nominal range, at 1 kHz, was found to have balanced source impedance and a balanced local impedance of 6000 + or - 600 ohms at 1 kHz, dc isolated. The Lyndon B. Johnson Space Center (JSC) electroacoustic test laboratory, an audio engineering facility consisting of a collection of acoustic test chambers, analyzed problems of speaker and headset performance, multiplexed control data coupled with audio channels, and the Orbiter cabin acoustic effects on the operational performance of voice communications. This system allows technical management and project engineering to address key constraining issues, such as identifying design deficiencies of the headset interface unit and the assessment of the Orbiter cabin performance of voice communications, which affect the subsystem development.

Stewart, C. H.

1978-01-01

356

Turing Machines A Turing machine M is  

E-print Network

to perform computations. A function f is Turing-computable iff there exists some TM M such that f(x) = y iff1 Turing Machines A Turing machine M is: Q, a set of internal states. , the input alphabet. , tape of Turing machine M on input w Initial state q0, tape w, blanks around w, read-write head at first symbol

Bylander, Tom

357

Robust audio-visual speech recognition under noisy audio-video conditions.  

PubMed

This paper presents the maximum weighted stream posterior (MWSP) model as a robust and efficient stream integration method for audio-visual speech recognition in environments, where the audio or video streams may be subjected to unknown and time-varying corruption. A significant advantage of MWSP is that it does not require any specific measurements of the signal in either stream to calculate appropriate stream weights during recognition, and as such it is modality-independent. This also means that MWSP complements and can be used alongside many of the other approaches that have been proposed in the literature for this problem. For evaluation we used the large XM2VTS database for speaker-independent audio-visual speech recognition. The extensive tests include both clean and corrupted utterances with corruption added in either/both the video and audio streams using a variety of types (e.g., MPEG-4 video compression) and levels of noise. The experiments show that this approach gives excellent performance in comparison to another well-known dynamic stream weighting approach and also compared to any fixed-weighted integration approach in both clean conditions or when noise is added to either stream. Furthermore, our experiments show that the MWSP approach dynamically selects suitable integration weights on a frame-by-frame basis according to the level of noise in the streams and also according to the naturally fluctuating relative reliability of the modalities even in clean conditions. The MWSP approach is shown to maintain robust recognition performance in all tested conditions, while requiring no prior knowledge about the type or level of noise. PMID:23757540

Stewart, Darryl; Seymour, Rowan; Pass, Adrian; Ming, Ji

2014-02-01

358

The Fields Institute: Lecture Audio and Slides  

NSDL National Science Digital Library

The Fields Institute for Research in Mathematical Sciences aims to "enhance mathematical activity in Canada by bringing together mathematicians from Canada and abroad, and by promoting contact and collaboration between professional mathematicians and the increasing numbers of users of mathematics." They support research in pure and applied mathematics, statistics and computer science, as well as collaborative projects between mathematicians and those applying mathematics in areas such as engineering, the physical and biological sciences, medicine, economics and finance, telecommunications and information systems. They offer this website with audio files and slides from events and lectures at the Fields Institute. The lectures, given by scientists from around the world, address such topics as Quantitative Finance, String Theory, Homological Algebra, Combinatorics, and much more. The files are organized by academic year and series title. In cases where the files are not available to download, they provide information on how to obtain the files.

359

A direct broadcast satellite-audio experiment  

NASA Technical Reports Server (NTRS)

System studies have been carried out over the past three years at the Jet Propulsion Laboratory (JPL) on digital audio broadcasting (DAB) via satellite. The thrust of the work to date has been on designing power and bandwidth efficient systems capable of providing reliable service to fixed, mobile, and portable radios. It is very difficult to predict performance in an environment which produces random periods of signal blockage, such as encountered in mobile reception where a vehicle can quickly move from one type of terrain to another. For this reason, some signal blockage mitigation techniques were built into an experimental DAB system and a satellite experiment was conducted to obtain both qualitative and quantitative measures of performance in a range of reception environments. This paper presents results from the experiment and some conclusions on the effectiveness of these blockage mitigation techniques.

Vaisnys, Arvydas; Abbe, Brian; Motamedi, Masoud

1992-01-01

360

Sony's Data Discman: A Look at These New Portable Information Machines and What They Mean for CD-ROM Developers.  

ERIC Educational Resources Information Center

Describes a portable CD-ROM machine intended for the mass market that provides access to searchable text, graphics, and audio through a user-friendly interface. Six search modes and other system features are reviewed, and electronic texts for the unit are introduced. A table compares features of the two available models. (NRP)

Bonime, Andrew

1992-01-01

361

Prosodic Phrasing: Machine and Human Evaluation  

Microsoft Academic Search

This paper describes a set of experiments aiming at the construction and evaluation of a new phrasing module for European Portuguese text-to-speech synthesis, using classification and regression trees learned from hand-labelled texts. Using the assessment criteria of matching boundary predictions against the corresponding labelled ones, the best solution achieves an overall performance of 91.9%, with 86.3% of correctly assigned breaks

M. Céu Viana; Luís C. Oliveira; Ana I. Mata

2003-01-01

362

Noise-Canceling Helmet Audio System  

NASA Technical Reports Server (NTRS)

A prototype helmet audio system has been developed to improve voice communication for the wearer in a noisy environment. The system was originally intended to be used in a space suit, wherein noise generated by airflow of the spacesuit life-support system can make it difficult for remote listeners to understand the astronaut s speech and can interfere with the astronaut s attempt to issue vocal commands to a voice-controlled robot. The system could be adapted to terrestrial use in helmets of protective suits that are typically worn in noisy settings: examples include biohazard, fire, rescue, and diving suits. The system (see figure) includes an array of microphones and small loudspeakers mounted at fixed positions in a helmet, amplifiers and signal-routing circuitry, and a commercial digital signal processor (DSP). Notwithstanding the fixed positions of the microphones and loudspeakers, the system can accommodate itself to any normal motion of the wearer s head within the helmet. The system operates in conjunction with a radio transceiver. An audio signal arriving via the transceiver intended to be heard by the wearer is adjusted in volume and otherwise conditioned and sent to the loudspeakers. The wearer s speech is collected by the microphones, the outputs of which are logically combined (phased) so as to form a microphone- array directional sensitivity pattern that discriminates in favor of sounds coming from vicinity of the wearer s mouth and against sounds coming from elsewhere. In the DSP, digitized samples of the microphone outputs are processed to filter out airflow noise and to eliminate feedback from the loudspeakers to the microphones. The resulting conditioned version of the wearer s speech signal is sent to the transceiver.

Seibert, Marc A.; Culotta, Anthony J.

2007-01-01

363

DC Machine  

NSDL National Science Digital Library

The effect of the commutator is to produce a fixed spatial distribution of current directions in the armature conductors (shown as blue & green circles) independent of shaft rotation. The field created by these currents (armature reaction) is vertically directed along the quadrature axis. The field established by the excitation of the stator poles is directed along the horizontal direct axis. This package presents computer models of electric machines based on the space vector formulation leading to the assessment of the dynamic performance of open- and closed-loopac and dc drive systems. The Simulink/Matlab implementation is adopted because of its inherent integration of vectorized system representations in block diagram form, of numerical analysis methods, of graphical portrayal of time evolutions of signals combined with the simple realization of the functionality of controllers and power electronic excitations. The development of Simulink models of drive assemblies is a relatively simple task consisting of combining input-output block representation of the various components making up the system; these blocks can readily be reused (utilizing standard copy-and-paste techniques) to modify the system components or the configuration of the system. This approach provides a powerful design tool because of the ease of observing the effects of parameters modifications and of differing system configurations and control strategies.

Riaz, Mahmoud

2012-08-29

364

The power of digital audio in interactive instruction: An unexploited medium  

SciTech Connect

Widespread use of audio in computer-based training (CBT) occurred with the advent of the interactive videodisc technology. This paper discusses the alternative of digital audio, which, unlike videodisc audio, enables one to rapidly revise the audio used in the CBT and which may be used in nonvideo CBT applications as well. We also discuss techniques used in audio script writing, editing, and production. Results from evaluations indicate a high degree of user satisfaction. 4 refs.

Pratt, J.; Trainor, M.

1989-01-01

365

Text-Like Segmentation of General Audio for Content-Based Retrieval  

Microsoft Academic Search

Automatic detection of (semantically) meaningful audio segments, or audio scenes, is an important step in high-level semantic inference from general audio signals, and can benefit various content-based applications involving both audio and multimodal (multimedia) data sets. Motivated by the known limitations of traditional low-level feature-based approaches, we propose in this paper a novel approach to discover audio scenes, based on

Lie Lu; Alan Hanjalic

2009-01-01

366

Iowa Virtual Literacy Protocol: A Pre-Experimental Design Using Kurzweil 3000 Text-to-Speech Software with Incarcerated Adult Learners  

ERIC Educational Resources Information Center

The problem: The increasingly competitive global economy demands literate, educated workers. Both men and women experience the effects of education on employment rates and income. Racial and ethnic minorities, English language learners, and especially those with prison records are most deeply affected by the economic consequences of dropping out…

McCulley, Yvette K.

2012-01-01

367

Direct broadcast satellite-audio, portable and mobile reception tradeoffs  

NASA Technical Reports Server (NTRS)

This paper reports on the findings of a systems tradeoffs study on direct broadcast satellite-radio (DBS-R). Based on emerging advanced subband and transform audio coding systems, four ranges of bit rates: 16-32 kbps, 48-64 kbps, 96-128 kbps and 196-256 kbps are identified for DBS-R. The corresponding grades of audio quality will be subjectively comparable to AM broadcasting, monophonic FM, stereophonic FM, and CD quality audio, respectively. The satellite EIRP's needed for mobile DBS-R reception in suburban areas are sufficient for portable reception in most single family houses when allowance is made for the higher G/T of portable table-top receivers. As an example, the variation of the space segment cost as a function of frequency, audio quality, coverage capacity, and beam size is explored for a typical DBS-R system.

Golshan, Nasser

1992-01-01

368

HHT-based audio coding Kais Khaldi1,2  

E-print Network

signal by superposition of the demodulated IMFs. The proposed approach is applied to audio signals-mean with symmetric envelopes AM-FM components. Basis functions of EMD are derived from the signal itself and hence

Paris-Sud XI, Université de

369

Companding techniques for high dynamic range audio CODEC receiver path  

E-print Network

In this thesis, an audio CODEC receiver path has been modified by the addition of companding techniques. Companding compresses the input signal and expands the output signal according to the input power strength such that ...

Ma, Yunjie, M. Eng. Massachusetts Institute of Technology

2009-01-01

370

A Monolithic Integrated Digital Signal Processor for Hifi Audio Applications  

Microsoft Academic Search

The following text describes a new programmable singlechip digital signal processor for top-of-the-range digital audio applications whose architecture and features fulfil exacting requirements. The new digital signal processor (DSP) has been developed for hifi digital audio systems with an analog interface IC (AIC). It is fabricated in 2 micron CMOS technology, and a bond-out version is also available for development

P. Anders; P. Aslett

1987-01-01

371

Stream confidence estimation for audio-visual speech recognition  

Microsoft Academic Search

We investigate the use of single modality confidence measures as a means of estimating adaptive, local weights for improved au- dio-visual automatic speech recognition. We limit our work to the toy problem of audio-visual phonetic classification by means of a two-stream Gaussian mixture model (GMM), where each stream models the class conditional audio- or visual-only obser- vation probability, raised to

Gerasimos Potamianos; Chalapathy Neti

2000-01-01

372

Hierarchical discriminant features for audio-visual LVCSR  

Microsoft Academic Search

We propose the use of a hierarchical, two-stage discriminant transformation for obtaining audio-visual features that improve automatic speech recognition. Linear discriminant analysis (LDA), followed by a maximum likelihood linear transform (MLLT) is first applied to MFCC based audio-only features, as well as on visual only features, obtained by a discrete cosine transform of the video region of interest. Subsequently, a

Gerasimos Potamianos; Juergen Luettiiz; Chalapathy Neti

2001-01-01

373

Speaker Independent Audio-Visual Database For Bimodal Asr  

Microsoft Academic Search

This paper describes the audio-visual database collected at AT&T Labs--Research for the study of bimodal speech recognition. To date, this database consists of two multiple speaker parts, namely isolated confusable words and connected letters, thus allowing the study of some popular and relatively simple speaker independent audio-visual recognition tasks. In addition, a single speaker connected digits database is collected to

Gerasimos Potamianos; Eric Cosatto; Hans Peter Graf; David B. Roe

1997-01-01

374

Asynchrony modeling for audio-visual speech recognition  

Microsoft Academic Search

We investigate the use of multi-stream HMMs in the automatic recognition of audio-visual speech. Multi-stream HMMs allow the modeling of asynchrony between the audio and visual state sequences at a variety of levels (phone, syllable, word, etc.) and are equivalent to product, or composite, HMMs. In this paper, we consider such models synchronized at the phone boundary level, allowing various

Guillaume Gravier; Gerasimos Potamianos; Chalapathy Neti

2002-01-01

375

Audio watermarking method robust against time- and frequency-fluctuation  

Microsoft Academic Search

In this paper, we describe an audio watermarking algorithm that can embed a multiple-bit message which is robust against wow-and-flutter, cropping, noise-addition, pitch-shift, and audio compressions such as MP3. The algorithm calculates and manipulates the magnitudes of segmented areas in the time-frequency plane of the content using short-term DFTs. The detection algorithm correlates the magnitudes with a pseudo-random array that

Ryuki Tachibana; Shuichi Shimizu; Taiga Nakamura; Seiji Kobayashi

2001-01-01

376

A single-chip audio signal processor for HDTV receiver  

Microsoft Academic Search

A single-chip LSI has been developed to decode audio signals which are multiplexed into MUSE (multiple sub-Nyquist sampling encoding) signals for transmitting HDTV (high-definition television) signals. Because the LSI requires just a few peripheral parts (such as two RAMs and one voltage-controlled oscillator circuit), use of the LSI can reduce the size of the MUSE audio signal decoding digital unit.

Kazuo Naganawa; Yoshihiro Hori; Shuji Yanase; Nobuo Itoh; Yoshikazu Asano

1991-01-01

377

The Future of Audio Reproduction - Technology - Formats - Applications  

Microsoft Academic Search

\\u000a The introduction of new techniques for audio reproduction such as binaural technology, Wave Field Synthesis and Higher Order Ambisonics is accompanied by a paradigm shift from channel-based to object-based transmission and storage of spatial audio. The separate coding of source signal and source location is not only more efficient\\u000a considering the number of channels used for reproduction by large loudspeaker

Matthias Geier; Sascha Spors; Stefan Weinzierl

2008-01-01

378

Frank Baumgarte Application of a physiological ear model to irrelevance reduction in audio coding AES 17 th International conference on High Quality Audio Coding 1  

E-print Network

Frank Baumgarte Application of a physiological ear model to irrelevance reduction in audio coding AES 17 th International conference on High Quality Audio Coding 1 APPLICATION OF A PHYSIOLOGICAL EAR@tnt.uni--hannover.de A previously published physiological ear model is applied as perceptual model to an audio coder complying

379

Proc. of the 9th Int. Conference on Digital Audio Effects (DAFX'06), Montreal, Canada, September 18-20, 2006 PARAMETRIC CODING OF STEREO AUDIO  

E-print Network

Proc. of the 9th Int. Conference on Digital Audio Effects (DAFX'06), Montreal, Canada, September 18-20, 2006 DAFX-1 PARAMETRIC CODING OF STEREO AUDIO BASED ON PRINCIPAL COMPONENT ANALYSIS Manuel Briand parametric coding of multichannel audio is mainly based on Binaural Cue Coding (BCC). Another multichannel

Paris-Sud XI, Université de

380

Proc. 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, New York, Oct. 1720, 1999 ADVANCES IN PARAMETRIC AUDIO CODING  

E-print Network

Proc. 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, New York, Oct. 17­20, 1999 ADVANCES IN PARAMETRIC AUDIO CODING Heiko Purnhagen University of Hannover@tnt.uni­hannover.de ABSTRACT Parametric modelling provides an efficient representation of gen­ eral audio signals

381

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 20, NO. 10, DECEMBER 2012 2657 Real-Time Perception-Based Clipping of Audio  

E-print Network

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 20, NO. 10, DECEMBER 2012 2657 Real-Time Perception-Based Clipping of Audio Signals Using Convex Optimization Bruno Defraene, Student-time audio applications, yet the use of existing clipping techniques generally has a detrimental effect

382

IEEE TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, VOL.V, NO.N, MMMM 2013 1 Parametric Audio Coding with Exponentially  

E-print Network

IEEE TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, VOL.V, NO.N, MMMM 2013 1 Parametric Audio Coding with Exponentially Damped Sinusoids Olivier Derrien, Member, IEEE, Roland Badeau, Senior popular techniques for low bitrate audio coding. Usually, the sinusoidal parameters (amplitude, pulsation

Paris-Sud XI, Université de

383

1890 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 6, NOVEMBER 2006 Objective Assessment of Speech and Audio  

E-print Network

1890 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 6, NOVEMBER 2006 Objective Assessment of Speech and Audio Quality--Technology and Applications Antony W. Rix, John G or monitoring speech and audio quality. By measuring perceived quality on an easily-understood subjective scale

384

Can Machines BeCan Machines Be Polite?Polite?  

E-print Network

Can Machines BeCan Machines Be Polite?Polite? Michael A. CovingtonMichael A. Covington Artificial Intelligence CenterArtificial Intelligence Center #12;Can machines be polite?Can machines be polite? Machines have beenMachines have been annoying peopleannoying people ever since humanever since human--machine

Covington, Michael A.

385

Hybrid Quantum Cloning Machine  

E-print Network

In this work, we introduce a special kind of quantum cloning machine called Hybrid quantum cloning machine. The introduced Hybrid quantum cloning machine or transformation is nothing but a combination of pre-existing quantum cloning transformations. In this sense it creates its own identity in the field of quantum cloners. Hybrid quantum cloning machine can be of two types: (i) State dependent and (ii) State independent or Universal. We study here the above two types of Hybrid quantum cloning machines. Later we will show that the state dependent hybrid quantum-cloning machine can be applied on only four input states. We will also find in this paper another asymmetric universal quantum cloning machine constructed from the combination of optimal universal B-H quantum cloning machine and universal anti-cloning machine. The fidelities of the two outputs are different and their values lie in the neighborhood of ${5/6} $

Satyabrata Adhikari; A. K. Pati; Indranil Chakrabarty; B. S. Choudhury

2007-06-14

386

Personal audio with a planar bright zone.  

PubMed

Reproduction of multiple sound zones, in which personal audio programs may be consumed without the need for headphones, is an active topic in acoustical signal processing. Many approaches to sound zone reproduction do not consider control of the bright zone phase, which may lead to self-cancellation problems if the loudspeakers surround the zones. Conversely, control of the phase in a least-squares sense comes at a cost of decreased level difference between the zones and frequency range of cancellation. Single-zone approaches have considered plane wave reproduction by focusing the sound energy in to a point in the wavenumber domain. In this article, a planar bright zone is reproduced via planarity control, which constrains the bright zone energy to impinge from a narrow range of angles via projection in to a spatial domain. Simulation results using a circular array surrounding two zones show the method to produce superior contrast to the least-squares approach, and superior planarity to the contrast maximization approach. Practical performance measurements obtained in an acoustically treated room verify the conclusions drawn under free-field conditions. PMID:25324075

Coleman, Philip; Jackson, Philip J B; Olik, Marek; Pedersen, Jan Abildgaard

2014-10-01

387

Extreme Kicking Machine  

NSDL National Science Digital Library

In this design challenge activity follow up to "Kicking Machine", learners add a hands-free feature to their Kicking Machine. Learners modify their kicking machine to have it either release the pendulum or rubber band while standing three feet away or have it automatically feed balls into the kicking machine, one after another. They must figure out how to integrate the new feature into the existing structure and use the design process to make sure the modified kicking machine works properly. Educators can use this activity to demonstrate simple machines and kinetic/potential energy.

2012-08-20

388

Digital Audio Radio Broadcast Systems Laboratory Testing Nearly Complete  

NASA Technical Reports Server (NTRS)

Radio history continues to be made at the NASA Lewis Research Center with the completion of phase one of the digital audio radio (DAR) testing conducted by the Consumer Electronics Group of the Electronic Industries Association. This satellite, satellite/terrestrial, and terrestrial digital technology will open up new audio broadcasting opportunities both domestically and worldwide. It will significantly improve the current quality of amplitude-modulated/frequency-modulated (AM/FM) radio with a new digitally modulated radio signal and will introduce true compact-disc-quality (CD-quality) sound for the first time. Lewis is hosting the laboratory testing of seven proposed digital audio radio systems and modes. Two of the proposed systems operate in two modes each, making a total of nine systems being tested. The nine systems are divided into the following types of transmission: in-band on-channel (IBOC), in-band adjacent-channel (IBAC), and new bands. The laboratory testing was conducted by the Consumer Electronics Group of the Electronic Industries Association. Subjective assessments of the audio recordings for each of the nine systems was conducted by the Communications Research Center in Ottawa, Canada, under contract to the Electronic Industries Association. The Communications Research Center has the only CCIR-qualified (Consultative Committee for International Radio) audio testing facility in North America. The main goals of the U.S. testing process are to (1) provide technical data to the Federal Communication Commission (FCC) so that it can establish a standard for digital audio receivers and transmitters and (2) provide the receiver and transmitter industries with the proper standards upon which to build their equipment. In addition, the data will be forwarded to the International Telecommunications Union to help in the establishment of international standards for digital audio receivers and transmitters, thus allowing U.S. manufacturers to compete in the world market.

2005-01-01

389

15 CFR 700.31 - Metalworking machines.  

Code of Federal Regulations, 2013 CFR

...section include: Bending and forming machines Boring machines Broaching machines Drilling and tapping machines Electrical discharge, ultrasonic and chemical erosion machines Forging machinery and hammers Gear cutting and...

2013-01-01

390

Interactive Gumball Machine  

NSDL National Science Digital Library

In this activity, learners review the history of gumball machines and explore potential and kinetic energy, while working in teams to build a gumball slide. Teams then design and build their own interactive gumball machine.

2013-08-30

391

Infinite Time Turing Machines  

Microsoft Academic Search

Infinite time Turing machines extend the operation of ordinary Turing machines into transfinite ordinal time. By doing so, they provide a natural model of infinitary computability, a theoretical setting for the analysis of the power and limitations of supertask algorithms.

Joel David Hamkins

2002-01-01

392

LISP Machine Progress Report  

E-print Network

This informal paper introduces the LISP Machine, describes the goals and current status of the project, and explicates some of the key ideas. It covers the LISP machine implementation, LISP as a system language, ...

Bawden, Alan

1977-08-01

393

Stirling machine operating experience  

NASA Technical Reports Server (NTRS)

Numerous Stirling machines have been built and operated, but the operating experience of these machines is not well known. It is important to examine this operating experience in detail, because it largely substantiates the claim that Stirling machines are capable of reliable and lengthy lives. The amount of data that exists is impressive, considering that many of the machines that have been built are developmental machines intended to show proof of concept, and were not expected to operate for any lengthy period of time. Some Stirling machines (typically free-piston machines) achieve long life through non-contact bearings, while other Stirling machines (typically kinematic) have achieved long operating lives through regular seal and bearing replacements. In addition to engine and system testing, life testing of critical components is also considered.

Ross, Brad; Dudenhoefer, James E.

1991-01-01

394

Stirling machine operating experience  

NASA Astrophysics Data System (ADS)

Numerous Stirling machines have been built and operated, but the operating experience of these machines is not well known. It is important to examine this operating experience in detail, because it largely substantiates the claim that Stirling machines are capable of reliable and lengthy lives. The amount of data that exists is impressive, considering that many of the machines that have been built are developmental machines intended to show proof of concept, and were not expected to operate for any lengthy period of time. Some Stirling machines (typically free-piston machines) achieve long life through non-contact bearings, while other Stirling machines (typically kinematic) have achieved long operating lives through regular seal and bearing replacements. In addition to engine and system testing, life testing of critical components is also considered.

Ross, Brad A.; Dudenhoefer, James E.

395

The Advantage of Machines  

NSDL National Science Digital Library

In this lesson, students learn about work as defined by physical science and see that work is made easier through the use of simple machines. Already encountering simple machines everyday, students will be learn about their widespread uses in improving everyday life. This lesson serves as the starting point for the Simple Machines Unit.

Integrated Teaching And Learning Program

396

Millikelvin Lab Machine Shop  

E-print Network

Millikelvin Lab OP105­112 Machine Shop OP132 Resistive Magnet Shop CICC Winding Area Transformers This building is home to the Millikelvin lab, the control room, the resistive magnet and machine shops, the CICC@magnet.fsu.edu (850) 644-4378 (850) 644-0534 2 MACHINE SHOP OP132 Vaughan Williams (A114*) williams

McQuade, D. Tyler

397

Talking Vending Machines  

E-print Network

're in the central Tokyo Station and you're thirsty so you go to the, you know, vending machine to buy a drink. Only this is not your average vending machine. Uh uh. This vending machine has opinions on what beverage you want. Sensors built into large touch panels...

Hacker, Randi

2011-02-02

398

Cable-Twisting Machine  

NASA Technical Reports Server (NTRS)

New cable-twisting machine is smaller and faster than many production units. Is useful mainly in production of short-run special cables. Already-twisted cable can be fed along axis of machine. Faster operation than typical industrial cable-twisting machines possible by using smaller spools of wire.

Kurnett, S.

1982-01-01

399

Automatic Inspection During Machining  

NASA Technical Reports Server (NTRS)

In experimental manufacturing process, numerically-controlled machine tool temporarily converts into inspection machine by installing electronic touch probes and specially-developed numerical-control software. Software drives probes in paths to and on newly machined parts and collects data on dimensions of parts.

Ransom, Clyde L.

1988-01-01

400

Find the Simple Machines  

NSDL National Science Digital Library

This is a web activity about simple machines. Learners will explore a lawn mower and identify six different simple machines which work together to help make our lives easier. This is an excellent activity for exploring how simple machines, and science in general, apply to learners' everyday lives.

Cosi

2000-01-01

401

Your Sewing Machine.  

ERIC Educational Resources Information Center

The programed instruction manual is designed to aid the student in learning the parts, uses, and operation of the sewing machine. Drawings of sewing machine parts are presented, and space is provided for the student's written responses. Following an introductory section identifying sewing machine parts, the manual deals with each part and its…

Peacock, Marion E.

402

What is Machine Learning? About the Course Example Machine Learning  

E-print Network

What is Machine Learning? About the Course Example Machine Learning DD2431 ¨Orjan Ekeberg Oct­Dec, 2007 What is Machine Learning? About the Course Example 1 What is Machine Learning? Definition A Hypothetical Project What is Machine Learning? About the Course Example 1 What is Machine Learning? Definition

Kjellström, Hedvig

403

What is Machine Learning? About the Course Example Machine Learning  

E-print Network

What is Machine Learning? About the Course Example Machine Learning DD2431 ¨Orjan Ekeberg Oct­Dec, 2007 #12;What is Machine Learning? About the Course Example 1 What is Machine Learning? Definition A Hypothetical Project #12;What is Machine Learning? About the Course Example 1 What is Machine Learning

Kjellström, Hedvig

404

What is Machine Learning? About the Course Example Machine Learning  

E-print Network

What is Machine Learning? About the Course Example Machine Learning DD2431 ¨Orjan Ekeberg Oct­Dec, 2008 #12;What is Machine Learning? About the Course Example 1 What is Machine Learning? Definition A Hypothetical Project #12;What is Machine Learning? About the Course Example 1 What is Machine Learning

Kjellström, Hedvig

405

What is Machine Learning? About the Course Example Machine Learning  

E-print Network

What is Machine Learning? About the Course Example Machine Learning DD2431 ¨Orjan Ekeberg Oct­Dec, 2008 What is Machine Learning? About the Course Example 1 What is Machine Learning? Definition A Hypothetical Project What is Machine Learning? About the Course Example 1 What is Machine Learning? Definition

Kjellström, Hedvig

406

Why Machine Learning and Games? Machine Learning in Video Games  

E-print Network

#12;Why Machine Learning and Games? Machine Learning in Video Games DrivatarsTM Reinforcement entertainment valueWhat is the best AI? #12;Why Machine Learning and Games? Machine Learning in Video Games;#12;Why Machine Learning and Games? Machine Learning in Video Games DrivatarsTM Reinforcement Learning

Hunt, Galen

407

Circular tests for HSM machine tools: Bore machining application  

E-print Network

operations lies in understanding the impact of machine tool behaviour on machining time and part qualityCircular tests for HSM machine tools: Bore machining application Abstract: Today's High-Speed Machining (HSM) machine tool combines productivity and part quality. The difficulty inherent in HSM

Paris-Sud XI, Université de

408

Primary Masters in Machine Learning  

E-print Network

Primary Masters in Machine Learning Student Handbook #12;#12;Page 1 Masters in Machine Learning:.......................................................................................8 Machine Learning Journal Club ..................................................................12 #12;Page 3 Introduction The field of machine learning is concerned with the question of how

409

Highlight summarization in golf videos using audio signals  

NASA Astrophysics Data System (ADS)

In this paper, we present an automatic summarization of highlights in golf videos based on audio information alone without video information. The proposed highlight summarization system is carried out based on semantic audio segmentation and detection on action units from audio signals. Studio speech, field speech, music, and applause are segmented by means of sound classification. Swing is detected by the methods of impulse onset detection. Sounds like swing and applause form a complete action unit, while studio speech and music parts are used to anchor the program structure. With the advantage of highly precise detection of applause, highlights are extracted effectively. Our experimental results obtain high classification precision on 18 golf games. It proves that the proposed system is very effective and computationally efficient to apply the technology to embedded consumer electronic devices.

Kim, Hyoung-Gook; Kim, Jin Young

2008-01-01

410

Say What? The Role of Audio in Multimedia Video  

NASA Astrophysics Data System (ADS)

Audio, including interviews, ambient sounds, and music, is a critical-yet often overlooked-part of an effective multimedia video. In February 2010, Linder joined scientists working on the Global Rivers Observatory Project for two weeks of intensive fieldwork in the Congo River watershed. The team's goal was to learn more about how climate change and deforestation are impacting the river system and coastal ocean. Using stills and video shot with a lightweight digital SLR outfit and audio recorded with a pocket-sized sound recorder, Linder documented the trials and triumphs of working in the heart of Africa. Using excerpts from the six-minute Congo multimedia video, this presentation will illustrate how to record and edit an engaging audio track. Topics include interview technique, collecting ambient sounds, choosing and using music, and editing it all together to educate and entertain the viewer.

Linder, C. A.; Holmes, R. M.

2011-12-01

411

Multi-channel spatialization systems for audio signals  

NASA Technical Reports Server (NTRS)

Synthetic head related transfer functions (HRTF's) for imposing reprogrammable spatial cues to a plurality of audio input signals included, for example, in multiple narrow-band audio communications signals received simultaneously are generated and stored in interchangeable programmable read only memories (PROM's) which store both head related transfer function impulse response data and source positional information for a plurality of desired virtual source locations. The analog inputs of the audio signals are filtered and converted to digital signals from which synthetic head related transfer functions are generated in the form of linear phase finite impulse response filters. The outputs of the impulse response filters are subsequently reconverted to analog signals, filtered, mixed, and fed to a pair of headphones.

Begault, Durand R. (inventor)

1993-01-01

412

Multi-channel spatialization system for audio signals  

NASA Technical Reports Server (NTRS)

Synthetic head related transfer functions (HRTF's) for imposing reprogramable spatial cues to a plurality of audio input signals included, for example, in multiple narrow-band audio communications signals received simultaneously are generated and stored in interchangeable programmable read only memories (PROM's) which store both head related transfer function impulse response data and source positional information for a plurality of desired virtual source locations. The analog inputs of the audio signals are filtered and converted to digital signals from which synthetic head related transfer functions are generated in the form of linear phase finite impulse response filters. The outputs of the impulse response filters are subsequently reconverted to analog signals, filtered, mixed and fed to a pair of headphones.

Begault, Durand R. (inventor)

1995-01-01

413

Virtual environment display for a 3D audio room simulation  

NASA Technical Reports Server (NTRS)

The development of a virtual environment simulation system integrating a 3D acoustic audio model with an immersive 3D visual scene is discussed. The system complements the acoustic model and is specified to: allow the listener to freely move about the space, a room of manipulable size, shape, and audio character, while interactively relocating the sound sources; reinforce the listener's feeling of telepresence in the acoustical environment with visual and proprioceptive sensations; enhance the audio with the graphic and interactive components, rather than overwhelm or reduce it; and serve as a research testbed and technology transfer demonstration. The hardware/software design of two demonstration systems, one installed and one portable, are discussed through the development of four iterative configurations.

Chapin, William L.; Foster, Scott H.

1992-01-01

414

Perception-based authentication watermarking for digital audio data  

NASA Astrophysics Data System (ADS)

Current systems and protocols based on cryptographic methods for integrity and authenticity verification of media data do not distinguish between legitimate signal transformation and malicious tampering that manipulates the content. Furthermore, they usually provide no localization or assessment of the relevance of such manipulations with respect to human perception or semantics. We present an algorithm for a authentication audio watermarking that uses a perception-based robust hash function in combination with robust watermarking to verify the integrity of audio recordings. Experimental results show that the proposed system provides both a high level of distinction between perceptually different audio data and a high robustness against signal transformations that do not change the perceived information.

Zmudzinski, Sascha; Steinebach, Martin

2009-02-01

415

Using MPEG-7 audio descriptors for music querying  

NASA Astrophysics Data System (ADS)

Due to the growing amount of digital audio an increasing need to automatically categorize music and to create self-controlled and suitable playlists has been emerged. A few approaches to this task relying on low-level features have been published so far. Unfortunately the results utilizing those technologies are not sufficient yet. This paper gives an introduction how to enhance the results with regard to the perceptual similarity using different high-level descriptors and a powerful interaction between the algorithm and the user to consider his preferences. A successful interaction between server and client requires a powerful standardized query language. This paper describes the tools of the MPEG-7 Audio standard in detail and gives examples of already established query languages. Furthermore the requirements of a multimedia query language are identified and its application is exemplified by an automatic audio creation system using a query language.

Gruhne, M.; Dittmar, C.

2006-08-01

416

Analysis of the Data Quality of Audio Descriptions of Environmental Sounds  

Microsoft Academic Search

In this paper we perform statistical data analysis of a broad set of state-of-the-art audio features and low-level MPEG-7 audio descriptors. The investigation comprises data analysis to reveal redundancies between state-of-the-art audio features and MPEG-7 audio descriptors. We introduce a novel measure to evaluate the information content of a descriptor in terms of variance. Statistical data analysis reveals the amount

Dalibor Mitrovic; Matthias Zeppelzauer; Horst Eidenberger

2007-01-01

417

Audio-Visual Emotion Recognition Based on a DBN Model with Constrained Asynchrony  

Microsoft Academic Search

This paper presents an audio visual multi-stream DBN model (Asy_DBN) for emotion recognition with constraint asynchrony, in which audio state and visual state transit individually in their corresponding stream but the transition is constrained by the allowed maximum audio visual asynchrony. Emotion recognition experiments of Asy_DBN with different asynchrony constraints are carried out on an audio visual speech database of

Danqi Chen; Dongmei Jiang; Ilse Ravyse; Hichem Sahli

2009-01-01

418

Applying Machine Learning and Audio Analysis Techniques to Insect Recognition in Intelligent Traps  

E-print Network

of millions sickened. It is estimated that dengue, a disease transmitted by mosquitoes of the genus Aedes cases in the last decade [5]. Due to the lack of vaccines and specific and effective medicines, dengue and filariasis have shown that the vectors' resistance to insecticides has steadily increased

Batista, Gustavo

419

Self-oscillating modulators for direct energy conversion audio power amplifiers  

E-print Network

Self-oscillating modulators for direct energy conversion audio power amplifiers Petar Ljusev1 conversion audio power amplifier represents total integration of switching-mode power supply and Class D audio power amplifier into one compact stage, achieving high efficiency, high level of integration, low

420

Audio Keyword Generation for Sports Video Analysis , Ling-Yu Duan  

E-print Network

. To facilitate event detection using audio information, we have introduced the concept of audio keyword (e voice, music, and various kinds of environmental sounds is an important type of media and a significant only [1, 2, and 3]. We have introduced the concept of audio keyword which refers to a set of game

Chia, Liang-Tien

421

Interactive Music with Active Audio CDs Sylvain Marchand, Boris Mansencal, and Laurent Girin  

E-print Network

Active CD Player We present the proof of concept of the active audio CD, as a player that can read anyInteractive Music with Active Audio CDs Sylvain Marchand, Boris Mansencal, and Laurent Girin La compact disc (CD) audio player, the only possibility for the user is to listen to the recorded track

Paris-Sud XI, Université de

422

Automatic Long Audio Alignment and Confidence Scoring for Conversational Arabic Speech  

E-print Network

Automatic Long Audio Alignment and Confidence Scoring for Conversational Arabic Speech Mohamed In this paper, a framework for long audio alignment for conversational Arabic speech is proposed. Accurate) training, audio summarizing and retrieving, etc. We have collected more than 1,400 hours of conversational

Hasegawa-Johnson, Mark

423

Indexing, Browsing and Searching of Digital Video and Digital Audio Information  

E-print Network

Indexing, Browsing and Searching of Digital Video and Digital Audio Information Alan F. Smeaton medium, namely digital audio and digital video. Our coverage of audio is centered around post of content-based access to digital video information we sketch out at a high level, the approaches taken

Smeaton, Alan F.

424

Hearing You Loud and Clear: Student Perspectives of Audio Feedback in Higher Education  

ERIC Educational Resources Information Center

The use of audio feedback for students in a full-time community nursing degree course is appraised. The aim of this mixed methods study was to examine student views on audio feedback for written assignments. Questionnaires and a focus group were used to capture student opinion of this pilot project. The majority of students valued audio feedback…

Gould, Jill; Day, Pat

2013-01-01

425

IEEE Transactions on Consumer Electronics Visible Light Communication for Audio Systems  

E-print Network

, visible light LEDs can reach a luminous efficacy beyond 15 lm/W, which is typical for incandescent lamps an audio system that is well- suited for use in a small confined area with many audio transmitters LEDs, in which current fed to the LEDs is modulated and encoded with audio information or messages

Pang, Grantham

426

Toward bandwidth-efficient and error-robust audio streaming over lossy packet networks  

Microsoft Academic Search

Bandwidth efficiency and error robustness are essential issues for different multimedia streaming applications. This paper presents strategies for high-quality audio streaming based on fragmenting perceptually coded audio frames and shuffling the data components among multiple packets for transportation. This is done to increase robustness against packet loss. We also address the delivery of audio data consisting of components with different

Jari Korhonen; Ye Wang; David Isherwood

2005-01-01

427

Robust AAM-Based Audio-Visual Speech Recognition against Face Direction Changes  

E-print Network

Robust AAM-Based Audio-Visual Speech Recognition against Face Direction Changes Yuto Komai komai, Experimentation Keywords audio-visual, speech recognition, face direction 1. INTRODUCTION In recent years, audio speech recognition (ASR) software for PCs and mobile phones has become widely used and at- tracts

Takiguchi, Tetsuya

428

Audio-Dateien enthalten die digitalisierte Form eines Klangs, also die diskrete Darstellung des zeitlichen  

E-print Network

, glissando) Rechnerschnittstelle: Soundkarte Datenaufnahme über CD-Audio, Line- oder Micro-Input EDV-gestütztes Publizieren #12; Unterscheidungskriterien: · Umfang nur Video Video und Audio gemischt · Auflösung und mehrere Systeme · programmspezifisches Format #12;· AVI (Audio Video Interleaved), Windows95+-Standard

Ott, Albrecht

429

Responding Effectively to Composition Students: Comparing Student Perceptions of Written and Audio Feedback  

ERIC Educational Resources Information Center

The authors compared student perceptions of audio and written feedback in order to assess what types of students may benefit from receiving audio feedback on their essays rather than written feedback. Many instructors previously have reported the advantages they see in audio feedback, but little quantitative research has been done on how the…

Bilbro, J.; Iluzada, C.; Clark, D. E.

2013-01-01

430

Perceptual Audio Rendering of Complex Virtual Environments Nicolas Tsingos, Emmanuel Gallo and George Drettakis  

E-print Network

Perceptual Audio Rendering of Complex Virtual Environments Nicolas Tsingos, Emmanuel Gallo and spatial clustering allows us to render such complex audio-visual scenes in real-time. Abstract We propose a real-time 3D audio rendering pipeline for complex virtual scenes containing hundreds of moving sound

Paris-Sud XI, Université de

431

Audio-video integration for background Marco Cristani, Manuele Bicego, and Vittorio Murino  

E-print Network

Audio-video integration for background modelling Marco Cristani, Manuele Bicego, and Vittorio of surveillance, namely, audio-visual data integration for background modelling. Actually, visual data acquired by a fixed camera can be easily supported by audio infor- mation allowing a more complete analysis

Cristani, Marco

432

GUIDAGE: A FAST AUDIO QUERY GUIDED ASSEMBLAGE Arshia Cont Shlomo Dubnov Gerard Assayag  

E-print Network

GUIDAGE: A FAST AUDIO QUERY GUIDED ASSEMBLAGE Arshia Cont Shlomo Dubnov G´erard Assayag UCSD, Music for fast and auto- matic retrieval of factors of audio content in a large au- dio database based on user's audio query. The proposed method, unlike most existing systems, takes explicit con- siderations

Paris-Sud XI, Université de

433

Searching in Audio: The Utility of Transcripts, Dichotic Presentation, and Time-compression  

E-print Network

Searching in Audio: The Utility of Transcripts, Dichotic Presentation, and Time University of Toronto www.mie.toronto.edu chignel @mie.utoronto.ca ABSTRACT Searching audio data can 100% accurate transcripts, additional techniques for fluid browsing and searching of the audio itself

Balakrishnan, Ravin

434

37 CFR 201.27 - Initial notice of distribution of digital audio recording devices or media.  

...Initial notice of distribution of digital audio recording devices or media. 201.27...Initial notice of distribution of digital audio recording devices or media. (a) General...as required by section 1003(b) of the Audio Home Recording Act of 1992,...

2014-07-01

435

ROBUST AUDIO SPEAKER SEGMENTATION USING ONE CLASS SVMS Hachem Kadri1  

E-print Network

ROBUST AUDIO SPEAKER SEGMENTATION USING ONE CLASS SVMS Hachem Kadri1 , Manuel Davy2 , Asma Rabaoui1.kadri@gmail.com, Manuel.Davy@inria.fr ABSTRACT This paper presents a new technique for segmenting an audio stream in various tasks such as automatic transcription of radio broadcast news and audio indexing. The segmentation

Paris-Sud XI, Université de

436

Design and Usability Testing of an Audio Platform Game for Players with Visual Impairments  

ERIC Educational Resources Information Center

This article reports on the evaluation of a novel audio platform game that creates a spatial, interactive experience via audio cues. A pilot study with players with visual impairments, and usability testing comparing the visual and audio game versions using both sighted players and players with visual impairments, revealed that all the…

Oren, Michael; Harding, Chris; Bonebright, Terri L.

2008-01-01

437

BUILDING AUDIO CLASSIFIERS FOR BROADCAST NEWS RETRIEVAL George Tzanetakis, Ming-Yu Chen  

E-print Network

BUILDING AUDIO CLASSIFIERS FOR BROADCAST NEWS RETRIEVAL George Tzanetakis, Ming-Yu Chen School@cs.cmu.edu ABSTRACT The process of building audio classifiers for high-level content descriptors, especially in large datasets, is not trivial. In this paper we describe the design and development of audio classification

438

The Case for FEC-based Error Control for Packet Audio in the Internet  

Microsoft Academic Search

We consider the problem of distributing real-time packet audio overnetworks such as the Internet which do not provide support for real-timeapplications. Experiments with such networks indicate that audio qualityis mediocre in large part because of excessive audio packet losses. In thispaper, we show using measurements over the Internet as well as analyticmodeling that most loss periods involve a small number

Andrs Vega-garca; Jean-chrysostome Bolot

1997-01-01

439

A FUZZY APPROACH TOWARDS PERCEPTUAL CLASSIFICATION AND SEGMENTATION OF MP3/AAC AUDIO  

E-print Network

A FUZZY APPROACH TOWARDS PERCEPTUAL CLASSIFICATION AND SEGMENTATION OF MP3/AAC AUDIO Serkan a novel perceptual based fuzzy approach towards classification and segmentation for MP3 and AAC audio in the compressed domain. The input audio is split into segments, which are classified as speech, music, fuzzy

Gabbouj, Moncef

440

REALTIME AUDIO SOURCE SEPARATION BY DELAY AND ATTENUATION COMPENSATION IN THE TIME DOMAIN  

E-print Network

REAL­TIME AUDIO SOURCE SEPARATION BY DELAY AND ATTENUATION COMPENSATION IN THE TIME DOMAIN.rosca,ningping.fan,radu.balang@scr.siemens.com ABSTRACT There is increased interest in using microphone arrays in a variety of audio source separation to a strongly echoic office with reverber­ ation time of 500 msec. 1. INTRODUCTION Speech enhancement and audio

Balan, Radu V.

441

Preprint No. 4825 A Real Time DSP Kernel for Concurrent Audio Tasks  

E-print Network

Preprint No. 4825 A Real Time DSP Kernel for Concurrent Audio Tasks David Reinhardt and Robert C audio tasks, but this poses a variety of practical problems. In this paper the features of general-purpose operating systems are contrasted with the needs of typical audio DSP processes, andthe architecture

Maher, Robert C.

442

Hear-Through and Mic-Through Augmented Reality: Using Bone Conduction to Display Spatialized Audio  

E-print Network

Hear-Through and Mic-Through Augmented Reality: Using Bone Conduction to Display Spatialized Audio-generated audio for augmented reality (AR) applications. Analogous to optical-see-through and video-see-through techniques in the visual domain, we present Hear-Through and Mic-Through audio AR. Hear-Through AR uses

Lindeman, Robert W.

443

Audio Events Detection in Public Transport Vehicle Jean-Luc Rouas, Jrme Louradour, Sbastien Ambellouis  

E-print Network

Audio Events Detection in Public Transport Vehicle Jean-Luc Rouas, Jérôme Louradour, Sébastien Ambellouis Abstract-- This paper addresses the problem of automatic audio analysis for aided surveillance segments. The problem is quite similar to the "`audio indexing"' framework, nevertheless the environment

Paris-Sud XI, Université de

444

LOSSLESS AUDIO COMPRESSION USING INTEGER MODIFIED DISCRETE COSINE Yoshikazu Yokotani and Soontorn Oraintara  

E-print Network

LOSSLESS AUDIO COMPRESSION USING INTEGER MODIFIED DISCRETE COSINE TRANSFORM Yoshikazu Yokotani:yoshi@msp.uta.edu, oraintar@uta.edu ABSTRACT Recently, an MPEG2 AAC [1] based lossless audio codec with the Integer MDCT (Int coeffi- cients in the MPEG2 AAC based lossless audio coding. The results indicate that there exists

Oraintara, Soontorn

445

Comparison of level discrimination, increment detection, and comodulation masking release in the audio-and  

E-print Network

in the audio- and envelope-frequency domains Paul C. Nelson Department of Biomedical and Chemical Engineering for certain observations made in detection and masking experiments in the audio-frequency domain. Two; these variations are referred to as fine structure and are determined by the instantaneous audio fre- quency

Carney, Laurel H.

446

Audio keyword extraction by unsupervised word discovery Armando Muscariello, Guillaume Gravier, Frederic Bimbot  

E-print Network

Audio keyword extraction by unsupervised word discovery Armando Muscariello, Guillaume Gravier, Fr.surname@irisa.fr Abstract In real audio data, frequently occurring patterns often convey relevant information on the overall such key patterns, can be exploited for providing audio summaries and speeding up the access to relevant

Paris-Sud XI, Université de

447

WATERMARKING OF AUDIO LEARNING OBJECTS WITH QUALITY OF SERVICE Hani Jabbour, Shervin Shirmohammadi, Jiying Zhao  

E-print Network

WATERMARKING OF AUDIO LEARNING OBJECTS WITH QUALITY OF SERVICE Hani Jabbour, Shervin Shirmohammadi. Audio streaming is a major part of e-learning, Copyright protection and Quality of Service (QoS) monitoring for audio streams are two of the major issues encountered. While many studies have tacked

Ottawa, University of

448

AUDIO ENCODING USING HUANG AND HILBERT TRANSFORMS K. Khaldi 1, 2, 4  

E-print Network

AUDIO ENCODING USING HUANG AND HILBERT TRANSFORMS K. Khaldi 1, 2, 4 , A.O. Boudraa 2 , B. Torr an audio coding scheme based on the Empirical Mode Decomposition (EMD) in association with the Hilbert transform is presented. The audio signal is decomposed adap- tively into intrinsic oscillatory components

Paris-Sud XI, Université de

449

Short-Term Audio-Visual Atoms for Generic Video Concept Classification  

E-print Network

Short-Term Audio-Visual Atoms for Generic Video Concept Classification Wei Jiang1 Courtenay Cotton1 the challenging issue of joint audio-visual analysis of generic videos targeting at semantic concept de- tection. We propose to extract a novel representation, the Short-term Audio-Visual Atom (S-AVA), for improved

Ellis, Dan

450

Perceptual Audio Coding using Adaptive Pre-and Post-Filters and Lossless Compression  

E-print Network

1 Perceptual Audio Coding using Adaptive Pre- and Post-Filters and Lossless Compression Gerald perceptual audio coding method that achieves high compression ratios and is capable of low encoding units. This contrasts traditional audio coding where both are inte- grated within the same subband

Yu, Bin

451

Content-based Audio Classification using Collective Network of Binary Classifiers  

E-print Network

Content-based Audio Classification using Collective Network of Binary Classifiers Toni Mäkinen) framework is presented for content-based audio classification. The topic has been studied in several and the number of classes, as well as to create a scalable network design, which allows introducing new audio

Gabbouj, Moncef

452

Audio Use in E-Learning: What, Why, When, and How?  

ERIC Educational Resources Information Center

Decisions related to the implementation of audio in e-learning are perplexing for many instructional designers, and deciphering theory and principles related to audio use can be difficult for practitioners. Yet, as bandwidth on the Internet increases, digital audio is becoming more common in online courses. This article provides a review of…

Calandra, Brendan; Barron, Ann E.; Thompson-Sellers, Ingrid

2008-01-01

453

Spatial audio quality in regard to 3D video S. Moulin, R. Nicol and L. Gros  

E-print Network

Spatial audio quality in regard to 3D video S. Moulin, R. Nicol and L. Gros France T an improved immersion in terms of visual perception. As for the associate audio channels, most of them there are various ways of listening to 5.1 audio content, either over loudspeaker arrays (for instance ITU standard

Paris-Sud XI, Université de

454

AUDIO FORENSIC AUTHENTICATION BASED ON MOCC BETWEEN ENF AND REFERENCE SIGNALS  

E-print Network

AUDIO FORENSIC AUTHENTICATION BASED ON MOCC BETWEEN ENF AND REFERENCE SIGNALS Zhisheng Lv1 of Warwick, Coventry CV4 7AL, UK ABSTRACT This paper proposes a new audio authenticity detection algorithm Frequency) signal and the reference signal. We first extract the ENF signal from a query audio signal

Li, Chang-Tsun

455

Combined video and audio watermarking: embedding content information in multimedia data  

Microsoft Academic Search

Audio and video watermarking enable the copyright protection with owner or customer authentication and the detection of media manipulations. The available watermarking technology concentrates on single media like audio or video. But the typical multimedia stream consists of both video and audio data. Our goal is to provide a solution with robust and fragile aspects to guarantee authentication and integrity

Jana Dittmann; Martin Steinebach; Ivica Rimac; Stephan Fischer; Ralf Steinmetz

2000-01-01

456

Evaluation of robustness and transparency of multiple audio watermark embedding  

NASA Astrophysics Data System (ADS)

As digital watermarking becomes an accepted and widely applied technology, a number of concerns regarding its reliability in typical application scenarios come up. One important and often discussed question is the robustness of digital watermarks against multiple embedding. This means that one cover is marked several times by various users with by same watermarking algorithm but with different keys and different watermark messages. In our paper we discuss the behavior of our PCM audio watermarking algorithm when applying multiple watermark embedding. This includes evaluation of robustness and transparency. Test results for multiple hours of audio content ranging from spoken words to music are provided.

Steinebach, Martin; Zmudzinski, Sascha

2008-02-01

457

Three dimensional audio versus head down TCAS displays  

NASA Technical Reports Server (NTRS)

The advantage of a head up auditory display was evaluated in an experiment designed to measure and compare the acquisition time for capturing visual targets under two conditions: Standard head down traffic collision avoidance system (TCAS) display, and three-dimensional (3-D) audio TCAS presentation. Ten commercial airline crews were tested under full mission simulation conditions at the NASA Ames Crew-Vehicle Systems Research Facility Advanced Concepts Flight Simulator. Scenario software generated targets corresponding to aircraft which activated a 3-D aural advisory or a TCAS advisory. Results showed a significant difference in target acquisition time between the two conditions, favoring the 3-D audio TCAS condition by 500 ms.

Begault, Durand R.; Pittman, Marc T.

1994-01-01

458

Influence of audio triggered emotional attention on video perception  

NASA Astrophysics Data System (ADS)

Perceptual video coding methods attempt to improve compression efficiency by discarding visual information not perceived by end users. Most of the current approaches for perceptual video coding only use visual features ignoring the auditory component. Many psychophysical studies have demonstrated that auditory stimuli affects our visual perception. In this paper we present our study of audio triggered emotional attention and it's applicability to perceptual video coding. Experiments with movie clips show that the reaction time to detect video compression artifacts was longer when video was presented with the audio information. The results reported are statistically significant with p=0.024.

Torres, Freddy; Kalva, Hari

2014-02-01

459

Perspex machine: VII. The universal perspex machine  

NASA Astrophysics Data System (ADS)

The perspex machine arose from the unification of projective geometry with the Turing machine. It uses a total arithmetic, called transreal arithmetic, that contains real arithmetic and allows division by zero. Transreal arithmetic is redefined here. The new arithmetic has both a positive and a negative infinity which lie at the extremes of the number line, and a number nullity that lies off the number line. We prove that nullity, 0/0, is a number. Hence a number may have one of four signs: negative, zero, positive, or nullity. It is, therefore, impossible to encode the sign of a number in one bit, as floating-point arithmetic attempts to do, resulting in the difficulty of having both positive and negative zeros and NaNs. Transrational arithmetic is consistent with Cantor arithmetic. In an extension to real arithmetic, the product of zero, an infinity, or nullity with its reciprocal is nullity, not unity. This avoids the usual contradictions that follow from allowing division by zero. Transreal arithmetic has a fixed algebraic structure and does not admit options as IEEE, floating-point arithmetic does. Most significantly, nullity has a simple semantics that is related to zero. Zero means "no value" and nullity means "no information." We argue that nullity is as useful to a manufactured computer as zero is to a human computer. The perspex machine is intended to offer one solution to the mind-body problem by showing how the computable aspects of mind and, perhaps, the whole of mind relates to the geometrical aspects of body and, perhaps, the whole of body. We review some of Turing's writings and show that he held the view that his machine has spatial properties. In particular, that it has the property of being a 7D lattice of compact spaces. Thus, we read Turing as believing that his machine relates computation to geometrical bodies. We simplify the perspex machine by substituting an augmented Euclidean geometry for projective geometry. This leads to a general-linear perspex-machine which is very much easier to program than the original perspex-machine. We then show how to map the whole of perspex space into a unit cube. This allows us to construct a fractal of perspex machines with the cardinality of a real-numbered line or space. This fractal is the universal perspex machine. It can solve, in unit time, the halting problem for itself and for all perspex machines instantiated in real-numbered space, including all Turing machines. We cite an experiment that has been proposed to test the physical reality of the perspex machine's model of time, but we make no claim that the physical universe works this way or that it has the cardinality of the perspex machine. We leave it that the perspex machine provides an upper bound on the computational properties of physical things, including manufactured computers and biological organisms, that have a cardinality no greater than the real-number line.

Anderson, James A. D. W.

2006-01-01

460

Coping with Contamination  

NSDL National Science Digital Library

This magazine article features an interview with Mars Analog Research and Technology Experiment (MARTE) scientist Carol Stoker. In this third session of the four-part series, Stoker describes how the MARTE team avoids contaminating their drill-core samples. Her team is drilling into the pyrite subsurface of Spain's Rio Tinto in search for microbes existing in an iron-sulfur-based energy system, similar to that of Mars. She describes the technical challenges to be faced in the waterless environment of other-world drilling. The resource includes images from the Mars rover project, links to related web sites, and an MP3 Audio Machine text-to-speech option.

Bortman, Henry; Stoker, Carol; Magazine, Astrobiology

461

Drilling for Weird Life  

NSDL National Science Digital Library

This magazine article introduces the Mars Analog Research and Technology Experiment (MARTE). Featuring an interview with NASA scientist Carol Stoker, the article describes Rio Tinto, a river in Spain with highly acidic water the color or red wine, and explains why scientists are looking to the subsurface pyrite deposits near this river's edge for signs of microbial life. Stoker describes the field site and discusses some of the research team's early results. This is the first of a four-part interview series. The resource includes images from Rio Tinto and the Mars project, links to related web sites, and an MP3 Audio Machine text-to-speech option.

Bortman, Henry; Stoker, Carol; Magazine, Astrobiology

462

Living on Fools Gold  

NSDL National Science Digital Library

This magazine article explores the extreme environment of the Rio Tinto, a river in Spain with highly acidic and metal-rich water. The article features the work of Ricardo Amils, a scientist studying the microbial population of the Rio Tinto. Along with others, Amils pays particular attention to chemolithotrophic bacteria that survive by getting their energy by oxidizing pyrite, also known as "fool's gold". He postulates about how these processes might have played out in the ancient earth and draws parallels to the possibility of life on Mars. The resource includes images from the Rio Tinto, links to related web sites, and a MP3 Audio Machine text-to-speech option.

Mullen, Leslie; Magazine, Astrobiology

463

STEREOPHONIC PERSONAL AUDIO REPRODUCTION USING PLANARITY CONTROL OPTIMIZATION  

E-print Network

STEREOPHONIC PERSONAL AUDIO REPRODUCTION USING PLANARITY CONTROL OPTIMIZATION Philip Coleman levels and complex phase patterns within the target zone. Furthermore, multichannel reproduction systems in an arbitrarily-placed target zone. In this study, the application of planarity control for stereo reproduction

Jackson, Philip JB

464

Inferring the Structure of a Tennis Game Using Audio Information  

Microsoft Academic Search

We describe a novel framework for inferring the low-level structure of a sports game (tennis) using only the infor- mation available on the audio track of a video recording of the game. Our goal is to segment the games into a sequence of points, the natural unit for describing a tennis match. The framework is hierarchical, consisting of, at the

Qiang Huang; Stephen Cox

2011-01-01

465

Character animation from audio: speech articulation and beyond  

Microsoft Academic Search

We present ongoing work on character animation in the field of message entertainment. Our aim is to extract as much information from audio signals as possible and to create a customized animation clips. Goal of the work is not to reproduce ground truth data as accurately as possible but to create a plausibly and personalized animated avatar. However, in order

Sascha Fagel

2010-01-01

466

Audio and Video Reflections to Promote Social Justice  

ERIC Educational Resources Information Center

Purpose: The purpose of this paper is to examine how 15 graduate students enrolled in a US school leadership preparation program understand issues of social justice and equity through a reflective process utilizing audio and/or video software. Design/methodology/approach: The study is based on the tradition of grounded theory. The researcher…

Boske, Christa

2011-01-01

467

Spatial Error Concealment in Ad-hoc Audio Conferencing Systems  

Microsoft Academic Search

Abstract In this work we consider an ad-hoc audio conferensing system based on VoIP services in which the participants connect to the conference using mo- bile communication devices with wireless connectivity. To overcome possible quality problems in the wireless link in this configuration, we propose im- provements to the existing conferencing systems. Some networking modifications are suggested to increase the

Reza Parhizkar

2009-01-01

468

Infant Perception of Audio-Visual Speech Synchrony  

ERIC Educational Resources Information Center

Three experiments investigated perception of audio-visual (A-V) speech synchrony in 4- to 10-month-old infants. Experiments 1 and 2 used a convergent-operations approach by habituating infants to an audiovisually synchronous syllable (Experiment 1) and then testing for detection of increasing degrees of A-V asynchrony (366, 500, and 666 ms) or by…

Lewkowicz, David J.

2010-01-01

469

High quality audio for multimedia: key technologies and MPEG standards  

Microsoft Academic Search

With today's speech and audio compression techniques an excellent coding quality can be obtained with bit rates in the range of 0.5 to 2 bits per sample and channel. We cover key technologies, in particular auditory masking and perceptual coding, and dynamic bit allocation. We then focus on the MPEG standardization work, which has provided international standards for a wide

P. Noll

1999-01-01

470

Indexing of multilingual news telecast using audio-visual keywords  

Microsoft Academic Search

Indexing of news video streams with semantic keywords is of interest to agencies that regularly monitor many news channels. In this paper, we describe a new method for indexing news video in different languages, for which there are inadequate language tools. Our approach involves combining multimodal inputs, namely audio and visual, and spotting of a handful of keywords with higher

H. Ghosh; A. Khare; A. Gorai; S. K. Kopparapu; M. Pandharipande

2010-01-01

471

Recognition of blue movies by fusion of audio and video  

Microsoft Academic Search

Along with the explosive growth of the Internet, comes the proliferation of pornography. Compared with the pornographic texts and images, blue movies can do much harm to children, due to the greater realism and voyeurism of blue movies. In this paper, a framework for recognizing blue movies by fusing the audio and video information is described. A one-class Gaussian mixture

Haiqiang Zuo; Ou Wu; Weiming Hu; Bo Xu

2008-01-01

472

STREAMCATCHER: INTEGRATED VISUALIZATION OF MUSIC CLIPS AND ONLINE AUDIO STREAMS  

E-print Network

of resorting to predefined taxonomies for music cat- egorization (e.g., by Genre), an alternative is to presentSTREAMCATCHER: INTEGRATED VISUALIZATION OF MUSIC CLIPS AND ONLINE AUDIO STREAMS Martin Gasser.g., web radio streams). The visualization space is defined by prototypical instances of musical concepts

Widmer, Gerhard

473

Chemical News Via Audio Tapes: Chemical Industry News  

ERIC Educational Resources Information Center

Tape coverage of internal R&D news now has a broader scope with improved features. A new tape series covering external news of broad interest has been initiated. The use of tape in a Continuing Education Program is discussed as the future plans for expanding the audio tape program. (1 reference) (Author)

Hanford, W. E.; And Others

1972-01-01

474

West Virginia Palliative Care Network 2011 Audio Conference Series  

E-print Network

West Virginia Palliative Care Network 2011 Audio Conference Series Target Audience - These courses care clinicians who wish to expand their knowledge of end-of-life care. Course Description - Hospital palliative care consultation teams have been shown to improve care for patients with serious illness

Mohaghegh, Shahab

475

Audio sparse decompositions in parallel Let the greed be shared !  

E-print Network

, and an efficient indexing of their parameters, these coders achieve a high compression ratio at virtually no loss-of-the-art multimedia compression schemes, such as still image coding (for instance JPEG-2000 [1]), and audio coding coefficients naturally leads to an increased coding efficiency. For instance, in the JPEG-2000 image coder

Paris 7 - Denis Diderot, Université

476

Multimedia content analysis-using both audio and visual clues  

Microsoft Academic Search

Multimedia content analysis refers to the computerized understanding of the semantic meanings of a multimedia document, such as a video sequence with an accompanying audio track. With a multimedia document, its semantics are embedded in multiple forms that are usually complimentary of each other, Therefore, it is necessary to analyze all types of data: image frames, sound tracks, texts that

Yao Wang; Zhu Liu; Jin-Cheng Huang

2000-01-01

477

A survey of packet loss recovery techniques for streaming audio  

Microsoft Academic Search

We survey a number of packet loss recovery techniques for streaming audio applications operating using IP multicast. We begin with a discussion of the loss and delay characteristics of an IP multicast channel, and from this show the need for packet loss recovery. Recovery techniques may be divided into two classes: sender- and receiver-based. We compare and contrast several sender-based

Colin Perkins; Orion Hodson; Vicky Hardman

1998-01-01

478

Real Time Video and Audio in the World Wide Web  

Microsoft Academic Search

The architecture of World Wide Web (WWW) browsers and servers support full file transfer fordocument retrieval. TCP is used for data transfers by Web browsers and their associated HypertextTransfer Protocol (HTTP) servers. Full file transfer and TCP are unsuitable for continuous media, suchas real time audio and video. In order for the WWW to support continuous media, we require thetransmission

Zhigang Chen

1995-01-01

479

Audio signal representations for indexing in the transform domain  

E-print Network

and music is now widely stored and diffused in digital form. This revolution is mainly due to the spread successor of MP3, This research was supported in part by the French GIP ANR under contract ANR-06-JCJC-0027 as the state-of-the-art standard for (near-)transparent audio coding. More recently, the digital revolution

Richard, Gaël

480

Audio signal representations for indexing in the transform domain  

E-print Network

and diffused in digital form. This revolution is mainly due to the spread of audio coding technologies, which Coding (AAC), was first introduced in This research was supported in part by the French GIP ANR under coding. More recently, the digital revolution gave birth to another research domain known as automatic

Paris 7 - Denis Diderot, Université

481

Spatial Interaction with Sonic Objects in Audio Augmented Environments  

Microsoft Academic Search

This STSM aimed at developing a set of basic test cases for the evaluation of enactive inter- action patterns with spatially arranged sonic objects. Audio augmented environment technology with binaural rendering was used as a display for sonic objects. An accurate, low-latency, three- dimensional tracking system served as the main interface for embodied user interaction. The basic structure of a

Martin Rumori

482

Secure spread spectrum watermarking for images, audio and video  

Microsoft Academic Search

We describe a digital watermarking method for use in audio, image, video and multimedia data. We argue that a watermark must be placed in perceptually significant components of a signal if it is to be robust to common signal distortions and malicious attack. However, it is well known that modification of these components can lead to perceptual degradation of the

I. J. Cox; J. Kilian; T. Leighton; T. Shamoon

1996-01-01

483

Compresion de audio y video Practica 2: Compresion de Video  

E-print Network

Compresi´on de audio y video Pr´actica 2: Compresi´on de Video Ej 1. MPEG-1 Video Codificar una de las secuencias de video. Fijar 12 cuadros para cada grupo de cuadros (GOP) y M=3 (M es la distancia

Figueira, Santiago

484

Exploratory Evaluation of Audio Email Technology in Formative Assessment Feedback  

ERIC Educational Resources Information Center

Formative assessment generates feedback on students' performance, thereby accelerating and improving student learning. Anecdotal evidence gathered by a number of evaluations has hypothesised that audio feedback may be capable of enhancing student learning more than other approaches. In this paper we report on the preliminary findings of a…

Macgregor, George; Spiers, Alex; Taylor, Chris

2011-01-01

485

Survey of the State of Audio Collections in Academic Libraries  

ERIC Educational Resources Information Center

The goal of this survey was to collect and analyze baseline information about the status of audio collections held by a set of research institutions. This information can help shape the national preservation plan now being developed by the National Recording Preservation Board (NRPB) and the Library of Congress to preserve "sound recordings that…

Smith, Abby; Allen, David Randal; Allen, Karen

2004-01-01

486

INTELLIGENT SYSTEMS FOR MIXING MULTICHANNEL AUDIO Joshua D. Reiss  

E-print Network

on Sound magazine wrote, "There's no reason why a band recording using reasonably conventional the sound engineer refine his choices to produce an aesthetically pleasing mix which best captures the intended sound. In order to address this challenge, a new form of multichannel audio signal processing has

Reiss, Josh

487

Normalized Auditory Attention Levels for Automatic Audio Surveillance  

E-print Network

Normalized Auditory Attention Levels for Automatic Audio Surveillance L. Couvreur, F. Bettens, J of auditory attention on a normalized scale, i.e. between 0 and 1. The proposed features are derived from loudness, temporal and frequency contrasts. Normalized auditory attention levels can be used to detect

Dupont, Stéphane

488

Speaker Tracking Using an Audio-visual Particle Filter  

Microsoft Academic Search

We present an approach for tracking a lecturer during the course of his speech. We use features from multiple cameras and micro- phones, and process them in a joint particle filter framework. The filter performs sampled projections of 3D location hypotheses and scores them using features from both audio and video. On the video side, the features are based on

Kai Nickel; Tobias Gehrig; Hazim K. Ekenel; Rainer Stiefelhagen

489

An Audio Glanmce at Syntactic Structure Based on Spoken Form  

Microsoft Academic Search

An example of where and how to use non-speech sounds in an interface is described. The role of the sound is to provide an overview or glance for a blind person reading mathematics by listening. The type of sound used to provide the audio glance is the earcon. The algebra earcons were designed using prosodic cues from spoken algebra combined

R. D. Stevens; P. C. Wright; A. D. N. Edwards; S. A. Brewster

490

Understanding the Impact of Abstracted Audio Preview of SMS  

E-print Network

Understanding the Impact of Abstracted Audio Preview of SMS Alireza Sahami Shirazi1 , Ari the availability of other mobile messaging applica- tions, SMS has kept its position as a heavily used commu of SMS. Based on a real-time analysis of the content of a message, we provide auditory cues in addition

491

Training Ircam's score follower [audio to musical score alignment system  

Microsoft Academic Search

This paper describes our attempt to make the hidden Markov model (HMM) score following system, developed at Ircam, sensible to past experiences in order to obtain better audio to score real-time alignment for musical applications. A new observation modeling based on Gaussian mixture models is developed which is trainable using a learning algorithm we would call automatic discriminative training. The

Arshia Cont; Diemo Schwarz; Norbert Schnell

2005-01-01

492

Verification of an Audio Protocol with Bus Collision Using UPPAAL  

Microsoft Academic Search

In this paper we apply the tool Uppaal1 to an automatic analysis of a version of the Philips Audio Control Protocol with two senders and bus collision handling. This case study is significantly larger than the real-time\\/hybrid systems previously analysed by automatic tools. During the case study the tool Uppaal was extended with a new feature, committed locations, allowing efficient

Johan Bengtsson; W. O. David Griffioen; Kåre J. Kristoffersen; Kim Guldstrand Larsen; Fredrik Larsson; Paul Pettersson; Wang Yi

1996-01-01

493

Imaging concert hall acoustics using visual and audio cameras  

Microsoft Academic Search

Using a recently developed real time audio camera, that uses the output of a spherical microphone array beamformer steered in all directions to create central projection to create acoustic intensity images, we present a technique to measure the acoustics of rooms and halls. A panoramic mosaiced visual image of the space is also create. Since both the visual and the

Adam O'Donovan; Ramani Duraiswami; Dmitry N. Zotkin

2008-01-01

494

Learning from Animated Concept Maps with Concurrent Audio Narration  

ERIC Educational Resources Information Center

An animated concept map is a presentation of a network diagram in which nodes and links are sequentially added or modified. An experiment compared learning from animated concept maps and text by randomly assigning 133 undergraduates to study 1 of 4 narrated animations presenting semantically equivalent information accompanied by identical audio

Nesbit, John C.; Adesope, Olusola O.

2011-01-01

495

Modulated pulse audio power amplifiers for integrated circuits  

Microsoft Academic Search

Silicon monolithic circuits are best suited for applications with low power dissipation. Conventional approaches to audio power amplifiers have a rather limited efficiency, and are therefore not necessarily the best choice for integrated circuits. Pulse-width modulation, however, promises an efficiency of up to 100 percent and has several other advantages. Three approaches to amplitude to pulse-width conversion are discussed and

H. Camenzind

1966-01-01

496

Decaptcha: Breaking 75% of eBay Audio CAPTCHAs  

Microsoft Academic Search

CAPTCHA tests aim at preventing attackers from per- forming automatic website registration. In this paper we show that our prototype Decaptcha is able to success- fully break 75% of eBay audio captchas. We compare its performance with the state of the art, readily available speech recognition system Sphinx and discuss the impli- cations for eBay security.

Elie Bursztein; Steven Bethard

497

Economic evaluation of Interactive Audio Media for securing Internet services  

E-print Network

, is expected to become a serious threat in the near future. Audio CAPTCHA (Completely Automated Public Turing with the acronym CAPTCHA. A CAPTCHA is a type of challenge-response #12;test trying to ensure that the response to solve. In effect, any user entering a correct solution is presumed to be human. A num- ber of CAPTCHA

Katsaros, Panagiotis

498

Packet Audio Playout Delay Adjustment: Performance Bounds and Algorithms  

Microsoft Academic Search

In packet audio applications, packets are buffered at a receiving site and their playout delayed in order to com- pensate for variable network delays. In this paper, we con- sider the problem of adaptively adjusting the playout delay in order to keep this delay as small as possible, while at the same time avoiding excessive \\

Sue B. Moon; James F. Kurose; Donald F. Towsley

1998-01-01

499

IKLAX : A NEW MUSICAL AUDIO FORMAT FOR INTERACTIVE MUSIC  

E-print Network

composers. 1.INTRODUCTION iKlax is an innovative music project for the development of active listening software products: a music player for listeners and a music editor software for composers. 2.1.iKlax FormatIKLAX : A NEW MUSICAL AUDIO FORMAT FOR INTERACTIVE MUSIC Fabien Gallot Owen Lagadec Myriam Desainte

Paris-Sud XI, Université de

500

Developing a Framework for Effective Audio Feedback: A Case Study  

ERIC Educational Resources Information Center

The increase in the use of technology-enhanced learning in higher education has included a growing interest in new approaches to enhance the quality of feedback given to students. Audio feedback is one method that has become more popular, yet evaluating its role in feedback delivery is still an emerging area for research. This paper is based on a…

Hennessy, Claire; Forrester, Gillian

2014-01-01