These are representative sample records from Science.gov related to your search topic.
For comprehensive and current results, perform a real-time search at Science.gov.
1

Building a Prototype Text to Speech for Sanskrit  

NASA Astrophysics Data System (ADS)

This paper describes about the work done in building a prototype text to speech system for Sanskrit. A basic prototype text-to-speech is built using a simplified Sanskrit phone set, and employing a unit selection technique, where prerecorded sub-word units are concatenated to synthesize a sentence. We also discuss the issues involved in building a full-fledged text-to-speech for Sanskrit.

Mahananda, Baiju; Raju, C. M. S.; Patil, Ramalinga Reddy; Jha, Narayana; Varakhedi, Shrinivasa; Kishore, Prahallad

2

BIBLIOGRAPHY Text-to-speech in Vocabulary Acquisition and Student  

E-print Network

BIBLIOGRAPHY Text-to-speech in Vocabulary Acquisition and Student Knowledge Models: a Classroom comprehension." Proceedings of the Ninth International Conference on Spoken Language Processing. Pittsburgh, U-assisted vocabulary acquisition in the ESL class

3

Prosodic phrasing in a Polish text-to-speech system  

Microsoft Academic Search

This contribution presents the linguistic research underlying the implementation of prosodic phrasing in a Polish text-to-speech system 1. While in the past few years concatenative text-to - speech technology dramatically improved the acoustic quality of the synthesized voices, yet the naturalness and expressivity of present text-to-spe ech systems are still unsatisfactory. In particular , these systems usually read with a

Morena Danieli; Beata Dobrzy; Alberto Pacchiotti; Elena Cabrio

4

Choosing and Using Text-to-Speech Software  

ERIC Educational Resources Information Center

This article describes a computer-based technology for generating speech called text-to-speech (TTS). This software is ready for widespread use by libraries, other organizations, and individual users. It offers the affordable ability to turn just about any electronic text that is not image-based into an artificially spoken communication. The…

Peters, Tom; Bell, Lori

2007-01-01

5

Approaches for adaptive database reduction for Text-To-Speech synthesis Aleksandra Krul 1,2  

E-print Network

Approaches for adaptive database reduction for Text-To-Speech synthesis Aleksandra Krul 1,2 , G This paper raises the issue of speech database reduction adapted to a specific domain for Text-To-Speech (TTS measures. Index Terms: text-to-speech synthesis, corpus reduction, database adaptation, KL divergence 1

Yvon, François

6

"Look What I Did!": Student Conferences with Text-to-Speech Software  

ERIC Educational Resources Information Center

The authors describe a strategy that empowers students to edit and revise their own writing. Students input their writing in to text-to-speech software that rereads the text aloud. While listening, students make necessary revisions and edits.

Young, Chase; Stover, Katie

2014-01-01

7

AUDIO MUSIC MOOD CLASSIFICATION USING SUPPORT VECTOR MACHINE  

Microsoft Academic Search

The system submitted to the MIREX Audio Music Mood Classification task is described here. It uses a set of 133 descriptors and a Support Vector Machine classifier to pre- dict the mood cluster. The features are spectral, temporal, tonal but also describe loudness and danceability. The fea- tures were selected previously according to experiments on our annotated databases. The SVM

Cyril Laurier; Perfecto Herrera

8

Faking it: Synthetic text-to-speech synthesis for u nder-resourced languages - Experimental design  

Microsoft Academic Search

Speech synthesis or text-to-speech (TTS) systems are currently available for a number of the world's major languages, but for thousands of the world's 'minor' languages no such technology is available. While awaiting the development of such technology, we would like to try the stop-gap solution of using an existing TTS system for a major language (the base language) to 'fake'

Harold Somers

9

Machine learning an audio taxonomy: Quantifying biodiversity and habitat recovery through rainforest audio recordings.  

PubMed

We present a set of tools for semi-supervised classification of ecosystem health in Meso-American tropical dry forest, one of the most highly endangered habitats on Earth. Audio recordings were collected from 15-year-old, 30-year-old and old growth tropical dry forest plots in the Guanacaste Conservation Area, Costa Rica, on both nutrient rich and nutrient poor soils. The goals of this project were to classify the overall health of the regenerating forests using markers of biodiversity. Semi-supervised machine learning and digital signal processing techniques were explored and tested for their ability to detect species and events in the audio recordings. Furthermore, multi-recorder setups within the same vicinity were able to improve detection rates and accuracy by enabling localization of audio events. Variations in species' and rainforest ambient noise detection rates over time were hypothesized to correlate to biodiversity and hence the health of the rainforest. By comparing levels of biodiversity measured in this manner between old growth and young dry forest plots, we hope to determine the effectiveness of reforestation techniques and identify key environmental factors shaping the recovery of forest ecosystems. PMID:25235867

Treuer, Tim; Altosaar, Jaan; Hartnett, Andrew; Twomey, Colin; Dobson, Andy; Wilcove, David; Couzin, Iain

2014-04-01

10

Advancements in text-to-speech technology and implications for AAC applications  

NASA Astrophysics Data System (ADS)

Intelligibility was the initial focus in text-to-speech (TTS) research, since it is clearly a necessary condition for the application of the technology. Sufficiently high intelligibility (approximating human speech) has been achieved in the last decade by the better formant-based and concatenative TTS systems. This led to commercially available TTS systems for highly motivated users, particularly the blind and vocally impaired. Some unnatural qualities of TTS were exploited by these users, such as very fast speaking rates and altered pitch ranges for flagging relevant information. Recently, the focus in TTS research has turned to improving naturalness, so that synthetic speech sounds more human and less robotic. Unit selection approaches to concatenative synthesis have dramatically improved TTS quality, although at the cost of larger and more complex systems. This advancement in naturalness has made TTS technology more acceptable to the general public. The vocally impaired appreciate a more natural voice with which to represent themselves when communicating with others. Unit selection TTS does not achieve such high speaking rates as the earlier TTS systems, however, which is a disadvantage to some AAC device users. An important new research emphasis is to improve and increase the range of emotional expressiveness of TTS.

Syrdal, Ann K.

2003-10-01

11

THE STUDY AND IMPLEMENTATION OF TEXT-TO-SPEECH SYSTEM FOR  

E-print Network

hebei province, which has conversed more than 8000 audio files as programming materials for the local year's development, China has made the amazing progress in the development face the farmer for farmers by the media of TV, phone and network, such as integrated service pattern in Hebei province (Gao

Paris-Sud XI, Université de

12

Expressive Text-to-Speech: A user-centred approach to sound design in voice-enabled mobile applications  

Microsoft Academic Search

Recent improvements in speech technology are expected to change the way we communicate, facilitating access to web services and applications (voice portals, multimodal email clients or games) in various mobile situations. In order to be attractive for users, however, speech output needs to be more expressive to increase naturalness, meaningfulness and ease of listening. Conventional Text-to-Speech (TTS) systems do not

Peter Froehlich; Florian Hammer

2004-01-01

13

Event detection in field sports video using audio-visual features and a support vector Machine  

Microsoft Academic Search

In this paper, we propose a novel audio-visual feature-based framework for event detection in broadcast video of multiple different field sports. Features indicating significant events are selected and robust detectors built. These features are rooted in characteristics common to all genres of field sports. The evidence gathered by the feature detectors is combined by means of a support vector machine,

David A. Sadlier; Noel E. O'connor

2005-01-01

14

The effects of word prediction and text-to-speech technologies on the narrative writing skills of students with specific learning disabilities  

Microsoft Academic Search

This study investigated the effects of word prediction and text-to-speech on the narrative composition writing skills of 6, fifth-grade Hispanic boys with specific learning disabilities (SLD). A multiple baseline design across subjects was used to explore the efficacy of word prediction and text-to-speech alone and in combination on four dependent variables: writing fluency (words per minute), syntax (T-units), spelling accuracy,

Monica C Silio

2008-01-01

15

The Effects of Word Prediction and Text-to-Speech Technologies on the Narrative Writing Skills of Hispanic Students with Specific Learning Disabilities  

ERIC Educational Resources Information Center

A multiple-baseline design across subjects was used to investigate the effects of word prediction and text-to-speech alone and in combination on four narrative composition-writing skills (writing fluency, syntax, spelling accuracy, and overall organization) of six fifth-grade Hispanic boys with specific learning disabilities (SLD). Participants…

Silio, Monica C.; Barbetta, Patricia M.

2010-01-01

16

Study of an Audio Playback Machine Storage, Distribution, and Repair System. Options for Machine Operation. Study II, Part 1, Phase 2, Final Report.  

ERIC Educational Resources Information Center

This report presents the results of a management study of audio playback equipment operations conducted by the National Library Service, Library of Congress, its associated network of state and local machine lending agencies (MLA), and other parties that play a role in current operations. The objectives were to document current operations,…

ManTech Technical Services Corp., Fairfax, VA.

17

An investigation into the effects of Text-To-Speech voice and 3D avatars on the perception of presence and flow of live help in electronic commerce  

Microsoft Academic Search

Expansion and growth of online shopping has led many companies to provide real-time communications on their Web sites to facilitate human-to-human interaction between service representatives and customers. The current study analyzes the interface design of such Live Help functions. More specifically, it attempts to understand whether or not the implementation of Text-To-Speech (TTS) voice communication and 3D avatars in the

Lingyun Qiu; Izak Benbasat

2005-01-01

18

MIXED TYPE AUDIO CLASSIFICATION WITH SUPPORT VECTOR MACHINE Department of Computer Science  

E-print Network

the classification of audio into single type such as music, speech, environmen- tal sound and silence is well studied Figure 1We use HZCRR [6]. 1, we can see that the ZCR values of speech with music background and of music

Ã?zsu, M. Tamer

19

BATS: The Blind Audio Tactile Mapping System Peter Parente  

E-print Network

BATS: The Blind Audio Tactile Mapping System Peter Parente Department of Computer Science students with visual impairments access and explore spatial information using standard computer hardware to the blind such as text-to-speech synthesis, auditory icons, and tactile feedback. We add spatial sound

Bishop, Gary

20

Audio 2008: Audio Fixation  

ERIC Educational Resources Information Center

Take a look around the bus or subway and see just how many people are bumping along to an iPod or an MP3 player. What they are listening to is their secret, but the many signature earbuds in sight should give one a real sense of just how pervasive digital audio has become. This article describes how that popularity is mirrored in library audio

Kaye, Alan L.

2008-01-01

21

Picture My Voice: Audio to Visual Speech Synthesis using Artificial Neural Networks  

E-print Network

Picture My Voice: Audio to Visual Speech Synthesis using Artificial Neural Networks Dominic W face at the receiving station of the participant. A standard text-to-speech engine would translate. The downside of this technology is that the voice would not correspond to the voice of the sender

Massaro, Dominic

22

Audio Ticker  

Microsoft Academic Search

Audio ticker is a novel user interface that employs audio to highlight frequently updated information in Webcasting or Internet push systems. It can be used to present a wide variety of content channels such as stock quote, sports score, traffic report, and weather forecast.

Qi Lu; Lev Stesin

1998-01-01

23

Fast transcription of unstructured audio recordings  

E-print Network

We introduce a new method for human-machine collaborative speech transcription that is significantly faster than existing transcription methods. In this approach, automatic audio processing algorithms are used to robustly ...

Roy, Brandon Cain

24

2005 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 16-19, 2005, New Paltz, NY LEARNING AUDITORY MODELS OF MACHINE VOICES  

E-print Network

to traditional therapy akin to art therapy and music therapy, utilizes the sounds of machines as relational@ee.columbia.edu ABSTRACT Vocal imitation is often found useful in Machine Therapy ses- sions as it creates an emphatic by our work in Machine Therapy in which hu- mans try to vocally imitate machines, but this task also

Ellis, Dan

25

Video salient event classification using audio features  

NASA Astrophysics Data System (ADS)

The aim of this work is to detect the events in video sequences that are salient with respect to the audio signal. In particular, we focus on the audio analysis of a video, with the goal of finding which are the significant features to detect audio-salient events. In our work we have extracted the audio tracks from videos of different sport events. For each video, we have manually labeled the salient audio-events using the binary markings. On each frame, features in both time and frequency domains have been considered. These features have been used to train different classifiers: Classification and Regression Trees, Support Vector Machine, and k-Nearest Neighbor. The classification performances are reported in terms of confusion matrices.

Corchs, Silvia; Ciocca, Gianluigi; Fiori, Massimiliano; Gasparini, Francesca

2014-03-01

26

Eavesdropping: audience interaction in networked audio performance  

Microsoft Academic Search

Eavesdropping is an internet-based, interactive audio system that explores network mediated, musical performance in shared public spaces. The project aims to develop an environment which increases audience interaction and connectedness in a localized, computer-controlled performance. The system is a client-server architecture made of three components: (1) an audio preparation interface, (2) an interactive performance interface, and (3) a machine learning-based

Jack Stockholm; Philippe Pasquier

2008-01-01

27

Designing help topics for use with text-to-speech  

Microsoft Academic Search

Speech technology can be used to provide online help to users in situations where visual display of online help is not possible, or has some display-related limitations. Presenting help material in this manner can also complement traditional online help systems. To date, most online help material has been developed with the assumption that the material will be read. This paper

Aidan Kehoe; Ian J. Pitt

2006-01-01

28

RECENT ADVANCES IN MULTILINGUAL TEXT-TO-SPEECH SYNTHESIS  

Microsoft Academic Search

this paper we will discuss recent advances in multilingualtext-to-speech (TTS) synthesis research atAT&T Bell Laboratories. The TTS system developedat AT&T Bell Laboratories generates syntheticspeech by concatenating segments of natural speech.The architecture of the system is designed as a modularpipeline where each module handles one particularstep in the process of converting text into speech. Besidesconceptual and computational advantages, themodular structure has

Bernd M; Juergen Schroeter; Jan van Santen; Richard Sproat; Joseph Olive

1996-01-01

29

Streaming Audio Recorder  

NSDL National Science Digital Library

The Streaming Audio Recorder application allows users to record any type of streaming audio via their computers' speakers or microphone. It's a simple way to record audio from sites such as Grooveshark, YouTube, BBC, and others. The program is compatible with computers running Windows 2000 and newer.

2012-11-02

30

Audio Engineering Society  

NSDL National Science Digital Library

The Audio Engineering Society (AES), now in its fifth decade, is the only professional society devoted exclusively to audio technology. Its membership consists of leading engineers, scientists and other authorities throughout the world. The Web site has links to information about audio education, events, careers and more.

Inc.., Audio E.

1997-01-01

31

Audio?visual aids  

Microsoft Academic Search

FILMSJUST WHAT IS GENERAL SEMANTICS? NET Film Service, Audio?Visual Center, Indiana University, 1956. go minutes. Sound. Black and white. Sale: $100. Rental: Apply.DO YOU KNOW HOW TO MAKE A STATEMENT OF FACT? NET Film Service, Audio?Visual Center, Indiana University, 1956. 30 minutes. Sound. Black and white. Sale: $100. Rental: Apply.WHY DO PEOPLE MISUNDERSTAND EACH OTHER? NET Film Service, Audio?Visual Center,

Thurston M. Reeves; Kenneth D. Bryson; Thomas R. McManus; Janet G. Smith; Carl A. Dallinger

1957-01-01

32

Audio?visual aids  

Microsoft Academic Search

AUDIO BOOKS. Audio Book Company, St. Joseph, Michigan. 1956. 16 rpm disc recordings. About 60 minutes per record.GREAT TALES AND POEMS OF EDGAR ALLEN POE. 4 records. Sale: $4.95.THE TRIAL OF SOCRATES. 3 records. Sale: $3.95.THE BEST OF MARK TWAIN. 4 records. Sale: $4.95.THE AUTOBIOGRAPHY OF BENJAMIN FRANKLIN. 8 records. Sale: $8.95.ALICE IN WONDERLAND. 3 records. Sale: &3.95.THE AUDIO BOOKS

Earle E. Curtis; A. C. LaFollette; Helen Harton; Georgia Bowman; Wally Schneider

1958-01-01

33

Topic in Depth - Audio Mining  

NSDL National Science Digital Library

Occasionally referred to as audio indexing, audio mining is a computerized task involving the processing of an audio file, extracting the dialog and creating a textual transcript, and searching the transcript for certain words or phrases. Considering the amount of audio content on the Internet and other sources, it is clear that audio mining is a growing technology of growing importance.

2010-09-15

34

Audio signal management techniques  

NASA Astrophysics Data System (ADS)

The objective of the Audio Signal Management technical program was to design and develop an Exploratory Development Model Audio Signal Management System (ASMS). This system is to be used to test and evaluate present and future voice data entry algorithms, processing techniques, and hardware modules. The ASMS consists of internal functions implemented on the RADC PDP 11/70 computer, external functions implemented in stand-alone hardware devices, an Audio Distribution Network (ADN) for shaping and routing audio signals, and an ADP Data entry communication interface/keyboard translator with HP 2645A terminal for function control and transcription.

Anderson, A. P.; Lane, J. K.; Pudliner, B. K.

1983-02-01

35

AUDIO ANALYSIS IN PWGLSYNTH  

Microsoft Academic Search

In this paper, we present an incremental improvement of a known fundamental frequency estimation algorithm for monophonic sig- nals. This is viewed as a case study of using our signal graph based synthesis language, PWGLSynth, for audio analysis. The roles of audio and control signals are discussed in both analysis and syn- thesis contexts. The suitability of the PWGLSynth system

Vesa Norilo; Mikael Laurson

2008-01-01

36

Modem/Audio IntegrationModem/Audio Integration Concurrent Audio AndConcurrent Audio And  

E-print Network

/synthesisSpeech recognition/synthesis uu Advanced audio codecsAdvanced audio codecs A programmable acceleratorA programmable of features #12;Performance ImpactPerformance Impact (Ref: PC 98 section 17)(Ref: PC 98 section 17) uu GameTotal for concurrent operation: 90 MhzMhz,, 30% of a minimum PC 99 PC30% of a minimum PC 99 PC #12

Maher, Robert C.

37

Applications of virtual audio  

Microsoft Academic Search

Technology for electronically simulating spatial sound over loudspeakers and headphones has matured in the past few decades to facilitate many new applications of virtual audio. Electronic simulation of directional and distance auditory cues has greatly expanded the areas of application of virtual audio. Some potential aerospace applications include monitoring spatially separated speech communication signals to increase understanding, navigating by an

Mark Ericson; William D'Angelo; Eric Scarborough; Steve Rogers; Philip Amburn; Dennis Ruck

1993-01-01

38

Unsupervised Audio Scene Analysis  

Microsoft Academic Search

little attention is being paid to general audio and long-term modeling of audio in general. Even an ASR system which could give a complete transcription of the words heard in an environment would lack vital information. E. g., who was talking, when they were talking, what was the tone of the conversation, did someone slam the door, did someone use

Chris Stauffer; Eric Grimson; L. Grimson

39

Audio Engineers: Sound Weavers  

NSDL National Science Digital Library

Students are introduced to audio engineers, discovering the type of environment in which they work and exactly what they do on a day-to-day basis. Students come to realize that audio engineers help produce their favorite music and movies.

Integrated Teaching And Learning Program

40

An introduction to super audio CD and DVD-Audio  

Microsoft Academic Search

Highlights the latest developments in consumer audio and specifically in DVD-Audio and SACD. The DVD-Audio specification allows for up to 24-b PCM data and uses the Meridian lossless packing (MLP) algorithm to provide up to six channels of high-quality, multichannel audio at sampling rates of up to 96 kHz for six channels or 192 kHz for two channels. Super-audio CD

K. Konstantinides

2003-01-01

41

Perceptual coding of digital audio  

Microsoft Academic Search

During the last decade, CD-quality digital audio has essentially replaced analog audio. Emerging digital audio applications for network, wireless, and multimedia computing systems face a series of constraints such as reduced channel bandwidth, limited storage capacity, and low cost. These new applications have created a demand for high-quality digital audio delivery at low bit rates. In response to this need,

Ted Painter; Andreas Spanias

2000-01-01

42

The Audio Adventurer: Design of a Portable Audio Adventure Game  

Microsoft Academic Search

In this paper we describe the design of a portable device for playing audio adventure games. This device enables the player\\u000a to explore an audio world, interact with it, and solve challenges while a narrative evolves. To avoid the difficulties that\\u000a can arise when freely navigating open spaces in audio-only worlds, we structured our audio world as a network of

Philip Mendels; Joep W. Frens

2008-01-01

43

3D Audio System  

NASA Technical Reports Server (NTRS)

Ames Research Center research into virtual reality led to the development of the Convolvotron, a high speed digital audio processing system that delivers three-dimensional sound over headphones. It consists of a two-card set designed for use with a personal computer. The Convolvotron's primary application is presentation of 3D audio signals over headphones. Four independent sound sources are filtered with large time-varying filters that compensate for motion. The perceived location of the sound remains constant. Possible applications are in air traffic control towers or airplane cockpits, hearing and perception research and virtual reality development.

1992-01-01

44

Real World Audio  

NASA Technical Reports Server (NTRS)

Crystal River Engineering was originally featured in Spinoff 1992 with the Convolvotron, a high speed digital audio processing system that delivers three-dimensional sound over headphones. The Convolvotron was developed for Ames' research on virtual acoustic displays. Crystal River is a now a subsidiary of Aureal Semiconductor, Inc. and they together develop and market the technology, which is a 3-D (three dimensional) audio technology known commercially today as Aureal 3D (A-3D). The technology has been incorporated into video games, surround sound systems, and sound cards.

1998-01-01

45

Duplicate Detection and Audio Thumbnails with Audio Fingerprinting  

Microsoft Academic Search

Audio fingerprinting is a powerful tool for identifying either streaming or file-based audio, using a database of fingerprints. This paper presents two new applications: duplicate detection, whose goal is to identify duplicate audio clips in a set, even if they differ in compression quality or duration, and thumbnail generation, which aims at providing a representative short clip of a music

Christopher J. C. Burges; Daniel Plastina; John C. Platt; Erin Renshaw; Henrique S. Malvar

2004-01-01

46

audio-transcript  

Cancer.gov

Audio Clips of Telephone Calls to NCI’s Cancer Information Service Inquiries from Callers Caller #1: I have a…a girlfriend that has liver cancer, and she’s going for chemo, and I was wondering if you have any information on liver cancer? Caller #

47

Women's Audio Mission  

NSDL National Science Digital Library

Get the inside scoop on the recording industry. The Women's Audio Mission is dedicated to helping women and girls in the field. Not only can you learn more about the industry on these pages, you can see the way woman have made their mark in it.

Women\\'s Audio Mission

48

AHA: Audio HTML Access  

Microsoft Academic Search

This report discusses the “AHA” system for presenting HTML in audio for blind users and others who wish to access the WWW non-visually. AHA is a framework and set of suggestions for HTML presentation based on an initial experiment. Further experimentation and further revisions will be performed with the system.

Frankie James

1997-01-01

49

Acoustic chase : designing an interactive audio environment to stimulate human body movement  

E-print Network

An immersive audio environment was created that explores how humans react to commands imposed by a machine generating its acoustic stimuli on the basis of tracked body movement. In this environment, different states of ...

Schiessl, Simon Karl Josef, 1972-

2004-01-01

50

Semantic Learning for Audio Applications: A Computer Vision Approach  

Microsoft Academic Search

Recent work in machine learning has significantly benefited semantic extraction tasks in computer vision, particularly for object recognition and image retrieval. We argue that the computer vision techniques that have been successfully applied in those settings can effectively be translated to other domains, such as audio. This claim is supported by recent results in music vs. speech classification, structure from

Rahul Sukthankar; Yan Ke; Derek Hoiem

2006-01-01

51

Audio Feature Engineering for Automatic Music Genre Classification  

E-print Network

Audio Feature Engineering for Automatic Music Genre Classification Paolo Annesi, Roberto Basili learning algorithm, i.e. Support Vector Machines, to design an automatic classifier of music genres Introduction Music genres are difficult to describe as there is no complete agreement on their definition

Moschitti, Alessandro

52

A ROBUST AUDIO FINGERPRINT'S BASED IDENTIFICATION METHOD An audio fingerprint is a small digest of an audio file  

E-print Network

A ROBUST AUDIO FINGERPRINT'S BASED IDENTIFICATION METHOD ABSTRACT An audio fingerprint is a small digest of an audio file computed from its main perceptual properties. Like human fingerprints, audio fingerprints allow to identify an audio file among a set of candidates but does not allow to re­ trieve any

Brun, Chez Luc

53

The ISO audio coding standard  

Microsoft Academic Search

An ISO audio coding standard is being developed that will provide an audio quality comparable to that of a compact disc using a reduced bit rate of about 2×128 kb\\/s for a stereo sound signal instead of 2×706 kb\\/s. Four coding algorithms have been considered in order to develop the audio coding standard. Two of these coding algorithms have been

H. G. Musmann

1990-01-01

54

Audio Engineering Society Convention Paper  

E-print Network

modulation technique has been proposed and applied in analog to digital conversion for many years [1 of the Audio Engineering Society. Fuzzy Impulsive Control of High Order Interpolative Lowpass Sigma Delta.reiss@elec.qmul.ac.uk ABSTRACT In this paper, high order interpolative sigma delta modulators (SDMs) in audio applications

Reiss, Josh

55

Audio Engineering Society Convention Paper  

E-print Network

josh.reiss@elec.qmul.ac.uk ABSTRACT Sigma delta modulation is a popular form of audio analogue-to-digital popular methods for analog to digital (and digital to analog) conversion for audio applications. Yet of Sigma Delta Modulators Georgi Tsenov1 , Valeri Mladenov1 , and Joshua D. Reiss2 1 Dept of Theoretical

Reiss, Josh

56

UNICEF Video/Audio  

NSDL National Science Digital Library

UNICEF is known throughout the world for their focus on the health, education, equality and protection of children. They produce a number of helpful research reports and policy briefs, and as visitors to this site will find out, a good deal of audio and visual material in the form of podcasts, video news reports, and radio programs. Visitors to the UNICEF Radio area will find a wide range of radio reports on topics such as Nigeria's efforts to contain outbreaks of avian influenza and the effects of floods in Mozambique on children. Visitors interested in podcasts will be impressed with the offerings here, as they include over one hundred total archived programs, and visitors can also sign up to receive each new addition to this collection.

57

BATS: The Blind Audio Tactile Mapping System  

Microsoft Academic Search

The BATS project focuses on helping students with visual impairments access and explore spatial information using standard computer hardware and open source software. Our work is largely based on prior techniques used in presenting maps to the blind such as text-to-speech synthesis, auditory icons, and tactile feedback. We add spatial sound to position auditory icons and speech callouts in three

Peter Parente; Gary Bishop

58

A Tutorial on MPEG\\/Audio Compression  

Microsoft Academic Search

ABSTRACT This tutorial covers the theory behind MPEG\\/audio compression This algorithm was developed by the Motion Picture Experts Group (MPEG), as an International Organization for Standardization (ISO) standard for the high fidelity compression of digital audio The MPEG\\/audio compression standard is one part of a multiple part standard that addresses the compression of video - 2), the compression of audio

Davis Pan

1995-01-01

59

Modeling Audio Fingerprints: Structure, Distortion, Capacity  

Microsoft Academic Search

An audio fingerprint is a compact low-level representation of a multimedia signal. An audio fingerprint can be used to identify audio files or fragments in a reliable way. The use of audio fingerprints for identification consists of two phases. In the enrollment phase known content is fingerprinted, and ingested into a database, together with all relevant metadata. In the identification

P. J. O. Doets

2010-01-01

60

A centralized audio presentation manager  

SciTech Connect

The centralized audio presentation manager addresses the problems which occur when multiple programs running simultaneously attempt to use the audio output of a computer system. Time dependence of sound means that certain auditory messages must be scheduled simultaneously, which can lead to perceptual problems due to psychoacoustic phenomena. Furthermore, the combination of speech and nonspeech audio is examined; each presents its own problems of perceptibility in an acoustic environment composed of multiple auditory streams. The centralized audio presentation manager receives abstract parameterized message requests from the currently running programs, and attempts to create and present a sonic representation in the most perceptible manner through the use of a theoretically and empirically designed rule set.

Papp, A.L. III; Blattner, M.M.

1994-05-16

61

The Audio-Tutorial System  

ERIC Educational Resources Information Center

Describes the audio-tutorial program in Botany at Purdue University. Advantages include adaptability to individual stduent needs, integration of laboratory activities and information giving, aid flexibility in use of media and means of presentation. (EB)

Postlethwait, S. N.

1970-01-01

62

Text to Speech: A 4-H Model of Accessibility and Inclusion  

ERIC Educational Resources Information Center

4-H project manuals play an integral part in a youth's ability to achieve mastery in a specific project area. For youth who struggle with reading, written 4-H materials prove inadequate in addressing the needs of the learner. This article proposes a new delivery method of 4-H educational material designed to create a more inclusive and…

Green, Jeremy W.

2012-01-01

63

Using Prosodic Structure to Improve Pitch Range Variation in Text to Speech Synthesis  

E-print Network

range variation is most appar- ent at a tone group level of prosodic structure, and how phrase ini- tial, consisting of phrases, consisting of tone groups. The terms used here for phrase structure units have been structure, as one high level of phrasing consists of exactly one phrase unit from the level below it

Edinburgh, University of

64

Using Prosodic Structure to Improve Pitch Range Variation in Text to Speech Synthesis  

E-print Network

range variation is most appar­ ent at a tone group level of prosodic structure, and how phrase ini­ tial, consisting of phrases, consisting of tone groups. The terms used here for phrase structure units have been structure, as one high level of phrasing consists of exactly one phrase unit from the level below it

Edinburgh, University of

65

The derivation of prosody for text-to-speech from prosodic sentence structure  

Microsoft Academic Search

Suprasegmental phenomena in synthetic speech should reflect the linguistic structure of the input text. An algorithm is described, which establishes the prosodic sentence structure (PSS). This can be achieved without exhaustive syntactic parsing, using a dictionary of 550 function words. Subsequently, phrase and accent locations are derived from the PPS; accentuation is also affected by some semantic and contextual information.

H. Quené; R. W. J. Kager

1992-01-01

66

WORD AND SYLLABLE MODELS FOR GERMAN TEXT-TO-SPEECH SYNTHESIS  

E-print Network

of the productivity of word forming affixes. One important sub- component of the word model is a phonotactic syllable- ery natural language has productive word formation processes, and the community of speakers and innovative processes as regular words are. Thus, in unlimited vocabulary scenarios we are not facing a memory

Möbius, Bernd

67

47 CFR 73.403 - Digital audio broadcasting service requirements.  

Code of Federal Regulations, 2010 CFR

...2010-10-01 false Digital audio broadcasting service...Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED...RADIO BROADCAST SERVICES Digital Audio Broadcasting § 73.403 Digital audio broadcasting...

2010-10-01

68

The Timbre Toolbox: extracting audio descriptors from musical signals.  

PubMed

The analysis of musical signals to extract audio descriptors that can potentially characterize their timbre has been disparate and often too focused on a particular small set of sounds. The Timbre Toolbox provides a comprehensive set of descriptors that can be useful in perceptual research, as well as in music information retrieval and machine-learning approaches to content-based retrieval in large sound databases. Sound events are first analyzed in terms of various input representations (short-term Fourier transform, harmonic sinusoidal components, an auditory model based on the equivalent rectangular bandwidth concept, the energy envelope). A large number of audio descriptors are then derived from each of these representations to capture temporal, spectral, spectrotemporal, and energetic properties of the sound events. Some descriptors are global, providing a single value for the whole sound event, whereas others are time-varying. Robust descriptive statistics are used to characterize the time-varying descriptors. To examine the information redundancy across audio descriptors, correlational analysis followed by hierarchical clustering is performed. This analysis suggests ten classes of relatively independent audio descriptors, showing that the Timbre Toolbox is a multidimensional instrument for the measurement of the acoustical structure of complex sound signals. PMID:22087919

Peeters, Geoffroy; Giordano, Bruno L; Susini, Patrick; Misdariis, Nicolas; McAdams, Stephen

2011-11-01

69

Audio feedback system for engineering test satellite VII  

NASA Astrophysics Data System (ADS)

An efficient and reliable interface to manage telemetry information is most important in the teleoperation of space robots. Operators need to be able to recognize and verify large amounts of telemetry information quickly and accurately. Visual information around the workspace of space robots is very limited, and the detailed position of work is uncertain. These difficulties raise the load on operators. We have been running experiments on assembling antennas using the Engineering Test Satellite VII (ETS-VII), so we are very much aware of the need for an effective man-machine interface to handle telemetry information. We have developed an audio interface system for the efficient operation of ETS-VII. Unlike a visual interface, this audio interface allows an operator to (1) perceive information even if pay small attention for it, and (2) easily identify trends and changes. The system analyzes telemetry information in real- time, and converts changes in the status of information into voice data, and changes in the magnitude of forces into the frequency of motor noise. The effectiveness of this audio interface was verified in operations of ETS-VII by monitoring eye movements over time. Time is measured by the mean interval between status changes and command submissions. An eye mark recorder records eye movements. The data suggests significant effects of the audio interface system.

Nagai, Yasufumi; Kimura, Shinichi; Tsuchiya, Shigeru; Iida, Takashi

1999-11-01

70

Kenneth S. Goldstein Audio Recordings  

NSDL National Science Digital Library

This remarkable collection consists of over 850 audio reels recorded primarily by Dr. Kenneth S. Goldstein. He was a folklorist, record producer, and teacher who happened to also find time to serve as chairman of the department of folklore and folklife at the University of Pennsylvania. These audio tapes include interviews with musicians and storytellers, recitations of folktales from Newfoundland and Labrador, Pennsylvania, and Scotland. First-time visitors might do well to look over the English Language Folktale reels and then move on to perform their own detailed search across the entire archive. Visitors can also elect to receive updates on the collection via their RSS feed.

71

Emphasizing the Audio in the Audio-Lingual Approach.  

National Technical Information Service (NTIS)

The neglect of listening comprehension in the audio-lingual approach to the teaching of foreign languages is pointed out. The need to develop listening comprehension, not only as a foundation for speaking, but as a skill in its own right, is stressed. The...

G. Newmark, E. Diller

1964-01-01

72

Audio-visual integration in multimodal communication  

Microsoft Academic Search

We review recent research that examines audio-visual integration in multimodal communication. The topics include bimodality in human speech, human and automated lip reading, facial animation, lip synchronization, joint audio-video coding, and bimodal speaker verification. We also study the enabling technologies for these research topics, including automatic facial-feature tracking and audio-to-visual mapping. Recent progress in audio-visual research shows that joint processing

Tsuhan Chen; Ram R. Rao

1998-01-01

73

Audio, Video and Audio-Visual Signatures for Short Video Clip Detection: Experiments on Trecvid2003  

E-print Network

Audio, Video and Audio-Visual Signatures for Short Video Clip Detection: Experiments on Trecvid2003.Senechal,Denis.Pellerin}@lis.inpg.fr, Laurent.Besacier@imag.fr ABSTRACT In this paper, we present the association of audio and video signatures for short video clip detection. First, we present an audio signature based on the spectral flatness measure

Paris-Sud XI, Université de

74

AUDIO GRAFFITI: A LOCATION BASED AUDIO-TAGGING AND REMIXING ENVIRONMENT  

E-print Network

AUDIO GRAFFITI: A LOCATION BASED AUDIO-TAGGING AND REMIXING ENVIRONMENT Zack Settel University demonstration of Audio Graffiti, we explore novel modes of interaction with sound and space. Set in an outdoor and personal remixing space. 1. INTRODUCTION The conception and motivation for Audio Graffiti comes from

Cooperstock, Jeremy R.

75

Conducting Audio Files via Computer Vision  

Microsoft Academic Search

This paper presents a system to control the playback of audio flles by means of the standard classical conducting technique. Computer vision techniques are developed to track a conductor's baton, and the gesture is subsequently analysed. Audio parameters are extracted from the sound-flle and are further processed for audio beat tracking. The sound-flle playback speed is adjusted in order to

Declan Murphy; Tue Haste Andersen; Kristoffer Jensen

2003-01-01

76

Audio steganography using bit modification  

Microsoft Academic Search

A method of embedding a covert audio message in a cover utterance for secure communication is presented. The covert message is represented in a compressed form with possibly encryption and\\/or encoding for added security. One bit in each of the samples of a given cover utterance is altered in accordance with the data bits and a key. The same key

K. Gopalan

2003-01-01

77

Quality aspects of audio communication  

E-print Network

· Contributions and conclusions 2 My vision The Internet will carry a significant proportion of the world for simple audio coding e.g G.711 PCM A loss rate of 1% with no packet loss concealment A loss rate of 10% with packet loss concealment · Jitter ­ Smoothing buffers used to deliver uninterrupted speech, therefore add

78

Audio-Visual Materials Catalog.  

ERIC Educational Resources Information Center

This catalog lists 27 audiovisual programs produced by the Department of Medical Communications of the University of Texas M. D. Anderson Hospital and Tumor Institute for public distribution. Video tapes, 16 mm. motion pictures and slide/audio series are presented dealing mostly with cancer and related subjects. The programs are intended for…

Anderson (M.D.) Hospital and Tumor Inst., Houston, TX.

79

Audio Engineering Society Convention Paper  

E-print Network

of the Audio Engineering Society. Acoustical Modeling Using a Russian Roulette Strategy Bill Kapralos1,3 , Michael R. Jenkin1,3 , Evangelos Milios2,3 1 Dept. of Computer Science and Engineering, York University. 4700 Keele St. Toronto, Canada. M3J 1P3 2 Faculty of Computer Science, Dalhousie University. 6050

Milios, Evangelos E.

80

Audio Engineering Society Convention Paper  

E-print Network

Sigma delta modulation is a popular technique for high-resolution analog-to-digital conversion and digital-to- analog-conversion. It has been considered as a new format for recording and storage of audio signals, Direct Stream Digital, employing 1-bit oversampling sigma-delta modulation, has recently been

Reiss, Josh

81

Audio Engineering Society Convention Paper  

E-print Network

York 10165-2520, USA; also see www.aes.org. All rights reserved. Reproduction of this paper, or any occurring phenomenon that shifts the pitch of sound if the emitting objects distance to the listener, velocity and movement direction. In this paper we investigate spatial audio reproduction methods

Reiss, Josh

82

Audio Engineering Society Convention Paper  

E-print Network

for impaired sound-field Reference reproduction system Mapping of test signals Reference test signals Impaired York 10165-2520, USA; also see www.aes.org. All rights reserved. Reproduction of this paper, or any of a spatial audio reproduction to a reference reproduction. Test signals designed to identify distortions

Jackson, Philip JB

83

Audio Engineering Society Convention Paper  

E-print Network

York 10165-2520, USA; also see www.aes.org. All rights reserved. Reproduction of this paper, or any for Vision, Speech & Signal Processing, University of Surrey, UK 2 Institute of Sound Recording, University of a spatial audio reproduction against a reference reproduction. This paper presents implementation details

Jackson, Philip JB

84

Radioactive Decay: Audio Data Collection  

ERIC Educational Resources Information Center

Many phenomena generate interesting audible time series. This data can be collected and processed using audio software. The free software package "Audacity" is used to demonstrate the process by recording, processing, and extracting click times from an inexpensive radiation detector. The high quality of the data is demonstrated with a simple…

Struthers, Allan

2009-01-01

85

Audio Engineering Society Convention Paper  

E-print Network

of the system's basic concepts for music therapy applications. The use of the Wii Controller as means, construction and analysis of an interactive gesture-controlled audio mixing system by means of a wireless video game controller. The concept is based on the idea that the mixing engineer can step away from

Reiss, Josh

86

Aeronautical audio broadcasting via satellite  

NASA Astrophysics Data System (ADS)

A system design for aeronautical audio broadcasting, with C-band uplink and L-band downlink, via Inmarsat space segments is presented. Near-transparent-quality compression of 5-kHz bandwidth audio at 20.5 kbit/s is achieved based on a hybrid technique employing linear predictive modeling and transform-domain residual quantization. Concatenated Reed-Solomon/convolutional codes with quadrature phase shift keying are selected for bandwidth and power efficiency. RF bandwidth at 25 kHz per channel, and a decoded bit error rate at 10(exp -6) with E(sub b)/N(sub o) at 3.75 dB are obtained. An interleaver, scrambler, modem synchronization, and frame format were designed, and frequency-division multiple access was selected over code-division multiple access. A link budget computation based on a worst-case scenario indicates sufficient system power margins. Transponder occupancy analysis for 72 audio channels demonstrates ample remaining capacity to accommodate emerging aeronautical services.

Tzeng, Forrest F.

87

VISUAL AND AUDIO PRESENTATION IN MACHINE PROGRAMED INSTRUCTION. FINAL REPORT.  

ERIC Educational Resources Information Center

THIS STUDY WAS PART OF A LARGER RESEARCH PROGRAM AIMED TOWARD DEVELOPMENT OF PARADIGMS OF MESSAGE DESIGN. OBJECTIVES OF THREE PARALLEL EXPERIMENTS WERE TO EVALUATE INTERACTIONS OF PRESENTATION MODE, PROGRAM TYPE, AND CONTENT AS THEY AFFECT LEARNER CHARACTERISTICS. EACH EXPERIMENT USED 18 TREATMENTS IN A FACTORIAL DESIGN WITH RANDOMLY SELECTED…

ALLEN, WILLIAM H.

88

Working with HTML5 Audio and Video  

Microsoft Academic Search

\\u000a In this chapter, we’ll explore what you can do with two important HTML5 elements—audio and video— and we’ll show you how they can be used to create compelling applications. The audio and video elements add new media options\\u000a to HTML5 applications that allow you to use audio and video without plugins while providing a common, integrated, and scriptable\\u000a API.

Peter Lubbers; Brian Albers; Frank Salim

89

Parametric multichannel audio coding: synthesis of coherence cues  

Microsoft Academic Search

Parametric multichannel audio coding represents an audio signal as one single audio channel plus side information. The side information contains estimates of perceptually relevant differences between the original audio channels. Usually, time difference, level difference, and coherence cues are considered. These cues determine, to a large degree, the auditory spatial image that is perceived when playing back multichannel audio signals.

Christof Faller

2006-01-01

90

Audio quality assessment techniques - A review, and recent developments  

Microsoft Academic Search

Assessing the perceptual quality of wideband audio signals is an important considera- tion in many audio and multimedia networks and devices. Examples of such multimedia technologies are: streaming audio over the Internet, Digital Radio Mondiale (DRM), Digital Audio Broadcasting (DAB), VoIP (Voice over Internet Protocol), mobile phones, as well as compression algorithms for digital audio. The International Telecommunications Union (ITU)

Dermot Campbell; Edward Jones; Martin Glavin

2009-01-01

91

Three-Dimensional Audio Client Library  

NASA Technical Reports Server (NTRS)

The Three-Dimensional Audio Client Library (3DAudio library) is a group of software routines written to facilitate development of both stand-alone (audio only) and immersive virtual-reality application programs that utilize three-dimensional audio displays. The library is intended to enable the development of three-dimensional audio client application programs by use of a code base common to multiple audio server computers. The 3DAudio library calls vendor-specific audio client libraries and currently supports the AuSIM Gold-Server and Lake Huron audio servers. 3DAudio library routines contain common functions for (1) initiation and termination of a client/audio server session, (2) configuration-file input, (3) positioning functions, (4) coordinate transformations, (5) audio transport functions, (6) rendering functions, (7) debugging functions, and (8) event-list-sequencing functions. The 3DAudio software is written in the C++ programming language and currently operates under the Linux, IRIX, and Windows operating systems.

Rizzi, Stephen A.

2005-01-01

92

Audio-visual gender recognition  

NASA Astrophysics Data System (ADS)

Combining different modalities for pattern recognition task is a very promising field. Basically, human always fuse information from different modalities to recognize object and perform inference, etc. Audio-Visual gender recognition is one of the most common task in human social communication. Human can identify the gender by facial appearance, by speech and also by body gait. Indeed, human gender recognition is a multi-modal data acquisition and processing procedure. However, computational multimodal gender recognition has not been extensively investigated in the literature. In this paper, speech and facial image are fused to perform a mutli-modal gender recognition for exploring the improvement of combining different modalities.

Liu, Ming; Xu, Xun; Huang, Thomas S.

2007-11-01

93

Reverse Audio Engineering: Model-Based Inversion of Dynamic Range  

E-print Network

and entertainment. But not many know how the audio was produced. If we take sound recording and reproduction, reverse audio engineering. I. INTRODUCTION SOUND or audio engineering is an established discipline

94

Cluster: Metals. Course: Machine Shop. Research Project.  

ERIC Educational Resources Information Center

The set of 13 units is designed for use with an instructor in actual machine shop practice and is also keyed to audio visual and textual materials. Each unit contains a series of task packages which: specify prerequisites within the series (minimum is Unit 1); provide a narrative rationale for learning; list both general and specific objectives in…

Sanford - Lee County Schools, NC.

95

Digital Audio Compression By Davis Yen Pan  

E-print Network

). Figure 1 shows the digital audio process. ANALOG-TO-DIGITAL CONVERSION DIGITAL SIGNAL PROCESSING DIGITAL-TO-ANALOG functions (e.g., mixing, filtering, and equalization) through the digital computer. The conversion from the analog to the digital do- main begins by sampling the audio input in regular, discrete intervals of time

Lazar, Aurel A.

96

Features for audio and music classification  

Microsoft Academic Search

Four audio feature sets are evaluated in their ability to classify five general audio classes and seven pop- ular music genres. The feature sets include low-level signal properties, mel-frequency spectral coefficients, and two new sets based on perceptual models of hear- ing. The temporal behavior of the features is ana- lyzed and parameterized and these parameters are in- cluded as

Martin F. Mckinney; Jeroen Breebaart

2003-01-01

97

Realistic Human Action Recognition with Audio Context  

Microsoft Academic Search

Recognizing human actions in realistic scenes has emerged as a challenging topic due to various aspects such as dynamic backgrounds. In this paper, we present a novel approach to taking audio context into account for better action recognition performance, since audio can provide strong evidence to certain actions such as phone-ringing to answer-phone. At first, classifiers are established for visual

Qiuxia Wu; Zhiyong Wang; Feiqi Deng; David Dagan Feng

2010-01-01

98

Video Genre Categorization Using Audio Wavelet Coefficients  

Microsoft Academic Search

Abstract In this paper, we investigate the use of a wavelet transform-based analysis of audio tracks accompanying videos for the problem of automatic program genre detec- tion. We compare the classification performancebased on wavelet-based audio features to that using conventional features derived from Fourier and time analysis for the task of discriminating TV programs such as news, com- mercials, music

P. Q. Dinh; C. Dorai; S. Venkatesh

2002-01-01

99

Let's Hear It for Audio Mining  

NSDL National Science Digital Library

A detailed introduction is offered on the Leavitt Communications Web site. This article delves into how audio mining works by giving a basic technical understanding of the process. Approaches to audio mining are discussed, as well as how the technology works, performance, languages, and the challenges faced by designers.

Leavitt, Neal

2007-12-11

100

Dual Audio TV Instruction: A Broadcast Experiment.  

ERIC Educational Resources Information Center

An experiment assessed the potential effectiveness of "dual audio television instruction" (DATI) as a mass education medium. The DATI consisted of a radio program heard by children while they watched television shows. The audio instructor did not talk when the television characters spoke, but used the "quiet" times to help with reading, define…

Borton, Terry; And Others

101

Dual Audio Television: The First Public Broadcast.  

ERIC Educational Resources Information Center

A study was conducted in conjunction with the first public announced broadcast of dual audio television--a new method of combining simultaneous radio instruction and commercial entertainment TV. The study was designed to determine the audience which would be attracted to dual audio, the practicality of such instruction, and the correlation between…

Borton, Terry; And Others

102

The Space Shuttle audio distribution system  

NASA Astrophysics Data System (ADS)

The Space Shuttle Orbiter's Audio Distribution System (ADS) provides voice communication among crew members and an interface to various radio access functions and hardline functions associated with a given orbital mission's payloads. The ADS encompasses facilities for audio processing, mixing, amplification, volume control, isolation, switching, and distribution. Attention is given to ADS design features and interface possibilities.

Lee, R. K. P.

103

Collusion-resistant audio fingerprinting system in the modulated complex lapped transform domain.  

PubMed

Collusion-resistant fingerprinting paradigm seems to be a practical solution to the piracy problem as it allows media owners to detect any unauthorized copy and trace it back to the dishonest users. Despite the billionaire losses in the music industry, most of the collusion-resistant fingerprinting systems are devoted to digital images and very few to audio signals. In this paper, state-of-the-art collusion-resistant fingerprinting ideas are extended to audio signals and the corresponding parameters and operation conditions are proposed. Moreover, in order to carry out fingerprint detection using just a fraction of the pirate audio clip, block-based embedding and its corresponding detector is proposed. Extensive simulations show the robustness of the proposed system against average collusion attack. Moreover, by using an efficient Fast Fourier Transform core and standard computer machines it is shown that the proposed system is suitable for real-world scenarios. PMID:23762455

Garcia-Hernandez, Jose Juan; Feregrino-Uribe, Claudia; Cumplido, Rene

2013-01-01

104

History Channel: Audio and Video  

NSDL National Science Digital Library

ItâÂÂs perhaps a bit of a stretch of the imagination to think of a place that would include both a clip of Spiro Agnew speaking out on what he perceived to be the biases of television news coverage and some archival footage of Depression-era gangsters, but itâÂÂs all right here on the History ChannelâÂÂs Audio and Video online archive. The speech archive is quite nice, and may prove to be both edifying and entertaining. Visitors can browse the speech archive by topics (such as War & Diplomacy) or alphabetically. Some of the clips offered here include comments by the scientist Wernher von Braun after hearing that the U.S.S.R had landed a spacecraft on the moon. The video clip section is also quite well-developed, as it contains clips of the trial of Adolf Eichmann and the breaking of the sound barrier.

105

Interactive Learning of Spoken Words and Their Meanings Through an Audio-Visual Interface  

NASA Astrophysics Data System (ADS)

This paper presents a new interactive learning method for spoken word acquisition through human-machine audio-visual interfaces. During the course of learning, the machine makes a decision about whether an orally input word is a word in the lexicon the machine has learned, using both speech and visual cues. Learning is carried out on-line, incrementally, based on a combination of active and unsupervised learning principles. If the machine judges with a high degree of confidence that its decision is correct, it learns the statistical models of the word and a corresponding image category as its meaning in an unsupervised way. Otherwise, it asks the user a question in an active way. The function used to estimate the degree of confidence is also learned adaptively on-line. Experimental results show that the combination of active and unsupervised learning principles enables the machine and the user to adapt to each other, which makes the learning process more efficient.

Iwahashi, Naoto

106

Digital audio for satellite network radio  

NASA Astrophysics Data System (ADS)

The paper presents the time-division-multiplexed (TDM) digital system supplied by Scientific-Atlanta for ABC, CBS, and NBC. The system has a transmission rate of 8.78 Mbps, and can demodulate, decode and demultiplex the data into the desired audio and data channels, supporting data rates equivalent to twenty 15 kHz audio channels at 384 kbps each. Digital transmission is used for data and channel use flexibility, and efficient usage of the satellite transponder. This TDM digital earth terminal configuration provides very high quality audio reception, built-in expansion capability for future services, and relative immunity to terrestrial interference.

McBride, A. L.

1982-04-01

107

How to Make an Audio Tape Bow  

NSDL National Science Digital Library

From this How To slide show, you create an Audio Tape Bow that can play distorted audio sounds by running it across a tape head. Learners will open up cassette tapes and used tape players to see how they work. Then, they will dismantle some of the parts in order to create and design a new instrument. The How To includes a video of the sound the Audio Tape Bow makes when run across a tape head. This activity is a great for exploring the way sounds are recorded and the technology used to play it back.

Minnesota, Science M.

2012-06-26

108

A ROBUST AUDIO FINGERPRINT EXTRACTION ALGORITHM Jerome Lebosse  

E-print Network

A ROBUST AUDIO FINGERPRINT EXTRACTION ALGORITHM Jâ??erâ??ome Lebossâ??e France Tâ??elâ??ecom R&D 32 rue des coutures 14000 Caen,France jeanclaude.pailles@orange­ft.com ABSTRACT An Audio fingerprint is a small digest of an audio file computed from its main perceptual properties. Like human fingerprints, Audio fingerprints

Brun, Chez Luc

109

AUDIO-VIDEO EVENT RECOGNITION SYSTEM FOR PUBLIC TRANSPORT SECURITY  

E-print Network

AUDIO-VIDEO EVENT RECOGNITION SYSTEM FOR PUBLIC TRANSPORT SECURITY Van-Thinh Vu Quoc-Cuong Pham Keywords: audio-video surveillance, audio-video event, behavior analysis, event recognition. Abstract This paper presents an audio-video surveillance system for the automatic surveillance in public transport

Paris-Sud XI, Université de

110

REISS ET AL. COMPRESSION FOR SUPER AUDIO CD Audio Engineering Society  

E-print Network

, Philips and Sony have devised and implemented a new audio storage format known as Super Audio Compact Disc compact discs use 16 bit PCM encoding at 44.1kHz, DSD uses 1-bit sampling of audio at 64x44.1kHz. Thus in practical situations allow this amount of DSD to be stored on a single disk [1]. Compact Discs, on the other

Reiss, Josh

111

Audio Engineering Society Convention Paper 7884  

E-print Network

reflections can reduce front-back confusion for audio presented with non- individualized HRTFs via headphones will then be presented via headphones with these spatialized stimuli to determine the effect of early-order reflections

Maher, Robert C.

112

36 CFR 1002.12 - Audio disturbances.  

Code of Federal Regulations, 2010 CFR

...prohibited: (1) Operating motorized equipment or machinery such as an electric generating plant, motor vehicle, motorized toy, or an audio device, such as a radio, television set, tape deck or musical instrument, in a manner that exceeds...

2010-07-01

113

36 CFR 2.12 - Audio disturbances.  

Code of Federal Regulations, 2010 CFR

...prohibited: (1) Operating motorized equipment or machinery such as an electric generating plant, motor vehicle, motorized toy, or an audio device, such as a radio, television set, tape deck or musical instrument, in a manner: (i) That...

2010-07-01

114

Automatic Musical Genre Classification Of Audio Signals  

E-print Network

... describe music. They are commonly used to structure the increasing amounts of music available in digital form on the Web and are important for music information retrieval. Genre categorization for audio has traditionally been performed manually. A particular musical genre is characterized by statistical properties related to the instrumentation, rhythmic structure and form of its members. In this work, algorithms for the automatic genre categorization of audio signals are described. More specifically, we propose a set of features for representing texture and instrumentation. In addition a novel set of features for representing rhythmic structure and strength is proposed. The performance of those feature sets has been evaluated by training statistical pattern recognition classifiers using real world audio collections. Based on the automatic hierarchical genre classification two graphical user interfaces for browsing and interacting with large audio collections have been developed.

George Tzanetakis; Georg Essl; Perry Cook

2002-01-01

115

Automatic Musical Genre Classification Of Audio Signals  

E-print Network

Automatic Musical Genre Classification Of Audio Signals George Tzanetakis Computer Science Musical genres are categorical descriptions that are used to describe music. They are commonly used musical genre is characterized by statistical properties related to the instrumentation, rhythmic

Tzanetakis, George

116

Development of Audio Transducer Helmet Assembly.  

National Technical Information Service (NTIS)

The program objective is the development of an Audio Transducer Helmet Assembly to be used in the Army Air Crewman Helmet. The Assembly includes earphones mounted in attenuating earcups along with auxiliary hardware, wiring and ancillary devices. A furthe...

A. J. Rosenheck, A. L. DiMattia

1967-01-01

117

Digital audio for satellite network radio  

Microsoft Academic Search

The paper presents the time-division-multiplexed (TDM) digital system supplied by Scientific-Atlanta for ABC, CBS, and NBC. The system has a transmission rate of 8.78 Mbps, and can demodulate, decode and demultiplex the data into the desired audio and data channels, supporting data rates equivalent to twenty 15 kHz audio channels at 384 kbps each. Digital transmission is used for data

A. L. McBride

1982-01-01

118

Digital Multicasting of Multiple Audio Streams  

NASA Technical Reports Server (NTRS)

The Mission Control Center Voice Over Internet Protocol (MCC VOIP) system (see figure) comprises hardware and software that effect simultaneous, nearly real-time transmission of as many as 14 different audio streams to authorized listeners via the MCC intranet and/or the Internet. The original version of the MCC VOIP system was conceived to enable flight-support personnel located in offices outside a spacecraft mission control center to monitor audio loops within the mission control center. Different versions of the MCC VOIP system could be used for a variety of public and commercial purposes - for example, to enable members of the general public to monitor one or more NASA audio streams through their home computers, to enable air-traffic supervisors to monitor communication between airline pilots and air-traffic controllers in training, and to monitor conferences among brokers in a stock exchange. At the transmitting end, the audio-distribution process begins with feeding the audio signals to analog-to-digital converters. The resulting digital streams are sent through the MCC intranet, using a user datagram protocol (UDP), to a server that converts them to encrypted data packets. The encrypted data packets are then routed to the personal computers of authorized users by use of multicasting techniques. The total data-processing load on the portion of the system upstream of and including the encryption server is the total load imposed by all of the audio streams being encoded, regardless of the number of the listeners or the number of streams being monitored concurrently by the listeners. The personal computer of a user authorized to listen is equipped with special- purpose MCC audio-player software. When the user launches the program, the user is prompted to provide identification and a password. In one of two access- control provisions, the program is hard-coded to validate the user s identity and password against a list maintained on a domain-controller computer at the MCC. In the other access-control provision, the program verifies that the user is authorized to have access to the audio streams. Once both access-control checks are completed, the audio software presents a graphical display that includes audiostream-selection buttons and volume-control sliders. The user can select all or any subset of the available audio streams and can adjust the volume of each stream independently of that of the other streams. The audio-player program spawns a "read" process for the selected stream(s). The spawned process sends, to the router(s), a "multicast-join" request for the selected streams. The router(s) responds to the request by sending the encrypted multicast packets to the spawned process. The spawned process receives the encrypted multicast packets and sends a decryption packet to audio-driver software. As the volume or muting features are changed by the user, interrupts are sent to the spawned process to change the corresponding attributes sent to the audio-driver software. The total latency of this system - that is, the total time from the origination of the audio signals to generation of sound at a listener s computer - lies between four and six seconds.

Macha, Mitchell; Bullock, John

2007-01-01

119

Could Audio-Described Films Benefit from Audio Introductions? An Audience Response Study  

ERIC Educational Resources Information Center

Introduction: Time constraints limit the quantity and type of information conveyed in audio description (AD) for films, in particular the cinematic aspects. Inspired by introductory notes for theatre AD, this study developed audio introductions (AIs) for "Slumdog Millionaire" and "Man on Wire." Each AI comprised 10 minutes of…

Romero-Fresco, Pablo; Fryer, Louise

2013-01-01

120

A SENTENCE-PITCH-CONTOUR GENERATION METHOD USING VQ\\/HMM FOR MANDARIN TEXT-TO-SPEECH  

Microsoft Academic Search

In this paper, a method with sentence-wide optimization consideration is proposed to generate a Mandarin sentence's pitch-contour. The developed model is called the sentence pitch-contour HMM (SPC-HMM) due to its use of VQ (vector quantization) and HMM (hidden Markov model). To construct an SPC-HMM, the pitch-contours of the syllables from each training sentence are normalized on both time and pitch-height

Hung-Yan GU; Chung-Chieh YANG

121

Kid Machine  

NSDL National Science Digital Library

This activity is on page 3 (continued on page 2) of the pdf, part of the Simple Machines Discovery Box. In this fun activity, learners "create" a complex machine by simulating the parts in action. Learners move their bodies and make sounds as if they are individual parts of a moving machine. Then learners discover what happens when part of a machine is broken and problem solve ways to fix it.

Omsi

2004-01-01

122

Quantum Machines  

E-print Network

We discuss quantum information processing machines. We start with single purpose machines that either redistribute quantum information or identify quantum states. We then move on to machines that can perform a number of functions, with the function they perform being determined by a program, which is itself a quantum state. Examples of both deterministic and probabilistic programmable machines are given, and we conclude with a discussion of the utility of quantum programs.

Mark Hillery; Vladimir Buzek

2009-03-24

123

Mining machine  

Microsoft Academic Search

A haulage system for a mining machine comprises a mining machine mounted on and\\/or guided by a conveyor and reciprocable with respect thereto, the conveyor being provided with a rack having plural rows of teeth of identical pitch, with the teeth of one row staggered with respect to an adjacent row(s), and the machine being provided with at least one

Parrott

1985-01-01

124

Podscanning : audio microcontent and synchronous communication for mobile devices  

E-print Network

Over the past decade, computationally powerful audio communication devices have become commonplace. Mobile devices have high storage capacity for digital audio, and smartphones or networked PDAs can be used to stream ...

Wheeler, Patrick Sean

2010-01-01

125

47 CFR 10.520 - Common audio attention signal.  

Code of Federal Regulations, 2013 CFR

...FEDERAL COMMUNICATIONS COMMISSION GENERAL WIRELESS EMERGENCY ALERTS Equipment Requirements § 10.520 Common audio attention...d) The audio attention signal must be restricted to use for Alert Messages under part 10. (e) A device may include the...

2013-10-01

126

47 CFR 10.520 - Common audio attention signal.  

Code of Federal Regulations, 2011 CFR

...FEDERAL COMMUNICATIONS COMMISSION GENERAL COMMERCIAL MOBILE ALERT SYSTEM Equipment Requirements § 10.520 Common audio...d) The audio attention signal must be restricted to use for Alert Messages under part 10. (e) A device may include the...

2011-10-01

127

47 CFR 10.520 - Common audio attention signal.  

Code of Federal Regulations, 2010 CFR

...FEDERAL COMMUNICATIONS COMMISSION GENERAL COMMERCIAL MOBILE ALERT SYSTEM Equipment Requirements § 10.520 Common audio...d) The audio attention signal must be restricted to use for Alert Messages under part 10. (e) A device may include the...

2010-10-01

128

47 CFR 10.520 - Common audio attention signal.  

Code of Federal Regulations, 2012 CFR

...FEDERAL COMMUNICATIONS COMMISSION GENERAL COMMERCIAL MOBILE ALERT SYSTEM Equipment Requirements § 10.520 Common audio...d) The audio attention signal must be restricted to use for Alert Messages under part 10. (e) A device may include the...

2012-10-01

129

Overview of the MPEG\\/audio compression algorithm  

Microsoft Academic Search

This paper gives a summary of the MPEG\\/audio compression algorithm. This algorithm was developed by the Motion Picture Experts Group (MPEG), as an International Organization for Standardization standard for the high fidelity compression of digital audio. The MPEG\\/audio compression standard is one part of a multiple part standard that addresses the compression of video (11172-2), the compression of audio (11172-3),

Davis Y. Pan

1994-01-01

130

A real-time IMT2000 audio transmission system  

Microsoft Academic Search

For real-time audio streaming services over next-generation mobile phone systems, such as live concert broadcasting and radio relaying, we have developed a realtime IMT-2000 audio transmission system. This system consists of a real-time encoder and a portable real-time decoder, which are both based on fixed-point DSP chips. TWINVQ technology is used for audio compression, and a scheme to protect audio

Takeshi Mori; Takao Kaneko; Takehiro Moriya; K. Ikeda

2001-01-01

131

Interaction with Machine Improvisation  

NASA Astrophysics Data System (ADS)

We describe two multi-agent architectures for an improvisation oriented musician-machine interaction systems that learn in real time from human performers. The improvisation kernel is based on sequence modeling and statistical learning. We present two frameworks of interaction with this kernel. In the first, the stylistic interaction is guided by a human operator in front of an interactive computer environment. In the second framework, the stylistic interaction is delegated to machine intelligence and therefore, knowledge propagation and decision are taken care of by the computer alone. The first framework involves a hybrid architecture using two popular composition/performance environments, Max and OpenMusic, that are put to work and communicate together, each one handling the process at a different time/memory scale. The second framework shares the same representational schemes with the first but uses an Active Learning architecture based on collaborative, competitive and memory-based learning to handle stylistic interactions. Both systems are capable of processing real-time audio/video as well as MIDI. After discussing the general cognitive background of improvisation practices, the statistical modelling tools and the concurrent agent architecture are presented. Then, an Active Learning scheme is described and considered in terms of using different improvisation regimes for improvisation planning. Finally, we provide more details about the different system implementations and describe several performances with the system.

Assayag, Gerard; Bloch, George; Cont, Arshia; Dubnov, Shlomo

132

A new audio device driver abstraction Dickon Reed  

E-print Network

-- an MPEG audio decoder playing music, a game, user interface sound effects and a video conference games, play several effects at once and so have to perform their own mixing on the various audio� ence ``Networks and Operating Systems Support for distributed audio and video'', 1998 Traditional audio

Hand, Steven

133

Highly realistic audio spatialization for multiparty conferencing using headphones  

Microsoft Academic Search

It is known that during multi-party conferencing spatialized audio which maps remote participants' voices to distinct virtual locations improves the listening experience. In this paper, we consider the case when the audio is rendered through headphones due to e.g. privacy reasons. Although existing headphone spatial audio techniques abound, most lack the desired realism dictated by listeners' expectation of naturalness in

Wei-Ge Chen; Zhengyou Zhang

2009-01-01

134

How text and audio chat change the online video experience  

Microsoft Academic Search

Many online video sites provide a text chat feature so viewers can chat with others while watching videos. How does chatting affect their experience? Would audio chat be more fun or would it be too distracting? The richer medium of audio may more closely approximate the living room or club experience, but human factors research suggests that audio chat could

Justin D. Weisz; Sara B. Kiesler

2008-01-01

135

Query by humming: musical information retrieval in an audio database  

Microsoft Academic Search

The emergence of audio and video data types in databases will require new information retrieval methods adapted to the specific characteristics and needs of these data types. An ef- fective and natural way of querying a musical audio database is by humming the tune of a song. In this paper, a system for querying an audio database by humming is

Asif Ghias; Jonathan Logan; David Chamberlin; Brian C. Smith

1995-01-01

136

A Novel Steganalysis Algorithm of Phase Coding in Audio Signal  

Microsoft Academic Search

Audio steganalysis has attracted more attentions recently. Phase steganalysis is one of the most challenging research fields. In this paper, a novel algorithm to detect phase coding steganography in audio signal is proposed. It is based on analysis of the phase discontinuities, and can be described as follows. Firstly, it takes FFT transform of special segment of audio and unwraps

Wei Zeng; Haojun Ai; Ruimin Hu

2007-01-01

137

Live, Audio-Visual Communication Systems for Distance Learning  

E-print Network

of the experience of managers, tutors and learners who have used live, audio-visual communication systemsLive, Audio-Visual Communication Systems for Distance Learning: Experience, Heuristics and ISDN, WC1H 0AP, U.K. tel: (44) (0)71 387 7050 ext. 5315 fax: (44) (0)71 580 1100 #12;Submission Live, Audio-Visual

Finkelstein, Anthony

138

Music and audio - oh how they can stress your network  

Microsoft Academic Search

Nearly ten years ago a paper written by the Audio Engineering Society (AES)[1] made a number of interesting statements: 1. 2. The current Internet is inadequate for transmitting music and professional audio. Performance and collaboration across a distance stress beyond acceptable bounds the quality of service Audio and music provide test cases in which the bounds of the network are

R. Fletcher

2007-01-01

139

Dynamic Soundscape: mapping time to space for audio browsing  

Microsoft Academic Search

Browsing audio data is not as easy as browsing printed documents because of the temporal nature of sound. This paper presents a browsing environment that provides a spatial interface for temporal navigation of audio data, taking advantage of human abilities of simultaneous listening and memory of spatial location. Instead of fast-forwarding or rewinding, users browse the audio data by switching

Minoru Kobayashi; Chris Schmandt

1997-01-01

140

Compression Algorithms for Audio-Video Streaming  

Microsoft Academic Search

Live audio-video streaming has become popular over the years especially when it comes to communication between people living huge geographical areas apart and at a very cheap rate or even free. However, when it comes to places where bandwidth is low or limited, smooth streaming has become a serious challenge because after all bandwidth is a natural resource and is

Tarif Riyad Rahman; Miftahur Rahman

2010-01-01

141

Audio/Visual Ratios in Commercial Filmstrips.  

ERIC Educational Resources Information Center

Developed by the Westinghouse Electric Corporation, Video Audio Compressed (VIDAC) is a compressed time, variable rate, still picture television system. This technology made it possible for a centralized library of audiovisual materials to be transmitted over a television channel in very short periods of time. In order to establish specifications…

Gulliford, Nancy L.

142

Acceptance Inspection for Audio Cassette Recorders.  

ERIC Educational Resources Information Center

A series of inspections for cassette recorders that can be performed to assure that the devices are acceptable is described. The inspections can be completed in 20 minutes and can be performed by instructional personnel. The series of inspection procedures includes tests of the intelligibility of audio, physical condition, tape speed, impulse…

Smith, Edgar A.

143

Bose Learning Center - Audio Demonstrator Technology  

NSDL National Science Digital Library

Here's information about a new audio technology created at the Bose Corporation, a large manufacturer of sound equipment. The new technology lets the building managers of large places like arenas, auditoriums and outdoor stadiums, preview how the sound system will sound before it's installed.

Corporation, Bose

144

Spectral Frequency Tracking for Classifying Audio Signals  

Microsoft Academic Search

Taniguchi et al. proposed a sinusoidal decomposition framework for classifying audio sounds. In this framework, spectral tracking is important, yet still presents an unsolved problem, although it has been investigated for the purpose of sound synthesis or sound modification. Conventional methods developed for these purposes are either ad hoc and less computationally complex or not ad hoc but more computationally

Toru Taniguchi; Mikio Tohyama; Katsuhiko Shirai

2006-01-01

145

Estimation of recording location using audio watermarking  

Microsoft Academic Search

In this paper, we propose a novel application of audio watermarking, estimation of recording location. The purpose of the paper is to determine the seat location in a theater at which a bootleg recording was made by using a digital video camera. In the proposed application, we embed different watermarks in the channels of the multi-channel sound of the movie.

Yuta Nakashima; Ryuki Tachibana; Masafumi Nishimura; Noboru Babaguchi

2006-01-01

146

Automatic Musical Genre Classification of Audio Signals  

Microsoft Academic Search

Musical genres are categorical descriptions that are used to describe music. They are c ommonly used to structure the increasing amounts of music available in digital form on the Web and are important for music information retrieval. Genre ca tegorization for audio has traditionally been performed manually. A particular musical genre is characterized by statistical properties related to the instrumentation,

George Tzanetakis

2001-01-01

147

Audio Mining: The Next Big Thing?  

NSDL National Science Digital Library

To get an idea of what audio mining is and how it can be used, people can read this article from the Cutter Consortium. It lists six broad areas that can benefit from using the technology and briefly discusses each one: technical support centers and help desks, call centers, broadcast media, conference managers, intelligence gathering, law enforcement, and security operations.

2008-01-30

148

Lessons from Developing Audio HTML Interfaces  

Microsoft Academic Search

In this paper, we discuss our previous research on the estab- lishment of guidelines and principles for choosing sounds to use in an audio interface to HTML, called the AHA frame- work. These principles, along with issues related to the tar- get audience such as user tasks, goals, and interests are factors that can help us to choose specific sounds

Frankie James

149

Joint audio visual retrieval for tennis broadcasts  

Microsoft Academic Search

In recent years, there has been increasing work in the area of content retrieval for sports. The idea is generally to extract important events or create summaries to allow personalisation of the media stream. While previous work in sports analysis has employed either the audio or video stream to achieve some goal, there is little work that explores how much

Rozenn Dahyot; A. Kokaram; N. Rea; Hugh Denman

2003-01-01

150

Marble Track Audio Manipulator (MTAM): A Tangible User Interface for Audio Composition  

E-print Network

children to create musical compositions through constructive play. Our Marble Track Audio Manipulator (MTAM sound effects. To create musical compositions, children collaboratively build a marble tower system provides children with a creative, playful, and engaging encounter with music. Author Keywords

Jacob, Robert J.K.

151

Kicking Machine  

NSDL National Science Digital Library

In this design challenge activity, learners build a machine that kicks a Ping-Pong ball into a cup lying on its side 12 inches away. Learners use a pendulum, a rubber band, or a combination of the two to do this and test the accuracy of their machines by shooting a Ping-Pong ball into a cup. Educators can use this activity to demonstrate simple machines and kinetic/potential energy. This activity can be followed up with "Extreme Kicking Machine" which adds an even greater challenge into the mix.

Wgbh

2010-01-01

152

: Helmholtz machine estimation .  

E-print Network

: Helmholtz machine density estimation . . : . . . (supervised learning) , (active learning) (query learning) [1, 3]. . (unsupervised learning), . , [5]. . Helmholtz machine , . Helmholtz machine : Helmholtz machine [2] . Helmholtz machine (generative network) (recognition network) . , , . Helmholtz machine (self

153

Nonplanar machines  

SciTech Connect

This talk examines methods available to minimize, but never entirely eliminate, degradation of machine performance caused by terrain following. Breaking of planar machine symmetry for engineering convenience and/or monetary savings must be balanced against small performance degradation, and can only be decided on a case-by-case basis. 5 refs.

Ritson, D. (Stanford Linear Accelerator Center, Menlo Park, CA (USA))

1989-05-01

154

Electric machine  

DOEpatents

An interior permanent magnet electric machine is disclosed. The interior permanent magnet electric machine comprises a rotor comprising a plurality of radially placed magnets each having a proximal end and a distal end, wherein each magnet comprises a plurality of magnetic segments and at least one magnetic segment towards the distal end comprises a high resistivity magnetic material.

El-Refaie, Ayman Mohamed Fawzi (Niskayuna, NY); Reddy, Patel Bhageerath (Madison, WI)

2012-07-17

155

Simple Machines  

NSDL National Science Digital Library

How do you get a glove and a ball up to your tree house? One answer is to use a pulley. A pulley is a simple machine. In this original KET interactive, children learn about the basic workings of three simple machines.

Ket

2010-11-16

156

Excavating machines  

SciTech Connect

The excavating machine has a cutter carrying boom carried by a boom support member which can be swung about an axis extending in the direction of the roadway. The machine includes a cutter unit and a stay unit each of which is releasably anchorable in the roadway and each of which can be advanced relative to the other unit.

Plummer, D.

1980-10-21

157

Capacity-optimized mp2 audio watermarking  

NASA Astrophysics Data System (ADS)

Today a number of audio watermarking algorithms have been proposed, some of them at a quality making them suitable for commercial applications. The focus of most of these algorithms is copyright protection. Therefore, transparency and robustness are the most discussed and optimised parameters. But other applications for audio watermarking can also be identified stressing other parameters like complexity or payload. In our paper, we introduce a new mp2 audio watermarking algorithm optimised for high payload. Our algorithm uses the scale factors of an mp2 file for watermark embedding. They are grouped and masked based on a pseudo-random pattern generated from a secret key. In each group, we embed one bit. Depending on the bit to embed, we change the scale factors by adding 1 where necessary until it includes either more even or uneven scale factors. An uneven group has a 1 embedded, an even group a 0. The same rule is later applied to detect the watermark. The group size can be increased or decreased for transparency/payload trade-off. We embed 160 bits or more in an mp2 file per second without reducing perceived quality. As an application example, we introduce a prototypic Karaoke system displaying song lyrics embedded as a watermark.

Steinebach, Martin; Dittmann, Jana

2003-06-01

158

Synthecology: sound use of audio in teleimmersion  

NASA Astrophysics Data System (ADS)

This paper examines historical audio applications used to provide real-time immersive sound for CAVE TM environments and discusses their relative strengths and weaknesses. We examine and explain issues of providing spatialized sound immersion in real-time virtual environments (VEs), some problems with currently used sound servers, and a set of requirements for an 'ideal' sound server. We present the initial configuration of a new cross-platform sound server solution using open source software and the Open Sound Control (OSC) specification for the creation of real-time spatialized audio with CAVE applications, specifically Ygdrasil (Yg) environments. The application, aNother Sound Server (NSS) establishes an application interface (API) using OSC, a logical server layer implemented in Python, and an audio engine using SuperCollider (SC). We discuss spatialization implementation and other features. Finally, we document the Synthecology project which premiered at WIRED NEXTFEST 2005 and was the first VE to use NSS. We also discuss various techniques that enhance presence in networked VEs, as well as possible and planned extensions of NSS.

Baum, Geoffrey; Gotsis, Marientina; Chang, Benjamin; Drinkwater, Robb; St. Clair, Dan

2006-02-01

159

Scribbling Machines  

NSDL National Science Digital Library

In this activity, learners explore electronics and motion by making a Scribbling Machine, a motorized contraption that moves in unusual ways and leaves a mark to trace its path. Itâs made from simple materials and is based on the idea of motion created by an offset motor. Try using harvested motors and switches from discarded toys and electronics to make your Scribbling Machine - this not only keeps costs down, but is a playful and inventive way to explore how everyday objects work. To take the activity further, you can also incorporate PicoCrickets to make your Scribbling Machine more intelligent and to explore computers.

Exploratorium

2013-01-30

160

Machine therapy  

E-print Network

Machine Therapy is a new practice combining art, design, psychoanalysis, and engineering work in ways that access and reveal the vital, though often unnoticed, relevance of people's interactions and relationships with ...

Dobson, Kelly E. (Kelly Elizabeth), 1970-

2007-01-01

161

Math Machines  

NSDL National Science Digital Library

The mission of the Math Machines organization is to "improve the quality of mathematical education, enhance the transfer of mathematical thinking into other classes, and increase students' ability to apply rigorous mathematics outside the classroom." Their website supports a National Science Foundation ATE grant-supported project designed to improve teaching in the areas of Mathematics, Science, and Technology at the high school and college levels. This improved learning results from using math, science, and technology principles to build and control various machines such as pointers and robots or "math machines", which are simple devices that provide an immediate, physical, dynamic expression to abstract mathematical equations. The website provides information links on Educational Theory, Classroom Activities, Project Workshops, Calculators & Programs, and Machine Construction Instructions for Building: Closed Circuits, Servo Motors, Controllers, Robot Boards and more. There is also contact information, an FAQ section, as well as upcoming events.

2010-05-18

162

Monel Machining  

NASA Technical Reports Server (NTRS)

Castle Industries, Inc. is a small machine shop manufacturing replacement plumbing repair parts, such as faucet, tub and ballcock seats. Therese Castley, president of Castle decided to introduce Monel because it offered a chance to improve competitiveness and expand the product line. Before expanding, Castley sought NERAC assistance on Monel technology. NERAC (New England Research Application Center) provided an information package which proved very helpful. The NASA database was included in NERAC's search and yielded a wealth of information on machining Monel.

1983-01-01

163

Learning bimodal structure in audio-visual data.  

PubMed

A novel model is presented to learn bimodally informative structures from audio-visual signals. The signal is represented as a sparse sum of audio-visual kernels. Each kernel is a bimodal function consisting of synchronous snippets of an audio waveform and a spatio-temporal visual basis function. To represent an audio-visual signal, the kernels can be positioned independently and arbitrarily in space and time. The proposed algorithm uses unsupervised learning to form dictionaries of bimodal kernels from audio-visual material. The basis functions that emerge during learning capture salient audio-visual data structures. In addition, it is demonstrated that the learned dictionary can be used to locate sources of sound in the movie frame. Specifically, in sequences containing two speakers, the algorithm can robustly localize a speaker even in the presence of severe acoustic and visual distracters. PMID:19963447

Monaci, Gianluca; Vandergheynst, Pierre; Sommer, Friedrich T

2009-12-01

164

The Digital Audio Editor as a Teaching and Laboratory Tool  

NASA Astrophysics Data System (ADS)

Digital audio editors such as Software Audio Workshop and Cool Edit Pro are powerful tools used in the radio and audio recording fields for editing digital audio. However, they are also powerful tools in the physics classroom and laboratory. During this presentation the author will show how a digital audio editor, combined with a library of audio .wav files produced by the author as part of sabbatical work, can be used to: 1. demonstrate quantitatively and qualitatively the relationship between the decibel, sound intensity, and loudness perception, 2. demonstrate quantitatively and qualitatively the relationship between frequency and pitch perception, 3. perform additive and subtractive sound synthesis, 4. demonstrate comb filtering, 5. demonstrate constructive and destructive interference, and 6. turn the computer into an accurate signal generator (sine wave, square wave, etc.) with a frequency resolution of 1Hz. Availability of the required software and .wav file library will also be discussed.

Latta, Gregory

2001-10-01

165

Flexible Scheduling for DataFlow Audio Processing  

Microsoft Academic Search

The notions of audio and control rate have been a pervasive feature of audio programming languages and environments. Real-time computer music systems depend on schedulers to coordinate and order the execution of many tasks over the course of time. In this paper we describe the scheduling in- frastructure of Marsyas-0.2, an open source framework for audio analysis and synthesis. We

Neil Burroughs; Adam Parkin; George Tzanetakis

166

A general audio classifier based on human perception motivated model  

Microsoft Academic Search

The audio channel conveys rich clues for content-based multimedia indexing. Interesting audio analysis includes, besides widely\\u000a known speech recognition and speaker identification problems, speech\\/music segmentation, speaker gender detection, special\\u000a effect recognition such as gun shots or car pursuit, and so on. All these problems can be considered as an audio classification\\u000a problem which needs to generate a label from low

Hadi Harb; Liming Chen

2007-01-01

167

Structured audio: creation, transmission, and rendering of parametric sound representations  

Microsoft Academic Search

Structured audio representations are semantic and symbolic descriptions that are useful for ultralow-bit-rate transmission, flexible synthesis, and perceptually based manipulation and retrieval of sound. We present an overview of techniques for transmitting and synthesizing sound represented in structured format, and for creating structured representations from audio waveforms. We discuss applications for structured audio in virtual environments, music synthesis, gaming, content-based

BARRY L. VERCOE; WILLIAM G. GARDNER; ERIC D. SCHEIRER

1998-01-01

168

BOOSTED BINARY AUDIO FINGERPRINT BASED ON SPECTRAL SUBBAND MOMENTS Sungwoong Kim and Chang D. Yoo  

E-print Network

BOOSTED BINARY AUDIO FINGERPRINT BASED ON SPECTRAL SUBBAND MOMENTS Sungwoong Kim and Chang D. Yoo@kaist.ac.kr, cdyoo@ee.kaist.ac.kr ABSTRACT An audio fingerprinting system identifies an audio based on a unique feature vector called the audio fingerprint. The performance of an audio fingerprinting system is directly

Yoo, Chang D.

169

Audio scene segmentation for video with generic content  

NASA Astrophysics Data System (ADS)

In this paper, we present a content-adaptive audio texture based method to segment video into audio scenes. The audio scene is modeled as a semantically consistent chunk of audio data. Our algorithm is based on "semantic audio texture analysis." At first, we train GMM models for basic audio classes such as speech, music, etc. Then we define the semantic audio texture based on those classes. We study and present two types of scene changes, those corresponding to an overall audio texture change and those corresponding to a special "transition marker" used by the content creator, such as a short stretch of music in a sitcom or silence in dramatic content. Unlike prior work using genre specific heuristics, such as some methods presented for detecting commercials, we adaptively find out if such special transition markers are being used and if so, which of the base classes are being used as markers without any prior knowledge about the content. Our experimental results show that our proposed audio scene segmentation works well across a wide variety of broadcast content genres.

Niu, Feng; Goela, Naveen; Divakaran, Ajay; Abdel-Mottaleb, Mohamed

2008-01-01

170

On Steganography in Lost Audio Packets  

E-print Network

The paper presents a new hidden data insertion procedure based on estimated probability of the remaining time of the call for steganographic method called LACK (Lost Audio PaCKets steganography). LACK provides hidden communication for real-time services like Voice over IP. The analytical results presented in this paper concern the influence of LACK's hidden data insertion procedures on the method's impact on quality of voice transmission and its resistance to steganalysis. The proposed hidden data insertion procedure is also compared to previous steganogram insertion approach based on estimated remaining average call duration.

Mazurczyk, Wojciech; Szczypiorski, Krzysztof

2011-01-01

171

ABC News: Video and Audio Newsclips  

NSDL National Science Digital Library

ABC News has added a section of video and audio newsclips to its news service at the GO Network, InfoSeek Corporation's Internet portal. Users can see and listen to national headline news, such as a clip from Warren Beatty's speech at an awards dinner Wednesday night (sounding rather presidential). They can also search for additional video files using Videosearch, by Virage. Beatty as a search term turned up a clip about the Clinton family's summer vacation on Martha's Vineyard that included a mention of Beatty's presidential aspirations and opinions on the Democratic Party, but no additional pictures of Beatty.

172

Audio-visual speech perception is special.  

PubMed

In face-to-face conversation speech is perceived by ear and eye. We studied the prerequisites of audio-visual speech perception by using perceptually ambiguous sine wave replicas of natural speech as auditory stimuli. When the subjects were not aware that the auditory stimuli were speech, they showed only negligible integration of auditory and visual stimuli. When the same subjects learned to perceive the same auditory stimuli as speech, they integrated the auditory and visual stimuli in a similar manner as natural speech. These results demonstrate the existence of a multisensory speech-specific mode of perception. PMID:15833302

Tuomainen, Jyrki; Andersen, Tobias S; Tiippana, Kaisa; Sams, Mikko

2005-05-01

173

Audio and Podcasts: The Poetry Foundation  

NSDL National Science Digital Library

The Poetry Foundation has a myriad of wonderful resources for the lover of quatrains, hyperbole, or iambic pentameter. This corner of its site houses audio and podcasts in one convenient locale. The Poetry Off the Shelf section contains recent conversations with poets Edward Hirsch, Nathaniel Mackey, Robert Duncan, and others. Moving on, the Poem of the Day features a number of lovely works, such as "Horseflies" and "I go back to May 1937.â There are six other sections here, including the Poetry Radio Project and Avant-garde All the Time. Additionally, users can sign up to receive updates when new works are added to the site.

174

Quantization and psychoacoustic model in audio coding in advanced audio coding  

NASA Astrophysics Data System (ADS)

This paper presents complete optimized architecture of Advanced Audio Coder quantization with Huffman coding. After that psychoacoustic model theory is presented and few algorithms described: standard Two Loop Search, its modifications, Genetic, Just Noticeable Level Difference, Trellis-Based and its modification: Cascaded Trellis-Based Algorithm.

Brzuchalski, Grzegorz

2011-10-01

175

Instructional Audio Guidelines: Four Design Principles to Consider for Every Instructional Audio Design Effort  

ERIC Educational Resources Information Center

This article contends that instructional designers and developers should attend to four particular design principles when creating instructional audio. Support for this view is presented by referencing the limited research that has been done in this area, and by indicating how and why each of the four principles is important to the design process.…

Carter, Curtis W.

2012-01-01

176

Audio Adapted Assessment Data: Does the Addition of Audio to Written Items Modify the Item Calibration?  

ERIC Educational Resources Information Center

This dissertation research examined the changes in item RIT calibration that occurred when adding audio to a set of currently calibrated RIT items and then placing these new items as field test items in the modified assessments on the NWEA MAP test platform. The researcher used test results from over 600 students in the Poway School District in…

Snyder, James

2010-01-01

177

Investigating the impact of audio instruction and audio-visual biofeedback for lung cancer radiation therapy  

NASA Astrophysics Data System (ADS)

Lung cancer accounts for 13% of all cancers in the Unites States and is the leading cause of deaths among both men and women. The five-year survival for lung cancer patients is approximately 15%.(ACS facts & figures) Respiratory motion decreases accuracy of thoracic radiotherapy during imaging and delivery. To account for respiration, generally margins are added during radiation treatment planning, which may cause a substantial dose delivery to normal tissues and increase the normal tissue toxicity. To alleviate the above-mentioned effects of respiratory motion, several motion management techniques are available which can reduce the doses to normal tissues, thereby reducing treatment toxicity and allowing dose escalation to the tumor. This may increase the survival probability of patients who have lung cancer and are receiving radiation therapy. However the accuracy of these motion management techniques are inhibited by respiration irregularity. The rationale of this thesis was to study the improvement in regularity of respiratory motion by breathing coaching for lung cancer patients using audio instructions and audio-visual biofeedback. A total of 331 patient respiratory motion traces, each four minutes in length, were collected from 24 lung cancer patients enrolled in an IRB-approved breathing-training protocol. It was determined that audio-visual biofeedback significantly improved the regularity of respiratory motion compared to free breathing and audio instruction, thus improving the accuracy of respiratory gated radiotherapy. It was also observed that duty cycles below 30% showed insignificant reduction in residual motion while above 50% there was a sharp increase in residual motion. The reproducibility of exhale based gating was higher than that of inhale base gating. Modeling the respiratory cycles it was found that cosine and cosine 4 models had the best correlation with individual respiratory cycles. The overall respiratory motion probability distribution function could be approximated to a normal distribution function. A statistical analysis was also performed to investigate if a patient's physical, tumor or general characteristics played a role in identifying whether he/she responded positively to the coaching type---signified by a reduction in the variability of respiratory motion. The analysis demonstrated that, although there were some characteristics like disease type and dose per fraction that were significant with respect to time-independent analysis, there were no significant time trends observed for the inter-session or intra-session analysis. Based on patient feedback with the existing audio-visual biofeedback system used for the study and research performed on other feedback systems, an improved audio-visual biofeedback system was designed. It is hoped the widespread clinical implementation of audio-visual biofeedback for radiotherapy will improve the accuracy of lung cancer radiotherapy.

George, Rohini

178

17 CFR 232.304 - Graphic, image, audio and video material.  

Code of Federal Regulations, 2011 CFR

... false Graphic, image, audio and video material. 232.304 Section 232...232.304 Graphic, image, audio and video material. (a) If a filer includes graphic, image, audio or video material in a document...

2011-04-01

179

17 CFR 232.304 - Graphic, image, audio and video material.  

Code of Federal Regulations, 2012 CFR

... false Graphic, image, audio and video material. 232.304 Section 232...232.304 Graphic, image, audio and video material. (a) If a filer includes graphic, image, audio or video material in a document...

2012-04-01

180

17 CFR 232.304 - Graphic, image, audio and video material.  

Code of Federal Regulations, 2013 CFR

... false Graphic, image, audio and video material. 232.304 Section 232...232.304 Graphic, image, audio and video material. (a) If a filer includes graphic, image, audio or video material in a document...

2013-04-01

181

17 CFR 232.304 - Graphic, image, audio and video material.  

... false Graphic, image, audio and video material. 232.304 Section 232...232.304 Graphic, image, audio and video material. (a) If a filer includes graphic, image, audio or video material in a document...

2014-04-01

182

17 CFR 232.304 - Graphic, image, audio and video material.  

Code of Federal Regulations, 2010 CFR

... false Graphic, image, audio and video material. 232.304 Section 232...232.304 Graphic, image, audio and video material. (a) If a filer includes graphic, image, audio or video material in a document...

2010-04-01

183

37 CFR 201.28 - Statements of Account for digital audio recording devices or media.  

Code of Federal Regulations, 2013 CFR

...Account for digital audio recording devices or media. 201.28 Section 201.28 Patents...Account for digital audio recording devices or media. (a) General. This section...recording device or digital audio recording medium. (b) Definitions. For...

2013-07-01

184

37 CFR 201.28 - Statements of Account for digital audio recording devices or media.  

Code of Federal Regulations, 2012 CFR

...Account for digital audio recording devices or media. 201.28 Section 201.28 Patents...Account for digital audio recording devices or media. (a) General. This section...recording device or digital audio recording medium. (b) Definitions. For...

2012-07-01

185

37 CFR 201.28 - Statements of Account for digital audio recording devices or media.  

Code of Federal Regulations, 2011 CFR

...Account for digital audio recording devices or media. 201.28 Section 201.28 Patents...Account for digital audio recording devices or media. (a) General. This section...recording device or digital audio recording medium. (b) Definitions. For...

2011-07-01

186

Workout Machine  

NASA Technical Reports Server (NTRS)

The Orbotron is a tri-axle exercise machine patterned after a NASA training simulator for astronaut orientation in the microgravity of space. It has three orbiting rings corresponding to roll, pitch and yaw. The user is in the middle of the inner ring with the stomach remaining in the center of all axes, eliminating dizziness. Human power starts the rings spinning, unlike the NASA air-powered system. Marketed by Fantasy Factory (formerly Orbotron, Inc.), the machine can improve aerobic capacity, strength and endurance in five to seven minute workouts.

1995-01-01

187

Culture Machine  

NSDL National Science Digital Library

Culture Machine is a new, refereed, electronic journal encompassing cultural studies and cultural theory. The international editorial board of the interactive journal aims to "generat[e] research in culture and theory" by promoting and publishing "the most provocative of new work." The theme of the inaugural issue is Taking Risks with the Future. Content includes articles such as Life After Death of the Text by Johan Fornas, Cultural Studies in the Clouds: Mourning for Detail by Tadeusz Slawek, and The Future States of Politics by Kenneth Surin. Culture Machine is hosted by the University of Teesside, England.

188

Audio-Assisted Video Browsing for DVD Recorders  

Microsoft Academic Search

We present an audio-assisted video browsing system for a Hard Disk Drive (HDD) enhanced DVD recorder. We focus on our sports highlights extraction based on audio classification. We have systematically established that sports highlights are indicated by the presence of audience reaction such as cheering, applause and the commentators excited speech. That enables us to develop a common highlights extraction

Ajay Divakaran; Isao Otsuka; Regunathan Radhakrishnan; Kazuhiko Nakane; Masaharu Ogawa

2004-01-01

189

SOLAR: Sound Object Localization and Retrieval in Complex Audio Environments  

Microsoft Academic Search

The ability to identify sounds in complex audio environ- ments is highly useful for multimedia retrieval, security, and many mobile robotic applications, but very little work has been done in this area. We present the SOLAR sys- tem, a system capable of finding sound objects, such as dog barks or car horns, in complex audio data extracted from movies. SOLAR

Derek Hoiem; Yan Ke; Rahul Sukthankar

190

SOLAR: sound object localization and retrieval in complex audio environments  

Microsoft Academic Search

The ability to identify sounds in complex audio environments is highly useful for multimedia retrieval, security, and many mobile robotic applications, but very little work has been done in this area. We present the SOLAR system, a system capable of finding sound objects, such as dog barks or car horns, in complex audio data extracted from movies. SOLAR avoids the

Derek Hoiem; Yan Ke; Rahul Sukthankar

2005-01-01

191

AME CURRICULUM GUIDE SPRING 2014 AUDIO AND MUSIC ENGINEERING  

E-print Network

engineering extends far beyond the traditional music recording industry. Career paths for program graduatesAME CURRICULUM GUIDE SPRING 2014 AUDIO AND MUSIC ENGINEERING THE HAJIM SCHOOL OF ENGINEERING & Curricula AT A GLANCE Programs 5-10 Audio and Music Engineering Admission 4 Declaring an AME major Courses

Mahon, Bradford Z.

192

How Text and Audio Chat Change the Online Video Experience  

E-print Network

with others while watching videos. How does chatting affect their experience? Would audio chat be more fun living rooms, and in bars and clubs, we can enjoy watching television shows and movies in the noisyHow Text and Audio Chat Change the Online Video Experience Justin D. Weisz Computer Science

Kiesler, Sara

193

Content-Based Retrieval of Music and Audio  

Microsoft Academic Search

Though many systems exist for content-based retrieval of images, little work has been done on the audio portionof the multimedia stream. This paper presents a system to retrieve audio documents by acoustic similarity. Thesimilarity measure is based on statistics derived from a supervised vector quantizer, rather than matching simple pitchor spectral characteristics. The system is thus able to learn distinguishing

Jonathan T. Foote

1997-01-01

194

Visualizing music and audio using self-similarity  

Microsoft Academic Search

This paper presents a novel approach to visualizing the time structure of music and audio. The acoustic similarity between any two instants of an audio recording is displayed in a 2D representation, allowing identification of structural and rhythmic characteristics. Examples are presented for classical and popular music. Applications include content-based analysis and segmentation, as well as tempo and structure extraction.

Jonathan Foote

1999-01-01

195

A Case Study on Audio Feedback with Geography Undergraduates  

ERIC Educational Resources Information Center

Several small-scale studies have suggested that audio feedback can help students to reflect on their learning and to develop deep learning approaches that are associated with higher attainment in assessments. For this case study, Geography undergraduates were given audio feedback on a written essay assignment, alongside traditional written…

Rodway-Dyer, Sue; Knight, Jasper; Dunne, Elizabeth

2011-01-01

196

Audio Interface as a Device for Physical Computing  

Microsoft Academic Search

In this paper, we would like to describe the employment of audio interface as a device for physical computing. We compare the audio interface with other devices and describe its characteristics. We also present examples of the employment with three different art works, Monalisa \\

Kazuhiro Jo

2008-01-01

197

Packet Audio Playout Delay Adjustment: Performance Bounds and Algorithms \\Lambda  

E-print Network

compute upper and lower bounds (which are shown to be tight for the range of loss and delay values and performs close to the theoretical optimum over a range of parameter values of interest. Keywords is typically divided into ``talkspurts'' (periods of audio activity) and ``silence periods'' (periods of audio

Massachusetts at Amherst, University of

198

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting  

E-print Network

Automatic Commercial Monitoring for TV Broadcasting Using Audio Fingerprinting Dalwon Jang1, an automatic commercial monitoring system using audio fingerprinting is proposed. The goal of the commercial monitoring system is to identify the title and the exact duration of commercials in real-time. To achieve

Yoo, Chang D.

199

Exposing audio data to the web: an API and prototype  

Microsoft Academic Search

The HTML5 specification introduces the audio and video media elements, and with them the opportunity to change the way media is integrated on the web. The current HTML5 media API provides ways to play and get limited information about audio and video, but no way to programatically access or create such media. In this paper we present an enhanced API

David Humphrey; Corban Brook; Alistair MacDonald

2010-01-01

200

AUDIO FINGERPRINT IDENTIFICATION BY APPROXIMATE STRING Jerome Lebosse  

E-print Network

AUDIO FINGERPRINT IDENTIFICATION BY APPROXIMATE STRING MATCHING Jerome Lebosse France Telecom R boulevard du Marchal Juin 14050 Caen, FRANCE luc.brun@greyc.ensicaen.fr ABSTRACT An audio fingerprint first presents a fingerprint extraction algorithm. The identification task is performed by a new

Paris-Sud XI, Université de

201

Loss Concealment for Multi-Channel Streaming Audio Rishi Sinha  

E-print Network

such as Internet2, high quality uncompressed transmission of multi-channel audio streams has become possible Loss concealment, immersive audio, streaming, real-time, multi- channel. 1. INTRODUCTION The Internet in the commodity Internet, such experiments abound in Internet2, a high performance network linking together

Papadopoulos, Christos

202

Overcoming Asynchrony in Audio-Visual Speech Recognition  

E-print Network

. INTRODUCTION Visual information can improve the performance of audio speech recognition systems, specially, as the visual information can help discern sounds easily confusable by ear but distinguishable by eye of visible articulatory features as lips, teeth and tongue [2]. The statistical models commonly used in Audio

Soatto, Stefano

203

Audio Watermarking Quality Evaluation: Robustness to DA\\/AD Processes  

Microsoft Academic Search

Audio watermarking has become an important technology for recording and advertisement industry. Today's embedding and detection strategies often rely on digital or at least high quality channels, like CD or mp3. However for numerous applications, watermarks surviving noisy analogue environments are better suited or even necessary. Application of such schemes allows to identify audio signals with monitoring devices without being

Martin Steinebach; Andreas Lang; Jana Dittmann; Christian Neubauer

2002-01-01

204

MUSICAL GENRE CLASSIFICATION OF AUDIO DATA USING SOURCE SEPARATION TECHNIQUES  

Microsoft Academic Search

We propose a two-step, audio feature-based musical genre classification methodology. First, we identify and separate the various musical instrument sources in the audio signal, using the convolutive sparse coding algorithm. Next, we extract classification features from the separated signals that correspond to distinct musical instrument sources. The methodology is evaluated and its performance is assessed.

P. S. Lampropoulou; A. S. Lampropoulos; G. A. Tsihrintzis

205

Representation of sound fields for audio recording and reproduction  

E-print Network

Representation of sound fields for audio recording and reproduction F. M. Fazia , M. Noisternigb of sound fields for audio recording and reproduction F. M. Fazia , M. Noisternigb and O. Warusfelb and loudspeaker arrays are often used for the recording and reproduction of a given sound field. A number

Paris-Sud XI, Université de

206

Reinforcement Learning of Listener Response for Mood Classification of Audio  

Microsoft Academic Search

This paper describes a method of applying a reinforcement learning artificial intelligence to categorize audio files by mood based on listener response during a performance. The system discussed is implemented in a performance art environment designed to present the moods of multiple participants simultaneously in a room via a diffusion of representative audio samples.

Jack Stockholm; Philippe Pasquier

2009-01-01

207

Selected Audio-Visual Materials for Consumer Education. [New Version.  

ERIC Educational Resources Information Center

Ninety-two films, filmstrips, multi-media kits, slides, and audio cassettes, produced between 1964 and 1974, are listed in this selective annotated bibliography on consumer education. The major portion of the bibliography is devoted to films and filmstrips. The main topics of the audio-visual materials include purchasing, advertising, money…

Johnston, William L.

208

Audio Design: Creating Multi-sensory Images for the Mind.  

ERIC Educational Resources Information Center

Explores the concept of "theater of the mind" and discusses design factors in creating audio works that effectively stimulate mental pictures, including: narrative format in audio scripting; qualities of voice; use of concrete language; music; noise versus silence; and the creation of the illusion of space using monaural, stereophonic, and…

Ferrington, Gary

1994-01-01

209

Effect of Audio vs. Video on Aural Discrimination of Vowels  

ERIC Educational Resources Information Center

Despite the growing use of media in the classroom, the effects of using of audio versus video in pronunciation teaching has been largely ignored. To analyze the impact of the use of audio or video training on aural discrimination of vowels, 61 participants (all students at a large American university) took a pre-test followed by two training…

McCrocklin, Shannon

2012-01-01

210

Getting Started with CD Audio in HyperCard.  

ERIC Educational Resources Information Center

This article examines the use of the Voyager Compact Disk (CD) AudioStack to provide HyperCard stacks designed to promote language learning with the ability to play on common precisely specified portions of off-the-shelf audio compact disks in a CD-ROM drive. Four German and Russian HyperCard stacks are described and their construction outlined.…

Decker, Donald A.

1992-01-01

211

The Effect of Audio and Animation in Multimedia Instruction  

ERIC Educational Resources Information Center

This study investigated the effects of audio, animation, and spatial ability in a multimedia computer program for high school biology. Participants completed a multimedia program that presented content by way of text or audio with lean text. In addition, several instructional sequences were presented either with static illustrations or animations.…

Koroghlanian, Carol; Klein, James D.

2004-01-01

212

The Use of Audio and Animation in Computer Based Instruction.  

ERIC Educational Resources Information Center

This study investigated the effects of audio, animation, and spatial ability in a computer-based instructional program for biology. The program presented instructional material via test or audio with lean text and included eight instructional sequences presented either via static illustrations or animations. High school students enrolled in a…

Koroghlanian, Carol; Klein, James D.

213

History Calls: Delivering Automated Audio Tours to Visitors' Cell Phones  

Microsoft Academic Search

Many museums around the world rent audio players to their visitors to provide automated tours delivering pre-recorded information about their exhibits. Though generally pleased with their patrons' responses to automated audio tours museum administrators find that hosting them can be expensive, time consuming, and frustrating. Ongoing advances in mobile wireless technology provide an alternative to the current cumbersome method of

Matthew Nickerson

2005-01-01

214

Simulating Turing Machines on Maurer Machines  

E-print Network

Simulating Turing Machines on Maurer Machines J.A. Bergstra1,2 and C.A. Middelburg3 1 Programming MB Eindhoven, the Netherlands keesm@win.tue.nl Abstract. Maurer machines are much closer to real computers than Turing machines. Computer instructions play a prominent part in Mau- rer machines. We show

van der Zwaag, Mark

215

Decoding Machine  

NSDL National Science Digital Library

In this math lesson, learners explore variables and their uses. Learners pretend to be FBI agents and make a TOP SECRET tool that enables them to decode and find the values of hidden messages and words. Learners make their simple "decoding machines" out of paper and tape.

Lessonplans, Utah

2012-10-22

216

Leonardo's Machines  

NSDL National Science Digital Library

This Science NetLinks lesson will highlight some of Leonardo da Vinci's futuristic inventions, introducing the elements of machines. Through the use of student interactives, this lesson provides an interesting context in which students can begin to develop their ideas about the uniqueness of humans in the natural world.

Science Netlinks;

2003-03-05

217

Function Machine  

NSDL National Science Digital Library

This Java applet allows learners to explore simple linear functions. Students determine the algebraic form of a linear equation by entering inputs into the machine and by looking for patterns in the outputs. The function rules available are: integers from -10 to 10 are either added to, subtracted from, or multiplied by the input x to yield the output y.

2011-01-01

218

From Edison toFrom Edison to ViterbiViterbi Audio Players and Recorders:Audio Players and Recorders  

E-print Network

From Edison toFrom Edison to ViterbiViterbi Audio Players and Recorders:Audio Players and Recorders at Work #12;Mechanics of Phonograph #12;Recording Sessions #12;Recording Sessions Note recording horn #12 Martinville, Paris, France · In 1857 marked sound waves on a blackened drum. · Could record sound

Southern California, University of

219

Horatio Audio-Describes Shakespeare's "Hamlet": Blind and Low-Vision Theatre-Goers Evaluate an Unconventional Audio Description Strategy  

ERIC Educational Resources Information Center

Audio description (AD) has been introduced as one solution for providing people who are blind or have low vision with access to live theatre, film and television content. However, there is little research to inform the process, user preferences and presentation style. We present a study of a single live audio-described performance of Hart House…

Udo, J. P.; Acevedo, B.; Fels, D. I.

2010-01-01

220

Drilling Machines: Vocational Machine Shop.  

ERIC Educational Resources Information Center

The lessons and supportive information in this field tested instructional block provide a guide for teachers in developing a machine shop course of study in drilling. The document is comprised of operation sheets, information sheets, and transparency masters for 23 lessons. Each lesson plan includes a performance objective, material and tools,…

Thomas, John C.

221

Machine Learning  

NASA Astrophysics Data System (ADS)

The purpose of this chapter is to present fundamental ideas and techniques of machine learning suitable for the field of this book, i.e., for automated scientific discovery. The chapter focuses on those symbolic machine learning methods, which produce results that are suitable to be interpreted and understood by humans. This is particularly important in the context of automated scientific discovery as the scientific theories to be produced by machines are usually meant to be interpreted by humans. This chapter contains some of the most influential ideas and concepts in machine learning research to give the reader a basic insight into the field. After the introduction in Sect. 1, general ideas of how learning problems can be framed are given in Sect. 2. The section provides useful perspectives to better understand what learning algorithms actually do. Section 3 presents the Version space model which is an early learning algorithm as well as a conceptual framework, that provides important insight into the general mechanisms behind most learning algorithms. In section 4, a family of learning algorithms, the AQ family for learning classification rules is presented. The AQ family belongs to the early approaches in machine learning. The next, Sect. 5 presents the basic principles of decision tree learners. Decision tree learners belong to the most influential class of inductive learning algorithms today. Finally, a more recent group of learning systems are presented in Sect. 6, which learn relational concepts within the framework of logic programming. This is a particularly interesting group of learning systems since the framework allows also to incorporate background knowledge which may assist in generalisation. Section 7 discusses Association Rules - a technique that comes from the related field of Data mining. Section 8 presents the basic idea of the Naive Bayesian Classifier. While this is a very popular learning technique, the learning result is not well suited for human comprehension as it is essentially a large collection of probability values. In Sect. 9, we present a generic method for improving accuracy of a given learner by generatingmultiple classifiers using variations of the training data. While this works well in most cases, the resulting classifiers have significantly increased complexity and, hence, tend to destroy the human readability of the learning result that a single learner may produce. Section 10 contains a summary, mentions briefly other techniques not discussed in this chapter and presents outlook on the potential of machine learning in the future.

Hoffmann, Achim; Mahidadia, Ashesh

222

The Fields Institute: Lecture Audio and Slides  

NSDL National Science Digital Library

The Fields Institute for Research in Mathematical Sciences aims to "enhance mathematical activity in Canada by bringing together mathematicians from Canada and abroad, and by promoting contact and collaboration between professional mathematicians and the increasing numbers of users of mathematics." They support research in pure and applied mathematics, statistics and computer science, as well as collaborative projects between mathematicians and those applying mathematics in areas such as engineering, the physical and biological sciences, medicine, economics and finance, telecommunications and information systems. They offer this website with audio files and slides from events and lectures at the Fields Institute. The lectures, given by scientists from around the world, address such topics as Quantitative Finance, String Theory, Homological Algebra, Combinatorics, and much more. The files are organized by academic year and series title. In cases where the files are not available to download, they provide information on how to obtain the files.

223

Illinois State Museum: Audio-Video Barn  

NSDL National Science Digital Library

If you want to explore the world of agriculture in Illinois, you should make a beeline for this interesting and thoughtful website. The Audio-Video Barn is a collaborative project designed by the Illinois State Museum, working in partnership with other local institutions and with funding from the Institute of Museum and Library Services. The project is a logical outgrowth of the Museum's "longstanding interest in human interactions with the natural world." So step right into the "barn" and listen to oral history interviews from the 1950s to the 1990s, locate interviews from a state-wide map, or look over the "User's Guide" for navigation tips. Visitors shouldn't miss the "Sit-Down Interviews" area, as they can just scan through photos and select an interviewee who looks interesting. To get started, visitors should check out some of the "Stories from the Barn", such as "My Father the Great Reader" and "Making Rails".

224

Spatial audio through a bone conduction interface.  

PubMed

Headphones are the standard presentation device for radio communication in the military. Although bone conduction devices possess several advantages over headphones for some military applications, they are generally considered inappropriate for inclusion in a multi-channel system. The current study tested the feasibility of a multi-channel bone conduction system by measuring the localizability of spatialized auditory stimuli presented through a pair of bone conduction vibrators. Listeners localized a Gaussian noise stimulus spatialized with individualized head-related transfer functions (HRTFs). The sounds were presented from eight virtual locations on the horizontal plane (0, +/-45, +/-90, +/-135, and 180 degrees ) through either stereo headphones or a stereo bone conduction system. Localization performance was found to be nearly identical for both audio systems, indicating that bone conduction systems can be effectively used for displaying spatial information. PMID:17062501

MacDonald, Justin A; Henry, Paula P; Letowski, Tomasz R

2006-10-01

225

Audio-visual event detection based on mining of semantic audio-visual labels  

Microsoft Academic Search

Removing commercials from television programs is a much sought-after feature for a personal video recorder. In this paper, we employ an unsupervised clustering scheme (CM Detect) to detect commercials in television programs. Each program is first divided into Ws-minute chunks, and we extract audio and visual features from each of these chunks. Next, we apply k-means clustering to assign each

King-Shy Goh; Koji Miyahara; Regunathan Radhakrishnan; Ziyou Xiong; Ajay Divakaran

2004-01-01

226

Fullerene Machines  

NASA Technical Reports Server (NTRS)

Fullerenes possess remarkable properties and many investigators have examined the mechanical, electronic and other characteristics of carbon SP2 systems in some detail. In addition, C-60 can be functionalized with many classes of molecular fragments and we may expect the caps of carbon nanotubes to have a similar chemistry. Finally, carbon nanotubes have been attached to t he end of scanning probe microscope (Spill) tips. Spills can be manipulated with sub-angstrom accuracy. Together, these investigations suggest that complex molecular machines made of fullerenes may someday be created and manipulated with very high accuracy. We have studied some such systems computationally (primarily functionalized carbon nanotube gears and computer components). If such machines can be combined appropriately, a class of materials may be created that can sense their environment, calculate a response, and act. The implications of such hypothetical materials are substantial.

Globus, Al; Saini, Subhash (Technical Monitor)

1998-01-01

227

Fullerene Machines  

NASA Technical Reports Server (NTRS)

Recent computational efforts at NASA Ames Research Center and computation and experiment elsewhere suggest that a nanotechnology of machine phase functionalized fullerenes may be synthetically accessible and of great interest. We have computationally demonstrated that molecular gears fashioned from (14,0) single-walled carbon nanotubes and benzyne teeth should operate well at 50-100 gigahertz. Preliminary results suggest that these gears can be cooled by a helium atmosphere and a laser motor can power fullerene gears if a positive and negative charge have been added to form a dipole. In addition, we have unproven concepts based on experimental and computational evidence for support structures, computer control, a system architecture, a variety of components, and manufacture. Combining fullerene machines with the remarkable mechanical properties of carbon nanotubes, there is some reason to believe that a focused effort to develop fullerene nanotechnology could yield materials with tremendous properties.

Globus, Al; Saini, Subhash

1998-01-01

228

Simple Machines  

NSDL National Science Digital Library

Can you identify the six types of simple machines? 1. What do you know about Inclined Planes? Draw an example on your graphic organizer and state one fact.Inclined Plane 2. What do you know about levers? Draw an example on your graphic organizer and state one fact.Lever. 3. What do you know about pulleys? Draw an example on your graphic organizer and ...

Stewart, Miss

2010-03-24

229

Induction machine  

DOEpatents

A polyphase rotary induction machine for use as a motor or generator utilizing a single rotor assembly having two series connected sets of rotor windings, a first stator winding disposed around the first rotor winding and means for controlling the current induced in one set of the rotor windings compared to the current induced in the other set of the rotor windings. The rotor windings may be wound rotor windings or squirrel cage windings.

Owen, Whitney H. (Ogden, UT)

1980-01-01

230

A Generalized Deletion Machine  

E-print Network

In this work we prescribe a more generalized quantum-deleting machine (input state dependent). The fidelity of deletion is dependent on some machine parameters such that on alteration of machine parameters we get back to standard deleting machines. We also carried out a various comparative study of various kinds of quantum deleting machines. We also plotted graphs, making a comparative study of fidelity of deletion of the deletion machines, obtained as particular cases on changing the machine parameters of our machine.

Indranil Chakrabarty; Satyabrata Adhikari

2005-11-22

231

Effects of aging on audio-visual speech integration.  

PubMed

This study investigated the impact of aging on audio-visual speech integration. A syllable identification task was presented in auditory-only, visual-only, and audio-visual congruent and incongruent conditions. Visual cues were either degraded or unmodified. Stimuli were embedded in stationary noise alternating with modulated noise. Fifteen young adults and 15 older adults participated in this study. Results showed that older adults had preserved lipreading abilities when the visual input was clear but not when it was degraded. The impact of aging on audio-visual integration also depended on the quality of the visual cues. In the visual clear condition, the audio-visual gain was similar in both groups and analyses in the framework of the fuzzy-logical model of perception confirmed that older adults did not differ from younger adults in their audio-visual integration abilities. In the visual reduction condition, the audio-visual gain was reduced in the older group, but only when the noise was stationary, suggesting that older participants could compensate for the loss of lipreading abilities by using the auditory information available in the valleys of the noise. The fuzzy-logical model of perception confirmed the significant impact of aging on audio-visual integration by showing an increased weight of audition in the older group. PMID:25324091

Huyse, Aurélie; Leybaert, Jacqueline; Berthommier, Frédéric

2014-10-01

232

State-Machine Replication  

E-print Network

State-Machine Replication #12;The Problem Clients Server #12;The Problem Clients Server #12;The (state machine) #12;The Solution 1. Make server deterministic (state machine) State machine #12;The Solution 1. Make server deterministic (state machine) 2. Replicate server State machines #12;The Solution 1

Venkataramani, Arun

233

The effects of output transformers on distortion in audio amplifiers  

E-print Network

THE EFZECTS OP OUTPUT TRARSPORMERS OM DISTORTION IK AUDIO AMPLIFIERS A Thesis By ROSS EDWIiV IJQTIER AUGUST 1949 Approved as to style and content by C irasLn o Committee THE EFFECTS OF OUTPUT TRANSFORMERS OE DISTORTIOE I J AUDIO A;8PLIFI... THE EFFECTS OF OUTPUT TRANSFORMERS ON DISTORTION IN AUDIO ANPLIFIZRS A Thesis Ross Edwin Lanier August 1949 IO CA CO Ol CL CC PEKZATQHY NOTE The writer wishes to acknowledge the help of Professor H. C. Dillingham, who is responsi- ble...

Lanier, Ross Edwin

2012-06-07

234

Audio-visual sports highlights extraction using Coupled Hidden Markov Models  

Microsoft Academic Search

We present our studies on the application of Coupled Hidden Markov Models(CHMMs) to sports highlights extraction from broadcast video using both audio and video information. First, we generate audio labels using audio classification via Gaussian mixture models, and video labels using quantization of the average motion vector magnitudes. Then, we model sports highlights using discrete-observations CHMMs on audio and video

Ziyou Xiong

2005-01-01

235

Machine Design  

NSDL National Science Digital Library

This website, the homepage of Machine Design.com, contains resources on a variety of information for engineers and technicians related to devices, components, design applications, products, and systems in the manufacturing technology sector. The site also features a CAD library, eBooks, audiovisual aids, webinars, whitepapers and a reference center. Some of the resources require a free login. The page offers an RSS feed to keep users up to date on new resources. A free login may be required to access some of these items.

2009-11-09

236

Function Machine  

NSDL National Science Digital Library

This Java activity for the interactive white board (free access with registration) provides learners an opportunity to explore number functions and to predict outcomes. Teachers set the desired function from the drop-down list (double it, halve it, add 1, subtract 1, add 10, or subtract 10) and choose the start number from 1-20. Students are asked to predict the outcome before asking the machine to go. There is an option to hide the function so the teacher can ask students to work out what operation is being used to get from the starting number to the number generated.

2012-07-19

237

MODIS: an audio motif discovery software Laurence Catanese1  

E-print Network

and the velocity of digital media creation and diffusion. Recent work have already studied the motif discovery in Figure 1. The software only requires an audio features file input. The feature type choice depends

Paris-Sud XI, Université de

238

Object-Coding for Resolution-Free Musical Audio  

E-print Network

into the high-resolution workflow. To maximize the benefits of these libraries, they need to be upsampled be made as flexible as possible. Most of the current formats for audio have addressed the compression

Plumbley, Mark

239

Sampling Function of Degree 2 for DVD-Audio  

Microsoft Academic Search

Authors have been studying Fluency Information Theory that generalizes Shannon's sampling theorem and its applications. Among the practical application of the research, the Fluency DAC that is developed as the Digital-to-analog converter for CD audio could have received objective valuation including receipt Golden Sound Award in 1988. In recent, DVD-Audio that deal with maximum sampling rate of 192 kHz has

Kazuo Toraichi; Koji Nakamura

2003-01-01

240

SAOL: The MPEG4 Structured Audio Orchestra Language  

Microsoft Academic Search

The MPEG-4 standard, which will be published in October 1998, contains extensive provisions for sound synthesis as well as tradition al methods of audio compression. At the heart of MPEG-4 Structured Audio, the sound-synthesis fra mework, is a new music-synthesis language called SAOL. This language, based on the Music-N model, is specified and defined fully in the MPEG-4 International Standard;

Eric D. Scheirer; Barry L. Vercoe

1999-01-01

241

Frequency of plumbing fixture use through audio sampling  

E-print Network

FREQUENCY OF PLUMBING FIXTURE USE THROUGH AUDIO SAMPLING A Senior Honors Thesis By KEVIN BRUCE SHEA Submitted to the Office of Honors Programs & Academic Scholarships Texas ARM University In partial fulfillment of the requirements... of the UNIVERSITY UNDERGRADUATE RESEARCH FELLOWS April 2000 Group. Physical Sciences FREQUENCY OF PLUMBING FIXTURE USE THROUGH AUDIO SAMPLING A Senior Honors Thesis By KEVIN BRUCE SHEA Submitted to the Office of Honors Programs & Academic Scholarships...

Shea, Kevin Bruce

2013-02-22

242

Audio watermarking method robust against time- and frequency-fluctuation  

Microsoft Academic Search

In this paper, we describe an audio watermarking algorithm that can embed a multiple-bit message which is robust against wow-and-flutter, cropping, noise-addition, pitch-shift, and audio compressions such as MP3. The algorithm calculates and manipulates the magnitudes of segmented areas in the time-frequency plane of the content using short-term DFTs. The detection algorithm correlates the magnitudes with a pseudo-random array that

Ryuki Tachibana; Shuichi Shimizu; Taiga Nakamura; Seiji Kobayashi

2001-01-01

243

Personal audio with a planar bright zone.  

PubMed

Reproduction of multiple sound zones, in which personal audio programs may be consumed without the need for headphones, is an active topic in acoustical signal processing. Many approaches to sound zone reproduction do not consider control of the bright zone phase, which may lead to self-cancellation problems if the loudspeakers surround the zones. Conversely, control of the phase in a least-squares sense comes at a cost of decreased level difference between the zones and frequency range of cancellation. Single-zone approaches have considered plane wave reproduction by focusing the sound energy in to a point in the wavenumber domain. In this article, a planar bright zone is reproduced via planarity control, which constrains the bright zone energy to impinge from a narrow range of angles via projection in to a spatial domain. Simulation results using a circular array surrounding two zones show the method to produce superior contrast to the least-squares approach, and superior planarity to the contrast maximization approach. Practical performance measurements obtained in an acoustically treated room verify the conclusions drawn under free-field conditions. PMID:25324075

Coleman, Philip; Jackson, Philip J B; Olik, Marek; Abildgaard Pedersen, Jan

2014-10-01

244

Audio-video synchronization management in embedded multimedia applications  

NASA Astrophysics Data System (ADS)

Multimedia systems are required to provide proper synchronization of various components for intelligible presentation. However, it is challenging to accommodate the heterogeneity of different media characteristics. Audio-video synchronization is, for instance, required for presenting video chunks with audio frames where video chunk size is generally large and variable, but audio frame size is small and fixed. Such audio-video synchronization problem has been widely studied in the literature. The problem involves proper definition and preservation of temporal relationship between audio and video. Moreover, it is also important to take into account the processing complexity, since the computational resources and processing power on embedded platforms, such as cell phones and other handheld devices, are very limited. In this paper, we present the implementation of three audio-video synchronization methods on an embedded system. We discuss the performance as well as the advantages and disadvantages of each of these techniques. Based on our evaluation, we reason why one of the presented techniques is superior to the other two.

Rehman, Hamood-Ur; Kim, Taehyun; Avadhanam, Niranjan; Subramanian, Sridharan

2008-02-01

245

1014 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 13, NO. 5, SEPTEMBER 2005 Robust Low-Delay Audio Coding  

E-print Network

TECHNOLOGICAL progress has made the public Internet infrastructure faster and has given more users high with high quality for both the video and the audio, musicians playing together remotely, wireless speakers

Kova?evi?, Jelena

246

Mind & Machine  

NSDL National Science Digital Library

Mind & Machine is a weekly column provided by Ashley Dunn for the New York Times Cybertimes that discusses topics related to computing, technology, and the Internet. Recent columns have addressed the topics of the development of Internet telephony, possible futures of user interfaces, the history of technology and standards, and the Internet as a vehicle for community. Articles are well written, opinionated, and thought provoking. Mr. Dunn is a free lance writer who has written for such papers as the New York Times, the Los Angeles Times, the Seattle Post-Intelligencer, and the South China Morning Post. Note that the site is available only upon registration and is free of charge only in the US.

Dunn, Ashley.

1996-01-01

247

Course info Machine Learning  

E-print Network

Course info Machine Learning Real life problems Lecture 1: Machine Learning Problem Qinfeng (Javen) Shi 28 July 2014 Intro. to Stats. Machine Learning COMP SCI 4401/7401 Qinfeng (Javen) Shi Lecture 1: Machine Learning Problem #12;Course info Machine Learning Real life problems Table of Contents I 1 Course

Shi, Qinfeng "Javen"

248

Digital Audio Radio Broadcast Systems Laboratory Testing Nearly Complete  

NASA Technical Reports Server (NTRS)

Radio history continues to be made at the NASA Lewis Research Center with the completion of phase one of the digital audio radio (DAR) testing conducted by the Consumer Electronics Group of the Electronic Industries Association. This satellite, satellite/terrestrial, and terrestrial digital technology will open up new audio broadcasting opportunities both domestically and worldwide. It will significantly improve the current quality of amplitude-modulated/frequency-modulated (AM/FM) radio with a new digitally modulated radio signal and will introduce true compact-disc-quality (CD-quality) sound for the first time. Lewis is hosting the laboratory testing of seven proposed digital audio radio systems and modes. Two of the proposed systems operate in two modes each, making a total of nine systems being tested. The nine systems are divided into the following types of transmission: in-band on-channel (IBOC), in-band adjacent-channel (IBAC), and new bands. The laboratory testing was conducted by the Consumer Electronics Group of the Electronic Industries Association. Subjective assessments of the audio recordings for each of the nine systems was conducted by the Communications Research Center in Ottawa, Canada, under contract to the Electronic Industries Association. The Communications Research Center has the only CCIR-qualified (Consultative Committee for International Radio) audio testing facility in North America. The main goals of the U.S. testing process are to (1) provide technical data to the Federal Communication Commission (FCC) so that it can establish a standard for digital audio receivers and transmitters and (2) provide the receiver and transmitter industries with the proper standards upon which to build their equipment. In addition, the data will be forwarded to the International Telecommunications Union to help in the establishment of international standards for digital audio receivers and transmitters, thus allowing U.S. manufacturers to compete in the world market.

2005-01-01

249

Socially guided machine learning  

E-print Network

Social interaction will be key to enabling robots and machines in general to learn new tasks from ordinary people (not experts in robotics or machine learning). Everyday people who need to teach their machines new things ...

Thomaz, Andrea Lockerd

2006-01-01

250

Machine musicianship  

NASA Astrophysics Data System (ADS)

The training of musicians begins by teaching basic musical concepts, a collection of knowledge commonly known as musicianship. Computer programs designed to implement musical skills (e.g., to make sense of what they hear, perform music expressively, or compose convincing pieces) can similarly benefit from access to a fundamental level of musicianship. Recent research in music cognition, artificial intelligence, and music theory has produced a repertoire of techniques that can make the behavior of computer programs more musical. Many of these were presented in a recently published book/CD-ROM entitled Machine Musicianship. For use in interactive music systems, we are interested in those which are fast enough to run in real time and that need only make reference to the material as it appears in sequence. This talk will review several applications that are able to identify the tonal center of musical material during performance. Beyond this specific task, the design of real-time algorithmic listening through the concurrent operation of several connected analyzers is examined. The presentation includes discussion of a library of C++ objects that can be combined to perform interactive listening and a demonstration of their capability.

Rowe, Robert

2002-05-01

251

Audio-video feature correlation: faces and speech  

NASA Astrophysics Data System (ADS)

This paper presents a study of the correlation of features automatically extracted from the audio stream and the video stream of audiovisual documents. In particular, we were interested in finding out whether speech analysis tools could be combined with face detection methods, and to what extend they should be combined. A generic audio signal partitioning algorithm as first used to detect Silence/Noise/Music/Speech segments in a full length movie. A generic object detection method was applied to the keyframes extracted from the movie in order to detect the presence or absence of faces. The correlation between the presence of a face in the keyframes and of the corresponding voice in the audio stream was studied. A third stream, which is the script of the movie, is warped on the speech channel in order to automatically label faces appearing in the keyframes with the name of the corresponding character. We naturally found that extracted audio and video features were related in many cases, and that significant benefits can be obtained from the joint use of audio and video analysis methods.

Durand, Gwenael; Montacie, Claude; Caraty, Marie-Jose; Faudemay, Pascal

1999-08-01

252

Audio-visual event detection based on mining of semantic audio-visual labels  

NASA Astrophysics Data System (ADS)

Removing commercials from television programs is a much sought-after feature for a personal video recorder. In this paper, we employ an unsupervised clustering scheme (CM_Detect) to detect commercials in television programs. Each program is first divided into W8-minute chunks, and we extract audio and visual features from each of these chunks. Next, we apply k-means clustering to assign each chunk with a commercial/program label. In contrast to other methods, we do not make any assumptions regarding the program content. Thus, our method is highly content-adaptive and computationally inexpensive. Through empirical studies on various content, including American news, Japanese news, and sports programs, we demonstrate that our method is able to filter out most of the commercials without falsely removing the regular program.

Goh, King-Shy; Miyahara, Koji; Radhakrishnan, Regunathan; Xiong, Ziyou; Divakaran, Ajay

2003-12-01

253

Perception-based authentication watermarking for digital audio data  

NASA Astrophysics Data System (ADS)

Current systems and protocols based on cryptographic methods for integrity and authenticity verification of media data do not distinguish between legitimate signal transformation and malicious tampering that manipulates the content. Furthermore, they usually provide no localization or assessment of the relevance of such manipulations with respect to human perception or semantics. We present an algorithm for a authentication audio watermarking that uses a perception-based robust hash function in combination with robust watermarking to verify the integrity of audio recordings. Experimental results show that the proposed system provides both a high level of distinction between perceptually different audio data and a high robustness against signal transformations that do not change the perceived information.

Zmudzinski, Sascha; Steinebach, Martin

2009-02-01

254

Say What? The Role of Audio in Multimedia Video  

NASA Astrophysics Data System (ADS)

Audio, including interviews, ambient sounds, and music, is a critical-yet often overlooked-part of an effective multimedia video. In February 2010, Linder joined scientists working on the Global Rivers Observatory Project for two weeks of intensive fieldwork in the Congo River watershed. The team's goal was to learn more about how climate change and deforestation are impacting the river system and coastal ocean. Using stills and video shot with a lightweight digital SLR outfit and audio recorded with a pocket-sized sound recorder, Linder documented the trials and triumphs of working in the heart of Africa. Using excerpts from the six-minute Congo multimedia video, this presentation will illustrate how to record and edit an engaging audio track. Topics include interview technique, collecting ambient sounds, choosing and using music, and editing it all together to educate and entertain the viewer.

Linder, C. A.; Holmes, R. M.

2011-12-01

255

Music identification system using MPEG-7 audio signature descriptors.  

PubMed

This paper describes a multiresolution system based on MPEG-7 audio signature descriptors for music identification. Such an identification system may be used to detect illegally copied music circulated over the Internet. In the proposed system, low-resolution descriptors are used to search likely candidates, and then full-resolution descriptors are used to identify the unknown (query) audio. With this arrangement, the proposed system achieves both high speed and high accuracy. To deal with the problem that a piece of query audio may not be inside the system's database, we suggest two different methods to find the decision threshold. Simulation results show that the proposed method II can achieve an accuracy of 99.4% for query inputs both inside and outside the database. Overall, it is highly possible to use the proposed system for copyright control. PMID:23533359

You, Shingchern D; Chen, Wei-Hwa; Chen, Woei-Kae

2013-01-01

256

Virtual environment display for a 3D audio room simulation  

NASA Technical Reports Server (NTRS)

The development of a virtual environment simulation system integrating a 3D acoustic audio model with an immersive 3D visual scene is discussed. The system complements the acoustic model and is specified to: allow the listener to freely move about the space, a room of manipulable size, shape, and audio character, while interactively relocating the sound sources; reinforce the listener's feeling of telepresence in the acoustical environment with visual and proprioceptive sensations; enhance the audio with the graphic and interactive components, rather than overwhelm or reduce it; and serve as a research testbed and technology transfer demonstration. The hardware/software design of two demonstration systems, one installed and one portable, are discussed through the development of four iterative configurations.

Chapin, William L.; Foster, Scott H.

1992-01-01

257

Music Identification System Using MPEG-7 Audio Signature Descriptors  

PubMed Central

This paper describes a multiresolution system based on MPEG-7 audio signature descriptors for music identification. Such an identification system may be used to detect illegally copied music circulated over the Internet. In the proposed system, low-resolution descriptors are used to search likely candidates, and then full-resolution descriptors are used to identify the unknown (query) audio. With this arrangement, the proposed system achieves both high speed and high accuracy. To deal with the problem that a piece of query audio may not be inside the system's database, we suggest two different methods to find the decision threshold. Simulation results show that the proposed method II can achieve an accuracy of 99.4% for query inputs both inside and outside the database. Overall, it is highly possible to use the proposed system for copyright control. PMID:23533359

You, Shingchern D.; Chen, Wei-Hwa; Chen, Woei-Kae

2013-01-01

258

Highlight summarization in golf videos using audio signals  

NASA Astrophysics Data System (ADS)

In this paper, we present an automatic summarization of highlights in golf videos based on audio information alone without video information. The proposed highlight summarization system is carried out based on semantic audio segmentation and detection on action units from audio signals. Studio speech, field speech, music, and applause are segmented by means of sound classification. Swing is detected by the methods of impulse onset detection. Sounds like swing and applause form a complete action unit, while studio speech and music parts are used to anchor the program structure. With the advantage of highly precise detection of applause, highlights are extracted effectively. Our experimental results obtain high classification precision on 18 golf games. It proves that the proposed system is very effective and computationally efficient to apply the technology to embedded consumer electronic devices.

Kim, Hyoung-Gook; Kim, Jin Young

2008-01-01

259

Three dimensional audio versus head down TCAS displays  

NASA Technical Reports Server (NTRS)

The advantage of a head up auditory display was evaluated in an experiment designed to measure and compare the acquisition time for capturing visual targets under two conditions: Standard head down traffic collision avoidance system (TCAS) display, and three-dimensional (3-D) audio TCAS presentation. Ten commercial airline crews were tested under full mission simulation conditions at the NASA Ames Crew-Vehicle Systems Research Facility Advanced Concepts Flight Simulator. Scenario software generated targets corresponding to aircraft which activated a 3-D aural advisory or a TCAS advisory. Results showed a significant difference in target acquisition time between the two conditions, favoring the 3-D audio TCAS condition by 500 ms.

Begault, Durand R.; Pittman, Marc T.

1994-01-01

260

Influence of audio triggered emotional attention on video perception  

NASA Astrophysics Data System (ADS)

Perceptual video coding methods attempt to improve compression efficiency by discarding visual information not perceived by end users. Most of the current approaches for perceptual video coding only use visual features ignoring the auditory component. Many psychophysical studies have demonstrated that auditory stimuli affects our visual perception. In this paper we present our study of audio triggered emotional attention and it's applicability to perceptual video coding. Experiments with movie clips show that the reaction time to detect video compression artifacts was longer when video was presented with the audio information. The results reported are statistically significant with p=0.024.

Torres, Freddy; Kalva, Hari

2014-02-01

261

Machines registres Yves Lafont  

E-print Network

Machines à registres Yves Lafont Université d'Aix-Marseille Institut de Mathématiques de Luminy) Pascal Turing Von Neumann #12;Quelle opération calcule cette machine ? Exemple 1 2 registres : (+ 1 Quelle opération calcule cette machine ? #12;Quelle propriété teste cette machine ? Exemple 3 1 seul

Lafont, Yves

262

Abstract--We describe the design and testing of a human machine interface to use surface electromyography (sEMG)  

E-print Network

Abstract--We describe the design and testing of a human machine interface to use surface electromyography (sEMG) collected from a covert location in response audio-visual feedback. Using sEMG collected. The surface electromyography (sEMG) signal is several orders of magnitude larger in amplitude, and thus well

Stepp, Cara E.

263

Frame Splitting Scheme for Error-Robust Audio Streaming over Packet-Switching Networks  

NASA Astrophysics Data System (ADS)

This letter presents a novel frame splitting scheme for an error-robust audio streaming over packet-switching networks. In our approach to perceptual audio coding, an audio frame is split into several subframes based on the network configuration such that each packet can be decoded independently at the receiver. Through a subjective comparison category rating (CCR) test, it is discovered that our approach enhances the quality of the decoded audio signal under the lossy packet-switching networks environment.

Kim, Jong Kyu; Kim, Jung Su; Yun, Hwan Sik; Chang, Joon-Hyuk; Kimt, Nam Soo

264

HMM-Based Audio Keyword Generation , Ling-Yu Duan2  

E-print Network

in the production creation of multimedia data, there is an increasing need for video semantic anal- ysis. Audio, as a significant part of video, provides important cues to human perception when humans are browsing and understanding video contents. To detect semantic content by useful audio information, we in- troduce audio

Chia, Liang-Tien

265

EASIER Sampling for Audio Event Identification Surong Wang Min Xu Liang-Tien Chia Manonranjan Dash  

E-print Network

a significant part of video. Recently people have begun to realize the importance of effective audio content in sports video, some game-specific audio sounds (e.g., excited audience sounds, excited commenta- tor to some specific audio sound which plays important role for video content analysis. In our pre- vious work

Chia, Liang-Tien

266

The state of audio description in the United Kingdom – from description to narration  

Microsoft Academic Search

Audio description and audio narration – this article focuses on the problematic relationship between description and narration, re-examining the pronouncements of objectivity in broadcast guidelines for audio describers in the United Kingdom. Using narratological theory and works by members of the various schools of continental philosophy by way of explication, the article calls for a more subjective and cultural-historical reference

Steve Finbow

2010-01-01

267

Towards robust features for classifying audio in the CueVideo system  

Microsoft Academic Search

The role of audio in the context of multimedia applications involving video is becoming increasingly important. Many efforts in this area focus on audio data that contains some built-in semantic information structure such as in broadcast news, or focus on classification of audio that contains a single type of sound such as cleaar speech or clear music only. In the

Savitha Srinivasan; Dragutin Petkovic; Dulce B. Ponceleon

1999-01-01

268

Hearing You Loud and Clear: Student Perspectives of Audio Feedback in Higher Education  

ERIC Educational Resources Information Center

The use of audio feedback for students in a full-time community nursing degree course is appraised. The aim of this mixed methods study was to examine student views on audio feedback for written assignments. Questionnaires and a focus group were used to capture student opinion of this pilot project. The majority of students valued audio feedback…

Gould, Jill; Day, Pat

2013-01-01

269

Designing a Narrative-Based Audio Only 3D Game Engine Timothy Roden *, Ian Parberry  

E-print Network

, making audio only games practical for mobile computing. Novel features of our system include real environments is a common goal for many interactive computer games. While PC-based audio only games set of research into audio only applications for sight impaired users, augmented reality systems and human-computer

Parberry, Ian

270

Responding Effectively to Composition Students: Comparing Student Perceptions of Written and Audio Feedback  

ERIC Educational Resources Information Center

The authors compared student perceptions of audio and written feedback in order to assess what types of students may benefit from receiving audio feedback on their essays rather than written feedback. Many instructors previously have reported the advantages they see in audio feedback, but little quantitative research has been done on how the…

Bilbro, J.; Iluzada, C.; Clark, D. E.

2013-01-01

271

Building ensembles of audio and lyrics features to improve musical genre classification  

Microsoft Academic Search

Digital audio has become an almost ubiquitously spread medium, and for many consumers, digital audio is the major distribution and storage form of music. Numerous on-line music stores account for a growing share of record sales. The widespread adoption of digital audio on home computers and es- pecially mobile devices, and numerous on-line music stores show the size of this

Rudolf Mayer; Andreas Rauber

2010-01-01

272

A flexible framework for key audio effects detection and auditory context inference  

Microsoft Academic Search

Key audio effects are those special effects that play critical roles in human's perception of an auditory context in audiovisual materials. Based on key audio effects, high-level semantic inference can be carried out to facilitate various con- tent-based analysis applications, such as highlight extraction and video summarization. In this paper, a flexible framework is pro- posed for key audio effect

Rui Cai; Lie Lu; Alan Hanjalic; Hong-Jiang Zhang; Lian-hong Cai

2006-01-01

273

Integrating Additional Chord Information Into HMM-Based Lyrics-to-Audio Alignment  

Microsoft Academic Search

Aligning lyrics to audio has a wide range of ap- plications such as the automatic generation of karaoke scores, song-browsing by lyrics, and the generation of audio thumbnails. Existing methods are restricted to using only lyrics and match them to phoneme features extracted from the audio (usually mel-frequency cepstral coefficients). Our novel idea is to integrate the textual chord information

Matthias Mauch; Hiromasa Fujihara; Masataka Goto

2012-01-01

274

SOLAR: Sound Object Localization and Retrieval in Complex Audio Environments Derek Hoiem1  

E-print Network

SOLAR: Sound Object Localization and Retrieval in Complex Audio Environments Derek Hoiem1 , Yan Ke1 to identify sounds in complex audio environ- ments is highly useful for multimedia retrieval, security sys- tem, a system capable of finding sound objects, such as dog barks or car horns, in complex audio

Guestrin, Carlos

275

FOR FRIENDS AND SUPPORTERS OF THE KANSAS AUDIO-READER NETWORK SPRING 2014  

E-print Network

volunteer, but did you know she's still reading? Her name may not appear in record books, nor has books for a brand-new service called Audio-Reader. Eleanor had a busy life working full-Winston Managing Editor / Art Director The Audio-Reader Mission The Kansas Audio-Reader Network exists to provide

276

Effects of Reproduction Equipment on Interaction with a Spatial Audio Interface  

E-print Network

Effects of Reproduction Equipment on Interaction with a Spatial Audio Interface Georgios N their real world audio environment. In this paper we study the effects of three types of audio reproduction selected a target sound emitting from one of four different locations in the presence of distracters whilst

Williamson, John

277

Bimodal perception of audio-visual material properties for virtual environments  

E-print Network

contribute to signicant savings in computation time in an interactive audio-visual rendering system. To our. INTRODUCTION Interactive audio-visual virtual environments are now commonplace, ranging from com- puter games, therapy etc. In such environments, synthetic objects have audio-visual material properties, which

Paris-Sud XI, Université de

278

Guided by Voices: An Audio Augmented Reality System Kent Lyons, Maribeth Gandy, Thad Starner  

E-print Network

Guided by Voices: An Audio Augmented Reality System Kent Lyons, Maribeth Gandy, Thad Starner, lightweight audio-only augmented reality infrastructure. The system uses a simple wearable computer and a RF in creating audio-only augmented reality games and show how our location infrastructure is generalizable

Starner, Thad E.

279

Optimal Crosstalk Cancellation for Binaural Audio with Two Loudspeakers Edgar Y. Choueiri  

E-print Network

Optimal Crosstalk Cancellation for Binaural Audio with Two Loudspeakers Edgar Y. Choueiri Princeton of binaural audio through loudspeakers allowing a listener to perceive an accurate 3-D image of a recorded to the sound and that has been an impediment to the wide adoption of loudspeaker-based binaural audio

Bou-Zeid, Elie

280

Mixed structural modeling of head-related transfer functions for customized binaural audio delivery  

E-print Network

Mixed structural modeling of head-related transfer functions for customized binaural audio delivery to the modeling of head-related transfer functions (HRTFs) for binaural audio rendering is for- malized to the flexibility of the MSM approach, an exponential number of solutions for building custom binaural audio

Avanzini, Federico

281

Binaural cue coding: a novel and efficient representation of spatial audio  

Microsoft Academic Search

We present a novel concept for representing multi-channel audio signals: Binaural Cue Coding (BCC). BCC aims at separating the basic audio content and the information relevant for spatial perception. A multi-channel audio signal is represented as a mono signal and BCC parameters. We present two types of applications of BCC. Firstly, a number of separate sound source signals are reduced

Christof Faller; Frank Baumgarte

2002-01-01

282

37 CFR 201.27 - Initial notice of distribution of digital audio recording devices or media.  

Code of Federal Regulations, 2013 CFR

...distribution of digital audio recording devices or media. 201.27 Section 201.27 Patents...distribution of digital audio recording devices or media. (a) General. This section...recording device or digital audio recording medium in the United States. (b)...

2013-07-01

283

37 CFR 201.27 - Initial notice of distribution of digital audio recording devices or media.  

Code of Federal Regulations, 2012 CFR

...distribution of digital audio recording devices or media. 201.27 Section 201.27 Patents...distribution of digital audio recording devices or media. (a) General. This section...recording device or digital audio recording medium in the United States. (b)...

2012-07-01

284

Design and Usability Testing of an Audio Platform Game for Players with Visual Impairments  

ERIC Educational Resources Information Center

This article reports on the evaluation of a novel audio platform game that creates a spatial, interactive experience via audio cues. A pilot study with players with visual impairments, and usability testing comparing the visual and audio game versions using both sighted players and players with visual impairments, revealed that all the…

Oren, Michael; Harding, Chris; Bonebright, Terri L.

2008-01-01

285

Can Machines BeCan Machines Be Polite?Polite?  

E-print Network

Can Machines BeCan Machines Be Polite?Polite? Michael A. CovingtonMichael A. Covington Artificial Intelligence CenterArtificial Intelligence Center #12;Can machines be polite?Can machines be polite? Machines have beenMachines have been annoying peopleannoying people ever since humanever since human--machine

Covington, Michael A.

286

Increasing the capacity of LSB-based audio steganography  

Microsoft Academic Search

Conventionally, a perceptual limit of three bits per sample is imposed to the basic LSB audio steganography method. In this paper, we present a novel modification to standard LSB algorithm that is able to embed four bits per sample, thus improving the capacity of data hiding channel by 33%. The proposed algorithm makes use of minimum error replacement method for

Nedeljko Cvejic; Tapio Seppänen

2002-01-01

287

AUDIO-BASED GENDER IDENTIFICATION USING BOOTSTRAPPING George Tzanetakis  

E-print Network

AUDIO-BASED GENDER IDENTIFICATION USING BOOTSTRAPPING George Tzanetakis gtzan@cs.uvic.ca Department gender identification is used for video indexing and can improve speech recognition results by using gender- specific classifiers. Gender identification in large datasets is difficult because of the large

Tzanetakis, George

288

An FET Audio Peak Limiter W. Marshall Leach, Jr., Professor  

E-print Network

signal levels is desired. A peak limiter is a circuit which monitors the peak level of a signal. If the peak exceeds a preset limit threshold, the gain of the circuit is reduced so as to prevent the peak of broadcast transmitters, to prevent distortion caused by overload of audio recorders, to prevent overdrive

Leach Jr.,W. Marshall

289

Informed Audio Source Separation from Compressed Linear Stereo Mixtures  

E-print Network

phenomenon gets generalized from voice to instruments in musical video games such as Rock Band and Guitar recording in the video game. Indeed, multitrack formats (i.e. concatenation of the separate instruments, or adding individual audio effects, leading to a new "active" mode of music listening. Several multitrack

Paris-Sud XI, Université de

290

Indeterminate Adaptive Digital Audio for Games on Mobiles  

Microsoft Academic Search

1 Audio and Games on Mobiles A mobile game is a video game played on a mobile phone. The game market for mobiles is clearly regarded as a market with a future (11), as the multiple investments carried out by the large world editors on this segment testify. The mobiles are true platforms of large and general public games: mobile

Agnès Guerraz; Jacques Lemordant

291

Simultaneous Estimation of Chords and Musical Context From Audio  

Microsoft Academic Search

Chord labels provide a concise description of musical harmony. In pop and jazz music, a sequence of chord labels is often the only written record of a song, and forms the basis of so-called lead sheets. We devise a fully automatic method to simultaneously estimate from an audio waveform the chord sequence including bass notes, the metric positions of chords,

Matthias Mauch; Simon Dixon

2010-01-01

292

HRIR~: modulating range in headphone-reproduced spatial audio  

Microsoft Academic Search

HRIR~, a new software audio filter for Head-Related Impulse Response (HRIR) convolution is presented. The filter, implemented as a Pure-Data object, allows dynamic modification of a sound source's apparent location by modulating its virtual azimuth, elevation, and range in realtime, the last attribute being missing in surveyed similar applications. With hrir~ users can virtually localize monophonic sources around a listener's

Julián Villegas; Michael Cohen

2010-01-01

293

Indeterminate Adaptive Digital Audio for Games on Mobiles  

E-print Network

complaint of many in the industry is the lack of standards. It is necessary to adapt each game established as a credible digital music player, mobile music-related applications are becoming an increasingly and the music experience on the phone is then much more than just playing music tracks. Audio technologies

Paris-Sud XI, Université de

294

Video genre categorization and representation using audio-visual information  

NASA Astrophysics Data System (ADS)

We propose an audio-visual approach to video genre classification using content descriptors that exploit audio, color, temporal, and contour information. Audio information is extracted at block-level, which has the advantage of capturing local temporal information. At the temporal structure level, we consider action content in relation to human perception. Color perception is quantified using statistics of color distribution, elementary hues, color properties, and relationships between colors. Further, we compute statistics of contour geometry and relationships. The main contribution of our work lies in harnessing the descriptive power of the combination of these descriptors in genre classification. Validation was carried out on over 91 h of video footage encompassing 7 common video genres, yielding average precision and recall ratios of 87% to 100% and 77% to 100%, respectively, and an overall average correct classification of up to 97%. Also, experimental comparison as part of the MediaEval 2011 benchmarking campaign demonstrated the efficiency of the proposed audio-visual descriptors over other existing approaches. Finally, we discuss a 3-D video browsing platform that displays movies using feature-based coordinates and thus regroups them according to genre.

Ionescu, Bogdan; Seyerlehner, Klaus; Rasche, Christoph; Vertan, Constantin; Lambert, Patrick

2012-04-01

295

Adaptive audio-video streaming solution over IP mobile environments  

Microsoft Academic Search

Audio and video streaming over the Internet is becoming very attractive and popular. Thus, there is a need to provide efficient delivery of such applications to the growing number of demanding users. In addition, new types of devices, transportable and mobile are emerging rapidly. Now, common users are expecting to access their favorite Internet services anywhere, and through any equipment

Daniel Negru; Toufik Ahmed; Ahmed Mehaoua

2006-01-01

296

National Center for Audio Tapes 1971 Catalog Supplement.  

ERIC Educational Resources Information Center

About 600 audio tapes are cataloged in this supplement to the 1970-71 catalog (ED 038 854). The catalog is organized in three sections. The subject index, based on the Library of Congress subject area classification scheme, includes these categories: art, education, languages and literature, mathematics, physical education and recreation, science,…

Colorado Univ., Boulder. National Center for Audio Tapes.

297

How is audio description shaped by the functions it performs?  

E-print Network

#12;QUESTIONS How is audio description shaped by the functions it performs? What kinds, stares, puts, watches, opens, looking, runs, sitting, comes, picks, sees, holds, wearing, smile, nods, standing, leans, glances, gives, holding, watch, beat, grabs, leaves, falls, reaches, watching, drops

Bradstock, Burton

298

Packet Audio Playout Delay Adjustment: Performance Bounds and Algorithms  

Microsoft Academic Search

In packet audio applications, packets are buffered at a receiving site and their playout delayed in order to com- pensate for variable network delays. In this paper, we con- sider the problem of adaptively adjusting the playout delay in order to keep this delay as small as possible, while at the same time avoiding excessive \\

Sue B. Moon; James F. Kurose; Donald F. Towsley

1998-01-01

299

An Evaluation of the Audio Workbook System. R & D Report.  

ERIC Educational Resources Information Center

The Cassette Review Program (CRP), developed by The American College of Life Underwriters, is organized into 10 sections corresponding to the 10 courses of the American College C.L.U. diploma program. It includes both audio tapes and notebooks. The formative evaluation of the CRP carried out in 1971 resulted in a restructuring of both the tapes…

Andrulis, Richard S.

300

Recognition of blue movies by fusion of audio and video  

Microsoft Academic Search

Along with the explosive growth of the Internet, comes the proliferation of pornography. Compared with the pornographic texts and images, blue movies can do much harm to children, due to the greater realism and voyeurism of blue movies. In this paper, a framework for recognizing blue movies by fusing the audio and video information is described. A one-class Gaussian mixture

Haiqiang Zuo; Ou Wu; Weiming Hu; Bo Xu

2008-01-01

301

West Virginia Palliative Care Network 2011 Audio Conference Series  

E-print Network

West Virginia Palliative Care Network 2011 Audio Conference Series Target Audience - These courses care clinicians who wish to expand their knowledge of end-of-life care. Course Description - Hospital palliative care consultation teams have been shown to improve care for patients with serious illness

Mohaghegh, Shahab

302

Multimodal Music Mood Classification Using Audio and Lyrics  

Microsoft Academic Search

In this paper we present a study on music mood classi- fication using audio and lyrics information. The mood of a song is expressed by means of musical features but a rele- vant part also seems to be conveyed by the lyrics. We eval- uate each factor independently and explore the possibility to combine both, using Natural Language Processing and

Cyril Laurier; Jens Grivolla; Perfecto Herrera

2008-01-01

303

Modeling of Emotions expressed in Music using Audio features  

E-print Network

and the resulting ratings is made. Furthermore the influence of participants' musical experience, their mood beforeModeling of Emotions expressed in Music using Audio features Jens Madsen Kongens Lyngby 2011 MSC and rating music, using the emotions expressed in music. This measure can serve as a standalone pa- rameter

304

Improving Personal Diaries Using Social Audio Features Michael Kuhn  

E-print Network

shuffle play mode designed to select music fitting a user's mood, and a tag cloud to select music based that relates one specific sort of media items ­ music files ­ to each other by investigating their usage in a large community. The proposed social audio features are not restricted to relate music to music. Rather

305

What Makes Preschoolers Listen to Narrative Audio Tapes?  

Microsoft Academic Search

Most communication studies on children and media have focused solely on television. Other popular media products such as narrative audio tapes have been neglected. The present article addresses factors that influence preschoolers' selective exposure to these tapes. In line with past research, the emotional attractiveness of a story's protagonist and some formal design elements of the product are regarded as

Peter Vorderer; Saskia Böcking; Christoph Klimmt; Ute Ritterfeld

2006-01-01

306

Military applications for Digital Audio Radio Service (DARS)  

Microsoft Academic Search

This paper will discuss the three commercial Digital Audio Radio Service (DARS) systems that are currently, or soon to be, available, and the potential military applications of the different systems. XM Satellite Radio Inc., Sirius Satellite Inc., WorldSpace Corp, and potential future systems from Mobile Broadcasting Corp. and the European Space Agency system are DARS systems that each will provide

A. Hale; D. Ballinger

2002-01-01

307

FM Analysis/Synthesis Based Audio Coding Bondhan Winduratna  

E-print Network

, the frequency of a sine wave (carrier) is modulated by another sine wave (modulator) to generate a complex/s. The achieved sound quality, however, decreases at lower bit rates, in particular for speech and single was in musical sound synthesis. This FM synthesis offers the advantage, that it is capable to model audio signals

308

A framework for immersive spatial audio performance Mike Wozniewski  

E-print Network

´et´e Des Arts Technologiques 1195 Saint-Laurent boulevard Montreal, Quebec zack@sat.qc.ca Jeremy R, cognitive perception, and medical therapy. Despite contin- ued improvements in the quality of VE audio rendered identically for every source. The demands of musical composition and performance, however, re

Cooperstock, Jeremy R.

309

Normalized Auditory Attention Levels for Automatic Audio Surveillance  

E-print Network

Normalized Auditory Attention Levels for Automatic Audio Surveillance L. Couvreur, F. Bettens, J of auditory attention on a normalized scale, i.e. between 0 and 1. The proposed features are derived from loudness, temporal and frequency contrasts. Normalized auditory attention levels can be used to detect

Dupont, Stéphane

310

USING 3D AUDIO GUIDANCE TO LOCATE INDOOR STATIC OBJECTS  

E-print Network

, insufficient, or unavailable. The study also suggests that special cues should be added to the pure spatial environment does not permit visual navigation aids (due to smoke, darkness, mud, etc). Other people that may with a 3D audio cue. This study will however test if previous studies can be applied to a whole building

Tsigas, Philippas

311

MANIPULATION, ANALYSIS AND RETRIEVAL SYSTEMS FOR AUDIO SIGNALS  

E-print Network

RECOMMENDED FOR ACCEPTANCE BY THE DEPARTMENT OF COMPUTER SCIENCE JUNE 2002 #12;c Copyright by George of sound effects are also used by the movie and animation industry. Research areas that utilize large audio collections in- clude: Auditory Display, Bioacoustics, Computer Music, Forensics, and Music Cognition

Tzanetakis, George

312

Use of 3D Head Shape for Personalized Binaural Audio  

E-print Network

to Philip Jackson (p.jackson@surrey.ac.uk) ABSTRACT Natural-sounding reproduction of sound over headphones. Audio recordings were filtered by the output HRIRs to generate samples for a comparative listening test reproduction for immersive gaming, an individual requires accurate estimates of his/her own HRIRs over

Jackson, Philip JB

313

Recognition of Instrument Timbres in Real Polytimbral Audio Recordings  

E-print Network

], [29]. Huge repositories of audio recordings available from the Internet and private sets offer music with sweet violin sound. More advanced person (a musician) might need scores for the piece of music found in the Internet, to play it by himself or herself. All these issues are of interest

Ras, Zbigniew W.

314

page 1 of 8 Audio Streams Merging Over ALMI  

E-print Network

/audio applications based on multicast have been developed to facilitate group communications over the Internet. One. Introduction Today's Internet is a collection of a large number of end hosts connected to each other via a variety of networks. The simplest form of data transfer over the Internet consists of one

Fei, Zongming

315

IMPROVEMENTS OF AUDIO-BASED MUSIC SIMILARITY AND GENRE CLASSIFICATON  

E-print Network

IMPROVEMENTS OF AUDIO-BASED MUSIC SIMILARITY AND GENRE CLASSIFICATON Elias Pampalk1 , Arthur Flexer of experiments on four music col- lections. The evaluations are based on genre classifica- tion, assuming that evaluating sim- ilarity through genre classification is biased by the music collection (and genre taxonomy

316

Pitch Histograms in Audio and Symbolic Music Information Retrieval  

Microsoft Academic Search

In order to represent musical content, pitch and timing information is utilized in the majority of existing work in Symbolic Music Information Retrieval (MIR). Symbolic representations such as MIDI allow the easy calculation of such information and its manipulation. In contrast, most of the existing work in Audio MIR uses timbral and beat information, which can be calculated using automatic

George Tzanetakis; Andrey Ermolinskiy; Perry R. Cook

2002-01-01

317

Improvements of Audio-Based Music Similarity and Genre Classificaton  

Microsoft Academic Search

Audio-based music similarity measures can be applied to automatically generate playlists or recommendations. In this paper spectral similarity is combined with comple- mentary information from fluctuation patterns including two new descriptors derived thereof. The performance is evaluated in a series of experiments on four music col- lections. The evaluations are based on genre classifica- tion, assuming that very similar tracks

Elias Pampalk; Arthur Flexer; Gerhard Widmer

2005-01-01

318

Music Genre Classification Using MIDI and Audio Features  

E-print Network

Music Genre Classification Using MIDI and Audio Features Zehra Cataltepe, Yusuf Yaslan, Abdullah features from MIDI, separately and combined together, for MIDI music genre classification. We use McKay's 3 are applied on the feature vectors to infer genre [6]. MIDI format developed as a standard to play music

Cataltepe, Zehra

319

IMPROVEMENTS OF AUDIO-BASED MUSIC SIMILARITY AND GENRE CLASSIFICATON  

E-print Network

IMPROVEMENTS OF AUDIO-BASED MUSIC SIMILARITY AND GENRE CLASSIFICATON Elias Pampalk1 , Arthur Flexer Similarity, Fluctuation Patterns, Descriptors, Music Similarity, Genre Classification 1 INTRODUCTION Music that music genre taxonomies have several limitations (see e.g. [15]). An obvious issue is that many artists

Widmer, Gerhard

320

Stockholm, Dec 2002 An Audio Augmented Reality System  

E-print Network

sound in three dimensions the reproduction of spatial aspects of audio is essential to digitally create performance of synthesis and reproduction of 3D soundscapes in real-time, also on inexpensive hardware and of the environmental acoustics is done through the DirectSound3D and the EAX API's. A head-tracking device makes

Maguire Jr., Gerald Q.

321

Implementation issues of 3D audio in a virtual room  

Microsoft Academic Search

This paper presents the audio system built for the virtual room at Helsinki University of Technology. First we discuss the general problems for multichannel sound reproduction caused by the construction of, and the equipment in virtual rooms. We also describe the acoustics of the room in question, and the effect of the back-projected screens and reverberation to the sound. Compensation

Jarmo Hiipakka; Tommi Ilmonen; Tapio Lokki; Matti Groehn; Lauri Savioja

2001-01-01

322

STEREOPHONIC PERSONAL AUDIO REPRODUCTION USING PLANARITY CONTROL OPTIMIZATION  

E-print Network

Approaches to sound zone reproduction may be broadly categorized as energy cancellation approaches or sound may be applied to sound zone reproduction by means of coefficient translation between the local (zoneSTEREOPHONIC PERSONAL AUDIO REPRODUCTION USING PLANARITY CONTROL OPTIMIZATION Philip Coleman

Jackson, Philip JB

323

Spatial Audio System for Surround Video Martin Morrell, 2  

E-print Network

reproduction, immersing them in the content. The design of a spatial audio system was necessary to give 360 discuss the design process and decisions made that concluded in using a mixed reproduction system for recreating the effect of distance, in-head panning, sound scene rotations, reverberation and movement

Reiss, Josh

324

Effects of Tactile and Audio Cues on Reducing Vestibular Illusions.  

National Technical Information Service (NTIS)

The effect of multisensory cues (3-D, audio, tactile belt) to overcome a vestibular illusion in a rotating Barany Chair was investigated. Seated subjects were rotated about their spinal axis (Z axis) from a standing stop to a predetermined velocity. The a...

D. Bowden, L. Guzy, W. Albery

2006-01-01

325

Automatic Loudness Adjusting Circuits for Automotive Audio Systems  

Microsoft Academic Search

An audio system in an automobile suffers from its ambient noise. If a driver, in a car on express way or gravel road, is listening to a car radio or a car stereo, the sound of them will probably be masked by the vehicle-noise. Therefore, it is frequently required to adjust the loudness of sound. It is, however, dangerous for

Takashi Taniyama

1981-01-01

326

Chemical News Via Audio Tapes: Chemical Industry News  

ERIC Educational Resources Information Center

Tape coverage of internal R&D news now has a broader scope with improved features. A new tape series covering external news of broad interest has been initiated. The use of tape in a Continuing Education Program is discussed as the future plans for expanding the audio tape program. (1 reference) (Author)

Hanford, W. E.; And Others

1972-01-01

327

Power Dissipation in Linear Audio Power Amplifiers Paul Nossaman  

E-print Network

, and is also dissipated in the IC itself in the form of heat, or in other words, lost power. Again, dependingPower Dissipation in Linear Audio Power Amplifiers Paul Nossaman Analog Field Specialist Texas and sales teams usually require that the music be louder and better sounding. The power supply designer

Paderborn, Universität

328

COMBINING FEATURES REDUCES HUBNESS IN AUDIO SIMILARITY Arthur Flexer,1  

E-print Network

Department of Computational Perception Johannes Kepler University Linz, Austria arthur.flexer@ofai.at, dominik.schnitzer@ofai.at martin.gasser@ofai.at, tim.pohle@jku.at ABSTRACT In audio based music similarity, a well known effect is the existence of hubs, i.e. songs which appear similar to many other songs without

Widmer, Gerhard

329

Broadband Networked Audio: Entering the Era of Multisensory Data Distribution  

E-print Network

- pairing perception of the original signal. As music is one of the most demanding of applica- tions, John Roston, Wieslaw Woszczyk Centre for Interdisciplinary Research in Music Media and Technology Mc, Internet audio, once relegated to the second class status of AM radio, is now recognized as an effective

Cooperstock, Jeremy R.

330

A minicomputer system for audio-animatronics show data generation  

Microsoft Academic Search

Audio-Animatronics® shows have been produced by Disney since the introduction of several attractions at the 1964 New York World's Fair. Since then a variety of shows have been permanently installed at both Disneyland and Walt Disney World. They typically consist of a stage, or some enclosed show area, and a variety of special lighting effects, mechanized characters and other movable

Philip C. Stover; R. David Snyder

1980-01-01

331

AUDIO SOURCE SEPARATION WITH ONE SENSOR FOR ROBUST SPEECH RECOGNITION  

E-print Network

AUDIO SOURCE SEPARATION WITH ONE SENSOR FOR ROBUST SPEECH RECOGNITION L. Benaroya, F. Bimbot, G of noise compensa- tion in speech signals for robust speech recognition. Sev- eral classical denoising- perimposed to the voice of the speaker(s). While automatic speech recognition is a rather mature technology

Paris-Sud XI, Université de

332

AUDIO-VISUAL ISOLATED DIGIT RECOGNITION FOR WHISPERED SPEECH  

E-print Network

AUDIO-VISUAL ISOLATED DIGIT RECOGNITION FOR WHISPERED SPEECH Xing Fan, Carlos Busso, and John H. Therefore, performance of speech recognition systems trained with high energy voiced phonemes, degrades digit recognition of whispered speech. A small digit corpus with one subject speaking both whisper

Busso, Carlos

333

THE EFFECT OF SPEECH AND AUDIO COMPRESSION ON SPEECH RECOGNITION  

E-print Network

THE EFFECT OF SPEECH AND AUDIO COMPRESSION ON SPEECH RECOGNITION PERFORMANCE L. Besacier, C on the performance of our continuous speech recognition engine. GSM full rate, G711, G723.1 and MPEG coders are investigated. It is shown that MPEG transcoding degrades the speech recognition performance for low bitrates

Boyer, Edmond

334

Adding Audio Description: Does It Make a Difference?  

ERIC Educational Resources Information Center

A study involving 111 adults with blindness examined the impact of watching television science programs with and without audio description. Results indicate respondents gained and retained more information from watching programs with description. They reported that the description makes the program more enjoyable, interesting, and informative.…

Schmeidler, Emilie; Kirchner, Corinne

2001-01-01

335

Dynomite: a dynamically organized ink and audio notebook  

Microsoft Academic Search

Dynomite is a portable electronic notebook for the capture and retrieval of handwritten and audio notes. The goal of Dynomite is to merge the organization, search, and data acquisition capabilities of a computer with the benefits of a paper-based notebook. Dynomite provides novel solutions in four key problem areas. First, Dynomite uses a casual, low cognitive overhead interface. Second, for

Lynn D. Wilcox; Bill N. Schilit

1997-01-01

336

A Collaborative Interface for Multimodal Ink and Audio Documents  

Microsoft Academic Search

With the increased availability of pen-based devices, it becomes interesting to conduct and to archive multi-party communication sessions that involve audio and digital ink on a shared canvas. Collaborative whiteboards do exist today but typically use complex or closed protocols for communication. As a rule, existing whiteboards are not interoperable across multiple platforms and do not sup- port archival of

Amit Regmi; Stephen M. Watt

2009-01-01

337

Infant Perception of Audio-Visual Speech Synchrony  

ERIC Educational Resources Information Center

Three experiments investigated perception of audio-visual (A-V) speech synchrony in 4- to 10-month-old infants. Experiments 1 and 2 used a convergent-operations approach by habituating infants to an audiovisually synchronous syllable (Experiment 1) and then testing for detection of increasing degrees of A-V asynchrony (366, 500, and 666 ms) or by…

Lewkowicz, David J.

2010-01-01

338

EFMThe Modulation Method for the Compact Disc Digital Audio System  

Microsoft Academic Search

The modulation method employed in the Compact Disc (CD) digital audio system codeveloped by Philips N.V. (Eindhoven, The Netherlands) and Sony Corporation (Tokyo, Japan) is described. This method, called eight-to-fourteen (EFM), is an 8 (data bit) ---> 14 (channel bit) conversion block code with a space of 3 channel bits for every converted 14 channel bits which is used to

HIROSHI OGAWA; KEES A. SCHOUHAMER IMMINK

339

SHORT COMMUNICATION An audio/video surveillance system for wildlife  

E-print Network

widely used to document the presence and behaviour of animals at nests, trap sites and wildlife passagesSHORT COMMUNICATION An audio/video surveillance system for wildlife Roman Gula & Jörn Theuerkauf frequently visited by wildlife. Keywords Methods . Video surveillance . Nest monitoring . Digital video

Paris-Sud XI, Université de

340

Audio and Video Reflections to Promote Social Justice  

ERIC Educational Resources Information Center

Purpose: The purpose of this paper is to examine how 15 graduate students enrolled in a US school leadership preparation program understand issues of social justice and equity through a reflective process utilizing audio and/or video software. Design/methodology/approach: The study is based on the tradition of grounded theory. The researcher…

Boske, Christa

2011-01-01

341

Sounds in CD-ROM--Integrating Audio in Multimedia Products.  

ERIC Educational Resources Information Center

Describes how audio technology is being integrated into CD-ROMs to create multimedia products. Computer hardware and software are discussed, including the use of HyperCard to combine still pictures, moving video pictures, and sound; and specific new multimedia products produced by the Voyager Company are described. (LRW)

Rosebush, Judson

1992-01-01

342

HMM Based Falling Person Detection Using Both Audio and Video  

Microsoft Academic Search

Automatic detection of a falling person in video is an important problem with applications in security and safety areas including supportive home environments and CCTV surveillance systems. Human motion in video is modeled using Hidden Markov Models (HMM) in this paper. In addition, the audio track of the video is also used to distinguish a person simply sitting on a

B. Ugur Töreyin; Yigithan Dedeoglu; A. Enis Çetin

2005-01-01

343

Subword-based spoken term detection in audio course lectures  

Microsoft Academic Search

This paper investigates spoken term detection (STD) from audio recordings of course lectures obtained from an existing media repository. STD is performed from word lattices generated offline using an automatic speech recognition (ASR) system configured from a meetings domain. An efficient STD approach is presented where lattice paths which are likely to contain search terms are identified and an efficient

Richard C. Rose; Atta Norouzian; Aarthi Reddy; André Coy; Vishwa Gupta; Martin Karafiát

2010-01-01

344

Designing for persistent audio conversations in the enterprise  

Microsoft Academic Search

Social media websites like flickr and del.icio.us enable collaboration by allowing users to easily share content on the web through tagging. To provide a similar advantage to the enterprise, we have designed a tagging system for audio conversations. We are developing telephonic interfaces, where participants of a spoken conversation can opt to archive and share it. We have also developed

Adithya Renduchintala; Shreeharsh Kelkar; Ajita John; Dorée Duncan Seligmann

2007-01-01

345

Developing a Framework for Effective Audio Feedback: A Case Study  

ERIC Educational Resources Information Center

The increase in the use of technology-enhanced learning in higher education has included a growing interest in new approaches to enhance the quality of feedback given to students. Audio feedback is one method that has become more popular, yet evaluating its role in feedback delivery is still an emerging area for research. This paper is based on a…

Hennessy, Claire; Forrester, Gillian

2014-01-01

346

Hybrid Quantum Cloning Machine  

E-print Network

In this work, we introduce a special kind of quantum cloning machine called Hybrid quantum cloning machine. The introduced Hybrid quantum cloning machine or transformation is nothing but a combination of pre-existing quantum cloning transformations. In this sense it creates its own identity in the field of quantum cloners. Hybrid quantum cloning machine can be of two types: (i) State dependent and (ii) State independent or Universal. We study here the above two types of Hybrid quantum cloning machines. Later we will show that the state dependent hybrid quantum-cloning machine can be applied on only four input states. We will also find in this paper another asymmetric universal quantum cloning machine constructed from the combination of optimal universal B-H quantum cloning machine and universal anti-cloning machine. The fidelities of the two outputs are different and their values lie in the neighborhood of ${5/6} $

Satyabrata Adhikari; A. K. Pati; Indranil Chakrabarty; B. S. Choudhury

2007-05-04

347

Extreme Kicking Machine  

NSDL National Science Digital Library

In this design challenge activity follow up to "Kicking Machine", learners add a hands-free feature to their Kicking Machine. Learners modify their kicking machine to have it either release the pendulum or rubber band while standing three feet away or have it automatically feed balls into the kicking machine, one after another. They must figure out how to integrate the new feature into the existing structure and use the design process to make sure the modified kicking machine works properly. Educators can use this activity to demonstrate simple machines and kinetic/potential energy.

Wgbh

2010-01-01

348

Transcript of Audio Narrative Portion of: Scandinavian Heritage. A Set of Five Audio-Visual Film Strip/Cassette Presentations.  

ERIC Educational Resources Information Center

The document presents the transcript of the audio narrative portion of approximately 100 interviews with first and second generation Scandinavian immigrants to the United States. The document is intended for use by secondary school classroom teachers as they develop and implement educational programs related to the Scandinavian heritage in…

Anderson, Gerald D.; Olson, David B.

349

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 10, NO. 5, JULY 2002 293 Musical Genre Classification of Audio Signals  

E-print Network

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 10, NO. 5, JULY 2002 293 Musical Genre genres are categorical labels created by hu- mans to characterize pieces of music. A musical genre are related to the instrumentation, rhythmic structure, and harmonic content of the music. Genre hi- erarchies

Tzanetakis, George

350

Machine Learning for Global Optimization  

E-print Network

global optimization, machine learning, support vector machines, space trajectory design. Introduction ...... LIBSVM: a library for support vector machines. (Software ... Boston, Dordrecht, London: Kluwer Academic Publishers. Mangasarian, O. L. ...

schoen,,,

351

Interactive Gumball Machine  

NSDL National Science Digital Library

In this activity, learners review the history of gumball machines and explore potential and kinetic energy, while working in teams to build a gumball slide. Teams then design and build their own interactive gumball machine.

Ieee

2013-08-30

352

The method of narrow-band audio classification based on universal noise background model  

NASA Astrophysics Data System (ADS)

Audio classification is the basis of content-based audio analysis and retrieval. The conventional classification methods mainly depend on feature extraction of audio clip, which certainly increase the time requirement for classification. An approach for classifying the narrow-band audio stream based on feature extraction of audio frame-level is presented in this paper. The audio signals are divided into speech, instrumental music, song with accompaniment and noise using the Gaussian mixture model (GMM). In order to satisfy the demand of actual environment changing, a universal noise background model (UNBM) for white noise, street noise, factory noise and car interior noise is built. In addition, three feature schemes are considered to optimize feature selection. The experimental results show that the proposed algorithm achieves a high accuracy for audio classification, especially under each noise background we used and keep the classification time less than one second.

Rui, Rui; Bao, Chang-chun

2013-03-01

353

Your Sewing Machine.  

ERIC Educational Resources Information Center

The programed instruction manual is designed to aid the student in learning the parts, uses, and operation of the sewing machine. Drawings of sewing machine parts are presented, and space is provided for the student's written responses. Following an introductory section identifying sewing machine parts, the manual deals with each part and its…

Peacock, Marion E.

354

Talking Vending Machines  

E-print Network

're in the central Tokyo Station and you're thirsty so you go to the, you know, vending machine to buy a drink. Only this is not your average vending machine. Uh uh. This vending machine has opinions on what beverage you want. Sensors built into large touch panels...

Hacker, Randi

2011-02-02

355

Millikelvin Lab Machine Shop  

E-print Network

Millikelvin Lab OP105­112 Machine Shop OP132 Resistive Magnet Shop CICC Winding Area Transformers This building is home to the Millikelvin lab, the control room, the resistive magnet and machine shops, the CICC@magnet.fsu.edu (850) 644-4378 (850) 644-0534 2 MACHINE SHOP OP132 Vaughan Williams (A114*) williams

McQuade, D. Tyler

356

Cable-Twisting Machine  

NASA Technical Reports Server (NTRS)

New cable-twisting machine is smaller and faster than many production units. Is useful mainly in production of short-run special cables. Already-twisted cable can be fed along axis of machine. Faster operation than typical industrial cable-twisting machines possible by using smaller spools of wire.

Kurnett, S.

1982-01-01

357

Circular tests for HSM machine tools: Bore machining application  

Microsoft Academic Search

Today's High-Speed Machining (HSM) machine tool combines productivity and part quality. The difficulty inherent in HSM operations lies in understanding the impact of machine tool behaviour on machining time and part quality. Analysis of some of the relevant ISO standards [230. Acceptance code for machine tools. Part 4, Circular tests for numerically controlled machine-tools, April 1998, 10791. Test conditions for

Laurent Tapie; Kwamiwi Mawussi; Bernard Anselmetti

2007-01-01

358

What is Machine Learning? About the Course Example Machine Learning  

E-print Network

What is Machine Learning? About the Course Example Machine Learning DD2431 ¨Orjan Ekeberg Oct­Dec, 2008 #12;What is Machine Learning? About the Course Example 1 What is Machine Learning? Definition A Hypothetical Project #12;What is Machine Learning? About the Course Example 1 What is Machine Learning

Kjellström, Hedvig

359

What is Machine Learning? About the Course Example Machine Learning  

E-print Network

What is Machine Learning? About the Course Example Machine Learning DD2431 ¨Orjan Ekeberg Oct­Dec, 2007 What is Machine Learning? About the Course Example 1 What is Machine Learning? Definition A Hypothetical Project What is Machine Learning? About the Course Example 1 What is Machine Learning? Definition

Kjellström, Hedvig

360

What is Machine Learning? About the Course Example Machine Learning  

E-print Network

What is Machine Learning? About the Course Example Machine Learning DD2431 ¨Orjan Ekeberg Oct­Dec, 2008 What is Machine Learning? About the Course Example 1 What is Machine Learning? Definition A Hypothetical Project What is Machine Learning? About the Course Example 1 What is Machine Learning? Definition

Kjellström, Hedvig

361

What is Machine Learning? About the Course Example Machine Learning  

E-print Network

What is Machine Learning? About the Course Example Machine Learning DD2431 ¨Orjan Ekeberg Oct­Dec, 2007 #12;What is Machine Learning? About the Course Example 1 What is Machine Learning? Definition A Hypothetical Project #12;What is Machine Learning? About the Course Example 1 What is Machine Learning

Kjellström, Hedvig

362

Benchmarking the Connection Machine  

SciTech Connect

The purpose of this memo is to present the results of work done recently on the Connection Machine during a one-week visit by the authors to Thinking Machines Corporation. The main objectives of the trip were to benchmark the Connection Machine and to assess the software environment, in order to determine the suitability of the Connection Machine for solving computationally-intensive problems and to compare the Connection Machine with other highly-parallel computers. This report presents preliminary results, and describes the computer architecture.

Baillie, C.; Felten, E.; Walker, D.

1987-01-01

363

Edheads: The Compound Machine  

NSDL National Science Digital Library

This resource is a collection of interactive animations designed to help kids learn how forces and simple machines can work together to create the compound machine. Child-centered animated activities enhance understanding of how compound machines function and how they are differentiated from simple machines. Additionally the site includes a glossary of important terms, lesson plans, a teacher's guide and information about professionals who work with compound machines. This page is part of a larger collection of animated education resources for the elementary level.

2007-09-18

364

Using Audio Time Scale Modification for Video Browsing  

Microsoft Academic Search

In the IBM CueVideo TM project we study various aspects of fully automated video indexing, browsing and retrieval. The technical aspects include audio processing, speech recognition, image processing and information retrieval. Equally important, however, is exploring user expectations and conducting user studies. We focus on the field of video for Training and Education, including Distributed Learning, Remote Education, and Just-in-Time

Arnon Amir; Dulce B. Ponceleon; Brian Blanchard; Dragutin Petkovic; Savitha Srinivasan; G. Cohen

2000-01-01

365

Audio-Assisted Scene Segmentation for Story Browsing  

Microsoft Academic Search

Content-based video retrieval requires an eectiv e scene seg- mentation technique to divide a long video le into meaningful high-level aggregates of shots called scenes. Each scene is part of a story. Brows- ing these scenes unfolds the entire story of a lm. In this paper, we rst investigate recent scene segmentation techniques that belong to the visual-audio alignment approach.

Yu Cao; Wallapak Tavanapong; Kihwan Kim; Oh Jung-hwan

2003-01-01

366

Audio Signal Processing Based on Sinusoidal Analysis\\/Synthesis  

Microsoft Academic Search

Based on a sinusoidal model, an analysis\\/synthesis technique is developed that characterizes audio signals, such as speech\\u000a and music, in terms of the amplitudes, frequencies, and phases of the component sine waves. These parameters are estimated\\u000a by applying a peak-picking algorithm to the short-time Fourier transform of the input waveform. Rapid changes in the highly\\u000a resolved spectral components are tracked

T. F. Quatieri; R. J. Mcaulay

367

Improved phase vocoder time-scale modification of audio  

Microsoft Academic Search

The phase vocoder is a well established tool for time scaling and pitch shifting speech and audio signals via modification of their short-time Fourier transforms (STFTs). In contrast to time-domain time-scaling and pitch-shifting techniques, the phase vocoder is generally considered to yield high quality results, especially for large modification factors and\\/or polyphonic signals. However, the phase vocoder is also known

Jean Laroche; Mark Dolson

1999-01-01

368

An active development environment for structured audio performance and composition  

Microsoft Academic Search

Catnip Audio is a small group of musicians and coders from the UK, US, and who have joined to work on a new music program project titled ReTrack. ReTrack is a music entry system based initially on the trackers of the Internet music scene, but expanded to include a comprehensive set of MIDI functions as well as incorporate MPEG-4 Structured

M. Alchin

2000-01-01

369

Tactical audio and acoustic rendering in biomedical applications  

Microsoft Academic Search

Complexity of biomedical data requires novel so- phisticated analysis and presentation methods. Sonification is used as a new information display in augmented reality systems to overcome problems of existing human-computer interface (e.g., opaque or heavy head-mounted displays, slow computer graphics, etc.). A novel taxonomy of sonification methods and techniques is introduced. We present our experience with tactical audio and acoustic

Emil Jovanov; Kristen Wegner; Vlada Radivojevic; Dusan Starcevic; M. S. Quinn; Daniel B. Karron

1999-01-01

370

Market potential for interactive audio-visual media  

Microsoft Academic Search

NM2 (New Media for a New Millennium) develops tools for interactive, personalised and non-linear audio-visual content that will be tested in seven pilot productions. This paper looks at the market potential for these productions from a technological, a business and a users' perspective. It shows that digital broadcast networks and broadband Internet are rapidly growing in Europe, but will have

Andra Leurdijk; Sander Limonard

2005-01-01

371

An acoustic human-machine interface with multi-channel sound reproduction  

Microsoft Academic Search

For hands-free man-machine audio interfaces with multichannel sound reproduction and automatic speech recognition (ASR), sometimes both an acoustic echo canceller (AEC) and a beamforming (BF) microphone array are necessary for sufficient recognition rates. In the context of multimedia systems, multi-channel sound reproduction (e.g. stereo or 5.1 channel-surround systems) typically requires multi-channel AEC (M-C AEC). With M-C AEC being known to

H. Buchner; W. Kellermann

2001-01-01

372

Robust video and audio-based synchronization of multimedia files  

NASA Astrophysics Data System (ADS)

This paper addresses the problem of robust and automated synchronization of multiple audio and video signals. The input signals are from a set of independent multimedia recordings coming from several camcorders and microphones. While the camcorders are static, the microphones are mobile as they are attached to people. The motivation for synchronization of all signals is to support studies and understanding of human interaction in a decision support environment that have been limited so far due to the difficulties in automated processing of any observations during the decision making sessions. The application of our work is to environments supporting decisions. The data sets for this work have been acquired during training exercises of response teams, rescue workers, and fire fighters at multiple locations. The developed synchronization methodology for a set of independent multimedia recordings is based on introducing aural and visual landmarks with a bell and room light switches. Our approach to synchronization is based on detecting the landmarks in audio and video signals per camcorder and per microphone, and then fusing the results to increase robustness and accuracy of the synchronization. We report synchronization results that demonstrate accuracy of synchronization based on video and audio.

Raichel, Benjamin A.; Bajcsy, Peter

2010-02-01

373

Temporal Structure and Complexity Affect Audio-Visual Correspondence Detection  

PubMed Central

Synchrony between events in different senses has long been considered the critical temporal cue for multisensory integration. Here, using rapid streams of auditory and visual events, we demonstrate how humans can use temporal structure (rather than mere temporal coincidence) to detect multisensory relatedness. We find psychophysically that participants can detect matching auditory and visual streams via shared temporal structure for crossmodal lags of up to 200?ms. Performance on this task reproduced features of past findings based on explicit timing judgments but did not show any special advantage for perfectly synchronous streams. Importantly, the complexity of temporal patterns influences sensitivity to correspondence. Stochastic, irregular streams – with richer temporal pattern information – led to higher audio-visual matching sensitivity than predictable, rhythmic streams. Our results reveal that temporal structure and its complexity are key determinants for human detection of audio-visual correspondence. The distinctive emphasis of our new paradigms on temporal patterning could be useful for studying special populations with suspected abnormalities in audio-visual temporal perception and multisensory integration. PMID:23346067

Denison, Rachel N.; Driver, Jon; Ruff, Christian C.

2013-01-01

374

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 20, NO. 10, DECEMBER 2012 2657 Real-Time Perception-Based Clipping of Audio  

E-print Network

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 20, NO. 10, DECEMBER 2012 2657 which aims to explicitly minimize the perceptible clipping-induced distortion by embedding a convex), but could pos- sibly also damage the audio equipment. Thirdly, in music pro- duction applicati

375

Chapter 39. Piezoelectric fibers for sensing and actuation at ultrasonic and audio frequencies Piezoelectric fibers for sensing and actuation at ultrasonic and audio  

E-print Network

Chapter 39. Piezoelectric fibers for sensing and actuation at ultrasonic and audio frequencies 39-1 Piezoelectric fibers for sensing and actuation at ultrasonic and audio frequencies Project Staff Zheng Wang-based piezoelectric element that can be interfaced with external circuits in both transmitter and receiver

376

Perspex machine: VII. The universal perspex machine  

NASA Astrophysics Data System (ADS)

The perspex machine arose from the unification of projective geometry with the Turing machine. It uses a total arithmetic, called transreal arithmetic, that contains real arithmetic and allows division by zero. Transreal arithmetic is redefined here. The new arithmetic has both a positive and a negative infinity which lie at the extremes of the number line, and a number nullity that lies off the number line. We prove that nullity, 0/0, is a number. Hence a number may have one of four signs: negative, zero, positive, or nullity. It is, therefore, impossible to encode the sign of a number in one bit, as floating-point arithmetic attempts to do, resulting in the difficulty of having both positive and negative zeros and NaNs. Transrational arithmetic is consistent with Cantor arithmetic. In an extension to real arithmetic, the product of zero, an infinity, or nullity with its reciprocal is nullity, not unity. This avoids the usual contradictions that follow from allowing division by zero. Transreal arithmetic has a fixed algebraic structure and does not admit options as IEEE, floating-point arithmetic does. Most significantly, nullity has a simple semantics that is related to zero. Zero means "no value" and nullity means "no information." We argue that nullity is as useful to a manufactured computer as zero is to a human computer. The perspex machine is intended to offer one solution to the mind-body problem by showing how the computable aspects of mind and, perhaps, the whole of mind relates to the geometrical aspects of body and, perhaps, the whole of body. We review some of Turing's writings and show that he held the view that his machine has spatial properties. In particular, that it has the property of being a 7D lattice of compact spaces. Thus, we read Turing as believing that his machine relates computation to geometrical bodies. We simplify the perspex machine by substituting an augmented Euclidean geometry for projective geometry. This leads to a general-linear perspex-machine which is very much easier to program than the original perspex-machine. We then show how to map the whole of perspex space into a unit cube. This allows us to construct a fractal of perspex machines with the cardinality of a real-numbered line or space. This fractal is the universal perspex machine. It can solve, in unit time, the halting problem for itself and for all perspex machines instantiated in real-numbered space, including all Turing machines. We cite an experiment that has been proposed to test the physical reality of the perspex machine's model of time, but we make no claim that the physical universe works this way or that it has the cardinality of the perspex machine. We leave it that the perspex machine provides an upper bound on the computational properties of physical things, including manufactured computers and biological organisms, that have a cardinality no greater than the real-number line.

Anderson, James A. D. W.

2006-01-01

377

Machine Learning Group Machine Learning Group Machine Learning Group  

E-print Network

The deep Boltzmann machine is a powerful model that extracts the hierarchical structure of observed data. While inference is typically slow due to its undirected nature, we argue that the emerging feature hierarchy is still explicit enough to be traversed in a feedforward fashion. The claim is corroborated by training a set of deep neural networks on real data and measuring the evolution of the representation layer after layer. The analysis reveals that the deep Boltzmann machine produces a feed-forward hierarchy of increasingly invariant representations that clearly surpasses the layer-wise approach. 1

Grégoire Montavon; Mikio L. Braun; Klaus-robert Müller; Tu Berlin; Tu Berlin; Tu Berlin

378

Omnidirectional Audio-Visual Talker Localization Based on Dynamic Fusion of Audio-Visual Features Using Validity and Reliability Criteria  

NASA Astrophysics Data System (ADS)

This paper proposes a robust omnidirectional audio-visual (AV) talker localizer for AV applications. The proposed localizer consists of two innovations. One of them is robust omnidirectional audio and visual features. The direction of arrival (DOA) estimation using an equilateral triangular microphone array, and human position estimation using an omnidirectional video camera extract the AV features. The other is a dynamic fusion of the AV features. The validity criterion, called the audioor visual-localization counter, validates each audio- or visual-feature. The reliability criterion, called the speech arriving evaluator, acts as a dynamic weight to eliminate any prior statistical properties from its fusion procedure. The proposed localizer can compatibly achieve talker localization in a speech activity and user localization in a non-speech activity under the identical fusion rule. Talker localization experiments were conducted in an actual room to evaluate the effectiveness of the proposed localizer. The results confirmed that the talker localization performance of the proposed AV localizer using the validity and reliability criteria is superior to that of conventional localizers.

Denda, Yuki; Nishiura, Takanobu; Yamashita, Yoichi

379

Guidelines for the integration of audio cues into computer user interfaces  

SciTech Connect

Throughout the history of computers, vision has been the main channel through which information is conveyed to the computer user. As the complexities of man-machine interactions increase, more and more information must be transferred from the computer to the user and then successfully interpreted by the user. A logical next step in the evolution of the computer-user interface is the incorporation of sound and thereby using the sense of ''hearing'' in the computer experience. This allows our visual and auditory capabilities to work naturally together in unison leading to more effective and efficient interpretation of all information received by the user from the computer. This thesis presents an initial set of guidelines to assist interface developers in designing an effective sight and sound user interface. This study is a synthesis of various aspects of sound, human communication, computer-user interfaces, and psychoacoustics. We introduce the notion of an earcon. Earcons are audio cues used in the computer-user interface to provide information and feedback to the user about some computer object, operation, or interaction. A possible construction technique for earcons, the use of earcons in the interface, how earcons are learned and remembered, and the affects of earcons on their users are investigated. This study takes the point of view that earcons are a language and human/computer communication issue and are therefore analyzed according to the three dimensions of linguistics; syntactics, semantics, and pragmatics.

Sumikawa, D.A.

1985-06-01

380

High Capacity Reversible Watermarking for Audio by Histogram Shifting and Predicted Error Expansion  

PubMed Central

Being reversible, the watermarking information embedded in audio signals can be extracted while the original audio data can achieve lossless recovery. Currently, the few reversible audio watermarking algorithms are confronted with following problems: relatively low SNR (signal-to-noise) of embedded audio; a large amount of auxiliary embedded location information; and the absence of accurate capacity control capability. In this paper, we present a novel reversible audio watermarking scheme based on improved prediction error expansion and histogram shifting. First, we use differential evolution algorithm to optimize prediction coefficients and then apply prediction error expansion to output stego data. Second, in order to reduce location map bits length, we introduced histogram shifting scheme. Meanwhile, the prediction error modification threshold according to a given embedding capacity can be computed by our proposed scheme. Experiments show that this algorithm improves the SNR of embedded audio signals and embedding capacity, drastically reduces location map bits length, and enhances capacity control capability. PMID:25097883

Wang, Fei; Chen, Zuo

2014-01-01

381

The sawing machine is a machine SAWING MACHINES  

E-print Network

main parts, which are discussed below. The head is the large unit at the top of the contour machine), and the adjustable post which supports the upper saw guide. The job selector dial is also located on the head the speed in feet per minute (FPM). The butt welder is also mounted on the column. different positions

Gellman, Andrew J.

382

Machine tool locator  

DOEpatents

Machine tools can be accurately measured and positioned on manufacturing machines within very small tolerances by use of an autocollimator on a 3-axis mount on a manufacturing machine and positioned so as to focus on a reference tooling ball or a machine tool, a digital camera connected to the viewing end of the autocollimator, and a marker and measure generator for receiving digital images from the camera, then displaying or measuring distances between the projection reticle and the reference reticle on the monitoring screen, and relating the distances to the actual position of the autocollimator relative to the reference tooling ball. The images and measurements are used to set the position of the machine tool and to measure the size and shape of the machine tool tip, and examine cutting edge wear. patent

Hanlon, John A. (Los Alamos, NM); Gill, Timothy J. (Stanley, NM)

2001-01-01

383

Killer pop machines.  

PubMed

Abuse and misuse of soda vending machines has resulted in a considerable number of injuries and deaths. The machines fall forward when rocked or tilted and crush those in front. These accidents are all preventable by a simple and cheap device. There should be a law compelling safety requirements for these machines. This is a report of a recent case in which a young man was crushed to death. PMID:2329341

Spitz, D J; Spitz, W U

1990-03-01

384

Quantum Learning Machine  

E-print Network

We propose a novel notion of a quantum learning machine for automatically controlling quantum coherence and for developing quantum algorithms. A quantum learning machine can be trained to learn a certain task with no a priori knowledge on its algorithm. As an example, it is demonstrated that the quantum learning machine learns Deutsch's task and finds itself a quantum algorithm, that is different from but equivalent to the original one.

Jeongho Bang; James Lim; M. S. Kim; Jinhyoung Lee

2008-03-20

385

Algorithms for multiplex scheduling of object-based audio-visual presentations  

Microsoft Academic Search

With the emergence of standards such as MPEG-4, systems and applications using object- based audio-visual presentations will become more and more common. The main distinguishing feature of object-based audio-visual presentations is the scene composition at the user terminal. In this paper we discuss the problem of scheduling object-based audio- visual presentations with resource constraints. We explore the similarities with the

Hari Kalva; Alexandros Eleftheriadis

2004-01-01

386

Spatial Audio on the Web: Or Why Can't I hear Anything Over There?  

NASA Technical Reports Server (NTRS)

Auditory complexity, freedom of movement and interactivity is not always possible in a "true" virtual environment, much less in web-based audio. However, a lot of the perceptual and engineering constraints (and frustrations) that researchers, engineers and listeners have experienced in virtual audio are relevant to spatial audio on the web. My talk will discuss some of these engineering constraints and their perceptual consequences, and attempt to relate these issues to implementation on the web.

Wenzel, Elizabeth M.; Schlickenmaier, Herbert (Technical Monitor); Johnson, Gerald (Technical Monitor); Frey, Mary Anne (Technical Monitor); Schneider, Victor S. (Technical Monitor); Ahunada, Albert J. (Technical Monitor)

1997-01-01

387

Parallel Kinematic Machines (PKM)  

SciTech Connect

The purpose of this 3-year cooperative research project was to develop a parallel kinematic machining (PKM) capability for complex parts that normally require expensive multiple setups on conventional orthogonal machine tools. This non-conventional, non-orthogonal machining approach is based on a 6-axis positioning system commonly referred to as a hexapod. Sandia National Laboratories/New Mexico (SNL/NM) was the lead site responsible for a multitude of projects that defined the machining parameters and detailed the metrology of the hexapod. The role of the Kansas City Plant (KCP) in this project was limited to evaluating the application of this unique technology to production applications.

Henry, R.S.

2000-03-17

388

Asymmetric universal entangling machine  

E-print Network

We give a definition of asymmetric universal entangling machine which entangles a system in an unknown state to a specially prepared ancilla. The machine produces a fixed state-independent amount of entanglement in exchange to a fixed degradation of the system state fidelity. We describe explicitly such a machine for any quantum system having $d$ levels and prove its optimality. We show that a $d^2$-dimensional ancilla is sufficient for reaching optimality. The introduced machine is a generalization to a number of widely investigated universal quantum devices such as the symmetric and asymmetric quantum cloners, the symmetric quantum entangler, the quantum information distributor and the universal-NOT gate.

Dmitri Horoshko; Mikhail Kolobov; Sergei Kilin

2004-12-23

389

What Can Turing Machines Do? How powerful are Turing machines?  

E-print Network

1 What Can Turing Machines Do? How powerful are Turing machines? Turing machines can accept any regular or context- free language. Turing machines can perform basic arithmetic computations. Sequencing can be implemented on Turing ma- chines. Subroutine calls can be implemented on Turing machines

Bylander, Tom

390

Circular tests for HSM machine tools: Bore machining application  

E-print Network

Circular tests for HSM machine tools: Bore machining application Abstract: Today's High was ultimately validated on a bore machining simulation. Keywords: circular interpolation, HSM, kinematic machine machining features can be extracted [3] [4] such as bores, holes, drillings, tapered holes and planes

Paris-Sud XI, Université de

391

Analysis of Machining Stability for a Parallel Machine Tool  

Microsoft Academic Search

Machine tool chatter is a self-excited vibration generated by chip thickness variation. It severely degrades the quality of the machined surface. The incidence of chatter is greatly affected by the dynamic characteristics of machine tool structure. This article extends chatter stability analysis to a machine tool equipped with a parallel mechanism. The vibration model of a parallel machine tool is

D. Hong; S. Kim; W. C. Choi; J.-B. Song

2003-01-01

392

RICE UNIVERSITY The design and assessment of an accessible audio voting system  

E-print Network

RICE UNIVERSITY CHILVote: The design and assessment of an accessible audio voting system by Gillian, Psychology _______________________________ David M. Lane, Ph. D. Associate Professor, Psychology, Statistics

Byrne, Mike

393

Music and audio - oh how they can stress your network  

NASA Astrophysics Data System (ADS)

Nearly ten years ago a paper written by the Audio Engineering Society (AES)[1] made a number of interesting statements: 1. 2. The current Internet is inadequate for transmitting music and professional audio. Performance and collaboration across a distance stress beyond acceptable bounds the quality of service Audio and music provide test cases in which the bounds of the network are quickly reached and through which the defects in a network are readily perceived. Given these key points, where are we now? Have we started to solve any of the problems from the musician's point of view? What is it that musician would like to do that can cause the network so many problems? To understand this we need to appreciate that a trained musician's ears are extremely sensitive to very subtle shifts in temporal materials and localisation information. A shift of a few milliseconds can cause difficulties. So, can modern networks provide the temporal accuracy demanded at this level? The sample and bit rates needed to represent music in the digital domain is still contentious, but a general consensus in the professional world is for 96 KHz and IEEE 64-bit floating point. If this was to be run between two points on the network across 24 channels in near real time to allow for collaborative composition/production/performance, with QOS settings to allow as near to zero latency and jitter, it can be seen that the network indeed has to perform very well. Lighting the Blue Touchpaper for UK e-Science - Closing Conference of ESLEA Project The George Hotel, Edinburgh, UK 26-28 March, 200

Fletcher, R.

394

Diamond machine tool face lapping machine  

DOEpatents

An apparatus for shaping, sharpening and polishing diamond-tipped single-point machine tools. The isolation of a rotating grinding wheel from its driving apparatus using an air bearing and causing the tool to be shaped, polished or sharpened to be moved across the surface of the grinding wheel so that it does not remain at one radius for more than a single rotation of the grinding wheel has been found to readily result in machine tools of a quality which can only be obtained by the most tedious and costly processing procedures, and previously unattainable by simple lapping techniques.

Yetter, H.H.

1985-05-06

395

Lost Audio Packets Steganography: The First Practical Evaluation  

E-print Network

This paper presents first experimental results for an IP telephony-based steganographic method called LACK (Lost Audio PaCKets steganography). This method utilizes the fact that in typical multimedia communication protocols like RTP (Real-Time Transport Protocol), excessively delayed packets are not used for the reconstruction of transmitted data at the receiver, i.e. these packets are considered useless and discarded. The results presented in this paper were obtained basing on a functional LACK prototype and show the method's impact on the quality of voice transmission. Achievable steganographic bandwidth for the different IP telephony codecs is also calculated.

Mazurczyk, Wojciech

2011-01-01

396

Incorporating Auditory Models in Speech/Audio Applications  

NASA Astrophysics Data System (ADS)

Following the success in incorporating perceptual models in audio coding algorithms, their application in other speech/audio processing systems is expanding. In general, all perceptual speech/audio processing algorithms involve minimization of an objective function that directly/indirectly incorporates properties of human perception. This dissertation primarily investigates the problems associated with directly embedding an auditory model in the objective function formulation and proposes possible solutions to overcome high complexity issues for use in real-time speech/audio algorithms. Specific problems addressed in this dissertation include: 1) the development of approximate but computationally efficient auditory model implementations that are consistent with the principles of psychoacoustics, 2) the development of a mapping scheme that allows synthesizing a time/frequency domain representation from its equivalent auditory model output. The first problem is aimed at addressing the high computational complexity involved in solving perceptual objective functions that require repeated application of auditory model for evaluation of different candidate solutions. In this dissertation, a frequency pruning and a detector pruning algorithm is developed that efficiently implements the various auditory model stages. The performance of the pruned model is compared to that of the original auditory model for different types of test signals in the SQAM database. Experimental results indicate only a 4-7% relative error in loudness while attaining up to 80-90 % reduction in computational complexity. Similarly, a hybrid algorithm is developed specifically for use with sinusoidal signals and employs the proposed auditory pattern combining technique together with a look-up table to store representative auditory patterns. The second problem obtains an estimate of the auditory representation that minimizes a perceptual objective function and transforms the auditory pattern back to its equivalent time/frequency representation. This avoids the repeated application of auditory model stages to test different candidate time/frequency vectors in minimizing perceptual objective functions. In this dissertation, a constrained mapping scheme is developed by linearizing certain auditory model stages that ensures obtaining a time/frequency mapping corresponding to the estimated auditory representation. This paradigm was successfully incorporated in a perceptual speech enhancement algorithm and a sinusoidal component selection task.

Krishnamoorthi, Harish

397

Lecture outline Support vector machines  

E-print Network

Lecture outline · Support vector machines #12;Support Vector Machines · Find a linear hyperplane (decision boundary) that will separate the data #12;Support Vector Machines · One Possible Solution #12;Support Vector Machines · Another possible solution #12;Support Vector Machines · Other possible solutions

Terzi, Evimaria

398

Lecture outline Support vector machines  

E-print Network

Lecture outline · Support vector machines #12;Support Vector Machines · Find a linear hyperplane (decision boundary) that will separate the data #12;Support Vector Machines · One Possible Solution B1 #12;Support Vector Machines · Another possible solution B2 #12;Support Vector Machines · Other possible

Terzi, Evimaria

399

14. Interior, Machine Shop, Roundhouse Machine Shop Extension, Southern Pacific ...  

Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

14. Interior, Machine Shop, Roundhouse Machine Shop Extension, Southern Pacific Railroad Carlin Shops, view to north (90mm lens). - Southern Pacific Railroad, Carlin Shops, Roundhouse Machine Shop Extension, Foot of Sixth Street, Carlin, Elko County, NV

400

Technique for Machining Glass  

NASA Technical Reports Server (NTRS)

Process for machining glass with conventional carbide tools requires a small quantity of a lubricant for aluminum applied to area of glass to be machined. A carbide tool is then placed against workpiece with light pressure. Tool is raised periodically to clear work of glass dust and particles. Additional lubricant is applied as it is displaced.

Rice, S. H.

1982-01-01

401

Imagining machines with imagination  

Microsoft Academic Search

The author reviews and comments upon George Morton's predictive paper “Machines with Imagination”. He argues that Morton's paper demonstrates a significant insight into the progress of machine intelligence, particularly because that progress was hardly accomplished in a linear progression of systematic advances. Indeed, the forecasts laid down in Morton's paper have become pertinent only after a series of relative failures

DAVID B. FOGEL

2000-01-01

402

Stirling machine operating experience  

SciTech Connect

Numerous Stirling machines have been built and operated, but the operating experience of these machines is not well known. It is important to examine this operating experience in detail, because it largely substantiates the claim that stirling machines are capable of reliable and lengthy operating lives. The amount of data that exists is impressive, considering that many of the machines that have been built are developmental machines intended to show proof of concept, and are not expected to operate for lengthy periods of time. Some Stirling machines (typically free-piston machines) achieve long life through non-contact bearings, while other Stirling machines (typically kinematic) have achieved long operating lives through regular seal and bearing replacements. In addition to engine and system testing, life testing of critical components is also considered. The record in this paper is not complete, due to the reluctance of some organizations to release operational data and because several organizations were not contacted. The authors intend to repeat this assessment in three years, hoping for even greater participation.

Ross, B. [Stirling Technology Co., Richland, WA (United States); Dudenhoefer, J.E. [Lewis Research Center, Cleveland, OH (United States)

1994-09-01

403

Simple Machines Made Simple.  

ERIC Educational Resources Information Center

Simple machines have become a lost point of study in elementary schools as teachers continue to have more material to cover. This manual provides hands-on, cooperative learning activities for grades three through eight concerning the six simple machines: wheel and axle, inclined plane, screw, pulley, wedge, and lever. Most activities can be…

St. Andre, Ralph E.

404

Cellular Multihead Turing Machine.  

National Technical Information Service (NTIS)

The Turing machine is a mathematical model for describing procedures. It can do any calculation that can be done by modern day computers. One disadvantage of the Turing machine is its very slow speed. One way to increase the speed is to increase the numbe...

L. Librizzi

1970-01-01

405

The Chemical Abstract Machine  

Microsoft Academic Search

We introduce a new kind of abstract machine based on the chemical metaphor used in the ? language of Banâtre & al. States of a machine are chemical solutions where floating molecules can interact according to reaction rules. Solutions can be stratified by encapsulating subsolutions within membranes that force reactions to occur locally. We illustrate the use of this model

Gérard Berry; Gérard Boudol

1990-01-01

406

From Reduction Machines To Narrowing Machines  

Microsoft Academic Search

Narrowing, the evaluation mechanism of functional logic languages, can be seen as a generalizationof reduction, the evaluation mechanism of purely functional languages. The unidirectionalpattern matching, which is used for parameter passing in functional languages, is simplyreplaced by the bidirectional unification known from logic programming languages. We showin this paper, how to extend a reduction machine, that has been designed for

Rita Loogen; RWTH Aachen

1991-01-01

407

Audio-tactile integration and the influence of musical training.  

PubMed

Perception of our environment is a multisensory experience; information from different sensory systems like the auditory, visual and tactile is constantly integrated. Complex tasks that require high temporal and spatial precision of multisensory integration put strong demands on the underlying networks but it is largely unknown how task experience shapes multisensory processing. Long-term musical training is an excellent model for brain plasticity because it shapes the human brain at functional and structural levels, affecting a network of brain areas. In the present study we used magnetoencephalography (MEG) to investigate how audio-tactile perception is integrated in the human brain and if musicians show enhancement of the corresponding activation compared to non-musicians. Using a paradigm that allowed the investigation of combined and separate auditory and tactile processing, we found a multisensory incongruency response, generated in frontal, cingulate and cerebellar regions, an auditory mismatch response generated mainly in the auditory cortex and a tactile mismatch response generated in frontal and cerebellar regions. The influence of musical training was seen in the audio-tactile as well as in the auditory condition, indicating enhanced higher-order processing in musicians, while the sources of the tactile MMN were not influenced by long-term musical training. Consistent with the predictive coding model, more basic, bottom-up sensory processing was relatively stable and less affected by expertise, whereas areas for top-down models of multisensory expectancies were modulated by training. PMID:24465675

Kuchenbuch, Anja; Paraskevopoulos, Evangelos; Herholz, Sibylle C; Pantev, Christo

2014-01-01

408

Effects of stimulus duration on audio-visual synchrony perception.  

PubMed

The integration of visual and auditory inputs in the human brain occurs only if the components are perceived in temporal proximity, that is, when the intermodal time difference falls within the so-called subjective synchrony range. We used the midpoint of this range to estimate the point of subjective simultaneity (PSS). We measured the PSS for audio-visual (AV) stimuli in a synchrony judgment task, in which subjects had to judge a given AV stimulus using three response categories (audio first, synchronous, video first). The relevant stimulus manipulation was the duration of the auditory and visual components. Results for unimodal auditory and visual stimuli have shown that the perceived onset shifts to relatively later positions with increasing stimulus duration. These unimodal shifts should be reflected in changing PSS values, when AV stimuli with different durations of the auditory and visual components are used. The results for 17 subjects showed indeed a significant shift of the PSS for different duration combinations of the stimulus components. Because the shifts were approximately equal for duration changes in either of the components, no net shift of the PSS was observed as long as the durations of the two components were equal. This result indicates the need to appropriately account for unimodal timing effects when quantifying intermodal synchrony perception. PMID:22821079

Kuling, I A; van Eijk, R L J; Juola, J F; Kohlrausch, A

2012-09-01

409

Audio-Tactile Integration and the Influence of Musical Training  

PubMed Central

Perception of our environment is a multisensory experience; information from different sensory systems like the auditory, visual and tactile is constantly integrated. Complex tasks that require high temporal and spatial precision of multisensory integration put strong demands on the underlying networks but it is largely unknown how task experience shapes multisensory processing. Long-term musical training is an excellent model for brain plasticity because it shapes the human brain at functional and structural levels, affecting a network of brain areas. In the present study we used magnetoencephalography (MEG) to investigate how audio-tactile perception is integrated in the human brain and if musicians show enhancement of the corresponding activation compared to non-musicians. Using a paradigm that allowed the investigation of combined and separate auditory and tactile processing, we found a multisensory incongruency response, generated in frontal, cingulate and cerebellar regions, an auditory mismatch response generated mainly in the auditory cortex and a tactile mismatch response generated in frontal and cerebellar regions. The influence of musical training was seen in the audio-tactile as well as in the auditory condition, indicating enhanced higher-order processing in musicians, while the sources of the tactile MMN were not influenced by long-term musical training. Consistent with the predictive coding model, more basic, bottom-up sensory processing was relatively stable and less affected by expertise, whereas areas for top-down models of multisensory expectancies were modulated by training. PMID:24465675

Kuchenbuch, Anja; Paraskevopoulos, Evangelos; Herholz, Sibylle C.; Pantev, Christo

2014-01-01

410

A compact electroencephalogram recording device with integrated audio stimulation system.  

PubMed

A compact (96 x 128 x 32 mm(3), 374 g), battery-powered, eight-channel electroencephalogram recording device with an integrated audio stimulation system and a wireless interface is presented. The recording device is capable of producing high-quality data, while the operating time is also reasonable for evoked potential studies. The effective measurement resolution is about 4 nV at 200 Hz sample rate, typical noise level is below 0.7 microV(rms) at 0.16-70 Hz, and the estimated operating time is 1.5 h. An embedded audio decoder circuit reads and plays wave sound files stored on a memory card. The activities are controlled by an 8 bit main control unit which allows accurate timing of the stimuli. The interstimulus interval jitter measured is less than 1 ms. Wireless communication is made through bluetooth and the data recorded are transmitted to an external personal computer (PC) interface in real time. The PC interface is implemented with LABVIEW and in addition to data acquisition it also allows online signal processing, data storage, and control of measurement activities such as contact impedance measurement, for example. The practical application of the device is demonstrated in mismatch negativity experiment with three test subjects. PMID:20590254

Paukkunen, Antti K O; Kurttio, Anttu A; Leminen, Miika M; Sepponen, Raimo E

2010-06-01

411

On parallel machine scheduling 1  

E-print Network

On parallel machine scheduling 1 machines with setup times. The setup has to be performed by a single server. The objective is to minimize even for the case of two identical parallel machines. This paper presents a pseudopolynomial

Magdeburg, Universität

412

2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics October 16-19, 2011, New Paltz, NY MODEL BASED MULTIPLE AUDIO SEQUENCE ALIGNMENT  

E-print Network

. There exist robust audio fingerprinting methodologies that achieve high matching performances under very noisy in genetics that is called shotgun sequencing where the long DNA strands are assembled from shorter sequences

Cemgil, A. Taylan

413

Increasing Robustness of LSB Audio Steganography Using a Novel Embedding Method  

Microsoft Academic Search

In this paper, we present a novel high bit rate LSB audio watermarking method. The basic idea of the proposed LSB algorithm is watermark embedding that causes minimal embedding distortion of the host audio. Using the proposed two-step algorithm, watermark bits are embedded into higher LSB layers, resulting in increased robustness against noise addition or MPEG compression. Listening tests showed

Nedeljko Cvejic; Tapio Seppänen

2004-01-01

414

On the human ability to discriminate audio ambiances from similar locations of an urban environment  

E-print Network

on visual aspects only. Ex- amples include the online services Nokia Maps 3D [14] and Google Street View [6 and movies the audio is effectively used for conveying more in- formation about the environment to the user location-based systems augmented with audio ambiances, it would be cost- effective to use a few

Virtanen, Tuomas

415

Gesture Interaction with Spatial Audio Displays: Effects of Target Size and Inter-Target Separation  

Microsoft Academic Search

This paper presents the results of an experiment comparing two spatial audio display segmentation techniques by investi- gating the relative salience of target width versus distance to target in a gesture based spatial audio selection task. The first technique, MINIMAL, occupies as little of the display area as possible with sounds placed as close to each other as possible. The

Georgios Marentakis; Stephen A. Brewster

2005-01-01

416

"Listen to This!" Utilizing Audio Recordings to Improve Instructor Feedback on Writing in Mathematics  

ERIC Educational Resources Information Center

Providing audio files in lieu of written remarks on graded assignments is arguably a more effective means of feedback, allowing students to better process and understand the critique and improve their future work. With emerging technologies and software, this audio feedback alternative to the traditional paradigm of providing written comments…

Weld, Christopher

2014-01-01

417

IEEE SIGNAL PROCESSING MAGAZINE [84] MARCH 2009 he field of audio forensics involves many topics  

E-print Network

IEEE SIGNAL PROCESSING MAGAZINE [84] MARCH 2009 T he field of audio forensics involves many topics- falls encountered by signal processing experts when dealing with typical forensic material due processing research field. HISTORY Over the preceding 40 years, forensic audio examination has gradually

Maher, Robert C.

418

Phantom Materialization: A Novel Method to Enhance Stereo Audio Reproduction on Headphones  

Microsoft Academic Search

EDICs : AUD-SMCA Abstract— Loudspeaker reproduction systems are sub- ject to a compromise between spatial realism and cost. By simulating loudspeaker reproduction on headphones, the resulting spatial realism is limited accordingly, despite the virtually unlimited spatial imaging capabilities of binaural audio rendering technology. More particularly, phantom imaging as often used for stereo audio mate- rial intended for loudspeaker reproduction is

Jeroen Breebaart; Erik Schuijers

2008-01-01

419

Multimedia Information Retrieval: MIDI as a format for Content Based Retrieval of Audio  

E-print Network

1 Multimedia Information Retrieval: MIDI as a format for Content Based Retrieval of Audio John Mc in the arena of Multimedia Information Retrieval. MIDI is an encoding format which encodes and relays. Index terms: Multimedia Information Retrieval, MIDI, Content­Based Retrieval of Audio. 1 INTRODUCTION

Smeaton, Alan F.

420

A 3D audio only interactive Web browser: using spatialization to convey hypermedia document structure  

Microsoft Academic Search

Interactive audio browsers provide both sighted and visually impaired users with access to the WWW. In addition to the desktop PC, audio browsing technology can be deployed that enable users to browse the WWW using a telephone or while driving a car. This paper describes a new conceptual model of the HTML document structure and its mapping to a 3D

Stuart Goose; Carsten Möller

1999-01-01

421

Reaching Out: The Role of Audio Cassette Communication in Rural Development. Occasional Paper 19.  

ERIC Educational Resources Information Center

This report describes the state-of-the-art of audio cassette technology (ACT) and reports findings from field tests, case studies, and pilot projects in several countries which demonstrate the potential of audio cassettes as a medium for communicating with rural people. Specific guidance is also offered on how a project can use cassettes as a…

Adhikarya, Ronny; Colle, Royal D.

422

AUDIO-BASED COVER SONG RETRIEVAL USING APPROXIMATE CHORD SEQUENCES: TESTING SHIFTS, GAPS, SWAPS AND BEATS  

Microsoft Academic Search

This paper presents a variation on the theme of using string alignment for MIR in the context of cover song identifi- cation in audio collections. Here, the strings are derived from audio by means of HMM-based chord estimation. The characteristics of the cover-song ID problem and the nature of common chord estimation errors are carefully considered. As a result strategies

Juan Pablo Bello

2007-01-01

423

HOMOGENEOUS SEGMENTATION AND CLASSIFIER ENSEMBLE FOR AUDIO TAG ANNOTATION AND RETRIEVAL  

E-print Network

for recommendation. Music tags are free text labels associated with artists, genre, emotion, mood, instruments, etc: {hungyi, asriver, whm}@iis.sinica.edu.tw ABSTRACT Audio tags describe different types of musical information such as genre, mood, and instrument. This paper aims to automatically annotate audio clips

Wang, Hsin-Min

424

Unsupervised feature learning for audio classification using convolutional deep belief networks  

Microsoft Academic Search

In recent years, deep learning approaches have gained significant interest as a way of building hierarchical representations from unlabeled data. However, to our knowledge, these deep learning approaches have not been extensively stud- ied for auditory data. In this paper, we apply convolutional deep belief net- works to audio data and empirically evaluate them on various audio classification tasks. In

Honglak Lee; Yan Largman; Peter Pham; Andrew Y. Ng

425

Guess Who's Speaking: Audio Segmentation for the Automated Transcription of Large Spoken Archives  

Microsoft Academic Search

This paper addresses the influence of audio segmentation on automatic transcription accuracy for large spoken archives. The work forms part of the ongoing MALACH project, which is developing advanced techniques for supporting access to the world's largest digital archive of video oral histories, collected by VHF from over 52000 survivors and witnesses of the Holo- caust. We present several audio-only

Bhuvana Ramabhadran; Jing Huang; Upendra Chaudhari; Giridharan Iyengar; Harriet J. Nock

426

Report to the Legislature: Audio-Digital MCAS Pilot Program. Line Item 7061-0012  

ERIC Educational Resources Information Center

This paper presents the Final Report on the Audio-Digital MCAS Pilot Program. The Department and Recording For the Blind & Dyslexic (RFB&D) have collaborated to provide audio-digital read-aloud editions of the Grade 10 English Language Arts and Mathematics MCAS tests for a small number of students with disabilities such as dyslexia and/or vision…

Massachusetts Department of Elementary and Secondary Education, 2008

2008-01-01

427

Detecting proximity from personal audio recordings Daniel P. W. Ellis1  

E-print Network

' daily lives and activities. Smartphones already include high-quality microphones and audio input elec to continue operating as a phone. Of the many possible applications of continuous personal audio stream radius every 5 min. They used this data to map the daily behavior patterns and social networks

Ellis, Dan

428

WP9.06 Skew Detection and Compensation for Internet Audio Applications  

Microsoft Academic Search

Long lived audio streams, such as music broadcasts, and small differences in clock rates lead to buffer underflo w or overflo w events in receiving applications that manifest themselves as au- dible interruptions. We present a low complexity algorithm for detecting clock skew in network audio applications that function with local clocks and in the absence of a synchronization mech-

Orion Hodson; Colin Perkins; Vicky Hardman

2000-01-01

429

INTRODUCING A SIMPLE FUSION FRAMEWORK FOR AUDIO SOURCE SEPARATION Xabier Jaureguiberry  

E-print Network

INTRODUCING A SIMPLE FUSION FRAMEWORK FOR AUDIO SOURCE SEPARATION Xabier Jaureguiberry , Ga fusion framework for un- derdetermined audio source separation. This framework can be applied to a wide variety of source separation algorithms providing that they estimate time-frequency masks. Fusion

Boyer, Edmond

430

Using Audio-Visual Study Lessons to Teach the Unprepared Student  

ERIC Educational Resources Information Center

Describes supplementary audio-visual lessons in inorganic chemistry designed to correct for the wide diversity in educational background by giving the poorly-prepared student additional help in mastering subject matter. In ranking various study aids for their effectiveness, 88 percent of the students ranked audio-visual tapes as most effective.…

Suter, Patricia H.

1977-01-01

431

Reduced Rate Ultra Low Delay Audio Coder using Multistage Vector Quantization  

Microsoft Academic Search

Communication applications are usually delay restricted, especially for the instance of musicians playing over the Internet. This requires a one-way delay of maximum 25 msec and also a high audio quality is desired at feasible bit rates. The ultra low delay (ULD) audio coding structure is well suited to this application and we investigate further the application of multistage vector

T. V. Sreenivas; S. Wabnik; G. Schuller

2007-01-01

432

Packet Loss Concealment for Multichannel Audio Using the Multiband Source/Filter Model  

E-print Network

streaming through the Internet or wireless channels. 1 Introduction Multichannel audio recordings offer multichannel audio coding methods, our approach is suitable for applications such as distributed musicians col streaming through the Internet or wireless channels. In such cases, it is often possible that some packets

Tsakalides, Panagiotis

433

Real-Time Streaming of Multichannel Audio Data over Internet Aoxiang Xu1  

E-print Network

1 Real-Time Streaming of Multichannel Audio Data over Internet Aoxiang Xu1 , Wieslaw Woszczyk2 at McGill University, was transmitted to an audience at New York University, over the Internet. While Internet streaming audio technologies have been in use for several years, what made this event unique

Cooperstock, Jeremy R.

434

Adaptive Signal Models: Theory, Algorithms, and Audio Applications Michael Mark Goodwin  

E-print Network

Adaptive Signal Models: Theory, Algorithms, and Audio Applications by Michael Mark Goodwin S dissertation of Michael Mark Goodwin is approved: Chair Date Date Date University of California, Berkeley Fall Mark Goodwin #12;#12;1 Abstract Adaptive Signal Models: Theory, Algorithms, and Audio Applications

435

Adaptive Signal Models: Theory, Algorithms, and Audio Applications Michael Mark Goodwin  

E-print Network

Adaptive Signal Models: Theory, Algorithms, and Audio Applications by Michael Mark Goodwin S; The dissertation of Michael Mark Goodwin is approved: Chair Date Date Date University of California, Berkeley Fall Mark Goodwin #12; #12; 1 Abstract Adaptive Signal Models: Theory, Algorithms, and Audio Applications

436

Chalk Sounds: The Effects of Dynamic Synthesized Audio on Workspace Awareness in Distributed Groupware  

E-print Network

Workspace awareness ­ the up-to-the-moment knowledge of who is in a shared space, where they are, and what that can overcome the drawbacks of visual awareness displays: audio takes no space and does not needChalk Sounds: The Effects of Dynamic Synthesized Audio on Workspace Awareness in Distributed

Williamson, John

437

Audio Music Genre Classification Using Different Classifiers and Feature Selection Methods  

E-print Network

Audio Music Genre Classification Using Different Classifiers and Feature Selection Methods Yusuf classifiers on different audio feature sets to determine the genre of a given music piece. For each classifier genre set of 1000 music pieces. This performance is better than 71.1 ± 7.3% which is the best that has

Cataltepe, Zehra

438

Integration of Text and Audio Features for Genre Classification in Music Information  

E-print Network

Integration of Text and Audio Features for Genre Classification in Music Information Retrieval the use of textual data on top of low level audio features for music genre classification. Further, we musical genres, such as e.g. Christmas candles, can only be detected by including textual features

Rauber,Andreas

439

Polycom Ceiling Microphone Array Extraordinary room coverage with superior audio pickup  

E-print Network

as our tabletop microphone arrays. Unobtrusively, the microphone elements are located in the room whileDATA SHEET Polycom� Ceiling Microphone Array Extraordinary room coverage with superior audio pickup Polycom's digital microphone array is the only audio pickup device for video conferencing systems

Lu, Chang

440

ASSESSMENT OF AUDIO FEATURES FOR AUTOMATIC COUGH DETECTION Thomas Drugman, Jerome Urbain, Thierry Dutoit  

E-print Network

ASSESSMENT OF AUDIO FEATURES FOR AUTOMATIC COUGH DETECTION Thomas Drugman, Jerome Urbain, Thierry://tcts.fpms.ac.be/drugman/ ABSTRACT This paper addresses the issue of cough detection using only audio recordings, with the ultimate. 1. INTRODUCTION For children as well as for adults, cough is in pneumology the commonest syndrom

Dupont, Stéphane

441

TOWARDS ALIGNMENT OF SCORE AND AUDIO RECORDINGS OF OTTOMAN-TURKISH MAKAM MUSIC  

E-print Network

in the score and a audio prominent pitch from the audio recording. Then it identifies the performed tonic the state-of-the-art performs in makam musics. The results show that our method is able to handle the tonic or even entire sections. It is also common to trans- pose the tonic of a performance due to instrument

442

Energetic and informational masking effects in an audio-visual speech recognition system.  

E-print Network

1 Energetic and informational masking effects in an audio-visual speech recognition system. Jon Barker and Xu Shao Abstract The paper presents a robust audio-visual speech recognition technique called on a small vocabulary simultaneous speech recognition task in conditions that promote two contrasting types

Barker, Jon

443

Sensor and Data Systems, Audio-Assisted Cameras and Acoustic Doppler Kaustubh Kalgaonkar  

E-print Network

Sensor and Data Systems, Audio-Assisted Cameras and Acoustic Doppler Sensors Kaustubh Kalgaonkar for sens- ing and surveillance ­ audio-assisted cameras and acoustic Doppler sensors for gait recognition density for the vector (component-wise) ratio of the magnitude spectra of the signals captured. We augment

Zhu, Zhigang

444

Extraction of Pinna Features for Customized Binaural Audio Delivery on Mobile Devices  

E-print Network

Extraction of Pinna Features for Customized Binaural Audio Delivery on Mobile Devices Simone@dei.unipd.it ABSTRACT The paper presents a system for customized binaural au- dio delivery based on the extraction. The purpose of the desired system is to pro- vide low-tech custom binaural audio to any user without the need

Avanzini, Federico

445

CUAVE: A new audio-visual database for multimodal human-computer interface research  

Microsoft Academic Search

Multimodal signal processing has become an important topic of research for overcoming certain problems of audio-only speech processing. Audio-visual speech recognition is one area with great potential. Difficulties due to background noise and multiple speakers are significantly reduced by the additional information provided by extra visual features. Despite a few efforts to create databases in this area, none has emerged

E. K. Patterson; S. Gurbuz; Z. Tufekci; J. N. Gowdy

2002-01-01

446

3D Audio Perception System for Humanoid Robots Norbert Schmitz, Carsten Spranger, Karsten Berns  

E-print Network

3D Audio Perception System for Humanoid Robots Norbert Schmitz, Carsten Spranger, Karsten Berns as attention system for the vision system. In this paper the audio perception module of the robot RO- MAN system. Often persons hear a sound like the shutting of a door and turn toward the sound source to see

Berns, Karsten

447

HMM based structuring of tennis videos using visual and audio cues  

Microsoft Academic Search

This paper focuses on the use of hidden Markov models (HMMs) for structure analysis of videos, and demonstrates how they can be efficiently applied to merge audio and visual cues. Our approach is validated in the particular domain of tennis videos. The basic temporal unit is the video shot. Visual features describe the audio events within a video shot. The

E. Kijak; G. Gravier; P. Gros; L. Oisel; F. Bimbot

2003-01-01

448

ISSN0249-0803ISRNINRIA/RT--0401--FR+ENG Audio, Speech, and Language Processing  

E-print Network

of phoneme variants as a parameter. As no database of infant-directed speech1 containing rich phoneticapport technique ISSN0249-0803ISRNINRIA/RT--0401--FR+ENG Audio, Speech, and Language Processing 39 63 53 30 A note on the generation of allophonic rules Luc Boruta Theme : Audio, Speech

Paris-Sud XI, Université de

449

ISSN0249-0803ISRNINRIA/RT--0401--FR+ENG Audio, Speech, and Language Processing  

E-print Network

to vary the number of phoneme variants as a parameter. As no database of infant-directed speech1apport technique ISSN0249-0803ISRNINRIA/RT--0401--FR+ENG Audio, Speech, and Language Processing on the generation of allophonic rules Luc Boruta Theme : Audio, Speech, and Language Processing Perception

Paris-Sud XI, Université de

450

A Portable Audio/Video Recorder for Longitudinal Study of Child Development  

E-print Network

A Portable Audio/Video Recorder for Longitudinal Study of Child Development Soroush Vosoughi MIT- istic, longitudinal recordings of child development. Categories and Subject Descriptors H.5.2 [User, privacy man- agement, video recorder, audio recorder, child development 1. INTRODUCTION Collection

Roy, Deb

451

Professional Development and Audio-Assisted Independent Study in Mental Health.  

ERIC Educational Resources Information Center

Discusses a continuing education program which incorporates effective staff development techniques and utilization of instructional resources. Considers use of audio tapes in audio-assisted independent study as an appropriate delivery system, emphasizing staff growth and improvement as a community service function jointly planned with other…

Geib, Peter; McMeen, George R.

1984-01-01

452

FINGERPRINTING TO IDENTIFY REPEATED SOUND EVENTS IN LONG-DURATION PERSONAL AUDIO RECORDINGS  

E-print Network

, Acoustic signal analysis, Fingerprint identification. 1. INTRODUCTION Recent technology advances in digitalFINGERPRINTING TO IDENTIFY REPEATED SOUND EVENTS IN LONG-DURATION PERSONAL AUDIO RECORDINGS James P for this data is a challenge; in this paper we investigate the use of an audio fingerprinting technique

Ellis, Dan

453

The Basic Anaesthesia Machine  

PubMed Central

After WTG Morton's first public demonstration in 1846 of use of ether as an anaesthetic agent, for many years anaesthesiologists did not require a machine to deliver anaesthesia to the patients. After the introduction of oxygen and nitrous oxide in the form of compressed gases in cylinders, there was a necessity for mounting these cylinders on a metal frame. This stimulated many people to attempt to construct the anaesthesia machine. HEG Boyle in the year 1917 modified the Gwathmey's machine and this became popular as Boyle anaesthesia machine. Though a lot of changes have been made for the original Boyle machine still the basic structure remains the same. All the subsequent changes which have been brought are mainly to improve the safety of the patients. Knowing the details of the basic machine will make the trainee to understand the additional improvements. It is also important for every practicing anaesthesiologist to have a thorough knowledge of the basic anaesthesia machine for safe conduct of anaesthesia. PMID:24249876

Gurudatt, CL

2013-01-01

454

DNA-Based Machines.  

PubMed

The base sequence in nucleic acids encodes substantial structural and functional information into the biopolymer. This encoded information provides the basis for the tailoring and assembly of DNA machines. A DNA machine is defined as a molecular device that exhibits the following fundamental features. (1) It performs a fuel-driven mechanical process that mimics macroscopic machines. (2) The mechanical process requires an energy input, "fuel." (3) The mechanical operation is accompanied by an energy consumption process that leads to "waste products." (4) The cyclic operation of the DNA devices, involves the use of "fuel" and "anti-fuel" ingredients. A variety of DNA-based machines are described, including the construction of "tweezers," "walkers," "robots," "cranes," "transporters," "springs," "gears," and interlocked cyclic DNA structures acting as reconfigurable catenanes, rotaxanes, and rotors. Different "fuels", such as nucleic acid strands, pH (H(+)/OH(-)), metal ions, and light, are used to trigger the mechanical functions of the DNA devices. The operation of the devices in solution and on surfaces is described, and a variety of optical, electrical, and photoelectrochemical methods to follow the operations of the DNA machines are presented. We further address the possible applications of DNA machines and the future perspectives of molecular DNA devices. These include the application of DNA machines as functional structures for the construction of logic gates and computing, for the programmed organization of metallic nanoparticle structures and the control of plasmonic properties, and for controlling chemical transformations by DNA machines. We further discuss the future applications of DNA machines for intracellular sensing, controlling intracellular metabolic pathways, and the use of the functional nanostructures for drug delivery and medical applications. PMID:24647836

Wang, Fuan; Willner, Bilha; Willner, Itamar

2014-01-01

455

76 FR 57923 - Establishment of Rules and Policies for the Satellite Digital Audio Radio Service in the 2310...  

Federal Register 2010, 2011, 2012, 2013

...FEDERAL COMMUNICATIONS COMMISSION 47 CFR Part 25...Policies for the Satellite Digital Audio Radio Service in the...Frequency Band AGENCY: Federal Communications Commission. ACTION: Final...contained in the Satellite Digital Audio Radio Service...

2011-09-19

456

Virtual Turing Machine 2  

NSDL National Science Digital Library

A Turing machine is theoretical computer consisting of a finite set of internal states, a finite alphabet that includes a blank symbol, and a finite set of instructions. It has a physical head and a physical infinitely long tape, which is divided into cells. The cell values consist of the alphabet. The tape has a finite number of non-blank cells. The head can read and write to the cells and move the tape one cell to the left and one cell to the right. The Virtual Turing Machine lets you input tape values and an instruction set to see the output of a turing machine.

Ming, Paul R.

457

Audio-based queries for video retrieval over Java enabled mobile devices  

NASA Astrophysics Data System (ADS)

In this paper we propose a generic framework for efficient retrieval of audiovisual media based on its audio content. This framework is implemented in a client-server architecture where the client application is developed in Java to be platform independent whereas the server application is implemented for the PC platform. The client application adapts to the characteristics of the mobile device where it runs such as screen size and commands. The entire framework is designed to take advantage of the high-level segmentation and classification of audio content to improve speed and accuracy of audio-based media retrieval. Therefore, the primary objective of this framework is to provide an adaptive basis for performing efficient video retrieval operations based on the audio content and types (i.e. speech, music, fuzzy and silence). Experimental results approve that such an audio based video retrieval scheme can be used from mobile devices to search and retrieve video clips efficiently over wireless networks.

Ahmad, Iftikhar; Cheikh, Faouzi Alaya; Kiranyaz, Serkan; Gabbouj, Moncef

2006-02-01

458

Theory of Molecular Machines. I. Channel Capacity of Molecular Machines  

E-print Network

Theory of Molecular Machines. I. Channel Capacity of Molecular Machines running title: Channel Capacity of Molecular Machines Thomas D. Schneider version = 5.76 of ccmm.tex 2004 Feb 3 Version 5.67 was submitted 1990 December 5 Schneider, T. D. (1991). Theory of molecular machines. I. Channel capacity

Schneider, Thomas D.

459

Speech Recognition in Machines 785 Speech Recognition in Machines  

E-print Network

Speech Recognition in Machines 785 Speech Recognition in Machines Over the past several decades (speech recognition systems) human speech. We concentrate on speech recognition systems in this section. Speech recognition by machine refers to the capability of a machine to convert human speech to a textual

Liebling, Michael

460

16. Interior, Machine Shop, Roundhouse Machine Shop Extension, Southern Pacific ...  

Library of Congress Historic Buildings Survey, Historic Engineering Record, Historic Landscapes Survey

16. Interior, Machine Shop, Roundhouse Machine Shop Extension, Southern Pacific Railroad Carlin Shops, view to south (90mm lens). Note the large segmental-arched doorway to move locomotives in and out of Machine Shop. - Southern Pacific Railroad, Carlin Shops, Roundhouse Machine Shop Extension, Foot of Sixth Street, Carlin, Elko County, NV

461

Universal Developer Platform for MachinePlatform for Machine  

E-print Network

1 Universal Developer Platform for MachinePlatform for Machine Control Applications Hansjoerg will need some things: · Senses · Means June 24-26, 2008 ETH Zurich #12;2 And for Machine Control? June 24 ETH Zurich Conclusion: big effort for things that have little to do with Machine Control Multi

462

Curious and Creative Machines  

Microsoft Academic Search

st -grade elementary school children. In the school's gymnasium hall, a few dozen 6-year-olds gathered enthusiastically around a few shiny machines with plenty of sensors and actuators, demonstrating patterns of locomotion. \\

Hod Lipson

2006-01-01

463

Design of Vitrification Machine  

E-print Network

and unreliable process. Moreover, there is a lack of standardization in the methods for preparing cells and biological material for the vitrification process. The purpose of the Vitrification Machine is to greatly simplify the process by making it faster, more...

McDonald, Todd William

2009-11-30

464

Protein Thin Film Machines  

E-print Network

We report the first example of microcantilever beams that are reversibly driven by protein thin film machines fuelled by cycling the salt concentration of the surrounding solution. We also show that upon the same salinity ...

Federici, Stefania

465

Machine For Grinding Fog  

E-print Network

Machine For Grinding Fog, an exhibition of stop motion animated films, revolves around three primary concerns: relationships, psychological or subjective reality, and relativity. Rooted within concerns of faith, personal ethics, and a re...

Ward, Nicholas Adam

2010-04-30

466

Positive Linear Function Machine  

NSDL National Science Digital Library

Students investigate linear functions with positive slopes by trying to guess the slope and intercept from inputs and outputs. Positive Linear Function Machine is one of the Interactivate assessment explorers.

467

Designing Sociable Machines  

Microsoft Academic Search

Sociable machines are a blend of art, science, and engineering. We highlight how insights from these disciplines have helped us to address a few key design issues for building expressive humanoid robots that interact with people in a social manner.

Cynthia Breazeal

468

Real Time Streaming of Multi-channel Audio Data over Internet 5120 (I -3) Aoxiang Xu and Jeremy Cooperstock,  

E-print Network

Real Time Streaming of Multi-channel Audio Data over Internet 5120 (I -3) Aoxiang Xu and Jeremy AND COOPERSTOCK PREPRINT 5120 REAL TIME STREAMING OF Real-Time Streaming of Multichannel Audio Data over Internet, in real time, to an audience at New York University, over the Internet. While Internet streaming audio

Cooperstock, Jeremy R.

469

Fill out this application, have it signed, and mail or fax. Or call Audio-Reader toll-free,  

E-print Network

, and technology to share the gift of sight through sound. For your convenience, a donation form is enclosed. You for Audio-Reader Service The Kansas Audio-Reader Network is a reading and information service for blind of charge to anyone in our listening area who is unable to read normal printed material. Support Audio

470

Human-machine interactions  

SciTech Connect

Digital technology utilizing a cognitive model based on human naturalistic decision-making processes, including pattern recognition and episodic memory, can reduce the dependency of human-machine interactions on the abilities of a human user and can enable a machine to more closely emulate human-like responses. Such a cognitive model can enable digital technology to use cognitive capacities fundamental to human-like communication and cooperation to interact with humans.

Forsythe, J. Chris (Sandia Park, NM); Xavier, Patrick G. (Albuquerque, NM); Abbott, Robert G. (Albuquerque, NM); Brannon, Nathan G. (Albuquerque, NM); Bernard, Michael L. (Tijeras, NM); Speed, Ann E. (Albuquerque, NM)

2009-04-28

471

The Lisp Machine manual  

Microsoft Academic Search

This 471-page, softcover manual describes the programming language and software environment of the Lisp Machine developed at M.I.T.'s Artificial Intelligence Laboratory over the past 8 years. The Lisp Machine is the result of a successful experiment in computer science: a distributed computing system consisting of a network of powerful 32-bit personal computers, implemented with custom hardware and software as complete,

Daniel Weinreb; David Moon

1981-01-01

472

Working with simple machines  

NASA Astrophysics Data System (ADS)

A set of examples is provided that illustrate the use of work as applied to simple machines. The ramp, pulley, lever and hydraulic press are common experiences in the life of a student, and their theoretical analysis therefore makes the abstract concept of work more real. The mechanical advantage of each of these systems is also discussed so that students can evaluate their usefulness as machines.

Norbury, John W.

2006-11-01

473

Design a Flying Machine  

NSDL National Science Digital Library

The purpose of this activity is for the students to draw a design for their own flying machine. They will apply their knowledge of aircraft design and the forces acting on them. The students will start with a brainstorming activity where they come up with creative uses for every day objects. They will then use their creativity and knowledge of airplanes to design their own flying machine.

Integrated Teaching And Learning Program

474

Metalworking and machining fluids  

Microsoft Academic Search

Improved boron-based metal working and machining fluids. Boric acid and boron-based additives that, when mixed with certain carrier fluids, such as water, cellulose and\\/or cellulose derivatives, polyhydric alcohol, polyalkylene glycol, polyvinyl alcohol, starch, dextrin, in solid and\\/or solvated forms result in improved metalworking and machining of metallic work pieces. Fluids manufactured with boric acid or boron-based additives effectively reduce friction,

Ali Erdemir; Frank Sykora; Mark Dorbeck

2010-01-01

475

Weka Machine Learning Project  

NSDL National Science Digital Library

If you are inspired to try the process, the Weka Machine Learning Project from Waikato University offers open source software that can be used for data mining tasks. Visitors can also find the projects 1993 to 2006 publications, many of which are available for free in as PDFs. The "related" section offers a number of links to further information on topics such as artificial intelligence and machine learning.

2008-01-10

476

Metalworking and machining fluids  

DOEpatents

Improved boron-based metal working and machining fluids. Boric acid and boron-based additives that, when mixed with certain carrier fluids, such as water, cellulose and/or cellulose derivatives, polyhydric alcohol, polyalkylene glycol, polyvinyl alcohol, starch, dextrin, in solid and/or solvated forms result in improved metalworking and machining of metallic work pieces. Fluids manufactured with boric acid or boron-based additives effectively reduce friction, prevent galling and severe wear problems on cutting and forming tools.

Erdemir, Ali (Naperville, IL); Sykora, Frank (Caledon, ON, CA); Dorbeck, Mark (Brighton, MI)

2010-10-12

477

Driven Wave Machine Model  

NSDL National Science Digital Library

The Driven Wave Machine model simulates a wave machine driven by a sinusoidal oscillator. The machine consists of horizontal bars welded to a torsion rod that is perpendicular to the bars. The simulation allows the user to change the number of bars, the drive frequency, and the boundary conditions. The simulation also allows the user to change the lengths of the bars, thereby simulating the effect of a wave propagating in a non-uniform medium. The Driven Wave Machine model is a supplemental simulation for the article "Standing Waves in a Nonuniform Medium" by Paul Gluck in The Physics Teacher 49(2), 76-77 (2011) and has been approved by the authors and The Physics Teacher editor. The Driven Wave Machine model was created using the Easy Java Simulations (EJS) modeling tool. It is distributed as a ready-to-run (compiled) Java archive. Double clicking the ejs_mech_osc_chains_DrivenWaveMachine.jar file will run the program if Java is installed.

Christian, Wolfgang

2010-11-09

478

Wave Machine Model  

NSDL National Science Digital Library

This model simulates the wave generating machine created by John Shive at Bell Laboratories and made famous by the Similarities in Wave Behavior film. The machine consists of horizontal cross-bars welded to a central wire spine that is perpendicular to the bars. The spine was constructed so that it can freely twist, allowing the cross-bars to produce wave-like patterns. The simulation allows various pulse shapes to be sent down the machine by selecting a function for the twist of the first rod or by dragging the first rod. The far end of the wave machine can be free or fixed, which changes the nature of the reflected wave. Change the lengths of the bars to simulate the effect of a wave propagating in a non-uniform medium. The original film that introduced Dr. Shive's wave generating machine can be viewed at no cost: Film: Similarities of Wave Behavior (AT&T Archives and History Collection). The Wave Machine model was created using the Easy Java Simulations (EJS) modeling tool. It is distributed as a ready-to-run (compiled) Java archive.

Christian, Wolfgang

2014-06-13

479

Audio-visual speech perception: a developmental ERP investigation  

PubMed Central

Being able to see a talking face confers a considerable advantage for speech perception in adulthood. However, behavioural data currently suggest that children fail to make full use of these available visual speech cues until age 8 or 9. This is particularly surprising given the potential utility of multiple informational cues during language learning. We therefore explored this at the neural level. The event-related potential (ERP) technique has been used to assess the mechanisms of audio-visual speech perception in adults, with visual cues reliably modulating auditory ERP responses to speech. Previous work has shown congruence-dependent shortening of auditory N1/P2 latency and congruence-independent attenuation of amplitude in the presence of auditory and visual speech signals, compared to auditory alone. The aim of this study was to chart the development of these well-established modulatory effects over mid-to-late childhood. Experiment 1 employed an adult sample to validate a child-friendly stimulus set and paradigm by replicating previously observed effects of N1/P2 amplitude and latency modulation by visual speech cues; it also revealed greater attenuation of component amplitude given incongruent audio-visual stimuli, pointing to a new interpretation of the amplitude modulation effect. Experiment 2 used the same paradigm to map cross-sectional developmental change in these ERP responses between 6 and 11 years of age. The effect of amplitude modulation by visual cues emerged over development, while the effect of latency modulation was stable over the child sample. These data suggest that auditory ERP modulation by visual speech represents separable underlying cognitive processes, some of which show earlier maturation than others over the course of development. PMID:24176002

Knowland, Victoria CP; Mercure, Evelyne; Karmiloff-Smith, Annette; Dick, Fred; Thomas, Michael SC

2014-01-01

480

Audio-visual speech perception: a developmental ERP investigation.  

PubMed

Being able to see a talking face confers a considerable advantage for speech perception in adulthood. However, behavioural data currently suggest that children fail to make full use of these available visual speech cues until age 8 or 9. This is particularly surprising given the potential utility of multiple informational cues during language learning. We therefore explored this at the neural level. The event-related potential (ERP) technique has been used to assess the mechanisms of audio-visual speech perception in adults, with visual cues reliably modulating auditory ERP responses to speech. Previous work has shown congruence-dependent shortening of auditory N1/P2 latency and congruence-independent attenuation of amplitude in the presence of auditory and visual speech signals, compared to auditory alone. The aim of this study was to chart the development of these well-established modulatory effects over mid-to-late childhood. Experiment 1 employed an adult sample to validate a child-friendly stimulus set and paradigm by replicating previously observed effects of N1/P2 amplitude and latency modulation by visual speech cues; it also revealed greater attenuation of component amplitude given incongruent audio-visual stimuli, pointing to a new interpretation of the amplitude modulation effect. Experiment 2 used the same paradigm to map cross-sectional developmental change in these ERP responses between 6 and 11 years of age. The effect of amplitude modulation by visual cues emerged over development, while the effect of latency modulation was stable over the child sample. These data suggest that auditory ERP modulation by visual speech represents separable underlying cognitive processes, some of which show earlier maturation than others over the course of development. PMID:24176002

Knowland, Victoria C P; Mercure, Evelyne; Karmiloff-Smith, Annette; Dick, Fred; Thomas, Michael S C

2014-01-01

481

The Knife Machine. Module 15.  

ERIC Educational Resources Information Center

This module on the knife machine, one in a series dealing with industrial sewing machines, their attachments, and operation, covers one topic: performing special operations on the knife machine (a single needle or multi-needle machine which sews and cuts at the same time). These components are provided: an introduction, directions, an objective,…

South Carolina State Dept. of Education, Columbia. Office of Vocational Education.

482

Applications of the Connection Machine  

Microsoft Academic Search

The Connection Machine development effort was initiated in the belief that parallel processing and artificial intelligence could together accelerate the rate of progress toward truly intelligent machines. This progress is the result of the ease with which the machine can be programmed and the dramatic increase in compute power that the machine can bring to bear. The authors have been

David L. Waltz

1987-01-01

483

Harvesting Machine Census 1999 & 2001  

E-print Network

1 Harvesting Machine Census 1999 & 2001 231 Corstorphine Road Edinburgh EH12 7AT www.forestry.gov.uk FCTN001 SUMMARY This Technical Note contains information on the 1999 and 2001 harvesting machine machines, converted forwarders, etc., account for the remaining machines. In the 2001 census, 65

484

MACHINE MONITORING USING PROBABILITY THRESHOLDS  

E-print Network

MACHINE MONITORING USING PROBABILITY THRESHOLDS AND SYSTEM OPERATING CHARACTERISTICS Stephen M a discrete-part production machine, with the objective of effectively determining when to shut the machine analysis: a) There is an underlying time interval that characterizes the operation of the machine, most

Pollock, Stephen

485

Machine Learning Thomas G. Dietterich  

E-print Network

Machine Learning Thomas G. Dietterich Department of Computer Science Oregon State University Corvallis, OR 97331 1 Introduction Machine Learning is the study of methods for programming computers a computer system. A machine learning system can study recorded data and subsequent machine failures

486

Stochastic Optimization for Machine Learning  

E-print Network

Stochastic Optimization for Machine Learning ICML 2010, Haifa, Israel Tutorial by Nati Srebro Descent: formulation, analysis and use in machine learning · Learn about extensions and generalizations, and their Machine Learning counterparts Main Goal: Machine Learning is Stochastic Optimization #12;Outline

Powell, Warren B.

487

Interactive Machine Learning Mustafa Bilgic  

E-print Network

Interactive Machine Learning Mustafa Bilgic "Who We Are" Seminars IIT 9/24/2013 #12;Bio Bachelor Komurlu 2 #12;Research Interests Machine learning Interactive machine learning Statistical relational Diagnosis Recommender Systems Search Engines 4 #12;How Does a Machine Learn? There are many approaches

Heller, Barbara

488

Non-traditional machining techniques  

SciTech Connect

During the course of machining targets for various experiments it sometimes becomes necessary to adapt fixtures or machines, which are designed for one function, to another function. When adapting a machine or fixture is not adequate, it may be necessary to acquire a machine specifically designed to produce the component required. In addition to the above scenarios, the features of a component may dictate that multi-step machining processes are necessary to produce the component. This paper discusses the machining of four components where adaptation, specialized machine design, or multi-step processes were necessary to produce the components.

Day, Robert D [Los Alamos National Laboratory; Fierro, Frank [Los Alamos National Laboratory; Garcia, Felix P [Los Alamos National Laboratory; Hatch, Douglass J [Los Alamos National Laboratory; Randolph, Randall B [Los Alamos National Laboratory; Reardon, Patrick T [Los Alamos National Laboratory; Rivera, Gerald [Los Alamos National Laboratory

2008-01-01

489

Audio representations of multi-channel EEG: a new tool for diagnosis of brain disorders  

PubMed Central

Objective: The objective of this paper is to develop audio representations of electroencephalographic (EEG) multichannel signals, useful for medical practitioners and neuroscientists. The fundamental question explored in this paper is whether clinically valuable information contained in the EEG, not available from the conventional graphical EEG representation, might become apparent through audio representations. Methods and Materials: Music scores are generated from sparse time-frequency maps of EEG signals. Specifically, EEG signals of patients with mild cognitive impairment (MCI) and (healthy) control subjects are considered. Statistical differences in the audio representations of MCI patients and control subjects are assessed through mathematical complexity indexes as well as a perception test; in the latter, participants try to distinguish between audio sequences from MCI patients and control subjects. Results: Several characteristics of the audio sequences, including sample entropy, number of notes, and synchrony, are significantly different in MCI patients and control subjects (Mann-Whitney p < 0.01). Moreover, the participants of the perception test were able to accurately classify the audio sequences (89% correctly classified). Conclusions: The proposed audio representation of multi-channel EEG signals helps to understand the complex structure of EEG. Promising results were obtained on a clinical EEG data set. PMID:23383399

Vialatte, Francois B; Dauwels, Justin; Musha, Toshimitsu; Cichocki, Andrzej

2012-01-01

490

TECHNICAL NOTE: Portable audio electronics for impedance-based measurements in microfluidics  

NASA Astrophysics Data System (ADS)

We demonstrate the use of audio electronics-based signals to perform on-chip electrochemical measurements. Cell phones and portable music players are examples of consumer electronics that are easily operated and are ubiquitous worldwide. Audio output (play) and input (record) signals are voltage based and contain frequency and amplitude information. A cell phone, laptop soundcard and two compact audio players are compared with respect to frequency response; the laptop soundcard provides the most uniform frequency response, while the cell phone performance is found to be insufficient. The audio signals in the common portable music players and laptop soundcard operate in the range of 20 Hz to 20 kHz and are found to be applicable, as voltage input and output signals, to impedance-based electrochemical measurements in microfluidic systems. Validated impedance-based measurements of concentration (0.1-50 mM), flow rate (2-120 µL min-1) and particle detection (32 µm diameter) are demonstrated. The prevailing, lossless, wave audio file format is found to be suitable for data transmission to and from external sources, such as a centralized lab, and the cost of all hardware (in addition to audio devices) is ~10 USD. The utility demonstrated here, in combination with the ubiquitous nature of portable audio electronics, presents new opportunities for impedance-based measurements in portable microfluidic systems.

Wood, Paul; Sinton, David

2010-08-01

491

Audio-visual onset differences are used to determine syllable identity for ambiguous audio-visual stimulus pairs  

PubMed Central

Content and temporal cues have been shown to interact during audio-visual (AV) speech identification. Typically, the most reliable unimodal cue is used more strongly to identify specific speech features; however, visual cues are only used if the AV stimuli are presented within a certain temporal window of integration (TWI). This suggests that temporal cues denote whether unimodal stimuli belong together, that is, whether they should be integrated. It is not known whether temporal cues also provide information about the identity of a syllable. Since spoken syllables have naturally varying AV onset asynchronies, we hypothesize that for suboptimal AV cues presented within the TWI, information about the natural AV onset differences can aid in speech identification. To test this, we presented low-intensity auditory syllables concurrently with visual speech signals, and varied the stimulus onset asynchronies (SOA) of the AV pair, while participants were instructed to identify the auditory syllables. We revealed that specific speech features (e.g., voicing) were identified by relying primarily on one modality (e.g., auditory). Additionally, we showed a wide window in which visual information influenced auditory perception, that seemed even wider for congruent stimulus pairs. Finally, we found a specific response pattern across the SOA range for syllables that were not reliably identified by the unimodal cues, which we explained as the result of the use of natural onset differences between AV speech signals. This indicates that temporal cues not only provide information about the temporal integration of AV stimuli, but additionally convey information about the identity of AV pairs. These results provide a detailed behavioral basis for further neuro-imaging and stimulation studies to unravel the neurofunctional mechanisms of the audio-visual-temporal interplay within speech perception. PMID:23805110

ten Oever, Sanne; Sack, Alexander T.; Wheat, Katherine L.; Bien, Nina; van Atteveldt, Nienke

2013-01-01

492

Enhanced virtual machining for sculptured surfaces by integrating machine tool error models into NC machining simulation  

Microsoft Academic Search

Sculptured surface machining is a time-consuming and costly process. It requires simultaneously controlled motion of the machine axes. However, positioning inaccuracies or errors exist in machine tools. The combination of error motions of the machine axes will result in a complicated pattern of part geometry errors. In order to quantitatively predict these part geometry errors, a new application framework ‘enhanced

Yizhen Lin; Yin-Lin Shen

2004-01-01

493

Circular tests for HSM machine tools: Bore machining application  

E-print Network

Today's High-Speed Machining (HSM) machine tool combines productivity and part quality. The difficulty inherent in HSM operations lies in understanding the impact of machine tool behaviour on machining time and part quality. Analysis of some of the relevant ISO standards (230-1998, 10791-1998) and a complementary protocol for better understanding HSM technology are presented in the first part of this paper. These ISO standards are devoted to the procedures implemented in order to study the behavior of machine tool. As these procedures do not integrate HSM technology, the need for HSM machine tool tests becomes critical to improving the trade-off between machining time and part quality. A new protocol for analysing the HSM technology impact during circular interpolation is presented in the second part of the paper. This protocol which allows evaluating kinematic machine tool behaviour during circular interpolation was designed from tests without machining. These tests are discussed and their results analysed i...

Tapie, Laurent; Anselmetti, Bernard

2009-01-01

494

Method for Reading Sensors and Controlling Actuators Using Audio Interfaces of Mobile Devices  

PubMed Central

This article presents a novel closed loop control architecture based on audio channels of several types of computing devices, such as mobile phones and tablet computers, but not restricted to them. The communication is based on an audio interface that relies on the exchange of audio tones, allowing sensors to be read and actuators to be controlled. As an application example, the presented technique is used to build a low cost mobile robot, but the system can also be used in a variety of mechatronics applications and sensor networks, where smartphones are the basic building blocks. PMID:22438726

Aroca, Rafael V.; Burlamaqui, Aquiles F.; Goncalves, Luiz M. G.

2012-01-01

495

The Bearingless Electrical Machine  

NASA Technical Reports Server (NTRS)

Electromagnetic bearings allow the suspension of solids. For rotary applications, the most important physical effect is the force of a magnetic circuit to a high permeable armature, called the MAXWELL force. Contrary to the commonly used MAXWELL bearings, the bearingless electrical machine will take advantage of the reaction force of a conductor carrying a current in a magnetic field. This kind of force, called Lorentz force, generates the torque in direct current, asynchronous and synchronous machines. The magnetic field, which already exists in electrical machines and helps to build up the torque, can also be used for the suspension of the rotor. Besides the normal winding of the stator, a special winding was added, which generates forces for levitation. So a radial bearing, which is integrated directly in the active part of the machine, and the motor use the laminated core simultaneously. The winding was constructed for the levitating forces in a special way so that commercially available standard ac inverters for drives can be used. Besides wholly magnetic suspended machines, there is a wide range of applications for normal drives with ball bearings. Resonances of the rotor, especially critical speeds, can be damped actively.

Bichsel, J.

1992-01-01

496

Swinging Atwood's Machine Model  

NSDL National Science Digital Library

The Swinging Atwood's Machine Model is an extension of the traditional Atwood's machine where one hanging mass is allowed to swing like a pendulum. One assumes that all motions for the swinging mass are possible including positions above the second peg which requires that the string remains rigid at all times. Unlike the traditional Atwood's machine, orbits or cycles are possible. In other words, for certain conditions, the motion of the Atwood's machine can continue indefinitely. In changing the ratio of the two masses and the initial angle of displacement, the trajectory of the swinging mass changes. The object of this simulation is to model that system and allow the user a large amount of control over initial conditions so that they may fully explore the system. The Swinging Atwood's Machine Model was created using the Easy Java Simulations (EJS) modeling tool. It is distributed as a ready-to-run (compiled) Java archive. Double clicking the ejs_comp_phys_swinging_atwoods.jar file will run the program if Java is installed.

Ruckle, Leah

2011-06-15

497

Effects of augmentative visual training on audio-motor mapping  

E-print Network

surface electromyography (sEMG) to control a human�machine interface (HMI) for which the output was vowel history: Available online 12 February 2014 Keywords: Auditory�motor Visual feedback Surface electromyography a b s t r a c t The purpose of this study was to determine the effect of augmentative visual

Stepp, Cara E.

498

Monitoring frog communities: An application of machine learning  

SciTech Connect

Automatic recognition of animal vocalizations would be a valuable tool for a variety of biological research and environmental monitoring applications. We report the development of a software system which can recognize the vocalizations of 22 species of frogs which occur in an area of northern Australia. This software system will be used in unattended operation to monitor the effect on frog populations of the introduced Cane Toad. The system is based around classification of local peaks in the spectrogram of the audio signal using Quinlan`s machine learning system, C4.5. Unreliable identifications of peaks are aggregated together using a hierarchical structure of segments based on the typical temporal vocalization species` patterns. This produces robust system performance.

Taylor, A.; Watson, G.; Grigg, G.; McCallum, H.

1996-12-31

499

The Connection Machine  

SciTech Connect

This book describes what history may judge to be the second stage in the evolution of digital computers. Up to now, all computers have had basically the same architecture: one or a few large memory banks. Challenging that conception, the Connection Machine links together thousands or millions of extremely small processors and memories. From each moment to the next, the programs for this singularly versatile machine can refigure its architecture to suit the natural requirements of problems to be solved. The resulting configurations have already been applied to diverse data-structures required for processes involved with artificial intelligence, cellular physics, database inference, and number crunching computation.

Hillis, W.D.

1985-01-01

500

Precision Robotic Assembly Machine  

ScienceCinema

The world's largest laser system is the National Ignition Facility (NIF), located at Lawrence Livermore National Laboratory. NIF's 192 laser beams are amplified to extremely high energy, and then focused onto a tiny target about the size of a BB, containing frozen hydrogen gas. The target must be perfectly machined to incredibly demanding specifications. The Laboratory's scientists and engineers have developed a device called the "Precision Robotic Assembly Machine" for this purpose. Its unique design won a prestigious R&D-100 award from R&D Magazine.

None

2010-09-01